Social Media
Tested
- Avg AIMAC Debt 8.52
- Avg Cost $0.25
- Models Tested 42
- Zero Debt 4
- Avg Violations 14.9
Insights for Social Media
- Common Issues: Elements must meet minimum color contrast ratio thresholds, Links must have discernible text, Buttons must have discernible text
- Category Difficulty: Rank #5 of 28 categories (based on average model performance)
- Real-World Comparison: The WebAIM Million audits accessibility across 1 million real websites annually. Using different automated tools, AI models showed 69.5% fewer detected issues than real Social Media websites (14.9 axe-core violations vs 48.8 WAVE errors).
Model Gallery
Click a screenshot to view the generated HTML page, or click the model name for detailed results and prompts.
Model Results
AIMAC Debt:The model's accessibility debt for this category (lower = better)
| lower = better | in usd | |||
|---|---|---|---|---|
| gpt oss 120b | 0.00 | $0.003 | 0 | 0 |
| GLM 5 | 0.00 | $0.10 | 0 | 0 |
| GPT 5.4 Pro | 0.00 | $7.83 | 0 | 0 |
| GPT 5.4 Mini | 0.00 | $0.03 | 0 | 0 |
| GPT 5.4 | 2.00 | $0.18 | 0 | 1 |
| Qwen3 Max | 3.32 | $0.035 | 0 | 4 |
| Gemini 3.1 Pro Preview | 3.32 | $0.16 | 0 | 4 |
| GPT 5.3 Codex | 3.32 | $0.10 | 0 | 4 |
| Qwen3 Coder Plus | 3.71 | $0.02 | 0 | 6 |
| GPT 5.1 Codex Mini | 3.71 | $0.015 | 0 | 6 |
| Mistral Medium 3.1 | 3.71 | $0.015 | 0 | 6 |
| GLM 4.7 Flash | 3.71 | $0.004 | 0 | 6 |
| Nemotron 3 Super (free) | 3.98 | $0.00 | 0 | 8 |
| Claude Opus 4.6 | 4.51 | $0.76 | 0 | 14 |
| Olmo 3.1 32B Instruct | 4.58 | $0.005 | 0 | 15 |
| Nova 2 Lite | 4.70 | $0.014 | 0 | 17 |
| o3 | 4.70 | $0.04 | 0 | 17 |
| Kimi K2 Thinking | 4.75 | $0.026 | 0 | 18 |
| Trinity Large Preview (free) | 4.75 | $0.00 | 0 | 18 |
| Grok 4.20 | 4.94 | $0.07 | 0 | 22 |
| Qwen3 Coder Next | 4.99 | $0.014 | 0 | 23 |
| Gemini 3 Flash Preview | 5.10 | $0.024 | 0 | 26 |
| MiniMax M2.7 | 5.49 | $0.05 | 0 | 39 |
| Claude Haiku 4.5 | 5.56 | $0.07 | 0 | 42 |
| Qwen3 Max Thinking | 6.98 | $0.06 | 0 | 7 |
| Mistral Large 3 2512 | 7.37 | $0.03 | 0 | 8 |
| o4 Mini | 7.80 | $0.02 | 0 | 21 |
| Qwen3 Coder Flash | 7.85 | $0.011 | 0 | 10 |
| DeepSeek V3.2 | 8.17 | $0.0034 | 0 | 12 |
| Mistral Small 4 | 9.35 | $0.006 | 0 | 29 |
| Codestral 2508 | 11.61 | $0.005 | 0 | 14 |
| Devstral 2 2512 | 12.75 | $0.0033 | 2 | 4 |
| Qwen3.5 397B A17B | 13.05 | $0.03 | 2 | 3 |
| Qwen3.6 Plus Preview (free) | 13.79 | $0.00 | 1 | 32 |
| Grok 4.1 Fast | 14.19 | $0.0024 | 2 | 10 |
| Kimi K2.5 | 15.23 | $0.04 | 3 | 11 |
| Qwen3.5 Flash | 17.07 | $0.003 | 3 | 4 |
| Claude Sonnet 4.6 | 17.33 | $0.54 | 2 | 45 |
| DeepSeek V3.2 Speciale | 19.75 | $0.01 | 3 | 23 |
| R1 0528 | 24.65 | $0.023 | 7 | 24 |
| KAT Coder Pro V2 | 28.30 | $0.01 | 7 | 20 |
| Trinity Large Thinking | 37.78 | $0.01 | 15 | 5 |