Society
Tested
- Avg AIMAC Debt 6.57
- Avg Cost $0.14
- Models Tested 37
- Zero Debt 1
- Avg Violations 16.0
Insights for Society
- Common Issues: Elements must meet minimum color contrast ratio thresholds, Links must have discernible text, Select element must have an accessible name
- Category Difficulty: Rank #13 of 28 categories (based on average model performance)
- Real-World Comparison: The WebAIM Million audits accessibility across 1 million real websites annually. Using different automated tools, AI models showed 65.3% fewer detected issues than real Society websites (16.0 axe-core violations vs 46.2 WAVE errors).
Model Gallery
Click a screenshot to view the generated HTML page, or click the model name for detailed results and prompts.
Model Results
AIMAC Debt:The model's accessibility debt for this category (lower = better)
| lower = better | in usd | |||
|---|---|---|---|---|
| GPT 5.1 Codex Mini | 0.00 | $0.014 | 0 | 0 |
| MiniMax M1 | 2.00 | $0.025 | 0 | 1 |
| GPT 5.2 Codex | 2.00 | $0.06 | 0 | 1 |
| DeepSeek V3.2 Speciale | 3.05 | $0.007 | 0 | 3 |
| KAT Coder Pro V1 | 3.71 | $0.007 | 0 | 6 |
| Gemini 3 Pro Preview | 3.85 | $0.12 | 0 | 7 |
| o4 Mini | 3.98 | $0.018 | 0 | 8 |
| Qwen3 Max | 4.00 | $0.034 | 0 | 4 |
| Mistral Small 3.2 24B | 4.19 | $0.0009 | 0 | 10 |
| Qwen3 Coder 480B A35B | 4.37 | $0.01 | 0 | 12 |
| MiniMax M2.1 | 4.44 | $0.009 | 0 | 13 |
| gpt oss 120b | 4.51 | $0.004 | 0 | 14 |
| Claude Haiku 4.5 | 4.75 | $0.08 | 0 | 18 |
| Nova 2 Lite | 4.75 | $0.012 | 0 | 18 |
| Kimi K2 0905 | 4.75 | $0.013 | 0 | 18 |
| Claude Sonnet 4.5 | 4.85 | $0.22 | 0 | 20 |
| Qwen3 235B A22B Instruct 2507 | 4.85 | $0.01 | 0 | 20 |
| GLM 4.7 | 4.85 | $0.017 | 0 | 20 |
| Claude Opus 4.5 | 4.94 | $0.64 | 0 | 22 |
| GPT 5.2 Pro | 5.05 | $3.20 | 0 | 4 |
| Grok 4.1 Fast | 5.10 | $0.003 | 0 | 26 |
| Mistral Medium 3.1 | 5.10 | $0.014 | 0 | 26 |
| Kimi K2 Thinking | 5.54 | $0.025 | 0 | 41 |
| Gemini 3 Flash Preview | 6.63 | $0.04 | 0 | 6 |
| GPT 5 Mini | 7.10 | $0.023 | 0 | 27 |
| Devstral 2 2512 | 7.68 | $0.005 | 0 | 12 |
| Qwen3 Coder Flash | 8.09 | $0.01 | 0 | 13 |
| GLM 4.7 Flash | 8.09 | $0.004 | 0 | 13 |
| R1 | 8.17 | $0.03 | 0 | 12 |
| o3 | 8.37 | $0.04 | 0 | 16 |
| Mistral Large 3 2512 | 8.44 | $0.018 | 0 | 17 |
| GPT 5.2 | 9.78 | $0.26 | 0 | 55 |
| DeepSeek V3.2 | 9.91 | $0.0034 | 0 | 9 |
| Qwen3 Coder Plus | 13.75 | $0.015 | 3 | 0 |
| Gemini 2.5 Flash Lite | 14.06 | $0.0026 | 1 | 29 |
| GLM 4.5 Air | 15.96 | $0.011 | 1 | 22 |
| Codestral 2508 | 22.38 | $0.005 | 3 | 42 |