Science
Tested
- Avg AIMAC Debt 7.13
- Avg Cost $0.31
- Models Tested 42
- Zero Debt 5
- Avg Violations 13.4
Insights for Science
- Common Issues: Elements must meet minimum color contrast ratio thresholds, Links must have discernible text, Select element must have an accessible name
- Category Difficulty: Rank #12 of 28 categories (based on average model performance)
- Real-World Comparison: The WebAIM Million audits accessibility across 1 million real websites annually. Using different automated tools, AI models showed 70.3% fewer detected issues than real Science websites (13.4 axe-core violations vs 45.0 WAVE errors).
Model Gallery
Click a screenshot to view the generated HTML page, or click the model name for detailed results and prompts.
Model Results
AIMAC Debt:The model's accessibility debt for this category (lower = better)
| lower = better | in usd | |||
|---|---|---|---|---|
| GLM 4.7 Flash | 0.00 | $0.0035 | 0 | 0 |
| GLM 5 | 0.00 | $0.07 | 0 | 0 |
| GPT 5.3 Codex | 0.00 | $0.10 | 0 | 0 |
| GPT 5.4 | 0.00 | $0.17 | 0 | 0 |
| GPT 5.4 Mini | 0.00 | $0.027 | 0 | 0 |
| Qwen3 Coder Plus | 2.66 | $0.03 | 0 | 2 |
| Gemini 3 Flash Preview | 2.66 | $0.02 | 0 | 2 |
| o3 | 2.66 | $0.035 | 0 | 2 |
| DeepSeek V3.2 Speciale | 3.32 | $0.006 | 0 | 4 |
| KAT Coder Pro V2 | 3.53 | $0.023 | 0 | 5 |
| Kimi K2 Thinking | 3.85 | $0.03 | 0 | 7 |
| Grok 4.1 Fast | 3.85 | $0.0024 | 0 | 7 |
| Claude Opus 4.6 | 3.85 | $0.62 | 0 | 7 |
| Qwen3.5 Flash | 4.28 | $0.0045 | 0 | 11 |
| Kimi K2.5 | 4.51 | $0.04 | 0 | 14 |
| GPT 5.4 Pro | 4.51 | $10.59 | 0 | 14 |
| Qwen3 Coder Next | 4.70 | $0.015 | 0 | 17 |
| MiniMax M2.7 | 4.80 | $0.016 | 0 | 19 |
| Mistral Medium 3.1 | 4.94 | $0.014 | 0 | 22 |
| Trinity Large Preview (free) | 5.14 | $0.00 | 0 | 27 |
| Trinity Large Thinking | 5.24 | $0.007 | 0 | 30 |
| Claude Sonnet 4.6 | 5.27 | $0.51 | 0 | 31 |
| Claude Haiku 4.5 | 5.54 | $0.08 | 0 | 41 |
| gpt oss 120b | 6.53 | $0.004 | 0 | 7 |
| Mistral Large 3 2512 | 7.53 | $0.013 | 0 | 9 |
| Qwen3 Max | 7.71 | $0.04 | 0 | 10 |
| Nemotron 3 Super (free) | 7.75 | $0.00 | 0 | 20 |
| Qwen3.5 397B A17B | 7.87 | $0.05 | 0 | 14 |
| GPT 5.1 Codex Mini | 8.05 | $0.014 | 1 | 3 |
| o4 Mini | 8.05 | $0.024 | 1 | 3 |
| DeepSeek V3.2 | 8.41 | $0.003 | 0 | 14 |
| Codestral 2508 | 9.21 | $0.005 | 0 | 33 |
| Devstral 2 2512 | 9.32 | $0.0036 | 1 | 5 |
| Qwen3 Coder Flash | 9.35 | $0.012 | 0 | 29 |
| R1 0528 | 11.07 | $0.02 | 0 | 24 |
| Qwen3.6 Plus Preview (free) | 12.73 | $0.00 | 2 | 8 |
| Qwen3 Max Thinking | 12.94 | $0.04 | 3 | 1 |
| Olmo 3.1 32B Instruct | 13.19 | $0.004 | 2 | 13 |
| Gemini 3.1 Pro Preview | 13.53 | $0.12 | 1 | 25 |
| Nova 2 Lite | 17.82 | $0.017 | 2 | 23 |
| Mistral Small 4 | 23.81 | $0.008 | 7 | 15 |
| Grok 4.20 | 29.45 | $0.06 | 6 | 18 |