Claude 3.7 Sonnet
an archive of posts with this tag
| Jul 26, 2025 | The Visual Complexity Penalty in Code Understanding - SWE-bench Multimodal Analysis How visual complexity penalizes SWE-agents on SWE-bench Multimodal — testing SWE-agent, Agentless and OpenHands with Claude 3.7 Sonnet and OpenAI o3 on visually rich GitHub issues. |
|---|