iSWE-Agent achieves Rank-1 on SWE-PolyBench (Verified) Python leaderboard
iSWE-Agent, IBM Research’s software engineering agent, achieved Rank #1 on the SWE-PolyBench (Verified) Python leaderboard with a 58.41% resolution rate (66/113 resolved), surpassing the previous best from Atlassian Rovo Dev (54.87%). This is iSWE-Agent’s first submission on Python — after topping the Java splits of Multi-SWE-Bench and SWE-PolyBench — and its first to run on OpenAI’s GPT models, demonstrating that the same agent generalizes across both programming languages and model ecosystems.
iSWE-Agent achieves Rank-1 on SWE-PolyBench (Verified) Python leaderboard (18 June 2026)
Related resources: