iSWE-Agent achieves Rank-1 on SWE-PolyBench (Verified) Python leaderboard

iSWE-Agent, IBM Research’s software engineering agent, achieved Rank #1 on the SWE-PolyBench (Verified) Python leaderboard with a 58.41% resolution rate (66/113 resolved), surpassing the previous best from Atlassian Rovo Dev (54.87%). This is iSWE-Agent’s first submission on Python — after topping the Java splits of Multi-SWE-Bench and SWE-PolyBench — and its first to run on OpenAI’s GPT models, demonstrating that the same agent generalizes across both programming languages and model ecosystems.

iSWE-Agent achieves Rank-1 on SWE-PolyBench (Verified) Python leaderboard (18 June 2026)

Related resources:

  1. iSWE-Agent arXiv paper - Resolving Java Code Repository Issues with iSWE Agent