blog - page 2 | Jatin Ganhotra

A New Chapter in Blogging Exploring the World of Agents

After a decade-long hiatus, I am thrilled to announce my return to blogging! This new journey will center around the fascinating and ever-evolving domain of Agents, with a particular focus on Software Engineering Agents (SWE-Agents).

Through this blog, I aim to share insights, ideas, and developments in this exciting field. My goal is to spark thought-provoking discussions and provide content that is both insightful and valuable to readers. Your feedback and perspectives will be invaluable, so I warmly invite you to share your thoughts in the comments and join the conversation.

Do SWE-Agents Solve Multi-File Issues Like Humans? A Deep Dive into SWE-Bench Verified

How SWE-agents (OpenHands, SWE-agent, Agentless) handle multi-file software engineering tasks compared to human developers on SWE-bench Verified, with Claude 3.5 Sonnet and OpenAI models.

29 min read · January 05, 2025

2025 · evaluation benchmarks SWE-Bench_Verified SWE-agent OpenHands Agentless Claude 3.5 Sonnet · blog swe-agents
OpenHands CodeAct v2.1 v/s Tools + Claude 3.5 Sonnet

Head-to-head comparison of OpenHands CodeAct v2.1 and Anthropic Claude 3.5 Sonnet on SWE-bench Verified, analyzing the performance differences and capabilities of these leading SWE-agent approaches.

12 min read · December 31, 2024

2024 · SWE-Bench SWE-Bench_Verified OpenHands Claude 3.5 Sonnet CodeAct v2.1 Anthropic SWE-agent · blog swe-agents
SWE-Bench Verified ⊊ real-world SWE tasks

Why SWE-bench Verified is only a subset of real-world software engineering tasks — comparing SWE-agents such as OpenHands CodeAct v2.1, Amazon Q, SWE-agent, Agentless and AutoCodeRover, with Claude 3.5 Sonnet.

13 min read · December 26, 2024

2024 · evaluation benchmarks SWE-Bench SWE-Bench_Verified SWE-agent OpenHands Agentless Amazon Q Claude 3.5 Sonnet · blog swe-agents
Installing Octave on OS X 10.9 Mavericks

6 min read · January 21, 2014

2014 · Mac OS X Octave How to Install · blog
Comparison is always false due to limited range of data type

2 min read · August 30, 2013

2013 · C C++ Coding Tips · blog c++

A New Chapter in Blogging Exploring the World of Agents

Do SWE-Agents Solve Multi-File Issues Like Humans? A Deep Dive into SWE-Bench Verified

OpenHands CodeAct v2.1 v/s Tools + Claude 3.5 Sonnet

SWE-Bench Verified ⊊ real-world SWE tasks

Installing Octave on OS X 10.9 Mavericks

Comparison is always false due to limited range of data type