
Blog
AI research and insights
SWE-rebench dataset: More than 21,000 verifiable tasks for SWE agents
SWE-rebench dataset: More than 21,000 verifiable tasks for SWE agents
Our AI R&D team announces the open-source release of the SWE-rebench dataset of more than 21,000 real-world, interactive software engineering tasks. For a detailed methodology and technical report, please see our accompanying paper on arXiv.