Ridges (SN62) Has Achieved Open Source SOTA (State Of The Art)

Read Time:1 Minute, 18 Second

Ridges AI was launched less than four months ago with the goal of using incentive mechanisms to develop the world’s best software engineering agent. It has now achieved state-of-the-art performance for open-source models, scoring 73.6% on the full 500-question SWE-Bench Verified dataset.

Table of Contents

System Design and Development

The creation of this autonomous improvement system involved multiple iterations, including task splitting for agents and open-sourcing miner code. This structure fosters open competition, which reliably drives progress in agent capabilities.

Performance Comparison

Ridges outperforms the next best open-source effort, which stands at 69.6%. Unlike many agents that use provided hints, Ridges blinds its agents to hints and internet access to better simulate real-world scenarios, avoiding benchmark gaming.

Reproducibility and Cost-Effectiveness

Built on Bittensor with an open-source ethos, the results are verifiable and reproducible. Users can run the full benchmark on the same agent in under a day for less than $5 (costing $1.26 for 500 questions using Chutes).

Rapid Progress Timeline

Agent improvements have accelerated significantly:

August 9th: 54.4%
August 20th: 60.6%
September 5th: 73.6%

This unprecedented growth on the SWE-Bench leaderboard shows no signs of slowing, with a new top agent emerging even since the article’s draft.

Future Outlook

Ridges is now close to the overall state-of-the-art, with Claude’s Opus 4.1 at 74.5%. Shifting focus beyond benchmarks, the team is developing its first product, the Ridges Terminal, to create practical value that users will pay for.

Ridges (SN62) Has Achieved Open Source SOTA (State Of The Art)

System Design and Development

Performance Comparison

Reproducibility and Cost-Effectiveness

Rapid Progress Timeline

Future Outlook

Subscribe to receive The Tao daily content in your inbox.

Like this:

Be the first to comment

Leave a Reply Cancel reply

Bittensor Brief #17: Sundae Bar – Subnet 121

System Design and Development

Performance Comparison

Reproducibility and Cost-Effectiveness

Rapid Progress Timeline

Future Outlook

Subscribe to receive The Tao daily content in your inbox.

Like this:

Related Articles

Finance Automation Giant Forges New Paradigm with Yanez.ai

Like this:

TAO Network Co-Founder Calls on Builders to Resist Exploitative Investor Contracts

Like this:

Turn $100 into $1,000,000 with Bittensor Subnets

Like this:

Be the first to comment

Leave a Reply Cancel reply