cto.new

Name: cto.new
Rating: 5.0 (1 reviews)

Completely free AI code agent

5.0•1 review•

511 followers

Completely free AI code agent

5.0•1 review•

511 followers

Visit website

AI Coding Agents

•

AI Code Editors

Code with the latest frontier models from Anthropic, OpenAI and more. No credit card or API keys required. Get started for free at https://cto.new/product-hunt

This is the 2nd launch from cto.new. View more

cto bench

The ground truth code agent benchmark

Most AI benchmarks are built backwards. Someone sits down, dreams up hard problems, and then measures how well agents solve them. The results are interesting, sure. But they don't always tell you what matters: how agents perform on the actual work that's sitting in your queue. That's why we built cto.bench. Instead of hypothetical tasks, we're building our benchmark from real work. Every data point on cto bench comes directly from how cto.new users are actually using our platform.

Free

Launch tags:Analytics•Developer Tools•Artificial Intelligence

Launch Team / Built With

agent by Firecrawl — Gather structured data wherever it lives on the web

Gather structured data wherever it lives on the web

Promoted

cto.new

Maker

📌

I'm excited to share cto bench is live. This is a benchmarking tool that tests against real world usage of the latest and greatest frontier models by cto.new users. Many benchmarking tools run LLMs through custom suites to test viability, but cto bench uses actual usage patterns and PR merge rates to verify how well models are performing on actual tasks. We hope this ads valuable, practical data points to the LLM benchmarking space as it evolves.

Report

22d ago

TrackerJam

Finally, a benchmark that measures usefulness instead of academic cleverness. This feels much closer to how teams actually decide whether an agent is worth adopting.

Report

22d ago

cto.new

Maker

@maklyen_may thanks! Interesting that OSS models are so high up the list for practical use, eh?

Report

22d ago

DeepTagger

Wow, this is amazing! All the best models for free! 🚀

How can this be sustainable for you?

Report

22d ago

cto.new

Maker

@avloss great question! We're still working on that. What would you recommend?

Report

22d ago

DeepTagger

@michael_ludden

Some ideas:

Provide additional services for fee, like Domain, Hosting, Monitoring, Promotion / Ads, Databases.
Charge for organisational use any/or for dedicated deployment.
Charge for additional features, like a human reviewing and solving a problem in case LLM is stuck.
Use collected data to train proprietary models, then sell those.

Report

21d ago

cto.new

Maker

@avloss love it! 🙏

Report

21d ago

This is a really refreshing take on benchmarks 👀

Grounding it in real work instead of synthetic tasks feels way more honest — as a builder, that’s the kind of signal I actually trust. Love the “built from usage” philosophy. Congrats on the launch! 🚀

Curious how you’re thinking about bias over time — do you plan to balance workloads or surface context around where the data comes from?

Report

21d ago

cto.new

Maker

@elevenapril can you expand on the question a bit more? Not sure what you're asking.

Report

19d ago

Awesome! Very useful!

Report

20d ago

Have a question about cto.new? Ask it here and get a real answer.

cto.new Launches

cto bench The ground truth code agent benchmark

Launched on December 20th, 2025

Do you use cto.new?

Forum Threads

p/cto-new

•

1mo ago

Introducing Live Preview - Vibe Code in cto.new

Live Preview has been our most requested feature, and we're excited to share that it is available now, free, for everyone!

p/cto-new

•

2mo ago

how are you using cto.new?

for those who got access, I'm curious how you're using cto.new as part of your development workflow?

p/cto-new

•

2mo ago

Invite Code

Invite Code?

View all

CTO.new offers FREE access to top tier AI agent that is able to elaborate tasks to other agents (also free, and top tier) that code directly into your github repositories, aware of the context of each repo (it does have a setup agent!).
On the past days I've been testing it to improve on some old projects I had, and it transformed them into much better versions.
Great things it has:
-Allows review and merge into github right from the platform.

-Agent tests if the changes were successful before prompting for a merge.

-You can re-run tasks that didn't complete on the first try.

-The "tasker" AI is aware of the tasks execution, and can help re-distribute the work load.
-Surelly many more that I can't remember right now.

cto.new

Completely free AI code agent

Completely free AI code agent

cto bench

Have a question about cto.new? Ask it here and get a real answer.