All activity
Terraflareleft a comment
Looks like the new DeepSeek has caught up on claude opus no think, although it's not on the leaderboard yet (Deepseek R1.1 scored the same as claude-opus-4-nothink 70.7% on aider polyglot. Old R1 was 56.9%)

DeepSeek-R1-0528New open-source LLM that rivals o3 in coding & reasoning



