Links · February 6, 2026 · 1 min read

Kimi 2.5 is as good as Sonnet 4.5, ChatGPT O3 and Grok 4

Open source models are almost as good as frontier LLMs

It’s crazy how good good open source LLMs have gotten.

Kimi K2.5 set a new record among open-weight models on the Epoch Capabilities Index (ECI), which combines multiple benchmarks onto a single scale. Its score of 147 is about on par with o3, Grok 4, and Sonnet 4.5. It still lags the overall frontier.

Kimi K2.5 set a new record among open-weight models on the Epoch Capabilities Index (ECI)

Join the Conversation

Share your thoughts and go deeper down the rabbit hole

AI is killing software jobs

Automation is a moral imperative

The future of coding

Join the Conversation