V bersemangat untuk melihat ke mana kita pergi dari sini dengan model OS
pash
pash19 Jul 2025
I'd like to point out that for the real world tasks (not benchmarks), Kimi K2 outperforms Gemini. This is telemetry across all @cline users, showing diff edit failure rate. Notice how Kimi has about a 6% failure rate, which is significantly better than Gemini's ~ 10% error rate. Remarkably, Kimi even surpassed Claude 4 for most of this week, achieving a sub 4% failure rate!
7,58K