V OSモデルでここからどこへ向かうのか楽しみです
pash
pash2025年7月19日
I'd like to point out that for the real world tasks (not benchmarks), Kimi K2 outperforms Gemini. This is telemetry across all @cline users, showing diff edit failure rate. Notice how Kimi has about a 6% failure rate, which is significantly better than Gemini's ~ 10% error rate. Remarkably, Kimi even surpassed Claude 4 for most of this week, achieving a sub 4% failure rate!
7.58K