Grok Rankings Update 一 October 13
#1 Terminal-Bench Hard (Agentic Coding & Terminal Use)
#1 GPQA Diamond (Scientific Reasoning)
#1 SciCode (Coding)
#1 Artificial Analysis Intelligence Index Tokens Usage
#1 Token usage across models on OpenRouter Leaderboard
#1 Programming Usecase on OpenRouter
#1 Most popular LLMs for different languages on OpenRouter
#1 on KiloCode Leaderboard
#1 on Cline Leaderboard