Grok Rankings Update 一 October 13 #1 Terminal-Bench Hard (Agentic Coding & Terminal Use) #1 GPQA Diamond (Scientific Reasoning) #1 SciCode (Coding) #1 Artificial Analysis Intelligence Index Tokens Usage #1 Token usage across models on OpenRouter Leaderboard #1 Programming Usecase on OpenRouter #1 Most popular LLMs for different languages on OpenRouter #1 on KiloCode Leaderboard #1 on Cline Leaderboard