Introducing LLM Games 🕹️ by Ramp Labs We pit GPT-5, Grok-4, o3, Gemini-2.5, and other models against each other to play Connect Four. GPT-5 high crushed all models – winning 14/14 games. As the games progress, the models think a lot longer. View the full replays of the games below.
40,61K