everything that makes gpt-5 a better agent requires re-thinking how you architect your agents
Stagehand 🤘
Stagehand 🤘8.8. klo 05.55
The new GPT-5 performs worse than Opus 4.1 in Stagehand evals in both speed and accuracy. The smaller models are faster, but also still fall short of Opus 4.1.
2,37K