svaret är att GPT-5 Smart Router har varit ett genombrott i LLM-inferensekonomin. OpenAI har förmodligen fördubblat sitt vinst/kostnadsförhållande med det, jämfört med att bara servera en användarkontrollerad blandning av o3 och o4-mini. Ju mer du kan skicka till dumma modeller, desto mer sparar du.
Teknium (e/λ)
Teknium (e/λ)10 aug. 20:23
Why is it the right move? Seriously? 1. Models already think more for harder problems in reasoning mode. 2. You could just always have it try to reason, then itll never fail you in case it needs to. 3. Any time an answer isnt satisfactory if you didnt have reasoning on, you can just turn it on. Why is it worth this much trouble? What are the real upsides to taking the control away from users? Why are they so adamant about it?
29,23K