IMO 是每 4.5 小時 3 個問題<>每個問題 1.5 小時 所以我們是......按計劃進行?
METR
METR2025年3月19日
When will AI systems be able to carry out long projects independently? In new research, we find a kind of “Moore’s Law for AI agents”: the length of tasks that AIs can do is doubling about every 7 months.
5.25K