What does this all say about benchmarking LLMs? What does this say about early access and a wave of positive remarks?
7,19K