the ai skeptic’s most clever device is the score ceiling benchmark performance always feels logarithmic on tests with 0 - 100% scoring but when you look at no-ceiling benchmarks, we see a very different curve…
speaking of which i should run aidanbench
2,03K