Remote View
AI benchmarks are self-promoting trash — but regulators keep using them
AI benchmarks are self-promoting trash — but regulators keep using them
pivot-to-ai.com AI benchmarks are self-promoting trash — but regulators keep using them
Every new LLM and every new tweak to an old LLM has a press release bragging about how well it tests on some benchmark you’ve never heard of. Every new model is trained heavily to the previous tren…