BENCHMARKS Study Finds Hundreds of AI Benchmark Tests Are Fundamentally Flawed 8/10 4 min read 2 months ago