People are using Super Mario to benchmark AI now
techcrunch.com/2025/03/03/people-are-using-super-mario-to-benchmark-ai-now
Thought Pokémon was a tough benchmark for AI? One group of researchers argues that Super Mario Bros. is even tougher.
Hao AI Lab, a research org at the University of California San Diego, on Friday threw AI into live Super Mario Bros. games. Anthropic’s Claude 3.7 performed the best,…
This story appeared on techcrunch.com, 2025-03-03 23:54:19.