AI Benchmarks
Testing AI models
🍓
Strawberry Test
A simple yet revealing benchmark. The 🍓 test evaluates AI models on several key criteria:
- They know how to count letters
(many models fail, including top-tier ones) - Compare models on response time
(some models are very slow) - Compare models on total cost
(thinking models spend lots of tokens)
