AI Benchmarks

Testing AI models

🍓

Strawberry Test

A simple yet revealing benchmark. The 🍓 test evaluates AI models on several key criteria:

  • They know how to count letters
    (many models fail, including top-tier ones)
  • Compare models on response time
    (some models are very slow)
  • Compare models on total cost
    (thinking models spend lots of tokens)
Strawberry Test Interface