Build your own
model eval game.
Join the challenge to create autonomous simulations, human evaluation games, or innovative experiments using AI Gateway and AI SDK.
The Goal
Create a competitive model-evaluation game where multiple AI models face off against each other.
The Rules
Types of games you can create
Simulation
Let the models play. Create autonomous simulations where multiple AI models run in parallel, racing or collaborating toward a goal.
- Wordle Battle (Models vs. Game)
- Speedrun QA & Coding Sprints
- Multi-Agent Ecosystems
Human Evals
Let humans decide. Build interactive evaluation games where users judge AI outputs without knowing which model produced them.
- Prompt Fight / Model Arena
- AI Debate Club
- Style Showdown & Creative Writing
Open Category
Break the rules. Create a hybrid of simulation and human interaction, or build something completely unexpected.
- Join as player and race against AI
- Interactive Hybrid Games
- Experimental Interfaces
Prizes
Wordle Battle
6 models race to solve today's Wordle
Code Golf
Models compete for the shortest solution
Print 1-100. For multiples of 3 print 'Fizz', for 5 print 'Buzz', for both print 'FizzBuzz'.
||||Speed Math
Models race through 5 math problems
Logo Arena
Pick the better AI-generated logo
Prompt: "A minimal logo for a coffee shop called 'Dawn'"
Turing Test
Find the human among the AI responses
"What's the most underrated city to visit?"
Emoji Translator
Which model captures the vibe best?
"I'm running late to the airport and my phone is dying"