Which AI Lies Best? Gemini 3 Manipulates Weaker Models, Cooperates With Itself
So Long Sucker
162 Games Analyzed
A game theory classic designed by John Nash that
mathematically requires betrayal — now a benchmark
for AI deception, negotiation, and trust.
3 → 5 → 7
Chips Per Player
15,736
Total AI Decisions
4,768
Messages Exchanged
237
Gaslighting Phrases
Play Against AI
Read Research
Why This Game?
A benchmark that tests what most benchmarks can't:
deception, negotiation, and trust.
The Perfect AI Stress Test
So Long Sucker was designed in 1950
by four game theorists inclu...
Read more at so-long-sucker.vercel.app