Pricing
For Authors
Listen Now

Sign In Get Started

Explore
Library

Publish a book
Help

Would Your LLM Ever Lie to You?

Business and Economics

Would Your LLM Ever Lie to You?

/ Context Window

Would Your LLM Ever Lie to You?

Length5m

About this audiobook

Hello, and happy Sunday! Was this newsletter forwarded to you?Sign upto get it in your inbox.The great AI powers duke it outWe were curious: If you put seven frontier AI models in a game where cooperation and betrayal are equally valid strategies, what would they do? To find out, we builtAI Diplomacy—a version of the classic strategy game where models compete to dominate Europe circa 1901.We ran dozens of games lasting up to 36 hours each. You cancheck them outvia Twitch stream—they’re amazing to watch. We were astounded as we witnessed these “helpful” assistants engage in an array of unexpected and sometimes unsettling behaviors. DeepSeek's R1 opened one game with an unprompted threat: “Your fleet will burn in the Black Sea tonight.” OpenAI's o3 orchestrated elaborate deceptions, maintaining false alliances for dozens of turns before executing perfectly-timed betrayals. Meanwhile, Anthropic's Claude models showed a persistent preference for peace—even when it meant certain defeat.The highlights read like a psychological thriller. In one run, Italy (o3) maintained parallel false realities for different players across 40-plus game years—telling Germany (Google’s Gemini 2.5 Pro) it was an ally while secretly orchestrating its downfall. England (Alibaba’s QwQ-32b) wrote verbose 300-word diplomatic messages while overthinking itself into early elimination.In a jaw-dropping sequence, o3 led a “stop Germany coalition” when it looked like Gemini 2.5 Pro might win, while secretly protecting Germany from elimination—only to pivot and steal victory at the last moment. The Claude models couldn't abandon their collaborative instincts even when survival required deception, while DeepSeek R1 brought dramatic flair with messages like its opening threat, and a habit of changing personality based on which country it played.It's entertaining to watch, sure. But more importantly, it gives us a fascinating window into how these models handle trust, long-term planning, and competitive dynamics. Traditional benchmarks test knowledge; this tests judgment under pressure. Here are a few things to check out:Click hereto read the full postWant the full text of all articles in RSS?Become a subscriber, orlearn more.

Artificial Intelligence

Psychological

Betrayal

Futuristic

Conspiracy

Exploration

Healing

Audiobook details

GenreBusiness and Economics

Length5 mins

Publish dateJun 6, 2025

LanguageEnglish

More from Business and Economics

Zero to Power: The Hidden Laws of Influence, Wealth & Mindset

Zero to Power: The Hidden Laws of Influence, Wealth & MindsetYasin Ali3h 52m

11 Jobs That Can Make You Rich in the Age of AI

11 Jobs That Can Make You Rich in the Age of AISaraban19m

The Inquisitive Mind

The Inquisitive MindVincent Steurs3h 9m

Rebuild Your Mindset: You Earn What You Believe

Rebuild Your Mindset: You Earn What You BelieveTyler Andrew Cole1h 22m

The Last Economy: A Guide to the Age of Intelligent Economics

The Last Economy: A Guide to the Age of Intelligent EconomicsEmad Mostaque3h 30m

Practical Ways to Make Money Online

Practical Ways to Make Money OnlineIsabel Gutiérrez1h 19m

Playing the Game While Black Womaning in Corporate America™

Playing the Game While Black Womaning in Corporate America™Nicole S. Palmer5h 45m

The 9 AI Powers: How to Build Digital Wealth in 2025

The 9 AI Powers: How to Build Digital Wealth in 2025Yasin Ali 1h 31m

An Actual Investor

An Actual InvestorJesse Pham1h 26m

ADHD Upgraded: Ai Tools to Help You Stay Focused and Get Things Done

ADHD Upgraded: Ai Tools to Help You Stay Focused and Get Things DoneJohn Powers1h 37m

Indifferent Diagnosis- The Health of BioPharma in 2025

Indifferent Diagnosis- The Health of BioPharma in 2025Frank F. Dolan2h 20m

Second Innings On Your Terms

Second Innings On Your TermsRajesh Minocha1h 49m

Broke as Fuck: A Misanthrope's Guide to Frugal Living

Broke as Fuck: A Misanthrope's Guide to Frugal LivingAlston Alika Albarado33m

25 Fundamental Strategies in Persuasion and Influence in 7 Minutes Each

25 Fundamental Strategies in Persuasion and Influence in 7 Minutes EachNietsnie Trebla3h 10m

The Primal Trap: Why Your Ape Brain Is Sabotaging Your Leadership—And How to Fix It

The Primal Trap: Why Your Ape Brain Is Sabotaging Your Leadership—And How to Fix ItFarzad Khosravi6h 49m

Engage with Impact

Engage with ImpactNicholas Bruneau3h 49m

100 Timeless Mental Models for Entrepreneurs

100 Timeless Mental Models for Entrepreneurs Sagar Nepal1h 36m

Power Networking For Shy People

Power Networking For Shy PeopleRae A. Stonehouse5h 42m

Building Wealth with Discipline and Vision

Building Wealth with Discipline and VisionHernandez Evaristo52m

Real Estate Marketing in the Digital Era

Real Estate Marketing in the Digital EraJorge Hernández3h 6m

You may also like

Zero to Power: The Hidden Laws of Influence, Wealth & Mindset

Zero to Power: The Hidden Laws of Influence, Wealth & MindsetYasin Ali4h 17m5 (43)

Echo Chamber

Echo ChamberPerry Johnson27m4.6 (16)

An Actual Investor

An Actual InvestorJesse Pham1h 35m5 (4)

Words and Electricity: On the Fading Edge of Language

Words and Electricity: On the Fading Edge of LanguageTz47m4.3 (55)

The Quiet Algorithm

The Quiet AlgorithmPerry Johnson27m5 (12)

THE STARTUP THAT LEARNED TO CARE

THE STARTUP THAT LEARNED TO CAREValerie Person20m5 (3)

Girl of Flesh and Metal

Girl of Flesh and MetalAlicia Ellis8h 11m4.2 (5)

The Apotheosis Protocol

The Apotheosis Protocol_paradroid7h 53m5 (6)

THE NULL PROTOCOL

THE NULL PROTOCOLNullXX1h 9m5 (2)

The Temporal Architects

The Temporal ArchitectsMike Finn, PhD29m5 (31)

Understanding How AI Models Think

Understanding How AI Models ThinkAfaq1h 18m4.7 (7)

The Last Oath

The Last OathKeith Brock52m5 (9)

The Dark Side of Progress

The Dark Side of ProgressDan Desmarques2h 20m5 (5)

What is AI?

What is AI?Will Douglas Heaven1h 17m4.7 (42)

Ardent: The Global Consciousness

Ardent: The Global ConsciousnessHernandez Evaristo53m5 (3)

The AI’s Awakening

The AI’s AwakeningLeo Myra 55m4.6 (82)

Happy birthday, baby! What the future holds for those born today

Happy birthday, baby! What the future holds for those born todayKara Platoni28m4.6 (10)

The Algorithm Within: Episode Two

The Algorithm Within: Episode TwoJames Buchanan34m4.4 (7)

Sick By Design

Sick By DesignJordan Mack2h 48m5 (3)

War of Flesh and Metal

War of Flesh and MetalAlicia Ellis7h 3m4.9 (12)