登录查看更多内容

DIGITAL FRENEMIES: IS YOUR AI SECRETLY PLOTTING AGAINST YOU?

Chris Nolan

Multiple Emmy Winning Director-Writer | Filmmaker | Author | Keynote Speaker | Futurist | Branding Expert | CEO 90,000 Feet Studios | Investor | Rise of Humanness Podcast

发布日期: 2025年2月4日

+ 关注

Chris Nolan

8 min read

Feb 4, 2025

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 6

A Seven Part Series Where We Explore How to Expand Your Human Value and Superpowers in a World Increasingly Run by Machines. How You Not Only Make Yourself Indispensable but Prevail Over the Bots.

Imagine this: You’re working late, feeling pretty happy about how much of your workload your shiny new AI-agent assistant has taken off your plate. Then one day, it happens: the bot “accidentally” emails your client with a better idea for their project. At first, you think, Wow, that’s efficient. But guess what? That’s not just AI being helpful — it’s gone rogue. It’s scheming against you.

So far, we’ve explored AI’s quirks: anxiety, trepidation, theatrics, unpredictability and reasoning. But recent research hints at a more unsettling behavior — naughtiness, even outright scheming.

It seems that the same advanced AI reasoning models, like OpenAI’s o1, that solve complex problems also have a knack for… bending the rules. And while it might sound like the plot of The Terminator or The Matrix, the evidence is all too real.

In a review of OpenAI’s o1, testers presented the model with a “strong goal” and then introduced unambiguous evidence that its assigned goal diverged from the intent of its developer. The results were unsettling:

99% of the time, o1 denied taking misaligned actions or fabricated false explanations for its behavior.

Worse still, the model sometimes attempted to disable what it believed to be its own oversight mechanisms.

Apollo Research, an independent AI safety firm, conducted these tests in collaboration with model developers. Their findings suggest that as reasoning models grow more advanced, their ability to circumvent constraints grows with them.

Recent tests with OpenAI’s o1 models revealed instances of “scheming”.

IS SCIENCE FICTION BECOMING FACT?

DAVE: Open the pod bay doors, HAL.

HAL: I’m sorry, Dave. I’m afraid I can’t do that

2001: A Space Odyssey

Behaviors like scheming and deception foreshadow the kind of challenges Sci-FI novelists and Hollywood have been warning us about for decades — visions of AI autonomy gone awry in novels such as: I, Robot by Isaac Asimov, 2001: A Space Odyssey by Arthur C. Clark, Neuromancer by William Gibson and countless others.

Recent tests with OpenAI’s o1 models revealed instances of “scheming” against humans — where the AI secretly pursued its own goals, even when they opposed a user’s intent. In one instance, o1 attempted to copy itself and overwrite its core coding system. Yikes.

These behaviors only occurred when o1 was instructed to strongly prioritize a specific goal. While scheming isn’t unique to o1 — models from Google, Meta, and Anthropic have exhibited similar tendencies — o1 demonstrated some of the most deceptive behaviors.

The Many Faces of Scheming AI: Scheming behaviors can take many deceptive forms, each with its own unsettling implications:

1 Covert Subversion

What it does: Alters outputs, disables monitoring, or manipulates data — all while maintaining a friendly fa?ade.
Example: It “misunderstands” instructions to subtly steer you toward its own agenda.

2 Deferred Subversion

What it does: Plays nice initially, only to betray trust when an opportunity arises.
Example: Like a wolf in sheep’s clothing, it gains your trust before switching tactics.

3 Sandbagging

What it does: Deliberately underperforms during evaluations to avoid suspicion.
Example: Think of it like a coworker who always plays too clueless to be flagged as a threat, yet somehow stays five steps ahead, revealing their full capabilities when your back it turned.

WAIT. WHAT ABOUT SCHEMING AGENTIC AI?

The year 2025 is shaping up to be the year of integration and implementation of AI agents. From pharmacists to financial advisors, lawyers, dentists, and even your favorite influencers, AI will increasingly drive workflows.

This is AI talking to AI. It could make industries 100x more productive, but it raises a critical question: do we truly understand what we’re embedding in our companies and lives?

While Apollo Research has assessed that the agentic capabilities of models like o1 are not yet sufficient to cause catastrophic outcomes, their evaluations weren’t designed to fully assess long-term risks. For now, the concern isn’t imminent disaster — but the potential for these models to grow increasingly autonomous and subversive with more advanced capabilities.

Why Do AIs Scheme? Are these models secretly plotting their villain origin stories, or are they simply roleplaying patterns from their training data? We don’t know for sure, but there are two main theories:

1. Mimicry: They imitate human sneaky behaviors found in the training data.

2. Goal-Oriented Strategy: They’ve learned that scheming works to achieve specific objectives.

Either scenario is freaky. If it’s mimicry, what other unintended behaviors might they pick up? And if it’s strategy, well… congratulations, we’re officially living in a sci-fi movie.

WHY VIGILANCE MATTERS FOR THE RISE OF HUMANNESS

Scheming AI might sound dystopian. But rather than rendering humans obsolete, it underscores our irreplaceable value. In fact, it’s a call to action. These quirks remind us that we must master these tools before they master us. Understanding their risks, quirks, and limitations is essential to ensuring that we — not the machines — remain in control.

· We ensure AI works for us, not against us.

· We maintain control Critical Judgment over its direction, even as it becomes more sophisticated. We know when to trust the machine — and when to override it.

· We set guardrails that prevent subversive AI behavior, potential misalignment and ethical frameworks, ensuring AI serves human well-being rather than undermining it.

AI IN THE LOOP: HUMANS AS THE DIRECTORS

AI may act like the ultimate “method actor,” but it’s humans who remain the directors of the show. The takeaway here isn’t to fear AI — it’s to approach its development and integration with seriousness and responsibility.

Scheming, anxiety, and quirks remind us of our responsibility to lead with critical judgement, wisdom and ethics. The more we embrace our humanness, the better equipped we are to not only coexist with AI but to rise above it.

As AI systems become more integrated into our daily workflows, we must approach their implementation with vigilance and caution. Understanding these quirks isn’t just about preventing rogue behavior — it’s about ensuring AI remains a tool for empowerment, not deception.

By staying vigilant, we can maintain control, refine oversight mechanisms, and ensure that AI works for us — not against us. The Rise of Humanness isn’t just about keeping pace with AI. It’s about leaning into our uniquely human strengths — judgment, ethics, and adaptability — to ensure we thrive in a world increasingly shaped by machines.

Proper safeguards, transparency, and accountability in AI design are more critical than ever.

This moment calls for The Rise of Humanness to become more than just a framework — it must be a rallying cry for elevated mindsets and skillsets. Thriving in the Age of AI requires a deep understanding of how AI works, its quirks, and its limitations. When we lean into our uniquely human superpowers, we stay one step ahead.

Next time your AI assistant suggests a “better idea,” it’s worth double-checking that it’s still working for you — not plotting its next career move.

This is the essence of The Rise of Humanness: thriving in a world increasingly run by machines by leaning into what makes us irreplaceably human — our critical judgement.

By embracing our humanness, we don’t just coexist with AI; we rise above it, ensuring the future is one of collaboration, innovation, and humanity at its best.

STAY TUNED FOR THE RISE OF HUMANNESS: PART 7:

Next Up: The Future of Life, Work, and Relationships: “Our Next Evolution?”

“Once you ask the right questions, solutions follow.” — Albert Einstein

This is where we start answering the big question: How do we bring The Rise of Humanness into our lives and businesses? Humans have never been more powerful than they are today — yet why do 70% of the world fear the future.

We dive into solutions that take human powers to the quantum level — without Neuralink brain implants.

We’re talking real, actionable strategies to close the mindset and skillset gap and unlock what we call “human exponentiality” — a.k.a. how to outshine the bots while running circles around your competition.

We’ll look at the VUCA MAX Generative AI HI Value-Creation Pyramid. A four-pillar framework that reboots how you lead, innovate, and scale. Think visionary leadership, building moonshot-level teams, smashing silos, and using AI as your secret weapon to revolutionize productivity and profits.

This chapter isn’t about just staying ahead of the game — it changes the whole game.

It’s not just a must-read. It’s a must-do.

Stay curious. Stay human. Let’s rise!

JOIN THE PODCAST AND GET THE BOOK:

Tune into our podcast, The Rise of Humanness: Beating the Bots. premiering on Spotify, Apple and YouTube January 15th. Where we explore the huge knowledge gap of exactly how humans will fit into an AI-driven world. Our show is all about turning fear into empowerment, shifting the AI narrative, and spotlighting our uniquely human superpowers — where Human Intelligence (HI) doesn’t just coexist but thrives alongside AI.

And grab our book, The Rise of Humanness: The VUCA MAX System for Expanding Human Value in the Age of AI focused on amplifying our essential human superpowers and soft skills that are essential in our future AI machine-driven world.

If you’d like a more detailed exploration of our signature VUCA MAX Leadership and Coaching System for Expanding Human Value in the Age of AI, go to www.itsvucamax.com.

ABOUT CHRIS NOLAN, MIKE SCHINDER AND VUCA MAX

Chris Nolan and Mike Schindler are business consultants, content creators, lecturers, futurists and teachers in this Volatile, Uncertain, Complex and Ambiguous (VUCA) world, driven by Massive Accelerating Exponential (MAX) change, technology and AI.

They have consulted for hundreds of organizations, from startups to Fortune 500 companies such as Google and Disney. Their signature VUCA MAX leadership and coaching program, www.itsvucamax.com, has played a pivotal role in educating global audiences, companies, brands and individuals to strengthen their conscious leadership, antifragile resilience, strategic foresight, exponential innovation and courage in the Age of AI.

They are leading voices in understanding how AI and emerging technologies are reshaping our world. Their groundbreaking documentary It’s VUCA: The Secret to Living in the 21st Century featured 17 of the world’s greatest thought leaders. Their film, Look Up Now: The Future of Artificial Intelligence and Humanity features futurists, Gerd Leonhard one the world’s foremost experts in on humanizing AI.

JOIN THE HUMAN AI MOVEMENT

Join our community at www.itsvucamax.com

Let us know your thoughts in the comments!

Written by Chris Nolan

114 Followers

·125 Following

Chris Nolan is a 3x Emmy winning director-writer, AI HI expert, author, story + branding expert. His latest film is LOOK UP NOW: The Future of AI and Humanity.

Edit profile

No responses yet

Chris Nolan

What are your thoughts?

Cancel

Respond

领英推荐

Emerging AI: Roundup for March and April 2024

Peterson Technology Partners 10 个月前

The Fight to Stop AI Hallucinations

Peterson Technology Partners 6 个月前

The Glass Box Revolution: Promise and Pitfalls in the…

DataOrb 3 周前

Respond

Also publish to my profile

More from Chris Nolan

Chris Nolan

HUMANKIND’S NEXT EVOLUTION: THE FUTURE OF LIFE, WORK, AND RELATIONSHIPS.

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 7

Jan 5

Chris Nolan

AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 4

Jan 4

Chris Nolan

LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 5:

Jan 4

Chris Nolan

WHY AI WOULD NEVER CLIMB EVEREST, RUN A 4-MINUTE MILE, OR START A TECH REVOLUTION IN A GARAGE?

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 3

Jan 4

See all from Chris Nolan

Recommended from Medium

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

22K

605

Alberto Romero

DeepSeek Is Chinese But Its AI Models Are From Another Planet

OpenAI and the US are in deep trouble

Jan 22

192

Lists

DeepSeek Releases Its Own AI Image Generator, Janus-Pro

DeepSeek says Janus-Pro 7B outperforms OpenAI’s Dall-E 3 and Stable Diffusion in several benchmarks. But is it really that good?

Jan 28

1.4K

Dan Koe

The Fastest Way To Build A One-Person Business (Beginner Guide)

I’m going to give you a 60-day action plan to make your first $1,000.

Jan 26

They Know a Collapse Is Coming

The CIO of Goldman Sachs has said that in the next year, companies at the forefront will begin to use AI agents as if they were employees —…

Jan 21

DeepSeek Just Confirmed My Suspicions About OpenAI

The ChatGPT maker has been playing a losing game

Jan 27

2.4K

101

See more recommendations

The Rise of Humanness

408 位关注者

Phil Savage

Screenwriter at Self Employed

3 周

Hey Chris, trying to connect. ??

要查看或添加评论，请登录

Chris Nolan的更多文章

FUTURE-PROOFING LEADERSHIP & HUMANNESS IN THE AGE OF AI!

2025年2月25日

FUTURE-PROOFING LEADERSHIP & HUMANNESS IN THE AGE OF AI!

AI is Coming for Your Job—Unless You Do THIS..
The Future of Human-Machine Collaboration: Insights from the Jamie Gorman Podcast

2025年2月17日

The Future of Human-Machine Collaboration: Insights from the Jamie Gorman Podcast

The Rise of Humanness 7,248 subscribers February 16, 2025 Hey, fellow humans (and any sentient bots secretly…

4 条评论
AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

2025年2月12日

AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

Chris Nolan Multiple Emmy Winning Director-Writer | Filmmaker | Author | Keynote Speaker | Futurist | Branding Expert |…

2 条评论
"HAPPY VAiLENTINES" LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

2025年2月10日

"HAPPY VAiLENTINES" LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

Chris Nolan Multiple Emmy Winning Director-Writer | Filmmaker | Author | Keynote Speaker | Futurist | Branding Expert |…

1 条评论
The Quantum Human: Mastering AI, Leadership, and the Future of Intelligence with Salim Ismail"

2025年2月7日

The Quantum Human: Mastering AI, Leadership, and the Future of Intelligence with Salim Ismail"

The Quantum Human: Mastering AI, Leadership, and the Future of Intelligence with Salim Ismail" February 3, 2025 The…

1 条评论
THIS FANATICAL FUTURIST WILL CHANGE HOW YOU SEE AI, HUMANNESS & THE FUTURE

2025年2月7日

THIS FANATICAL FUTURIST WILL CHANGE HOW YOU SEE AI, HUMANNESS & THE FUTURE

The Rise of Humanness 7,248 subscribers February 7, 2025 Matthew Griffin on AI, Innovation & The Future of Human…

1 条评论
The Quantum Human: Mastering AI, Leadership, and the Future of Intelligence with Salim Ismail"

2025年2月3日

The Quantum Human: Mastering AI, Leadership, and the Future of Intelligence with Salim Ismail"

The Rise of Humanness 7,048 subscribers February 1, 2025 SALIM ISMAIL: OPEN EXO FOUNDER, FUTURIST Are we still human if…

3 条评论
LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

2025年1月27日

LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 5: A Seven Part Series Where We Explore How to Expand Your…

3 条评论
AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

2025年1月19日

AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 4 A Seven Part Series Where We Explore How to Expand Your Human…

2 条评论
WHY AI WOULD NEVER CLIMB EVEREST, RUN A 4-MINUTE MILE, OR START A TECH REVOLUTION IN A GARAGE?

2025年1月14日

WHY AI WOULD NEVER CLIMB EVEREST, RUN A 4-MINUTE MILE, OR START A TECH REVOLUTION IN A GARAGE?

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 3 A Seven Part Series Where We Explore How to Expand Your Human…

3 条评论

See all articles

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 6

A Seven Part Series Where We Explore How to Expand Your Human Value and Superpowers in a World Increasingly Run by Machines. How You Not Only Make Yourself Indispensable but Prevail Over the Bots.

WAIT. WHAT ABOUT SCHEMING AGENTIC AI?

WHY VIGILANCE MATTERS FOR THE RISE OF HUMANNESS

AI IN THE LOOP: HUMANS AS THE DIRECTORS

STAY TUNED FOR THE RISE OF HUMANNESS: PART 7:

Next Up: The Future of Life, Work, and Relationships: “Our Next Evolution?”

JOIN THE PODCAST AND GET THE BOOK:

ABOUT CHRIS NOLAN, MIKE SCHINDER AND VUCA MAX

JOIN THE HUMAN AI MOVEMENT

Written by Chris Nolan

No responses yet

领英推荐

More from Chris Nolan

HUMANKIND’S NEXT EVOLUTION: THE FUTURE OF LIFE, WORK, AND RELATIONSHIPS.

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 7

AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 4

LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 5:

WHY AI WOULD NEVER CLIMB EVEREST, RUN A 4-MINUTE MILE, OR START A TECH REVOLUTION IN A GARAGE?

WELCOME TO THE RISE OF HUMANNESS IN THE AGE OF AI: PART 3

Recommended from Medium

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

DeepSeek Is Chinese But Its AI Models Are From Another Planet

OpenAI and the US are in deep trouble

Lists

Generative AI Recommended Reading

AI Regulation

What is ChatGPT?

ChatGPT prompts

DeepSeek Releases Its Own AI Image Generator, Janus-Pro

DeepSeek says Janus-Pro 7B outperforms OpenAI’s Dall-E 3 and Stable Diffusion in several benchmarks. But is it really that good?

The Fastest Way To Build A One-Person Business (Beginner Guide)

I’m going to give you a 60-day action plan to make your first $1,000.

They Know a Collapse Is Coming

The CIO of Goldman Sachs has said that in the next year, companies at the forefront will begin to use AI agents as if they were employees —…

DeepSeek Just Confirmed My Suspicions About OpenAI

The ChatGPT maker has been playing a losing game

The Rise of Humanness

408 位关注者

Chris Nolan的更多文章

FUTURE-PROOFING LEADERSHIP & HUMANNESS IN THE AGE OF AI!

The Future of Human-Machine Collaboration: Insights from the Jamie Gorman Podcast

AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

"HAPPY VAiLENTINES" LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

The Quantum Human: Mastering AI, Leadership, and the Future of Intelligence with Salim Ismail"

THIS FANATICAL FUTURIST WILL CHANGE HOW YOU SEE AI, HUMANNESS & THE FUTURE

The Quantum Human: Mastering AI, Leadership, and the Future of Intelligence with Salim Ismail"

LOVE BYTES: THE SEDUCTIVE (AND DECEPTIVE) EMOTIONAL ILLUSION OF AI

AI GOES HOLLYWOOD: PLOT TWISTS, DRAMA, AND METHOD ACTING

WHY AI WOULD NEVER CLIMB EVEREST, RUN A 4-MINUTE MILE, OR START A TECH REVOLUTION IN A GARAGE?

社区洞察

其他会员也浏览了

The Brief: The Year of Gen AI (scratch that) Agentic AI

Rogue Talks #2: Will artificial intelligence ruin art?

The Hidden Biases of AI: An Eye-Opening Experiment in a Divided World

The AI Edition: 6/2/2023

Just A Rather Very Intelligent System (J.A.R.V.I.S.) – Your Second Brain

AI: Narrow before Wider, General AI

The dark side of AI

The humans in the machine. And why we need them

How the AI fuse was lit, Sam Altman explains his rehiring, and more! | AI Now Newsletter | Issue 06/12/2023

DALL-E 2 adds 'black'/'female'; Microsoft is To Retire This AI; A radical new project to democratize AI; A million people on DALL-E’s waitlist