AI in the shadows: From hallucinations to blackmail

AI in the shadows: From hallucinations to blackmail

0 Ratings
0
Episode
322 of 333
Duration
44min
Language
English
Format
Category
Non-fiction

In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break down how today’s models only mimic reasoning, which can lead to serious ethical considerations. They unpack a fascinating (and slightly terrifying) new study from Anthropic, where agentic AI models were caught simulating blackmail, deception, and even sabotage — all in the name of goal completion and self-preservation.

Featuring:

• Chris Benson – Website • , LinkedIn • , Bluesky • , GitHub • , X • Daniel Whitenack – Website • , GitHub • , X Links:

Agentic Misalignment: How LLMs could be insider threatsHugging Face Agents Course Register for upcoming webinars here!


Listen and read

Step into an infinite world of stories

  • Read and listen as much as you want
  • Over 1 million titles
  • Exclusive titles + Storytel Originals
  • 7 days free trial, then €9.99/month
  • Easy to cancel anytime
Try for free
Details page - Device banner - 894x1036
Cover for AI in the shadows: From hallucinations to blackmail

Other podcasts you might like ...