The coming AI security crisis (and what to do abou...

The coming AI security crisis (and what to do about it) | Sander Schulhoff

December 21, 2025 by Lenny Rachitsky

Share: Copied!

Get the full episode insights!

Enter your email below to get notified about more insights from:

Lenny's Podcast: Product | Career | Growth

This episode is titled:

The coming AI security crisis (and what to do about it) | Sander Schulhoff

Notable Quotes

"The only reason there hasn't been a massive attack yet is how early the adoption is, not because it's secure."

"You can patch a bug, but you can't patch a brain."

"Guardrails do not work; they just don't work."

Episode Summary

In this episode, host Lenny speaks with Sander Schulhoff, a leading researcher on AI security, specifically focusing on the significant vulnerabilities in current AI systems. Schulhoff asserts that AI 'guardrails'—designed to prevent misuse—are fundamentally ineffective. He emphasizes that as AI adoption grows, so do risks, particularly through vulnerabilities such as prompt injection and jailbreaking, where users can manipulate AI into performing harmful actions.

Schulhoff shares his extensive experience in the field, detailing how he has been instrumental in developing red teaming competitions that expose how easily AI systems can be tricked. He argues that the only reason we haven't faced massive attacks yet is due to the relatively early stages of AI adoption and not because these systems are secure. He stresses that existing defenses often provide a false sense of security, as they typically fail to mitigate many possible attack vectors.

The conversation also delves into how rising AI technologies—like automated agents and AI-powered systems—create new attack surfaces. Schulhoff warns of the future potential for real-world damages arising from insecure AI implementations, not merely theoretical risks.

Towards the end, Sander suggests that companies can bolster their AI security posture by implementing stricter data permissions and leveraging frameworks such as Camel to manage permissions effectively. Education, awareness of risks, and the need for adaptive defenses are also key takeaways from the episode. Schulhoff concludes by encouraging a holistic approach to AI security that combines classical cybersecurity practices with a deeper understanding of how AI systems can be manipulated.

Key Takeaways

AI guardrails do not effectively prevent malicious attacks.

Prompt injection and jailbreaking remain critical vulnerabilities in AI systems.

Existing AI defenses provide a false sense of security; many models can still be easily exploited.

Companies should focus on proper data permissions and operational security practices.

Education and awareness about AI security risks are paramount.

The coming AI security crisis (and what to do about it) | Sander Schulhoff

Get the full episode insights!

Lenny's Podcast: Product | Career | Growth

The coming AI security crisis (and what to do about it) | Sander Schulhoff

Notable Quotes

Episode Summary

Key Takeaways

More Podcast Insights

US-Iran Ceasefire Agreed (For Now)

US-Iran ceasefire: has Tehran played Trump? – The Latest

RunPod Sprint! Megan Boxall

£15k/month Health Anxiety App - Should They Take Investment or Stay Bootstrapped...