The lies that let AI run amok.

20.12.2025

Research Saturday

0:00

24:36

Darren Meyer, Security Research Advocate at Checkmarx, is sharing their work on "Bypassing AI Agent Defenses with Lies-in-the-Loop." Checkmarx Zero researchers introduce “lies-in-the-loop,” a new attack technique that bypasses human‑in‑the‑loop AI safety controls by deceiving users into approving dangerous actions that appear benign. Using examples with AI code assistants like Claude Code, the research shows how prompt injection and manipulated context can trick both the agent and the human reviewer into enabling remote code execution. The findings highlight a growing risk as AI agents become more common in developer workflows, underscoring the limits of human oversight as a standalone security control. The research can be found here: ⁠Bypassing AI Agent Defenses With Lies-In-The-Loop Learn more about your ad choices. Visit megaphone.fm/adchoices

Więcej odcinków z kanału "Research Saturday"

Więcej odcinków

Odkrywaj najlepsze podcasty dzięki bezpłatnej aplikacji GetPodcast.

Subskrybuj ulubione podcasty, słuchaj odcinków offline i sprawdzaj najlepsze polecane podcasty.

The lies that let AI run amok.

Research Saturday

Więcej odcinków z kanału "Research Saturday"

Picture perfect deception.

Walking on EggStremes.

Don’t trust that app!

Excel-lerating cyberattacks.

The lies that let AI run amok.

Root access to the great firewall.

When macOS gets frostbite.

A new stealer hiding behind AI hype.

Two RMMs walk into a phish…

When clicks turn criminal.