The Cyber Archive
Security topic
A

AI Agent Security

All Deep Dives For Infosec Conference Talks Covering AI Agent Security. Talks analyzed in full.

14 deep dives
6 conferences

Latest deep dives

Glass-Box Security: Operationalizing Mechanistic Interpretability | [un]prompted 2026
Unprompted 2026

Glass-Box Security: Operationalizing Mechanistic Interpretability | [un]prompted 2026

Learn how activation hooks, cosine similarity, and scalar projection enable behavior-based detection inside LLMs — the glass-box security approach to AI threat detection.

Carl Hurd 25 April 2026
Tenderizing the Target | [un]prompted 2026
Unprompted 2026

Tenderizing the Target | [un]prompted 2026

Learn how NVIDIAs Project Marinade uses LLM coding agents to inject realistic, tunable vulnerabilities into real codebases - giving you ground-truth benchmarks to evaluate your security tools.

Aaron Grattafiori Skyler Bingham 22 April 2026
AI Agent Detection Engineering
Unprompted 2026

AI Agent Detection Engineering

Learn why AI coding tools break EDR detection rules and how to close the intent attribution gap with process ancestry analysis and agent hooks.

Mika Ayenson 20 April 2026
Vibe Check: Security Failures in AI-Assisted IDEs | [un]prompted 2026
Unprompted 2026

Vibe Check: Security Failures in AI-Assisted IDEs | [un]prompted 2026

Discover how 37 AI-assisted IDE vulnerabilities across 15+ vendors enable zero-click RCE, prompt injection chains, and silent config poisoning — and how to test your tools.

Piotr Ryciak 15 April 2026
Breaking AI Agents: Exploiting Managed Prompt Templates to Take Over Amazon Bedrock Agents
Fwd cloudsec north america 2025

Breaking AI Agents: Exploiting Managed Prompt Templates to Take Over Amazon Bedrock Agents

Learn how attackers exploit Amazon Bedrock agent prompt templates to leak schemas, bypass input validation, and persist malicious instructions across sessions.

Jay Chen Royce Lu 14 April 2026
Anatomy of an Agentic Personal AI Infrastructure | [un]prompted 2026
Unprompted 2026

Anatomy of an Agentic Personal AI Infrastructure | [un]prompted 2026

Learn how to architect a unified Personal AI Infrastructure (PAI) stack with Council multi-agent debate, the PAI algorithm, and Arbo pipelines to amplify your security engineering practice.

Daniel Miessler 12 April 2026
AI Notetakers: The Most Important Person in the Room | [un]prompted 2026
Unprompted 2026

AI Notetakers: The Most Important Person in the Room | [un]prompted 2026

Discover how AI notetakers introduce prompt injection, viral OAuth expansion, and silent recording into your enterprise — and the controls every security team needs now.

Joe Sullivan 9 April 2026
FENRIR: AI Hunting for AI Zero-Days at Scale | [un]prompted 2026
Unprompted 2026

FENRIR: AI Hunting for AI Zero-Days at Scale | [un]prompted 2026

Discover how Trend Micro's FENRIR engine chains SAST tools, fast LLM triage, and agentic sandboxes to find 60+ CVEs at $8.80 per true positive.

Peter Girnus Derek Chen 8 April 2026
When Passports Execute: Exploiting AI Driven KYC Pipelines | [un]prompted 2026
Unprompted 2026

When Passports Execute: Exploiting AI Driven KYC Pipelines | [un]prompted 2026

Learn how attackers embed prompt injections in passport images to hijack AI KYC agents and exfiltrate customer PII at scale.

Sean Park 7 April 2026
Agents Exploiting Auth-by-One Errors | [un]prompted 2026
Unprompted 2026

Agents Exploiting Auth-by-One Errors | [un]prompted 2026

Learn how AI agents detect authentication bypasses, MFA bypasses, and authorization bugs using validator reuse and auth transmogrification.

Brendan Dolan Gavitt Vincent Olesen 31 March 2026
Guardrails beyond Vibes | [un]prompted 2026
Unprompted 2026

Guardrails beyond Vibes | [un]prompted 2026

Learn how Stripe built and deployed two production AI security agents with multi-agent architecture, LLM-as-judge eval pipelines, and phased rollout.

Jeffrey Zhang Siddh Shah 3 April 2026
The Hard Part Isn't Building the Agent: Measuring Effectiveness
Unprompted 2026

The Hard Part Isn't Building the Agent: Measuring Effectiveness

Learn why precision and recall fail for autonomous AI security agents — and how rubric-based LLM judge evaluation gives your team a reliable deployment bar.

Joshua Saxe 31 March 2026
Attacking AI
Owasp global appsec usa 2025

Attacking AI

Learn a proven 7-phase AI red teaming methodology, prompt injection taxonomy, and real enterprise case studies for assessing LLM systems.

Jason Haddix 28 March 2026
Indirect Prompt Injection: Architectural Testing Approaches for Real World AI/ML Systems
Owasp global appsec usa 2025

Indirect Prompt Injection: Architectural Testing Approaches for Real World AI/ML Systems

Learn to threat-model AI agents for indirect prompt injection: enumerate tools, map AI-specific attack vectors, and automate dynamic testing with TamperMonkey.

Will Vandevanter 25 March 2026