AI Agent Security

Unprompted 2026

Glass-Box Security: Operationalizing Mechanistic Interpretability | [un]prompted 2026

Learn how activation hooks, cosine similarity, and scalar projection enable behavior-based detection inside LLMs — the glass-box security approach to AI threat detection.

Carl Hurd 25 April 2026

Unprompted 2026

Tenderizing the Target | [un]prompted 2026

Learn how NVIDIAs Project Marinade uses LLM coding agents to inject realistic, tunable vulnerabilities into real codebases - giving you ground-truth benchmarks to evaluate your security tools.

Aaron Grattafiori Skyler Bingham 22 April 2026

Unprompted 2026

AI Agent Detection Engineering

Learn why AI coding tools break EDR detection rules and how to close the intent attribution gap with process ancestry analysis and agent hooks.

Mika Ayenson 20 April 2026

Unprompted 2026

Vibe Check: Security Failures in AI-Assisted IDEs | [un]prompted 2026

Discover how 37 AI-assisted IDE vulnerabilities across 15+ vendors enable zero-click RCE, prompt injection chains, and silent config poisoning — and how to test your tools.

Piotr Ryciak 15 April 2026

Fwd cloudsec north america 2025

Breaking AI Agents: Exploiting Managed Prompt Templates to Take Over Amazon Bedrock Agents

Learn how attackers exploit Amazon Bedrock agent prompt templates to leak schemas, bypass input validation, and persist malicious instructions across sessions.

Jay Chen Royce Lu 14 April 2026

Unprompted 2026

Anatomy of an Agentic Personal AI Infrastructure | [un]prompted 2026

Learn how to architect a unified Personal AI Infrastructure (PAI) stack with Council multi-agent debate, the PAI algorithm, and Arbo pipelines to amplify your security engineering practice.

Daniel Miessler 12 April 2026

Unprompted 2026

AI Notetakers: The Most Important Person in the Room | [un]prompted 2026

Discover how AI notetakers introduce prompt injection, viral OAuth expansion, and silent recording into your enterprise — and the controls every security team needs now.

Joe Sullivan 9 April 2026

Unprompted 2026

FENRIR: AI Hunting for AI Zero-Days at Scale | [un]prompted 2026

Discover how Trend Micro's FENRIR engine chains SAST tools, fast LLM triage, and agentic sandboxes to find 60+ CVEs at $8.80 per true positive.

Peter Girnus Derek Chen 8 April 2026

Unprompted 2026

When Passports Execute: Exploiting AI Driven KYC Pipelines | [un]prompted 2026

Learn how attackers embed prompt injections in passport images to hijack AI KYC agents and exfiltrate customer PII at scale.

Sean Park 7 April 2026

Unprompted 2026

Agents Exploiting Auth-by-One Errors | [un]prompted 2026

Learn how AI agents detect authentication bypasses, MFA bypasses, and authorization bugs using validator reuse and auth transmogrification.

Brendan Dolan Gavitt Vincent Olesen 31 March 2026

Unprompted 2026

Guardrails beyond Vibes | [un]prompted 2026

Learn how Stripe built and deployed two production AI security agents with multi-agent architecture, LLM-as-judge eval pipelines, and phased rollout.

Jeffrey Zhang Siddh Shah 3 April 2026

Unprompted 2026

The Hard Part Isn't Building the Agent: Measuring Effectiveness

Learn why precision and recall fail for autonomous AI security agents — and how rubric-based LLM judge evaluation gives your team a reliable deployment bar.

Joshua Saxe 31 March 2026

Owasp global appsec usa 2025

Attacking AI

Learn a proven 7-phase AI red teaming methodology, prompt injection taxonomy, and real enterprise case studies for assessing LLM systems.

Jason Haddix 28 March 2026

Owasp global appsec usa 2025

Indirect Prompt Injection: Architectural Testing Approaches for Real World AI/ML Systems

Learn to threat-model AI agents for indirect prompt injection: enumerate tools, map AI-specific attack vectors, and automate dynamic testing with TamperMonkey.

Will Vandevanter 25 March 2026

Latest deep dives

Glass-Box Security: Operationalizing Mechanistic Interpretability | [un]prompted 2026

Tenderizing the Target | [un]prompted 2026

AI Agent Detection Engineering

Vibe Check: Security Failures in AI-Assisted IDEs | [un]prompted 2026

Breaking AI Agents: Exploiting Managed Prompt Templates to Take Over Amazon Bedrock Agents

Anatomy of an Agentic Personal AI Infrastructure | [un]prompted 2026

AI Notetakers: The Most Important Person in the Room | [un]prompted 2026

FENRIR: AI Hunting for AI Zero-Days at Scale | [un]prompted 2026

When Passports Execute: Exploiting AI Driven KYC Pipelines | [un]prompted 2026

Agents Exploiting Auth-by-One Errors | [un]prompted 2026

Guardrails beyond Vibes | [un]prompted 2026

The Hard Part Isn't Building the Agent: Measuring Effectiveness

Attacking AI

Indirect Prompt Injection: Architectural Testing Approaches for Real World AI/ML Systems