The Cyber Archive
Security domain
A

AI/ML Security

All Deep Dives For Infosec Conference Talks Covering AI/ML Security. Talks analyzed in full.

32 deep dives
7 conferences

Latest deep dives

Kinetic Risk: Securing and Governing Physical AI in the Wild | [un]prompted 2026
Unprompted 2026

Kinetic Risk: Securing and Governing Physical AI in the Wild | [un]prompted 2026

Learn how physical AI security differs from digital AI risk and why latency is a safety parameter, not a performance metric, in autonomous systems.

Padma Apparao 28 April 2026
Securing Workspace GenAI at Google Speed | [un]prompted 2026
Unprompted 2026

Securing Workspace GenAI at Google Speed | [un]prompted 2026

Learn how Google's Workspace security team built a defense-in-depth architecture against indirect prompt injection and rogue agent actions in production GenAI systems.

Nicolas Lidzborski 27 April 2026
Glass-Box Security: Operationalizing Mechanistic Interpretability | [un]prompted 2026
Unprompted 2026

Glass-Box Security: Operationalizing Mechanistic Interpretability | [un]prompted 2026

Learn how activation hooks, cosine similarity, and scalar projection enable behavior-based detection inside LLMs — the glass-box security approach to AI threat detection.

Carl Hurd 25 April 2026
Hooking Coding Agents with the Cedar Policy Language | [un]prompted 2026
Unprompted 2026

Hooking Coding Agents with the Cedar Policy Language | [un]prompted 2026

Learn how to build a Cedar-based policy harness that hooks into Gemini CLI, Claude Code, and Cursor to enforce ABAC rules, track PII taint, and block AI agent data exfiltration.

Matt Maisel 24 April 2026
Tenderizing the Target | [un]prompted 2026
Unprompted 2026

Tenderizing the Target | [un]prompted 2026

Learn how NVIDIAs Project Marinade uses LLM coding agents to inject realistic, tunable vulnerabilities into real codebases - giving you ground-truth benchmarks to evaluate your security tools.

Aaron Grattafiori Skyler Bingham 22 April 2026
Detecting GenAI Threats at Scale with YARA-Like Semantic Rules
Unprompted 2026

Detecting GenAI Threats at Scale with YARA-Like Semantic Rules

Learn how SuperYARA combines semantic similarity, ML classifiers, and LLM rules to detect prompt injection and GenAI threats at scale — with 99% cost reduction via pre-filtering.

Mohamed Nabeel 21 April 2026
AI Agent Detection Engineering
Unprompted 2026

AI Agent Detection Engineering

Learn why AI coding tools break EDR detection rules and how to close the intent attribution gap with process ancestry analysis and agent hooks.

Mika Ayenson 20 April 2026
SIFT-FIND EVIL I Gave Claude Code R00t on DFIR SIFT Workstation | [un]prompted 2026
Unprompted 2026

SIFT-FIND EVIL I Gave Claude Code R00t on DFIR SIFT Workstation | [un]prompted 2026

Learn how Rob T. Lee gave Claude Code root on the SIFT Workstation and completed a full DFIR investigation — disk image, memory, event logs, MITRE ATT&CK mapping — in under 15 minutes.

Rob T Lee 19 April 2026
Three Phases of AI Adoption | [un]prompted 2026
Unprompted 2026

Three Phases of AI Adoption | [un]prompted 2026

Learn the 3 phases of enterprise AI adoption in cybersecurity — and why access, cost, and culture must be solved in order.

Chase Hasbrouck 18 April 2026
Enterprise AI Governance at Snowflake | [un]prompted 2026
Unprompted 2026

Enterprise AI Governance at Snowflake | [un]prompted 2026

Learn how Snowflake built an enterprise AI governance model that keeps pace with weekly vendor releases and autonomous coding agents — without killing developer productivity.

Ragini Ramalingam 17 April 2026
Establishing AI Governance Without Stifling Innovation | [un]prompted 2026
Unprompted 2026

Establishing AI Governance Without Stifling Innovation | [un]prompted 2026

Learn how to build a tiered AI governance framework that balances enterprise AI security with innovation — from intake scoring to human oversight gates.

Billy Norwood 16 April 2026
Bypassing AI Security Controls with Prompt Formatting
Fwd cloudsec north america 2025

Bypassing AI Security Controls with Prompt Formatting

Learn how prompt formatting attacks bypass AWS Bedrock Guardrails PII filters without injection — and how system prompt engineering fights back.

Nathan Kirk 16 April 2026
Vibe Check: Security Failures in AI-Assisted IDEs | [un]prompted 2026
Unprompted 2026

Vibe Check: Security Failures in AI-Assisted IDEs | [un]prompted 2026

Discover how 37 AI-assisted IDE vulnerabilities across 15+ vendors enable zero-click RCE, prompt injection chains, and silent config poisoning — and how to test your tools.

Piotr Ryciak 15 April 2026
Securing organizations ML & LLMops deployments : A platform architects journey onboarding LLM & MLops tools and securing multi-cloud data access
Fwd cloudsec north america 2025

Securing organizations ML & LLMops deployments : A platform architects journey onboarding LLM & MLops tools and securing multi-cloud data access

Learn to close the real security gaps in AWS Bedrock and Azure AI defaults — IAM, guardrails, private networking, and confused deputy risks in agentic pipelines.

Sai Gunaranjan Kyler Middleton 14 April 2026
Breaking AI Agents: Exploiting Managed Prompt Templates to Take Over Amazon Bedrock Agents
Fwd cloudsec north america 2025

Breaking AI Agents: Exploiting Managed Prompt Templates to Take Over Amazon Bedrock Agents

Learn how attackers exploit Amazon Bedrock agent prompt templates to leak schemas, bypass input validation, and persist malicious instructions across sessions.

Jay Chen Royce Lu 14 April 2026
Black-hat LLMs | [un]prompted 2026
Unprompted 2026

Black-hat LLMs | [un]prompted 2026

Discover how LLMs now autonomously find and exploit zero-day vulnerabilities in the Linux kernel and Ghost CMS — and what the AI capability curve means for defenders right now.

Nicholas Carlini 13 April 2026
Zeal of the Convert: Taming Shai-Hulud with AI | [un]prompted 2026
Unprompted 2026

Zeal of the Convert: Taming Shai-Hulud with AI | [un]prompted 2026

Learn how AI workflows, reasoning models, and feedback loops turned a two-week manual investigation into a two-day operation that identified 2,400 supply chain attack victims.

Rami Mccarthy 11 April 2026
AI Notetakers: The Most Important Person in the Room | [un]prompted 2026
Unprompted 2026

AI Notetakers: The Most Important Person in the Room | [un]prompted 2026

Discover how AI notetakers introduce prompt injection, viral OAuth expansion, and silent recording into your enterprise — and the controls every security team needs now.

Joe Sullivan 9 April 2026
FENRIR: AI Hunting for AI Zero-Days at Scale | [un]prompted 2026
Unprompted 2026

FENRIR: AI Hunting for AI Zero-Days at Scale | [un]prompted 2026

Discover how Trend Micro's FENRIR engine chains SAST tools, fast LLM triage, and agentic sandboxes to find 60+ CVEs at $8.80 per true positive.

Peter Girnus Derek Chen 8 April 2026
When Passports Execute: Exploiting AI Driven KYC Pipelines | [un]prompted 2026
Unprompted 2026

When Passports Execute: Exploiting AI Driven KYC Pipelines | [un]prompted 2026

Learn how attackers embed prompt injections in passport images to hijack AI KYC agents and exfiltrate customer PII at scale.

Sean Park 7 April 2026
Developing & Deploying AI Fingerprints | [un]prompted 2026
Unprompted 2026

Developing & Deploying AI Fingerprints | [un]prompted 2026

Learn how Binary Shield uses AI fingerprinting to detect and share prompt injection threats across all LLM services in your portfolio — privacy-safe and 36x faster.

Natalie Isak Waris Gill 31 March 2026
Guardrails beyond Vibes | [un]prompted 2026
Unprompted 2026

Guardrails beyond Vibes | [un]prompted 2026

Learn how Stripe built and deployed two production AI security agents with multi-agent architecture, LLM-as-judge eval pipelines, and phased rollout.

Jeffrey Zhang Siddh Shah 3 April 2026
Security Guidance as a Service | [un]prompted 2026
Unprompted 2026

Security Guidance as a Service | [un]prompted 2026

Learn how Adobe built a RAG-powered security guidance platform delivering org-specific recommendations across Jira, Slack, and IDE at scale.

Shruti Datta Gupta Chandrani Mukherjee 1 April 2026
The Hard Part Isn't Building the Agent: Measuring Effectiveness
Unprompted 2026

The Hard Part Isn't Building the Agent: Measuring Effectiveness

Learn why precision and recall fail for autonomous AI security agents — and how rubric-based LLM judge evaluation gives your team a reliable deployment bar.

Joshua Saxe 31 March 2026
1 2