Guest segment on evaluating agentic AI in security, and how organizations are cutting through 'AI fatigue' to make real decisions about deployment.
Talks & News
Selected public talks, writing, and news.
Risky Business #837 — Evaluating agentic AI amid AI fatigue
Trust, Then Autonomy: A New Framework for Evaluating Security AI
Talk introducing the five-level autonomy framework, evidence ladder, and three architectural foundations for evaluating earned autonomy in deployed AI systems.
How Sublime's AI Agents Are Secure by Design
Architectural deep-dive on tool scoping, platform-enforced authorization, prompt injection mitigation, and graduated oversight for production LLM agents.
CAMLIS 2025: Evaluating LLM-Generated Detection Rules
Paper accepted at CAMLIS 2025 — an open-source benchmark and three metrics (detection accuracy, economic cost of syntactic correctness, robustness of query) for measuring LLM-generated security rules.
Risky Biz News — The spam/email bombing problem
Sponsor interview on the rising use of spam bombing and email bombing as initial-access techniques in modern intrusion campaigns.