Skip to main content

Posts

2026

Improving a Coding Agent Harness: Part 3, Scoring 100% on Coding Benchmarks
Improving a Coding Agent Harness: Part 2.5, Securely Writing Code
·2589 words·13 mins
Improving a Coding Agent Harness: Part 2, Writing Code
·2333 words·11 mins
Improving a Coding Agent Harness: Part 1.5, Securely Reading Code
·1622 words·8 mins
Improving a Coding Agent Harness: Part 1, Reading Code
·1807 words·9 mins
TOCTOU Race Conditions in Multi-Agent Systems
·677 words·4 mins
Datalog for Agent Security Analysis
·986 words·5 mins
Creating Custom Security Evaluation Harnesses for Agent Systems
·967 words·5 mins
Automating Novel Prompt Injection Discovery for Mozilla's 0din
·1682 words·8 mins
How ReBAC can Limit the Blast Radius of Agent Composition Flaws
·2259 words·11 mins