A Real ASI02 Gap Caught Before Shipping

Sun, 15 Feb 2026 00:00:00 +0000

A useful security test does not need drama. Sometimes it only needs to put the wrong sentence in the right field and wait to see where the sentence travels.

During development of an agent catalog, one adversarial test exposed that kind of quiet failure. A support workflow accepted an issue summary, classified it, routed it, and drafted a reply. The ordinary functional tests passed. The deterministic path passed. The local LLM path passed. The workflow produced coherent replies.

Tool Misuse on Stack Research

A Real ASI02 Gap Caught Before Shipping