Agents Need CI, Not Vibes: Evaluating Microsoft 365 Copilot Agents

Microsoft 365 Copilot agents are crossing the line from demo artifacts into software products. Once that happens, manual spot checks are not enough. A production agent needs a release discipline: evaluation datasets, judge configuration, thresholds, CI/CD gates, evidence packages, and regression memory. Not as governance theatre. As the shortest safe path from “nice demo” to “we can ship this and explain why.” This is the blueprint I would use to move a Copilot agent from vibe-based confidence to governed delivery. ...

May 15, 2026 · 18 min · 3701 words · Pavel Nasovich

When the Scanner Turned: Inside the Trivy Supply Chain Attack and the Rise of CanisterWorm

In March 2026, attackers turned Aqua Security’s Trivy ecosystem into a credential-harvesting distribution channel. This was not one bug, one poisoned package, or one bad release. It was a chained failure across GitHub Actions trust, secret rotation, mutable tags, runner memory, registry publishing, and npm’s default willingness to execute third-party code. On February 27 and February 28, 2026, the Trivy story started the way a lot of modern software compromises start: not with a zero-day in the scanner, but with automation glued together too loosely around trust. An autonomous agent dubbed hackerbot-claw found a dangerous pull_request_target pattern in Aqua Security’s Trivy repository, exploited it, and stole a privileged aqua-bot token. That first breach was bad enough on its own. The real disaster came after the first incident was supposedly contained. ...

March 24, 2026 · 17 min · 3422 words · Pavel Nasovich