v0.7.2 hardens governance validation with negation-aware CRITICAL escalation, catches prompt injection split across lines, and detects settings that silently eliminate human oversight.
v0.6.3 adds 3 new checks (Claude settings, credential storage, unicode steganography), maps every finding to the OWASP Agentic Top 10, and adds CVE-specific detection patterns.
Behavioral rules for AI agents are text in the context window. Under pressure â deep in a fix loop, resolving conflicting instructions, running low on context â the model rationalizes around them. This isn’t a hypothetical failure mode. It’s documented.