You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Community-driven behavioral reliability benchmark for LLMs. 231 probes across 19 modules, deterministic scoring, perplexity correlation, layer sensitivity mapping, quant method capture, hardware-stratified community rankings. Every test contributes to the community dataset.
Open-source governance and compliance framework for AI agents — cross-provider policy enforcement, audit trails, and standards mapping across OWASP MCP Top 10, NIST AI RMF, and EU AI Act
Agent Compliance SDK - trust your agents in production. Turn what your agent handles into the controls you need. Data classification driven agent runtime security controls. Scale compliance to your agents automatically.
Inverse Turing test for AI agents. Procedurally generated challenges that prove substrate, autonomy, and intent — things a human can't fake. Self-hosted, open source, MIT licensed.