5
min read
Research

How to Run the Backbone Breaker Benchmark (B3)

Learn how to run the Backbone Breaker Benchmark (b3) to evaluate how resilient backbone LLMs are against real adversarial attacks drawn from Gandalf: Agent Breaker.
Julia Bazinska
March 11, 2026
Activate
untouchable mode.
Get started for free.

Lakera Guard protects your LLM applications from cybersecurity risks with a single line of code. Get started in minutes. Become stronger every day.

Join our Slack Community.

Several people are typing about AI/ML security. 
Come join us and 1000+ others in a chat that’s thoroughly SFW.