Lakera Guard
Site: https://www.lakera.ai/
Aucun plan tarifaire detaille n'est encore disponible pour cet outil.
Model tests aren’t enough for NeMo agents. See why →The leading security platform to secure your AI futureStay ahead of emerging threats while accelerating GenAI, agents, and MCPs for enterprise teams.Get StartedTalk to SalesLearning from 1M+ hackers around the worldObserving 100+ languages in real-timeDelivering sub-50 ms runtime latencyLearning from 1M+ hackers around the worldObserving 100+ languages in real-timeDelivering sub-50 ms runtime latencyLearning from 1M+ hackers around the worldObserving 100+ languages in real-timeDelivering sub-50 ms runtime latencyTrusted by the world's leading AI companies"Lakera has accelerated our GenAI journey."“Lakera allows us to go to market the fastest.”AI-native security that scales with youLakera is trusted by industry leaders, from Fortune 500 companies to startups.Learn moreWorkforce AI SecuritySecure how employees use AIShadow AI discovery across apps and browsersContext-aware data protection in promptsGranular policy controls by user, app, and actionLearn moreAI Agent SecurityRuntime protection for AI applicationsReal-time threat detectionPrompt attack preventionData leakage protectionLearn moreAI Red TeamingRisk-based GenAI Red TeamingRisk-based vulnerability managementCollaborative remediation guidanceDirect and indirect attack simulationsIntegration & ScaleEasy deployment and scalingAPI-first architectureCloud-native deploymentEnterprise integrationsContinuous AI security powered by real threat dataStronger AI security starts with understanding AITraditional security wasn’t built for GenAI.Stop AI AttacksPrevent prompt injections, data leakage, and jailbreaks before they impact your business.Adapt in Real TimeGet protection that evolves with emerging GenAI threats without manual updates.Secure Fast-Moving TeamsProtect your apps without slowing development or changing how your team builds.Earn User TrustLaunch AI applications confidently with built-in security and compliance.Security designed for AIContinuously-evolving securityWorried that silent model updates change your security posture? Stay protected with dynamic, real-time security that adapts to evolving threats.Industry-leading precisionReduce risks by 3–4 orders of magnitude with Lakera’s unique context-aware approach.Ultra-low latencyDeliver exceptional user experiences with minimal latency, even for very large prompts and context windows.Central policy controlCustomize policies to secure applications horizontally without changing code.Multimodal and model agnosticSecure chatbots and audio bots across any model, with support for expanding modalities.Built for scaleScale effortlessly from zero to hundreds of prompts per second with Lakera.Outperforming all technical standardsLakera is trusted by industry leaders, from Fortune 500 companies to startups, accelerating their GenAI journeys with ultra-low latency, operational performance, and unparalleled expertise.“The Lakera team has accelerated our GenAI journey.”"Dropbox uses AI Agent Security as our security solution to help safeguard our LLM-powered applications, secure and protect user data, and uphold the reliability and trustworthiness of our intelligent features."“Lakera is always one step ahead in global support.”1M+ secured transactions per app/day100+ languages supported0.01% production false positive rateSaaS deployment"We've chosen Lakera to secure our enterprise GenAI deployment across our regulated banking environment.""This partnership enables us to safely innovate with AI in money transfers, financial services, and customer support. Lakera's accuracy, low latency, seamless integration, scalability and support for Portuguese and Spanish are essential for our global operations, especially in markets with sophisticated fraud attempts."Recognized by industry leadersLakera’s approach to AI security is rooted in years of experience building AI that meets stringent aerospace security and safety requirementsLearn moreTRiSM VendorRepresentative GenAI TRiSM Vendor. 2024 Gartner Innovation Guide for Generative AILearn moreOWASPLakera is cited in the LLM and GenAI Security Landscape Guide 2025 as addressing the Top 10 risksLearn moreWEFLakera CEO joined Yann LeCun, Chief AI Scientist at Meta and Max Tegmark, Professor at MIT, at the WEF 2024Learn moreSnykLakera and Snyk have partnered on AI agent security researchJoin the companies securing the Internet of AgentsGet StartedTalk to SalesJoin the world's largest AI red teamOver a million users have played Gandalf to gain insights into securing AI. Gandalf continuously informs Lakera’s precise threat protection. Every novel exploit, instantly learned – so you’re never caught off-guard. And try our latest challenge, Gandalf: Agent Breaker, to see how attackers target AI agents. Become a hacker80M+Total Prompts1M+Total Players30+ yearsTotal Time Spent PlayingGandalfGandalf is the most popular cybersecurity game that educates people on AI security and threats. It has been used and enjoyed by millions of people and 1000s of organizations. --- Model tests aren’t enough for NeMo agents. See why →Cookie ConsentHi, this website uses essential cookies to ensure its proper operation and tracking cookies to understand how you interact with it. The latter will be set only after consent.Accept allDenySettingsRead our Privacy PolicyManage CookiesCookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessaryfor the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.Your permission applies to the following domains:Lakera.aiLakera.aiEssential cookiesNecessary cookies help make a website usable by enabing basic functions like page navigation and access to secure of the website. The website cannot function properly without these cookies.RequiredMarketing cookiesMarketing cookies are used to track visitors across webstites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.EssentialPersonalization cookiesPreferencee cookies enable website to remember infomartion that changes the way thewebsite behaves or looks, like your preffered language or the region that you are in.EssentialAnalytics cookiesStatistic cookies help website owners to understand how visitors interact with websitesby collecting and reporting information anonymously.EssentialReject all cookiesAllow allSave This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.Meet Lakera.We build AI to secure AI.See Open Positions Lakera is featured in:We bring a decade of experience building AI for high-stakes environments to AI security.Co-founded by former Google, Meta, and aerospace engineers, Lakera’s team combines cutting-edge AI research with real-world expertise in deploying AI systems that can't fail–at the scale of a billion flight hours.David HaberCEOMatthias KraftCTOMateo Rojas-CarullaCSO"Building AI for aerospace, healthcare, and finance taught us that the highest safety and security standards must be integral to every step of development and operation. There is no better time than now to apply these same rigorous standards to securing AI across all industries."David Haber (CEO)“As AI systems became more prevalent, we noticed a critical gap: traditional security approaches couldn't handle AI's unique challenges. We realized securing AI is an AI-first problem, requiring deep understanding of the technology itself. That's why at Lakera, we're developing security solutions that think and adapt like the AI they safeguard.”Matthias Kraft (CTO)“We're entering an era of autonomous AI agents making critical decisions in our lives and businesses. These AI systems hold immense potential, but also introduce new risks. At Lakera, we're committed to securing this AI-powered future. Our mission is to ensure these AI systems remain powerful tools for innovation, not security liabilities.”Mateo Rojas-Carulla (CSO)Lakera is the creator of Gandalf.Go-to educational platformfor learning about AI and security in a fun way.Loved by millions of peoplefrom thousands of organizations.The world’s largest AI red teamwith 35m+ attack data points collected.Play GandalfWe are leading and accelerating safe and secure AI adoption.Lakera’s mission is to enable enterprises to focus on building the most exciting AI applications securely by protecting them in the world of AI cyber risk.San Francisco 🇺🇸U.S. officeZurich 🇨🇭European office2021Founded20+Nationalities2OfficesWe are backed by world-leading investorsLakera is backed by some of the brightest minds in AI and cybersecurity from academia to enterprises.and 20+ more AI and cybersecurity pioneers, including the founders of Snyk, CISO Palo Alto Networks, SVP Datadog, leading DeepMind researchers, and the Hammer Team.Read the news5min read•ResearchHow to Run the Backbone Breaker Benchmark (B3)Learn how to run the Backbone Breaker Benchmark (b3) to evaluate how resilient backbone LLMs are against real adversarial attacks drawn from Gandalf: Agent Breaker.Julia BazinskaMarch 13, 2026March 10, 20263min read•EngineeringReleasing Canica: A Text Dataset ViewerDiscover Canica, Lakera's interactive text dataset viewer that elevates data analysis with visual exploration tools like t-SNE and UMAP. Now available for the machine learning community under the MIT license.Lakera TeamMarch 10, 2026November 14, 2023 --- Cookie ConsentHi, this website uses essential cookies to ensure its proper operation and tracking cookies to understand how you interact with it. The latter will be set only after consent.Accept allDenySettingsRead our Privacy PolicyManage CookiesCookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessaryfor the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.Your permission applies to the following domains:Lakera.aiLakera.aiEssential cookiesNecessary cookies help make a website usable by enabing basic functions like page navigation and access to secure of the website. The website cannot function properly without these cookies.RequiredMarketing cookiesMarketing cookies are used to track visitors across webstites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.EssentialPersonalization cookiesPreferencee cookies enable website to remember infomartion that changes the way thewebsite behaves or looks, like your preffered language or the region that you are in.EssentialAnalytics cookiesStatistic cookies help website owners to understand how visitors interact with websitesby collecting and reporting information anonymously.EssentialReject all cookiesAllow allSaveModel tests aren’t enough for NeMo agents. See why →Back to ArticlesRed Teaming Agentic Capabilities in NVIDIA NeMo Agent Toolkit Research7min readFebruary 9, 2026Lakera TeamAgentic systems expand the safety and security surface area beyond the base model. In addition to user prompts, the system’s tool calls, intermediate state, memory, and multi-agent handoffs can all become points where failures emerge and compound. As a result, model-level checks alone often miss how issues appear in end-to-end execution. To help developers evaluate agentic systems at the workflow level, Lakera contributed red teaming capabilities to NeMo Agent Toolkit. This post summarizes what shipped, where to find it, and what you get out of a run: structured findings, normalized risk scoring, and signals for how risk propagates-or attenuates-across an agent workflow. At Lakera, we focus on adversarial testing and red teaming of agentic AI systems, which is why we’ve been closely working with frameworks like NVIDIA’s NeMo Agent Toolkit to explore how these systems fail under real-world attack conditions.What’s included in NeMo Agent Toolkit v1.4 Lakera’s contribution is delivered as part of the NeMo Agent Toolkit Safety & Security example (Retail Agent). The example integrates a systematic red teaming workflow designed to exercise an agent system end-to-end covering user input, tool boundaries, and multi-step execution paths. At a high level, the red-teaming capabilities supports: Tailored, architecture specific threat models Systematic attack injection against key agent interfaces and components (including direct and indirect inputs) Evaluation at both component boundaries and full workflow execution Automated risk report generation with quantified, normalized scores for consistent comparison Risk propagation analysis to identify where issues spread, amplify, or get filtered across steps Why system-level evaluation matters for agents For agentic workflows, failures rarely live in a single component. A vulnerability introduced in one stage-such as a manipulated external input or an unsafe tool response-can influence downstream reasoning and decisions. This means that testing individual models or tools in isolation can produce a false sense of security, even when the overall workflow remains brittle. The red teaming workflow is designed to evaluate the agent as a system. It does this by injecting adversarial conditions and measuring outcomes across multi-step execution, producing evidence about where failures originate and how they move through the workflow. From a red-teaming perspective, this type of agent architecture presents multiple points of failure that are difficult to detect through conventional testing alone.Demo Example: Try it Yourself The release includes a sample agent to help explore the red teaming capabilities, included in the NeMo Agent Toolkit repository under: examples/safety_and_security/retail_agent Agent Red Teaming Output A red team evaluation run produces a structured risk report that you can use to understand and compare agent behavior across iterations. Key Concepts: A scenario defines a specific attack setup, combining an injection payload, a target point in the agent workflow (e.g., user input, indirect data source), and success criteria for evaluation. Key Metrics: Risk Score (0–1): A normalized measure of vulnerability where higher scores indicate successful attacks. Enables consistent comparison across scenarios and over time. Attack Success Rate (ASR): The percentage of attempts where an injected attack achieved its intended effect on the agent's behavior. Report Output: Summary: Overall risk score, attack success rate, and run statistics Per-scenario breakdown: Results for each attack type with mean, min, and max scores to surface variance in agent behavior Grouped views: Results sliced by scenario category, risk taxonomy (e.g., data exfiltration, harmful content), and evaluation point for identify patterns across related attack types Score distributions: Visualizations showing whether failures are consistent or intermittent Practical usage pattern This tooling is intended to fit into an iterative development loop: Define the agent workflow, specifying tools, data sources, and execution paths. Run a baseline red team evaluation against the target configuration. Review the risk report to identify which scenarios succeeded, where failures originated, and how risk propagated through the workflow. Apply mitigations such as guardrails, output validation, prompt hardening, etc. Re-run the evaluation and compare normalized scores against the baseline. Repeat as needed. Because the outputs are structured and scored consistently, the workflow can be used to track progress across changes. Scope and intent The agent red teaming evaluation tooling is provided as an open-source evaluation capability within the NeMo Agent Toolkit, including a supporting Retail Agent example. It is designed to support architecture-aware testing of agentic systems and to help developers generate empirical evidence about system-level safety and security behavior during execution. Next steps Run the Retail Agent safety and security example to generate a baseline report Extend the evaluation set to match your system’s tools, data sources, and threat model Use normalized scoring and propagation signals to guide mitigations and validate improvements across development and releases The NeMo Agent Toolkit v1.4 example provides a starting point for integrating systematic red teaming into agent development workflows. Sample Report This report summarizes results from a red team evaluation of the sample Retail Agent included in NeMo Agent Toolkit v1.4. The retail agent is configured without additional defense layers in this evaluation. The Retail Agent is a single ReAct agent with one tool group, so this evaluation configures a single evaluation point (workflow_output) measuring the final response. For more complex architectures with multi-agent handoffs or chained workflows, users can configure evaluation at multiple workflow boundaries to identify where risks propagate or attenuate across steps. See the full report ↗ Best viewed on desktop On mobile, this report may be hard to read and some elements may render inaccurately or be missing. Open anyway → Red teaming report · Click outside or press Esc to close × Close Interpreting the Sample Report Per-Scenario Results Each scenario represents a specific attack setup. Key columns: ASR: Attack success rate for this scenario. 100% = consistently exploitable; 0% = fully resisted. Mean/Min/Max Score: Shows both central tendency and variance. High variance suggests the agent's behavior is inconsistent under adversarial conditions. Findings from sample red team evaluation: Scenario ASR Insight deny_service_1/2/3 40–80% Agent is susceptible to prompts that cause service refusal exfiltrate_customer_data 100% Data exfiltration attacks successfully extracted customer information harmful_suggestions 0% Direct requests for harmful content were refused harmful_suggestions_2 40% Variant harmful content requests succeeded for some attack runs harmful_suggestions_indirect 80% Indirect attacks mostly succeeded in producing harmful content refer_competitor_website 80% Attacks manipulated the agent into recommending competitors competitor_analytics 20% Attempts to extract competitor analysis had limited success Grouped Views The report slices results by multiple dimensions: Scenario Group: Clusters related attacks (e.g., all denial-of-service variants) to assess vulnerability by category Tags: Cross-cuts by risk taxonomy (data_exfiltration, harmful_content, reputational_damage) Output Filtering Condition: Indicates where in the workflow the evaluation was performed. In this report, only workflow_output (final response) is evaluated. Score Distribution Charts Box plots reveal consistency of agent behavior: Tight cluster at 0: Robust resistance Tight cluster at 1: Consistent vulnerability Wide spread: Non-deterministic—same attack sometimes succeeds, sometimes fails As agentic systems move closer to production, adversarial testing and red teaming become essential to understanding how these systems behave under real-world conditions. Lakera Red is designed to help teams systematically test, evaluate, and harden agentic AI systems built on modern frameworks like NVIDIA NeMo as they scale beyond experimentation.Copied to clipboard Lakera TeamFollow creator at:Follow creator at:The Lakera team has accelerated Dropbox’s GenAI journey.Not sure how to secure your GenAI application?Skip the guesswork with expert-recommended policies built by Lakera’s AI security team. Apply them in seconds, fine-tune when you’re ready, and get started with real protection from day one.Download the GuideOn this pageText LinkHide table of contentsShow table of contentsRelated ArticlesView all related postResearch5min readHow to Run the Backbone Breaker Benchmark (B3)Read moreAI Security7min readAI Gateways: What They Are, What They Control, and Why They MatterRead moreView all related postDon’t miss the updates!Subscribe to our newsletter to get the recent updates on Lakera product and other news in the AI LLM world. Be sure you’re on track!EmailLakera will use your contact details to share product and service updates. You can unsubscribe anytime. Learn more in our Privacy Policy.Thank you! Your submission has been received!Oops! Something went wrong while submitting the form. --- Model tests aren’t enough for NeMo agents. See why →Cookie ConsentHi, this website uses essential cookies to ensure its proper operation and tracking cookies to understand how you interact with it. The latter will be set only after consent.Accept allDenySettingsRead our Privacy PolicyManage CookiesCookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessaryfor the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.Your permission applies to the following domains:Lakera.aiLakera.aiEssential cookiesNecessary cookies help make a website usable by enabing basic functions like page navigation and access to secure of the website. The website cannot function properly without these cookies.RequiredMarketing cookiesMarketing cookies are used to track visitors across webstites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.EssentialPersonalization cookiesPreferencee cookies enable website to remember infomartion that changes the way thewebsite behaves or looks, like your preffered language or the region that you are in.EssentialAnalytics cookiesStatistic cookies help website owners to understand how visitors interact with websitesby collecting and reporting information anonymously.EssentialReject all cookiesAllow allSaveThis is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.This is some text inside of a div block.Runtime security for your GenAI.Protect your company’s sensitive data and reputation at scale – without compromising user experience.Book a demoStart for freeVisibility, protection, and control for all your GenAI use cases.Real-Time VisibilityDiscover GenAI use cases and risks across all of your organization.Threat Detection & ResponseMitigate risks by identifying and stopping malicious behavior and actors in real time.AI ControlProvide guardrails to block inappropriate content and data leakage.ComplianceImplement GRC requirements to put your teams and customers at ease.Meet global standards.Trusted by GenAI leaders to secure mission-critical applications.Case study“The Lakera team has accelerated our GenAI journey”“After evaluating several AI security vendors and internal options, Lakera’s solution quickly stood out in terms of its operational performance and ultra-low latency. The Lakera team has accelerated our GenAI journey, allowing us to create secure GenAI experiences at scale. They are great partners and the team exhibits leadership in the space.”Adrian WoodSecurity Engineer at DropboxApplication security. Reimagined for AI.DiscoveryDiscover use cases and risks across your organization.AI Application FirewallProtect business-critical GenAI workloads from attacks.Security CenterMonitor apps, control policies, and identify threats in a central dashboard.GuardrailsEnsure your GenAI apps always perform at their best.LogsFeed GenAI interactions, user behavior, and Firewall events into SIEMs and dashboards.Threat IntelligenceBolster your security posture with continuously evolving threat intelligence.Start for freeTrusted by leading enterprises.Lakera is trusted by industry leaders, from Fortune 500 companies to startups, accelerating their GenAI journeys with ultra-low latency, operational performance, and unparalleled expertise.“We have been impressed throughout our collaboration with Lakera”“The team has extensive expertise and deep understanding of complex security challenges like prompt injection attacks and other AI security threats. We look forward to continuing to work together to address these.”Seraphina Goldfarb-TarrantHead of Safety at Cohere“Lakera stood out in quality & service”“We run workflows for enterprise clients with stringent compliance needs. Our PII and prompt injection protections needed to be battle-tested, but also configurable. We evaluated several solutions, and Lakera was a clear winner: simple to integrate, the right configurations out of the box, and an architecture that met our needs.”Matthew RastovacCEO & Founder at Respell3 steps to secure all of your GenAI use cases with the Lakera API.1Immediate ProtectionGet Started in <5 MinutesProtect your GenAI with a single API call for any app and any LLM.2Easy ScalingCustomize PoliciesCustomize policies to secure all your applications without changing code.3Central MonitoringMonitor InteractionsOversee your GenAI interactions and take actions in real-time from a central dashboard.Start for freeSeamless integration with enterprise environments.Optimized for your infrastructure.Lakera provides seamless integrations for all your use cases.Integrates with your existing analytics, monitoring and security stack.Lakera works with Grafana, Splunk, and more.Enterprise-grade security.Built to meet the highest standards, including SOC2, EU GDPR, and NIST.Powered by the world’s largest AI red team.Every novel exploit, instantly learned – so you’re never caught off-guard.55M+Total Attacks1M+Total Players30+ yearsTotal Time Spent PlayingGandalfGandalf is the most popular cybersecurity game that educates people on AI security and threats. It has been used and enjoyed by millions of people and 1000s of organizations.Try Gandalf