News

OpenAI Rolls Out Lockdown Mode to Mitigate Prompt Injection Data Exfiltration

OpenAI's new Lockdown Mode aims to prevent sensitive data exfiltration from prompt injection attacks by restricting outbound network requests, though it does not eliminate prompt injection vulnerabilities entirely.

News Published 10 June 2026 4 min read Maya Turner

Norwegian Dawn casino 3.JPG | by Captain-tucker | wikimedia_commons | CC BY-SA 3.0

OpenAI has introduced Lockdown Mode for ChatGPT, a new security feature designed to prevent the final stage of data exfiltration in prompt injection attacks. The feature is now rolling out to eligible personal accounts, including Free, Go, Plus, and Pro users, as well as self-serve ChatGPT Business accounts.

Lockdown Mode targets a critical vulnerability in large language models (LLMs) by limiting outbound network requests. This restriction aims to block malicious actors from transferring sensitive data from a user’s session to an attacker-controlled destination.

The Lethal Trifecta

The introduction of Lockdown Mode addresses what is often referred to as the “Lethal Trifecta” in AI security. This occurs when an LLM system simultaneously possesses three key capabilities: access to private data, exposure to untrusted content, and a mechanism to steal and transmit data back to an attacker.

According to analysis, the most practical way to neutralize this trifecta without significantly diminishing the utility of LLM systems is to restrict the data exfiltration vectors. Lockdown Mode appears to directly target this exfiltration leg.

How Lockdown Mode Works

Lockdown Mode functions by limiting the ability of ChatGPT to make outbound network requests. These requests are often exploited in prompt injection attacks to send sensitive information, such as user data or internal system details, to external servers controlled by attackers. By curtailing these requests, OpenAI aims to create a barrier against such data theft.

Crucially, the mechanisms employed by Lockdown Mode are deterministic. This means they rely on fixed rules rather than being evaluated by AI systems themselves, which can be susceptible to sophisticated subversion by prompt injection attacks. This approach is seen as a more robust defense against attacks that aim to manipulate the LLM’s behavior.

Limitations of Lockdown Mode

Despite its potential benefits, OpenAI clarifies that Lockdown Mode does not entirely prevent prompt injections from affecting ChatGPT’s internal processing. A prompt injection attack could still be embedded within cached web content or an uploaded file. While Lockdown Mode may prevent the exfiltrated data from leaving the system, the injection could still influence the chatbot’s responses and accuracy.

The very existence of Lockdown Mode implies that ChatGPT’s default settings do not offer comprehensive protection against determined data exfiltration attempts. Users who require enhanced security, particularly when dealing with sensitive information or interacting with potentially malicious content, may need to actively enable this mode.

Implications for Users

For users of ChatGPT, especially those in business or handling sensitive data, the rollout of Lockdown Mode is a significant development. It signals OpenAI’s ongoing efforts to address the security challenges inherent in powerful AI systems.

The feature provides an additional layer of defense against a specific class of attacks that have become increasingly sophisticated. However, it also underscores the importance of user vigilance. Prompt injection remains a complex problem, and while Lockdown Mode addresses a critical component, users should remain aware of the potential for manipulated outputs even when the mode is active.

The development suggests a path toward more secure AI interactions, where specific attack vectors are progressively hardened. The deterministic nature of Lockdown Mode’s defense mechanism is a noteworthy aspect, offering a more predictable security posture compared to defenses relying solely on AI.

This development matters for ReviewArticle readers by highlighting a crucial security enhancement for one of the most widely used AI tools. Understanding Lockdown Mode’s capabilities and limitations is essential for users seeking to protect sensitive information when interacting with ChatGPT, particularly in professional or data-sensitive contexts. It reflects the ongoing cat-and-mouse game in AI security, where new defenses are developed to counter emerging threats.

Fuente: OpenAI Help: Lockdown Mode, https://simonwillison.net/2026/Jun/6/openai-help-lockdown-mode/