Anthropic Releases New Claude Sandbox, Security Guidance Plugin

What: Anthropic introduces a new security guidance plugin for its AI model
Impact: Helps developers find vulnerabilities during code development

Artificial Intelligence Anthropic Releases New Claude Sandbox, Security Guidance Plugin The AI giant says the new plugin, which helps developers find vulnerabilities as they write code, has been used extensively internally. By Eduard Kovacs | May 27, 2026 (2:43 AM ET) Flipboard Reddit Whatsapp Whatsapp Email Anthropic has announced two new security features for its Claude AI: a self-hosted sandbox and a new security guidance plugin. The sandbox, currently in public beta, was announced at Anthorpic’s Code w/ Claude event in London this week. According to the company, Claude Managed Agents can now operate in a user-controlled sandbox connected to the user’s private MPC servers. “Tool execution moves to an environment you configure—your own infrastructure or a managed provider like Cloudflare, Daytona, Modal, or Vercel—while the agent loop that handles orchestration, context management, and error recovery stays on Anthropic’s infrastructure,” Anthropic explained. It added, “Your network policies, audit logging, and security tooling apply, files and repositories don’t leave your perimeter, and you control compute sizing and the runtime image for compute-heavy work.” Separately, the company unveiled a security guidance plugin for Claude Code, designed to help developers detect and fix vulnerabilities as they write code. Advertisement. Scroll to continue reading. The plugin scans for vulnerabilities on file edits, after AI-generated changes, and at commit time, analyzing risky code patterns, reviewing full diffs, and examining surrounding context. Available through the official Anthropic marketplace, the plugin has been widely used internally by the AI company. “Across our internal rollout and benchmarks, we’ve seen a 30-40% decrease in security-related comments on PRs opened using the plugin,” the company said. “The plugin serves as a lightweight first pass, catching issues before a full code review.” Last week, Anthropic announced 28 new enterprise security and compliance integrations for Claude. Related : Anthropic: Mythos Detected 23,000 Potential Vulnerabilities Across 1,000 OSS Projects Related : Anthropic Silently Patches Claude Code Sandbox Bypass Related : AI-Powered App Attacks Are Faster, More Frequent and Harder to Stop Written By Eduard Kovacs Eduard Kovacs (@EduardKovacs) is senior managing editor at SecurityWeek. He worked as a high school IT teacher before starting a career in journalism in 2011. Eduard holds a bachelor’s degree in industrial informatics and a master’s degree in computer techniques applied in electrical engineering. Daily Briefing Newsletter Subscribe to the SecurityWeek Email Briefing for the latest cybersecurity threats, trends, and expert insights. More from Eduard Kovacs Ghost CMS Vulnerability Exploited to Hack Over 700 Websites Oncology Institute Discloses Data Breach Anthropic: Mythos Detected 23,000 Potential Vulnerabilities Across 1,000 OSS Projects Drupal Vulnerability in Hacker Crosshairs Shortly After Disclosure Canadian Man Arrested for Operating Kimwolf Botnet ‘First VPN’ Cybercrime Service Disrupted, Administrator Arrested TrendAI Patches Apex One Zero-Day Exploited in the Wild Drupal Patches Highly Critical Vulnerability Exposing Websites to Hacking Latest News AppOmni’s Marlin AI Brings Autonomous Investigation to SaaS Security Iranian APT Targets Aviation, Software Companies With Updated Tools 185,000 Likely Impacted by 7-Eleven Data Breach Anthropic Expands Claude’s Enterprise Security Governance With 28 New Integrations Hackers Exploited KnowledgeDeliver Zero-Day for Web Shell Deployment Watch on Demand: Threat Detection & Incident Response Summit – All Sessions Available Open Source DockSec Uses AI to Cut Through Vulnerability Noise in Docker Images Lithuania Suspects Foreign Involvement in Data Leak of Over 600,000 National Register Entries Trending Daily Briefing Newsletter Subscribe to the SecurityWeek Email Briefing to stay informed on the latest threats, trends, and technology, along with insightful columns from industry experts. Virtual Event: Threat Detection and Incident Response Summit On-Demand Delve into big-picture strategies to reduce attack surfaces, improve patch management, conduct post-incident forensics, and tools and tricks needed in a modern organization. Register Webinar: Third-Party Risk in Practice June 4, 2026 Organizations are investing heavily in third-party risk management, but breaches, delays, and blind spots continue to persist. Join this live webinar as we examine the gap between how organizations think their third-party risk programs are performing and what’s actually happening in practice. Register People on the Move Joe Chen has become Chief Technology Officer at Trellix. Usercentrics has named Pawan Hegde as COO and Elena Ignatova as CPTO. SecureAuth has named Mark van Oppen as Chief Revenue Officer. More People On The Move Expert Insights Caught Off Guard: Securing AI After It Hits Production As enterprises rush AI projects into production, security teams are increasingly being forced into reactive mode. (Joshua Goldfarb) Cyber Resilience is the New Business Continuity Plan The organizations best prepared to face disruption are those that align security, continuity and risk management around what the business cannot afford to lose. (Steve Durbin) Enhancing Data Center Security Without Sacrificing Performance For AI data centers, where the stakes are the highest and performance constraints are the tightest, security and performance are no longer a zero-sum game. (Nadir Izrael) Is the SOC Obsolete, and We Just Haven’t Admitted It Yet? Many AI-first enterprises have already embraced sovereign architectures for general AI initiatives; cybersecurity—and the SOC—should be next. (Danelle Au) The Mythos Moment: Enterprises Must Fight Agents with Agents Only with the right platform and an agentic, AI-driven defense, will enterprises be able to protect themselves in the agentic era. (Etay Maor) Flipboard Reddit Whatsapp Whatsapp Email

Read Full Article → ← Back to News

Anthropic Releases New Claude Sandbox, Security Guidance Plugin

Related Articles

Share this article