- What: Anthropic introduces a new security guidance plugin for its AI model
- Impact: Helps developers find vulnerabilities during code development
Artificial Intelligence Anthropic Releases New Claude Sandbox, Security Guidance Plugin The AI giant says the new plugin, which helps developers find vulnerabilities as they write code, has been used extensively internally. By Eduard Kovacs | May 27, 2026 (2:43 AM ET) Flipboard Reddit Whatsapp Whatsapp Email Anthropic has announced two new security features for its Claude AI: a self-hosted sandbox and a new security guidance plugin. The sandbox, currently in public beta, was announced at Anthorpicâs Code w/ Claude event in London this week. According to the company, Claude Managed Agents can now operate in a user-controlled sandbox connected to the userâs private MPC servers. âTool execution moves to an environment you configureâyour own infrastructure or a managed provider like Cloudflare, Daytona, Modal, or Vercelâwhile the agent loop that handles orchestration, context management, and error recovery stays on Anthropicâs infrastructure,â Anthropic explained. It added, âYour network policies, audit logging, and security tooling apply, files and repositories donât leave your perimeter, and you control compute sizing and the runtime image for compute-heavy work.â Separately, the company unveiled a security guidance plugin for Claude Code, designed to help developers detect and fix vulnerabilities as they write code. Advertisement. Scroll to continue reading. The plugin scans for vulnerabilities on file edits, after AI-generated changes, and at commit time, analyzing risky code patterns, reviewing full diffs, and examining surrounding context. Available through the official Anthropic marketplace, the plugin has been widely used internally by the AI company. âAcross our internal rollout and benchmarks, weâve seen a 30-40% decrease in security-related comments on PRs opened using the plugin,â the company said. âThe plugin serves as a lightweight first pass, catching issues before a full code review.â Last week, Anthropic announced 28 new enterprise security and compliance integrations for Claude. Related : Anthropic: Mythos Detected 23,000 Potential Vulnerabilities Across 1,000 OSS Projects Related : Anthropic Silently Patches Claude Code Sandbox Bypass Related : AI-Powered App Attacks Are Faster, More Frequent and Harder to Stop Written By Eduard Kovacs Eduard Kovacs (@EduardKovacs) is senior managing editor at SecurityWeek. He worked as a high school IT teacher before starting a career in journalism in 2011. Eduard holds a bachelorâs degree in industrial informatics and a masterâs degree in computer techniques applied in electrical engineering. Daily Briefing Newsletter Subscribe to the SecurityWeek Email Briefing for the latest cybersecurity threats, trends, and expert insights. More from Eduard Kovacs Ghost CMS Vulnerability Exploited to Hack Over 700 Websites Oncology Institute Discloses Data Breach Anthropic: Mythos Detected 23,000 Potential Vulnerabilities Across 1,000 OSS Projects Drupal Vulnerability in Hacker Crosshairs Shortly After Disclosure Canadian Man Arrested for Operating Kimwolf Botnet âFirst VPNâ Cybercrime Service Disrupted, Administrator Arrested TrendAI Patches Apex One Zero-Day Exploited in the Wild Drupal Patches Highly Critical Vulnerability Exposing Websites to Hacking Latest News AppOmniâs Marlin AI Brings Autonomous Investigation to SaaS Security Iranian APT Targets Aviation, Software Companies With Updated Tools 185,000 Likely Impacted by 7-Eleven Data Breach Anthropic Expands Claudeâs Enterprise Security Governance With 28 New Integrations Hackers Exploited KnowledgeDeliver Zero-Day for Web Shell Deployment Watch on Demand: Threat Detection & Incident Response Summit â All Sessions Available Open Source DockSec Uses AI to Cut Through Vulnerability Noise in Docker Images Lithuania Suspects Foreign Involvement in Data Leak of Over 600,000 National Register Entries Trending Daily Briefing Newsletter Subscribe to the SecurityWeek Email Briefing to stay informed on the latest threats, trends, and technology, along with insightful columns from industry experts. Virtual Event: Threat Detection and Incident Response Summit On-Demand Delve into big-picture strategies to reduce attack surfaces, improve patch management, conduct post-incident forensics, and tools and tricks needed in a modern organization. Register Webinar: Third-Party Risk in Practice June 4, 2026 Organizations are investing heavily in third-party risk management, but breaches, delays, and blind spots continue to persist. Join this live webinar as we examine the gap between how organizations think their third-party risk programs are performing and whatâs actually happening in practice. Register People on the Move Joe Chen has become Chief Technology Officer at Trellix. Usercentrics has named Pawan Hegde as COO and Elena Ignatova as CPTO. SecureAuth has named Mark van Oppen as Chief Revenue Officer. More People On The Move Expert Insights Caught Off Guard: Securing AI After It Hits Production As enterprises rush AI projects into production, security teams are increasingly being forced into reactive mode. (Joshua Goldfarb) Cyber Resilience is the New Business Continuity Plan The organizations best prepared to face disruption are those that align security, continuity and risk management around what the business cannot afford to lose. (Steve Durbin) Enhancing Data Center Security Without Sacrificing Performance For AI data centers, where the stakes are the highest and performance constraints are the tightest, security and performance are no longer a zero-sum game. (Nadir Izrael) Is the SOC Obsolete, and We Just Havenât Admitted It Yet? Many AI-first enterprises have already embraced sovereign architectures for general AI initiatives; cybersecurityâand the SOCâshould be next. (Danelle Au) The Mythos Moment: Enterprises Must Fight Agents with Agents Only with the right platform and an agentic, AI-driven defense, will enterprises be able to protect themselves in the agentic era. (Etay Maor) Flipboard Reddit Whatsapp Whatsapp Email