Anthropic Relaxes AI Safety Guardrails as Rivalry Intensifies and Pentagon Tensions Escalate

Feb 26, 2026

—

Anthropic PBC, one of the most closely watched AI developers, has published the third edition of its Responsible Scaling Policy (RSP), a voluntary set of rules meant to reduce the chances of catastrophic harm from advanced AI systems. The update keeps the company’s core promise—test powerful models, set safety thresholds, and add safeguards as capabilities increase—but it also reflects a noticeable shift in tone as competition intensifies across the AI industry and government demand for cutting-edge systems grows.

At its heart, the Responsible Scaling Policy is designed to answer a simple question: what should an AI company do as its models become more capable—and potentially more dangerous? The policy outlines how Anthropic evaluates risks that could emerge from frontier AI, such as misuse that could cause large-scale damage or scenarios where a system’s capabilities outpace the controls meant to keep it safe. The new version continues to emphasize pre-deployment evaluation, internal governance, and triggers for stronger safety measures when risk levels rise.

What has changed in RSP version 3 is the balance between safety caution and the realities of a fast-moving market. Anthropic’s update signals that the company is trying to stay competitive while still presenting its safety program as a meaningful guardrail. In practice, that can mean adjusting how and when certain safety requirements apply, refining what counts as a red-line capability, and clarifying how decisions get made under pressure to ship new models quickly.

The timing of the update is also telling. The AI race is no longer just about consumer chatbots and office tools. It’s increasingly tied to national priorities, defense interest, and broader economic competitiveness. As government agencies—particularly in defense—look for advanced AI capabilities, companies face incentives to move faster, scale up deployment, and demonstrate they can deliver reliable systems at frontier performance levels. That environment can create tension between strict safety commitments and the demand to keep pace with rivals.

Anthropic’s latest policy revision appears to acknowledge that tension more openly than before. The company is still describing catastrophic-risk mitigation as a top priority, but the updated framework signals a more pragmatic approach—one shaped by the competitive landscape and by the fact that leading AI labs now operate in an ecosystem where commercial and geopolitical pressures are impossible to ignore.

For readers tracking AI safety, the key takeaway is that Responsible Scaling Policies remain voluntary, and their real-world value depends on how strict they are, how transparently they are enforced, and how willing labs are to slow down when risk signals flash red. Anthropic’s RSP v3 shows the company continuing to invest in structured safety processes, while also adapting to an industry moment where speed, scale, and strategic partnerships increasingly influence how “responsible” AI development is defined.

As frontier models evolve, updates like this will matter beyond a single company. They help shape emerging norms for AI governance—especially in the absence of comprehensive regulation—and they influence what other labs may adopt as baseline practices. Whether these internal policies ultimately reduce systemic AI risk will depend on consistent, measurable enforcement and the willingness to prioritize safeguards even when the market rewards moving first.

Anthropic Relaxes AI Safety Guardrails as Rivalry Intensifies and Pentagon Tensions Escalate

Share this:

Related Posts: