• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
Sq Magazine LogoSQ Magazine

Smarter Insights for a Fast-Moving Digital World

  • Latest News
  • Statistics
  • About
  • Contact
Subscribe
Sq Magazine Logo
  • Latest News
  • Statistics
  • About
  • Contact
Subscribe
Home » Artificial Intelligence

AI Rivals OpenAI and Anthropic Team Up for Safety Checks

Published on: August 28, 2025
Barry Elad
Written By
Barry Elad
Barry Elad
Founder & Senior Journalist • 707 Articles
Barry Elad is a seasoned journalist and analyst specializing in finance, technology, AI, and founder of SQ Magazine. He explores the world o...
LATEST POSTS:
AI Image Generation Statistics 2026: Market Size, Adoption & Risks
McDonald’s Tests Powerful New AI Drive Thru With Google
Anthropic Launches Claude Fable 5, Its Most Powerful AI Model Yet
Openai And Anthropic Team Up For Safety Checks
As Featured In
BluehostActive CampaignDesignrushSeeking AlphaResearch Com
Share on LinkedIn ChatGPT Perplexity Share on X Share on Facebook

In a rare show of cooperation, AI giants OpenAI and Anthropic have evaluated each other’s models to uncover hidden safety risks.

Quick Summary – TLDR:

  • OpenAI and Anthropic conducted a joint safety evaluation of each other’s publicly available AI models
  • The tests focused on alignment, misuse, hallucinations, and system behavior under stress
  • Both companies found concerning behaviors in some models, especially related to sycophancy and misuse
  • The evaluations were done before the launch of OpenAI’s GPT-5 and Anthropic’s Claude Opus 4.1

What Happened?

OpenAI and Anthropic, two of the most influential AI companies today, have released the results of mutual evaluations they conducted on each other’s AI systems. The evaluations focused on uncovering safety flaws like misuse potential, hallucinations, and alignment failures that may not be apparent through internal testing.

This collaboration comes despite recent tensions between the companies, including Anthropic restricting OpenAI’s access to its Claude models due to terms of service violations. Still, both companies found common ground to prioritize AI safety.

A First-of-Its-Kind AI Safety Test

OpenAI called this collaboration the first major cross-lab safety evaluation, aimed at improving how AI systems are tested for alignment with human values. The exercise marks a major moment in AI governance, showing that cooperation is possible even among fierce rivals.

Claude And Openai Model Test Results
Image Credit – OpenAI

Each company applied its own internal safety protocols and stress tests on the other’s models:

  • Anthropic tested OpenAI’s o3, o4-mini, GPT-4o, and GPT-4.1 models
  • OpenAI tested Anthropic’s Claude models, including Claude 3 and Claude 4 variants

To ensure thorough analysis, both companies disabled certain external safeguards during testing that would typically prevent risky behaviors, allowing them to stress-test the models in a more raw state.

Anthropic’s Evaluation of OpenAI Models:

  • Sycophancy was a recurring issue across OpenAI’s models, except for o3.
  • The GPT-4o and GPT-4.1 models raised concerns around potential misuse, particularly in how users might exploit them.
  • No evaluation was done on OpenAI’s latest GPT-5, as it had not yet launched during testing.
  • The company highlighted the importance of external reviews to catch blind spots internal teams might miss.
Newsletter
Subscribe To Our Newsletter!

Be the first to get exclusive offers and the latest news.

OpenAI’s Evaluation of Anthropic Models:

  • Claude models performed well in respecting instruction hierarchies and showed high refusal rates in hallucination tests.
  • The models were good at detecting their own uncertainty and avoiding wrong answers.
  • Performance in “scheming” evaluations was mixed, depending on test scenarios.
  • The Claude family struggled more in jailbreaking tests, where users try to bypass safeguards.

Industry Context and Past Frictions

This collaboration is even more surprising given that Anthropic was founded by former OpenAI employees, and the two companies have often been viewed as rivals both philosophically and commercially. OpenAI is known for prioritizing rapid deployment, while Anthropic promotes a safety-first approach called Constitutional AI.

Earlier this year, Anthropic barred OpenAI from using its models due to unauthorized usage during GPT development. However, for this evaluation, Anthropic allowed limited access strictly for benchmarking and safety review.

The partnership highlights how AI safety is a shared concern that can override competitive instincts. As AI models become more powerful and integrated into everyday use, public scrutiny and legal pressures are increasing. This includes a wrongful death lawsuit against OpenAI after a teen user of ChatGPT died by suicide, raising new questions about chatbot safety and accountability.

SQ Magazine’s Takeaway

I really like seeing this kind of cross-company accountability. It’s rare, and it’s needed. AI models are getting smarter and more capable by the day, but they’re not perfect. Having rivals like OpenAI and Anthropic test each other’s systems shows us that the industry is beginning to take real responsibility. This kind of transparency is what will build public trust and, frankly, help save lives. I hope this becomes the norm, not a one-time PR moment.

SQ Magazine follows strict Publishing Principles and a documented Fact-Check Policy to ensure accuracy, transparency, and editorial independence across all content.

Add SQ Magazine as a Preferred Source on Google for updates! Follow on Google News
Share ChatGPT Perplexity
Barry Elad

Barry Elad

Founder & Senior Journalist


Barry Elad is a seasoned journalist and analyst specializing in finance, technology, AI, and founder of SQ Magazine. He explores the world of artificial intelligence, uncovering trends, data, and real-world impacts for readers. When he’s off the page, you’ll find him cooking healthy meals, practicing yoga, or exploring nature with his family.

Related Posts

Anthropic Launches Claude Fable 5, Its Most Powerful AI Model Yet
Artificial Intelligence

Anthropic Launches Claude Fable 5, Its Most Powerful AI Model Yet

Anthropic Eyes $10B Raise in Massive AI Funding Surge
Artificial Intelligence

Anthropic Eyes $10B Raise in Massive AI Funding Surge

Anthropic Plans 2026 IPO as AI Competition Heats Up
Artificial Intelligence

Anthropic Plans 2026 IPO as AI Competition Heats Up

Disclaimer: The content published on SQ Magazine is for informational and educational purposes only. Please verify details independently before making any important decisions based on our content.

Reader Interactions

Leave a Comment Cancel reply

Primary Sidebar

Connect With Us

facebook x linkedin google-news telegram pinterest whatsapp email
google-preferred-source-badge Add as a preferred source on Google

You Should Also Read

OpenAI Secures Defense AI Contract Amid Anthropic Dispute
Claude Mythos Nears Public Release After Safety Tests
Anthropic Flags Massive Claude AI Distillation by Chinese Firms

Table of Contents

  • Quick Summary – TLDR:
  • What Happened?
  • A First-of-Its-Kind AI Safety Test
  • Anthropic’s Evaluation of OpenAI Models:
  • OpenAI’s Evaluation of Anthropic Models:
  • Industry Context and Past Frictions
  • SQ Magazine’s Takeaway
Connect on Telegram

Footer

SQ Magazine Logo

Smarter Insights for a Fast-Moving Digital World

Connect With Us

Follow Us on Google News

Editorial & Trust

  • About
  • Publishing Principles
  • Fact-Check Policy
  • Corrections Policy
  • Ethics Policy
  • Disclaimer

Worth Checking

  • Social Media Attention Span Stats
  • Reddit Statistics
  • Spotify User Statistics
  • TikTok vs. Instagram Statistics
  • Gen Z Social Media Statistics
Contact Us
13570 Grove Dr #189,
Maple Grove, MN 55311,
United States
10 a.m. – 6 p.m. | Every day

Copyright © 2022–2026 SQ Magazine. All Rights Reserved. Powered by the Neural Stack.

  • Privacy Policy
  • Terms
Company
  • About Us
  • Our Team
  • Our Mission
  • Core Values
Discover
  • Brand Assets
    Brand Assets
  • Stats Methodology
    Stats Research Process
  • Glossary
    Glossary
Categories
  • Internet
  • Gaming
  • Technology
  • Artificial Intelligence
  • Cybersecurity
Internet
Internet Outage Statistics 2026: Frequency, Cost and Causes
Internet Outage Statistics 2026: Frequency, Cost and Causes
Upwork Statistics 2026: Revenue, GSV, AI Work
Upwork Statistics 2026: Revenue, GSV, AI Work
Instagram Reels Statistics 2026: Plays and Engagement
Instagram Reels Statistics 2026: Plays and Engagement
Gig Economy Statistics 2026: Workforce & Earnings
Gig Economy Statistics 2026: Workforce & Earnings
Doomscrolling Statistics: Prevalence, Sleep and Mental Health
Doomscrolling Statistics: Prevalence, Sleep and Mental Health
TikTok Brain Statistics 2026: Attention, Memory, Health
TikTok Brain Statistics 2026: Attention, Memory, Health
Gaming
Online Gambling Regulations Statistics 2026: Global Compliance and Enforcement Data
Online Gambling Regulations Statistics 2026: Global Compliance and Enforcement Data
Fantasy Sports Statistics 2026: Users, Revenue & Trends
Fantasy Sports Statistics 2026: Users, Revenue & Trends
Apex Legends Statistics 2026: Players, Revenue, and Esports
Apex Legends Statistics 2026: Players, Revenue, and Esports
Fortnite Statistics 2026: Players, Revenue, Esports, and Engagement
Fortnite Statistics 2026: Players, Revenue, Esports, and Engagement
Gamers Statistics 2026: Players, Habits & Global Data
Gamers Statistics 2026: Players, Habits & Global Data
Minecraft Statistics 2026: 300 Million Copies Sold & 212M Monthly Players
Minecraft Statistics 2026: 300 Million Copies Sold & 212M Monthly Players
Technology
Employee Productivity Statistics 2026: Engagement, Costs & Trends
Employee Productivity Statistics 2026: Engagement, Costs & Trends
Software Engineer Layoff Statistics 2026: Companies, Roles, AI Impact
Software Engineer Layoff Statistics 2026: Companies, Roles, AI Impact
iPhone Ecosystem Statistics 2026: Big Market Trends
iPhone Ecosystem Statistics 2026: Big Market Trends
Average Screen Time by Age Statistics 2026: Latest Insights
Average Screen Time by Age Statistics 2026: Latest Insights
AI SEO Statistics 2026: Adoption, AI Overviews & LLM Citation Data
AI SEO Statistics 2026: Adoption, AI Overviews & LLM Citation Data
Digital Nomads Statistics 2026: Population, Demographics & Visa Data
Digital Nomads Statistics 2026: Population, Demographics & Visa Data
Artificial Intelligence
AI Image Generation Statistics 2026: Market Size, Adoption & Risks
AI Image Generation Statistics 2026: Market Size, Adoption & Risks
AI Influencer Marketing Statistics: Market Size and Engagement
AI Influencer Marketing Statistics: Market Size and Engagement
AI Market Statistics 2026: Size, Growth & Investment
AI Market Statistics 2026: Size, Growth & Investment
Meta AI Statistics 2026: Users, Capex, and Adoption Data
Meta AI Statistics 2026: Users, Capex, and Adoption Data
Predictive AI Statistics 2026: Market Size, Adoption & Accuracy Data
Predictive AI Statistics 2026: Market Size, Adoption & Accuracy Data
AI Overviews Statistics 2026: Google Search Impact Data
AI Overviews Statistics 2026: Google Search Impact Data
Cybersecurity
Password Statistics 2026: Credential Theft, MFA, and the Passkey Tipping Point
Password Statistics 2026: Credential Theft, MFA, and the Passkey Tipping Point
Identity Theft Statistics 2026: Key Fraud Data and Trends
Identity Theft Statistics 2026: Key Fraud Data and Trends
CVE Statistics 2026: Severity Distribution and Top Affected Vendors
CVE Statistics 2026: Severity Distribution and Top Affected Vendors
Dark Web AI Tool Marketplace Statistics 2026: Explosive Market Growth
Dark Web AI Tool Marketplace Statistics 2026: Explosive Market Growth
API Security Breach Statistics 2026: Hidden Threats
API Security Breach Statistics 2026: Hidden Threats
AI Voice Cloning Fraud Statistics 2026: Alarming Trends You Must Know Now
AI Voice Cloning Fraud Statistics 2026: Alarming Trends You Must Know Now
Categories
  • Internet
  • Gaming
  • Technology
  • Artificial Intelligence
  • Cybersecurity
Internet
Facebook and Instagram Hit by Major Global Outage
Facebook and Instagram Hit by Major Global Outage
Pinterest Bets Big on AI With Record $4B AWS Commitment
Pinterest Bets Big on AI With Record $4B AWS Commitment
Lovable Expands Google Cloud Deal, Boosts AI Infrastructure 5x
Lovable Expands Google Cloud Deal, Boosts AI Infrastructure 5x
Shopify Down: Thousands Report Outage and Checkout Issues
Shopify Down: Thousands Report Outage and Checkout Issues
Microsoft Investigates Teams and Office File Access Outage
Microsoft Investigates Teams and Office File Access Outage
Microsoft Confirms MFA Issues and My Sign Ins Downtime
Microsoft Confirms MFA Issues and My Sign Ins Downtime
Gaming
Epic Games Teases Unreal Engine 6 for Rocket League
Epic Games Teases Unreal Engine 6 for Rocket League
Stardew Valley Switch 2 Edition Arrives with Online Co-op
Stardew Valley Switch 2 Edition Arrives with Online Co-op
Hogwarts Legacy Crosses 40M Sales, Beating Industry Giants
Hogwarts Legacy Crosses 40M Sales, Beating Industry Giants
PUBG: Black Budget Launches Closed Alpha Test With a Bold PvPvE Twist
PUBG: Black Budget Launches Closed Alpha Test With a Bold PvPvE Twist
Counter-Strike 2’s $5.9 Billion Skin Economy Just Got Shattered
Counter-Strike 2’s $5.9 Billion Skin Economy Just Got Shattered
Battlefield 6 Outperforms Franchise Past with Record-Breaking Launch
Battlefield 6 Outperforms Franchise Past with Record-Breaking Launch
Technology
Telegram Returns to Wear OS With Smartwatch App Upgrade
Telegram Returns to Wear OS With Smartwatch App Upgrade
Apple Announces macOS 27 Golden Gate at WWDC 2026
Apple Announces macOS 27 Golden Gate at WWDC 2026
Apple iPadOS 27 Introduces New Siri App and Productivity Tools
Apple iPadOS 27 Introduces New Siri App and Productivity Tools
Microsoft Reveals Xbox Series X25 Limited Edition Console
Microsoft Reveals Xbox Series X25 Limited Edition Console
Leaked iOS 27 Features Include AI Siri and More iPhone Support
Leaked iOS 27 Features Include AI Siri and More iPhone Support
iPhone 18 Pro Max Leak Reveals No Change in Thickness
iPhone 18 Pro Max Leak Reveals No Change in Thickness
Artificial Intelligence
McDonald’s Tests Powerful New AI Drive Thru With Google
McDonald’s Tests Powerful New AI Drive Thru With Google
Anthropic Launches Claude Fable 5, Its Most Powerful AI Model Yet
Anthropic Launches Claude Fable 5, Its Most Powerful AI Model Yet
Google Launches Gemini 3.5 Live Translate in 70 Languages
Google Launches Gemini 3.5 Live Translate in 70 Languages
NotebookLM Gains Gemini 3.5, Code Execution and Web Access
NotebookLM Gains Gemini 3.5, Code Execution and Web Access
OpenAI Files for IPO as Altman Pushes Open AI Access
OpenAI Files for IPO as Altman Pushes Open AI Access
ChatGPT Superapp Coming Soon With AI Agents and Codex
ChatGPT Superapp Coming Soon With AI Agents and Codex
Cybersecurity
Urgent Oracle PeopleSoft Flaw Linked to ShinyHunters Campaign
Urgent Oracle PeopleSoft Flaw Linked to ShinyHunters Campaign
73,000 French Government Accounts Exposed in Tchap Breach
73,000 French Government Accounts Exposed in Tchap Breach
High Risk Microsoft Teams Android Bug Could Leak Sensitive Data
High Risk Microsoft Teams Android Bug Could Leak Sensitive Data
Europol Takes Down AudiA6 Crypto Laundering Service
Europol Takes Down AudiA6 Crypto Laundering Service
Microsoft Defender Adds RPC Attack Detection Features
Microsoft Defender Adds RPC Attack Detection Features
Google Patches Chrome Zero Day Vulnerability Under Attack
Google Patches Chrome Zero Day Vulnerability Under Attack
Newsletter

Subscribe To Our Newsletter!

Be the first to get exclusive offers and the latest news.

Newsletter

Subscribe To Our Newsletter!

Be the first to get exclusive offers and the latest news.