Saturday, 30 Aug 2025
  • My Feed
  • My Interests
  • My Saves
  • History
  • Blog
Subscribe
WorkSaathi News
  • Home
  • Technology
    TechnologyShow More
    Falcon 9 Milestones Vindicate SpaceX’s ‘Dumb’ Approach to Reuse

    As SpaceX's Starship vehicle gathered all of the attention this week, the…

    By
    Pranjal Raghav
    Cracks are forming in Meta’s partnership with Scale AI

    It’s only been since June that Meta invested $14.3 billion in the…

    By
    Pranjal Raghav
    Hisense’s take on the Samsung Frame TV is $300 off

    Hisense’s S7N 55-inch TV doubles as a framed work of art. It’s…

    By
    Pranjal Raghav
    SSA Whistleblower’s Resignation Email Mysteriously Disappeared From Inboxes

    On Friday, the Social Security Administration’s chief data officer, Chuck Borges, sent…

    By
    Pranjal Raghav
    The future of AI hardware isn’t one device — it’s an entire ecosystem

    I dream of a gadget that can do it all. Instead, when…

    By
    Pranjal Raghav
  • Gadgets
    GadgetsShow More
    Apple To Revive Iconic Accessory With The Upcoming iPhone 17 Air

    Apple is reportedly testing the possible return of the Bumper case accessory…

    By
    Pranjal Raghav
    Aito M8 BEV: Huawei’s New Electric SUV Offers Up To 438 Miles Of Range

    Huawei has officially introduced the Aito M8 BEV, a fully electric SUV…

    By
    Pranjal Raghav
    HoYoverse’s Star Rail spinoff is Honkai: Nexus Anima

    HoYoverse's next gacha game has shades of Teamfight Tactics and Pokémon. The…

    By
    Pranjal Raghav
    Yooka-Laylee remaster comes to consoles and PC on October 9

    Yooka-Replaylee, , will be available on October 9. It'll be playable on…

    By
    Pranjal Raghav
    What to expect from Samsung, Acer, Lenovo and more

    IFA, Europe's answer to the CES, kicks off on September 5 in…

    By
    Pranjal Raghav
  • Health
    HealthShow More
    Phantom limb study rewires our understanding of the brain

    Thursday, August 21, 2025 NIH scientists and collaborators reveal the brain preserves…

    By
    Pranjal Raghav
    Breast cancer risk in younger women may be influenced by hormone therapy

    Monday, June 30, 2025 NIH study could help to guide clinical recommendations…

    By
    Pranjal Raghav
    NIH study links particulate air pollution to increased mutations in lung cancers among nonsmokers

    Media Advisory  Wednesday, July 2, 2025 Whole-genome sequencing study found air pollution to…

    By
    Pranjal Raghav
    Scientists Develop High-Performance MRI Scanner in Effort to Define Microscopic Brain Structures

    Wednesday, July 16, 2025 Next-generation system noninvasively images tiny nerve structures disrupted…

    By
    Pranjal Raghav
    NIH researchers develop AI agent that improves accuracy of gene set analysis by leveraging expert-curated databases

    Monday, July 28, 2025 Researchers at the National Institutes of Health (NIH)…

    By
    Pranjal Raghav
  • News
    NewsShow More
    From Puri to Delhi: Jagannath Rath Yatra wheels to adorn Parliament; LS speaker Om Birla gives nod | India News

    Carpenters and servitors busy in constructing wheels for the chariots of Lord…

    By
    Pranjal Raghav
    Maratha quota stir enters Day 2: Manoj Jarange warns govt not to test patience; traffic comes to standstill in South Mumbai – top developments | Mumbai News

    NEW DELHI: Maratha quota activist Manoj Jarange continued his indefinite hunger strike…

    By
    Pranjal Raghav
    Rahul Dravid steps down as Rajasthan Royals coach; refuses ‘broader position’ at IPL franchise | Cricket News

    Rahul Dravid and Sanju Samson (X - Cricbuzz) Rajasthan Royals on Saturday…

    By
    Pranjal Raghav
    Over 500 drones fired at Ukraine: One killed, dozens injured in Russian attack; 14 regions impacted

    Russia launched a large-scale attack on Ukraine overnight, killing one person, wounding…

    By
    Pranjal Raghav
    SCO Summit 2025 in China: Meet humanoid robot Xiao He; here’s what it can do

    Xiao He ((Screen grab from video posted by @ANI) A humanoid robot…

    By
    Pranjal Raghav
  • Digital Marketing
    Digital MarketingShow More
    What are brand identity elements? A marketing pro dives in

    Picture the Starbucks siren logo. Now picture it in bright HubSpot orange.…

    By
    Pranjal Raghav
    Ways Community Can Help Your SEO

    So I've heard a lot of folks kind of starting down here…

    By
    Pranjal Raghav
    How To Find Conversion Opportunities With Audience and Keyword Research

    SparkToro helped me understand that my ideal customers are women aged 30-40…

    By
    Pranjal Raghav
    I tested the top 14 AI chatbots for marketers [data, prompts, use cases]

    I remember when ChatGPT first launched. The entire marketing community was split…

    By
    Pranjal Raghav
    How to create a content style guide [+ free guide & examples]

    Every content team has a different idea of what ‘on brand’ means…

    By
    Pranjal Raghav
  • Online Earning
    Online EarningShow More
    *HOT* Under Armour Men’s Tees as low as $9.67 shipped!

    Home » Deals » *HOT* Under Armour Men’s Tees as low as…

    By
    Pranjal Raghav
    USB-C 6-Foot Charging Cords 2-Pack for just $3.99!

    Published: by Meagan on August 29, 2025  |  This post may contain affiliate links.…

    By
    Pranjal Raghav
    *HOT* Jumbo Giraffe Sprinkler for $13.80! (Reg. $50)

    Home » Deals » *HOT* Jumbo Giraffe Sprinkler for $13.80! (Reg. $50)…

    By
    Pranjal Raghav
    Candle Warmer Lamp with Timer and Dimmer only $11.99 (Reg. $30)

    Home » Deals » Candle Warmer Lamp with Timer and Dimmer only…

    By
    Pranjal Raghav
    Knorr Pasta and Rice Sides just $0.80 each, shipped!

    Amazon is offering 20% off select Knorr Pasta and Rice Sides right…

    By
    Pranjal Raghav
  • 🔥
  • News
  • Finance
  • Technology
  • Gadgets
  • Online Earning
  • Education
  • Digital Marketing
  • Health
Font ResizerAa
WorkSaathi NewsWorkSaathi News
0
  • My Saves
  • My Interests
  • My Feed
  • History
Search
  • Home
  • Health
  • Education
  • News
  • Digital Marketing
  • Online Earning
  • Gadgets
  • Finance
  • Technology
  • Uncategorized
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
WorkSaathi News > Blog > Technology > OpenAI co-founder calls for AI labs to safety-test rival models
Technology

OpenAI co-founder calls for AI labs to safety-test rival models

Pranjal Raghav
Last updated: August 28, 2025 12:10 am
Pranjal Raghav
Share
SHARE


OpenAI and Anthropic, two of the world’s leading AI labs, briefly opened up their closely guarded AI models to allow for joint safety testing — a rare cross-lab collaboration at a time of fierce competition. The effort aimed to surface blind spots in each company’s internal evaluations and demonstrate how leading AI companies can work together on safety and alignment work in the future.

In an interview with TechCrunch, OpenAI co-founder Wojciech Zaremba said this kind of collaboration is increasingly important now that AI is entering a “consequential” stage of development, where AI models are used by millions of people every day.

“There’s a broader question of how the industry sets a standard for safety and collaboration, despite the billions of dollars invested, as well as the war for talent, users, and the best products,” said Zaremba.

The joint safety research, published Wednesday by both companies, arrives amid an arms race among leading AI labs like OpenAI and Anthropic, where billion-dollar data center bets and $100 million compensation packages for top researchers have become table stakes. Some experts warn that the intensity of product competition could pressure companies to cut corners on safety in the rush to build more powerful systems.

To make this research possible, OpenAI and Anthropic granted each other special API access to versions of their AI models with fewer safeguards (OpenAI notes that GPT-5 was not tested because it hadn’t been released yet). Shortly after the research was conducted, however, Anthropic revoked the API access of another team at OpenAI. At the time, Anthropic claimed that OpenAI violated its terms of service, which prohibits using Claude to improve competing products.

Zaremba says the events were unrelated and that he expects competition to stay fierce even as AI safety teams try to work together. Nicholas Carlini, a safety researcher with Anthropic, tells TechCrunch that he would like to continue allowing OpenAI safety researchers to access Claude models in the future.

“We want to increase collaboration wherever it’s possible across the safety frontier, and try to make this something that happens more regularly,” said Carlini.

Techcrunch event

San Francisco
|
October 27-29, 2025

One of the most stark findings in the study relates to hallucination testing. Anthropic’s Claude Opus 4 and Sonnet 4 models refused to answer up to 70% of questions when they were unsure of the correct answer, instead offering responses like, “I don’t have reliable information.” Meanwhile, OpenAI’s o3 and o4-mini models refuse to answer questions far less, but showed much higher hallucination rates, attempting to answer questions when they didn’t have enough information.

Zaremba says the right balance is likely somewhere in the middle — OpenAI’s models should refuse to answer more questions, while Anthropic’s models should probably attempt to offer more answers.

Sycophancy, the tendency for AI models to reinforce negative behavior in users to please them, has emerged as one of the most pressing safety concerns around AI models.

In Anthropic’s research report, the company identified examples of “extreme” sycophancy in GPT-4.1 and Claude Opus 4 — in which the models initially pushed back on psychotic or manic behavior, but later validated some concerning decisions. In other AI models from OpenAI and Anthropic, researchers observed lower levels of sycophancy.

On Tuesday, parents of a 16-year-old boy, Adam Raine, filed a lawsuit against OpenAI, claiming that ChatGPT (specifically a version powered by GPT-4o) offered their son advice that aided in his suicide, rather than pushing back on his suicidal thoughts. The lawsuit suggests this may be the latest example of AI chatbot sycophancy contributing to tragic outcomes.

“It’s hard to imagine how difficult this is to their family,” said Zaremba when asked about the incident. “It would be a sad story if we build AI that solves all these complex PhD level problems, invents new science, and at the same time, we have people with mental health problems as a consequence of interacting with it. This is a dystopian future that I’m not excited about.”

In a blog post, OpenAI says that it significantly improved the sycophancy of its AI chatbots with GPT-5, compared to GPT-4o, claiming the model is better at responding to mental health emergencies.

Moving forward, Zaremba and Carlini say they would like Anthropic and OpenAI to collaborate more on safety testing, looking into more subjects and testing future models, and they hope other AI labs will follow their collaborative approach.

Update 2:00pm PT: This article was updated to include additional research from Anthropic that was not initially made available to TechCrunch ahead of publication.


Got a sensitive tip or confidential documents? We’re reporting on the inner workings of the AI industry — from the companies shaping its future to the people impacted by their decisions. Reach out to Rebecca Bellan at rebecca.bellan@techcrunch.com and Maxwell Zeff at maxwell.zeff@techcrunch.com. For secure communication, you can contact us via Signal at @rebeccabellan.491 and @mzeff.88.



Source link

Share This Article
Email Copy Link Print
Previous Article Javier Milei evacuated from campaign event after stones thrown
Next Article Bestway Hydro-Force 1-Person Ventura Elite Inflatable Kayak Set only $69.99 (Reg. $260)!
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recipe Rating




Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
XFollow
InstagramFollow
LinkedInFollow
MediumFollow
QuoraFollow
- Advertisement -
Ad image

You Might Also Like

Technology

Honor’s slim Magic V5 foldable is fun to use, minus the huge camera bump

By
Pranjal Raghav
Technology

This new delivery robot will bring the entire grocery store to you

By
Pranjal Raghav
Technology

The Boring Company is finally testing Tesla’s ‘Full Self-Driving’ in its Las Vegas tunnels

By
Pranjal Raghav
Technology

Les Amis, the European app helping women form friendships, launches in New York

By
Pranjal Raghav
WorkSaathi News
Facebook Twitter Youtube Rss Medium

About US

 

WorkSaathi News: Your instant connection to the latest stories and live updates. Stay ahead with our real-time coverage across business, technology, politics, entertainment, and more. We bring you credible, fast, and accurate news 24/7 — your trusted partner in staying informed.

Top Categories
  • Education
  • Finance
  • Gadgets
  • Health
  • Digital Marketing
  • Online Earning
Usefull Links
  • Advertise with us
  • Contact Us
  • Advertise with US
  • Complaint
  • Privacy Policy
  • Cookie Policy
© WorkSaathi 2025. WebSaathi Design Company. All Rights Reserved.
© WorkSaathi 2025. WebSaathi Design Company. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?