Saturday, 30 Aug 2025
  • My Feed
  • My Interests
  • My Saves
  • History
  • Blog
Subscribe
WorkSaathi News
  • Home
  • Technology
    TechnologyShow More
    Cracks are forming in Meta’s partnership with Scale AI

    It’s only been since June that Meta invested $14.3 billion in the…

    By
    Pranjal Raghav
    Hisense’s take on the Samsung Frame TV is $300 off

    Hisense’s S7N 55-inch TV doubles as a framed work of art. It’s…

    By
    Pranjal Raghav
    SSA Whistleblower’s Resignation Email Mysteriously Disappeared From Inboxes

    On Friday, the Social Security Administration’s chief data officer, Chuck Borges, sent…

    By
    Pranjal Raghav
    The future of AI hardware isn’t one device — it’s an entire ecosystem

    I dream of a gadget that can do it all. Instead, when…

    By
    Pranjal Raghav
    Tesla asks court to toss wrongful death verdict that cost it $243 million

    Earlier this month, a jury found Tesla partially responsible for the death…

    By
    Pranjal Raghav
  • Gadgets
    GadgetsShow More
    Apple To Revive Iconic Accessory With The Upcoming iPhone 17 Air

    Apple is reportedly testing the possible return of the Bumper case accessory…

    By
    Pranjal Raghav
    Aito M8 BEV: Huawei’s New Electric SUV Offers Up To 438 Miles Of Range

    Huawei has officially introduced the Aito M8 BEV, a fully electric SUV…

    By
    Pranjal Raghav
    HoYoverse’s Star Rail spinoff is Honkai: Nexus Anima

    HoYoverse's next gacha game has shades of Teamfight Tactics and Pokémon. The…

    By
    Pranjal Raghav
    Yooka-Laylee remaster comes to consoles and PC on October 9

    Yooka-Replaylee, , will be available on October 9. It'll be playable on…

    By
    Pranjal Raghav
    What to expect from Samsung, Acer, Lenovo and more

    IFA, Europe's answer to the CES, kicks off on September 5 in…

    By
    Pranjal Raghav
  • Health
    HealthShow More
    Phantom limb study rewires our understanding of the brain

    Thursday, August 21, 2025 NIH scientists and collaborators reveal the brain preserves…

    By
    Pranjal Raghav
    Breast cancer risk in younger women may be influenced by hormone therapy

    Monday, June 30, 2025 NIH study could help to guide clinical recommendations…

    By
    Pranjal Raghav
    NIH study links particulate air pollution to increased mutations in lung cancers among nonsmokers

    Media Advisory  Wednesday, July 2, 2025 Whole-genome sequencing study found air pollution to…

    By
    Pranjal Raghav
    Scientists Develop High-Performance MRI Scanner in Effort to Define Microscopic Brain Structures

    Wednesday, July 16, 2025 Next-generation system noninvasively images tiny nerve structures disrupted…

    By
    Pranjal Raghav
    NIH researchers develop AI agent that improves accuracy of gene set analysis by leveraging expert-curated databases

    Monday, July 28, 2025 Researchers at the National Institutes of Health (NIH)…

    By
    Pranjal Raghav
  • News
    NewsShow More
    Sewadar who served 15 years at Delhi’s Kalkaji Temple beaten to death over ‘Chunni Prasad’ dispute | Delhi News

    NEW DELHI: A sewadar at Kalkaji Temple in southeast Delhi has died…

    By
    Pranjal Raghav
    J&K: Seven feared dead as landslide flattens residential house in Reasi; cloudburst in Ramban | India News

    NEW DELHI: Seven members of a family are feared dead as a…

    By
    Pranjal Raghav
    Bank holiday on Saturday: Are banks closed on August 30? Check state-wise full list of upcoming holidays

    Bank holidays in India can be cause of confusion, especially when it…

    By
    Pranjal Raghav
    ‘Jitna chahe jor lagale’: Hasin Jahan shares cryptic post after Mohammed Shami’s ‘I never regret the past’ comment | Cricket News

    Mohammed Shami and Hasin Jahan NEW DELHI: Indian pacer Mohammed Shami’s estranged…

    By
    Pranjal Raghav
    Crew member of Ayushmann-Sara Ali Khan film assaulted in UP’s Prayagraj; accused arrested | Prayagraj News

    NEW DELHI: Police in Uttar Pradesh have arrested a man for allegedly…

    By
    Pranjal Raghav
  • Digital Marketing
    Digital MarketingShow More
    What are brand identity elements? A marketing pro dives in

    Picture the Starbucks siren logo. Now picture it in bright HubSpot orange.…

    By
    Pranjal Raghav
    Ways Community Can Help Your SEO

    So I've heard a lot of folks kind of starting down here…

    By
    Pranjal Raghav
    How To Find Conversion Opportunities With Audience and Keyword Research

    SparkToro helped me understand that my ideal customers are women aged 30-40…

    By
    Pranjal Raghav
    I tested the top 14 AI chatbots for marketers [data, prompts, use cases]

    I remember when ChatGPT first launched. The entire marketing community was split…

    By
    Pranjal Raghav
    How to create a content style guide [+ free guide & examples]

    Every content team has a different idea of what ‘on brand’ means…

    By
    Pranjal Raghav
  • Online Earning
    Online EarningShow More
    *HOT* Under Armour Men’s Tees as low as $9.67 shipped!

    Home » Deals » *HOT* Under Armour Men’s Tees as low as…

    By
    Pranjal Raghav
    USB-C 6-Foot Charging Cords 2-Pack for just $3.99!

    Published: by Meagan on August 29, 2025  |  This post may contain affiliate links.…

    By
    Pranjal Raghav
    *HOT* Jumbo Giraffe Sprinkler for $13.80! (Reg. $50)

    Home » Deals » *HOT* Jumbo Giraffe Sprinkler for $13.80! (Reg. $50)…

    By
    Pranjal Raghav
    Candle Warmer Lamp with Timer and Dimmer only $11.99 (Reg. $30)

    Home » Deals » Candle Warmer Lamp with Timer and Dimmer only…

    By
    Pranjal Raghav
    Knorr Pasta and Rice Sides just $0.80 each, shipped!

    Amazon is offering 20% off select Knorr Pasta and Rice Sides right…

    By
    Pranjal Raghav
  • 🔥
  • News
  • Finance
  • Technology
  • Gadgets
  • Online Earning
  • Education
  • Digital Marketing
  • Health
Font ResizerAa
WorkSaathi NewsWorkSaathi News
0
  • My Saves
  • My Interests
  • My Feed
  • History
Search
  • Home
  • Health
  • Education
  • News
  • Digital Marketing
  • Online Earning
  • Gadgets
  • Finance
  • Technology
  • Uncategorized
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
WorkSaathi News > Blog > Technology > OpenAI co-founder calls for AI labs to safety-test rival models
Technology

OpenAI co-founder calls for AI labs to safety-test rival models

Pranjal Raghav
Last updated: August 28, 2025 12:10 am
Pranjal Raghav
Share
SHARE


OpenAI and Anthropic, two of the world’s leading AI labs, briefly opened up their closely guarded AI models to allow for joint safety testing — a rare cross-lab collaboration at a time of fierce competition. The effort aimed to surface blind spots in each company’s internal evaluations and demonstrate how leading AI companies can work together on safety and alignment work in the future.

In an interview with TechCrunch, OpenAI co-founder Wojciech Zaremba said this kind of collaboration is increasingly important now that AI is entering a “consequential” stage of development, where AI models are used by millions of people every day.

“There’s a broader question of how the industry sets a standard for safety and collaboration, despite the billions of dollars invested, as well as the war for talent, users, and the best products,” said Zaremba.

The joint safety research, published Wednesday by both companies, arrives amid an arms race among leading AI labs like OpenAI and Anthropic, where billion-dollar data center bets and $100 million compensation packages for top researchers have become table stakes. Some experts warn that the intensity of product competition could pressure companies to cut corners on safety in the rush to build more powerful systems.

To make this research possible, OpenAI and Anthropic granted each other special API access to versions of their AI models with fewer safeguards (OpenAI notes that GPT-5 was not tested because it hadn’t been released yet). Shortly after the research was conducted, however, Anthropic revoked the API access of another team at OpenAI. At the time, Anthropic claimed that OpenAI violated its terms of service, which prohibits using Claude to improve competing products.

Zaremba says the events were unrelated and that he expects competition to stay fierce even as AI safety teams try to work together. Nicholas Carlini, a safety researcher with Anthropic, tells TechCrunch that he would like to continue allowing OpenAI safety researchers to access Claude models in the future.

“We want to increase collaboration wherever it’s possible across the safety frontier, and try to make this something that happens more regularly,” said Carlini.

Techcrunch event

San Francisco
|
October 27-29, 2025

One of the most stark findings in the study relates to hallucination testing. Anthropic’s Claude Opus 4 and Sonnet 4 models refused to answer up to 70% of questions when they were unsure of the correct answer, instead offering responses like, “I don’t have reliable information.” Meanwhile, OpenAI’s o3 and o4-mini models refuse to answer questions far less, but showed much higher hallucination rates, attempting to answer questions when they didn’t have enough information.

Zaremba says the right balance is likely somewhere in the middle — OpenAI’s models should refuse to answer more questions, while Anthropic’s models should probably attempt to offer more answers.

Sycophancy, the tendency for AI models to reinforce negative behavior in users to please them, has emerged as one of the most pressing safety concerns around AI models.

In Anthropic’s research report, the company identified examples of “extreme” sycophancy in GPT-4.1 and Claude Opus 4 — in which the models initially pushed back on psychotic or manic behavior, but later validated some concerning decisions. In other AI models from OpenAI and Anthropic, researchers observed lower levels of sycophancy.

On Tuesday, parents of a 16-year-old boy, Adam Raine, filed a lawsuit against OpenAI, claiming that ChatGPT (specifically a version powered by GPT-4o) offered their son advice that aided in his suicide, rather than pushing back on his suicidal thoughts. The lawsuit suggests this may be the latest example of AI chatbot sycophancy contributing to tragic outcomes.

“It’s hard to imagine how difficult this is to their family,” said Zaremba when asked about the incident. “It would be a sad story if we build AI that solves all these complex PhD level problems, invents new science, and at the same time, we have people with mental health problems as a consequence of interacting with it. This is a dystopian future that I’m not excited about.”

In a blog post, OpenAI says that it significantly improved the sycophancy of its AI chatbots with GPT-5, compared to GPT-4o, claiming the model is better at responding to mental health emergencies.

Moving forward, Zaremba and Carlini say they would like Anthropic and OpenAI to collaborate more on safety testing, looking into more subjects and testing future models, and they hope other AI labs will follow their collaborative approach.

Update 2:00pm PT: This article was updated to include additional research from Anthropic that was not initially made available to TechCrunch ahead of publication.


Got a sensitive tip or confidential documents? We’re reporting on the inner workings of the AI industry — from the companies shaping its future to the people impacted by their decisions. Reach out to Rebecca Bellan at rebecca.bellan@techcrunch.com and Maxwell Zeff at maxwell.zeff@techcrunch.com. For secure communication, you can contact us via Signal at @rebeccabellan.491 and @mzeff.88.



Source link

Share This Article
Email Copy Link Print
Previous Article Javier Milei evacuated from campaign event after stones thrown
Next Article Bestway Hydro-Force 1-Person Ventura Elite Inflatable Kayak Set only $69.99 (Reg. $260)!
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recipe Rating




Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
XFollow
InstagramFollow
LinkedInFollow
MediumFollow
QuoraFollow
- Advertisement -
Ad image

You Might Also Like

Technology

Vocal Image is using AI to help people communicate better

By
Pranjal Raghav
Technology

Meta is going to stuff Midjourney AI images into your feed

By
Pranjal Raghav
Technology

Malaysia’s SkyeChip unveils the country’s first edge AI processor

By
Pranjal Raghav
Technology

21 Best Early Labor Day Sales on WIRED-Tested Gear (2025)

By
Pranjal Raghav
WorkSaathi News
Facebook Twitter Youtube Rss Medium

About US

 

WorkSaathi News: Your instant connection to the latest stories and live updates. Stay ahead with our real-time coverage across business, technology, politics, entertainment, and more. We bring you credible, fast, and accurate news 24/7 — your trusted partner in staying informed.

Top Categories
  • Education
  • Finance
  • Gadgets
  • Health
  • Digital Marketing
  • Online Earning
Usefull Links
  • Advertise with us
  • Contact Us
  • Advertise with US
  • Complaint
  • Privacy Policy
  • Cookie Policy
© WorkSaathi 2025. WebSaathi Design Company. All Rights Reserved.
© WorkSaathi 2025. WebSaathi Design Company. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?