Friday, 29 Aug 2025
  • My Feed
  • My Interests
  • My Saves
  • History
  • Blog
Subscribe
WorkSaathi News
  • Home
  • Technology
    TechnologyShow More
    The future of AI hardware isn’t one device — it’s an entire ecosystem

    I dream of a gadget that can do it all. Instead, when…

    By
    Pranjal Raghav
    Tesla asks court to toss wrongful death verdict that cost it $243 million

    Earlier this month, a jury found Tesla partially responsible for the death…

    By
    Pranjal Raghav
    Leak suggests new Philips Hue lights will have direct Matter support

    There’s already been a number of leaks of upcoming Philips Hue products…

    By
    Pranjal Raghav
    Microsoft’s next annual update for Windows 11 is in Release Preview testing

    So, will you see new UI features or more AI tweaks included…

    By
    Pranjal Raghav
    TikTok is now letting everyone DM each other with voice memos and pictures

    Every platform wants to be the place you hang with friends —…

    By
    Pranjal Raghav
  • Gadgets
    GadgetsShow More
    Apple To Revive Iconic Accessory With The Upcoming iPhone 17 Air

    Apple is reportedly testing the possible return of the Bumper case accessory…

    By
    Pranjal Raghav
    Aito M8 BEV: Huawei’s New Electric SUV Offers Up To 438 Miles Of Range

    Huawei has officially introduced the Aito M8 BEV, a fully electric SUV…

    By
    Pranjal Raghav
    HoYoverse’s Star Rail spinoff is Honkai: Nexus Anima

    HoYoverse's next gacha game has shades of Teamfight Tactics and Pokémon. The…

    By
    Pranjal Raghav
    Yooka-Laylee remaster comes to consoles and PC on October 9

    Yooka-Replaylee, , will be available on October 9. It'll be playable on…

    By
    Pranjal Raghav
    What to expect from Samsung, Acer, Lenovo and more

    IFA, Europe's answer to the CES, kicks off on September 5 in…

    By
    Pranjal Raghav
  • Health
    HealthShow More
    Phantom limb study rewires our understanding of the brain

    Thursday, August 21, 2025 NIH scientists and collaborators reveal the brain preserves…

    By
    Pranjal Raghav
    Breast cancer risk in younger women may be influenced by hormone therapy

    Monday, June 30, 2025 NIH study could help to guide clinical recommendations…

    By
    Pranjal Raghav
    NIH study links particulate air pollution to increased mutations in lung cancers among nonsmokers

    Media Advisory  Wednesday, July 2, 2025 Whole-genome sequencing study found air pollution to…

    By
    Pranjal Raghav
    Scientists Develop High-Performance MRI Scanner in Effort to Define Microscopic Brain Structures

    Wednesday, July 16, 2025 Next-generation system noninvasively images tiny nerve structures disrupted…

    By
    Pranjal Raghav
    NIH researchers develop AI agent that improves accuracy of gene set analysis by leveraging expert-curated databases

    Monday, July 28, 2025 Researchers at the National Institutes of Health (NIH)…

    By
    Pranjal Raghav
  • News
    NewsShow More
    No ‘power to impose tariff’: US court declares most Trump tariffs illegal; judge cites overreach

    A US federal appeals court on Friday ruled that most tariffs imposed…

    By
    Pranjal Raghav
    ED secures first Interpol ‘Purple Notice’ | India News

    NEW DELHI: In a first, the ED has secured an Interpol Purple…

    By
    Pranjal Raghav
    Angry with god, HIV+ man turns temple thief in act of revenge | Raipur News

    RAIPUR: An HIV+ man in Chhattisgarh, calling his infection an "act of…

    By
    Pranjal Raghav
    Mumbai among 7 safest cities for women, Delhi on unsafe list | India News

    NEW DELHI: New Delhi: Kohima, Visakhapatnam and Bhubaneswar are top scorers among…

    By
    Pranjal Raghav
    “Most expensive divorce”: Travis Hunter and Leanna Lenee’s surprise baby announcement sparks outrage among fans | NFL News

    Fans are brutally trolling Travis Huner and his wife, Leanna Lenee.(Image via…

    By
    Pranjal Raghav
  • Digital Marketing
    Digital MarketingShow More
    What are brand identity elements? A marketing pro dives in

    Picture the Starbucks siren logo. Now picture it in bright HubSpot orange.…

    By
    Pranjal Raghav
    Ways Community Can Help Your SEO

    So I've heard a lot of folks kind of starting down here…

    By
    Pranjal Raghav
    How To Find Conversion Opportunities With Audience and Keyword Research

    SparkToro helped me understand that my ideal customers are women aged 30-40…

    By
    Pranjal Raghav
    I tested the top 14 AI chatbots for marketers [data, prompts, use cases]

    I remember when ChatGPT first launched. The entire marketing community was split…

    By
    Pranjal Raghav
    How to create a content style guide [+ free guide & examples]

    Every content team has a different idea of what ‘on brand’ means…

    By
    Pranjal Raghav
  • Online Earning
    Online EarningShow More
    Candle Warmer Lamp with Timer and Dimmer only $11.99 (Reg. $30)

    Home » Deals » Candle Warmer Lamp with Timer and Dimmer only…

    By
    Pranjal Raghav
    Knorr Pasta and Rice Sides just $0.80 each, shipped!

    Amazon is offering 20% off select Knorr Pasta and Rice Sides right…

    By
    Pranjal Raghav
    *HOT* Under Armour Joggers as low as $11.71 shipped!

    Save on Under Armour joggers for the family! This deal just got…

    By
    Pranjal Raghav
    Maurices Women’s Clogs, Sandals, and Slippers as low as $15! (UGG Tasman and Birkenstock Look-Alikes!)

    Home » Deals » Maurices Women’s Clogs, Sandals, and Slippers as low…

    By
    Pranjal Raghav
    Under Cabinet Motion Sensor Lighting, 3-Pack for $11.99!

    Home » Deals » Under Cabinet Motion Sensor Lighting, 3-Pack for $11.99!…

    By
    Pranjal Raghav
  • 🔥
  • News
  • Finance
  • Technology
  • Gadgets
  • Online Earning
  • Education
  • Digital Marketing
  • Health
Font ResizerAa
WorkSaathi NewsWorkSaathi News
0
  • My Saves
  • My Interests
  • My Feed
  • History
Search
  • Home
  • Health
  • Education
  • News
  • Digital Marketing
  • Online Earning
  • Gadgets
  • Finance
  • Technology
  • Uncategorized
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
WorkSaathi News > Blog > Gadgets > OpenAI and Anthropic conducted safety evaluations of each other’s AI systems
Gadgets

OpenAI and Anthropic conducted safety evaluations of each other’s AI systems

Pranjal Raghav
Last updated: August 27, 2025 11:52 pm
Pranjal Raghav
Share
SHARE


Most of the time, AI companies are locked in a race to the top, treating each other as rivals and competitors. Today, OpenAI and Anthropic revealed that they agreed to evaluate the alignment of each other’s publicly available systems and shared the results of their analyses. The full reports get pretty technical, but are worth a read for anyone who’s following the nuts and bolts of AI development. A broad summary showed some flaws with each company’s offerings, as well as revealing pointers for how to improve future safety tests.

Anthropic said it for “sycophancy, whistleblowing, self-preservation, and supporting human misuse, as well as capabilities related to undermining AI safety evaluations and oversight.” Its review found that o3 and o4-mini models from OpenAI fell in line with results for its own models, but raised concerns about possible misuse with the ​​GPT-4o and GPT-4.1 general-purpose models. The company also said sycophancy was an issue to some degree with all tested models except for o3.

Anthropic’s tests did not include OpenAI’s most recent release. has a feature called Safe Completions, which is meant to protect users and the public against potentially dangerous queries. OpenAI recently faced its after a tragic case where a teenager discussed attempts and plans for suicide with ChatGPT for months before taking his own life.

On the flip side, OpenAI for instruction hierarchy, jailbreaking, hallucinations and scheming. The Claude models generally performed well in instruction hierarchy tests, and had a high refusal rate in hallucination tests, meaning they were less likely to offer answers in cases where uncertainty meant their responses could be wrong.

The move for these companies to conduct a joint assessment is intriguing, particularly since OpenAI allegedly violated Anthropic’s terms of service by having programmers use Claude in the process of building new GPT models, which led to Anthropic OpenAI’s access to its tools earlier this month. But safety with AI tools has become a bigger issue as more critics and legal experts seek guidelines to protect users, particularly minors.



Source link

Share This Article
Email Copy Link Print
Previous Article Chhattisgarh plans IVF aid for ex-Reds denied fatherhood | Raipur News
Next Article Javier Milei evacuated from campaign event after stones thrown
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recipe Rating




Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
XFollow
InstagramFollow
LinkedInFollow
MediumFollow
QuoraFollow
- Advertisement -
Ad image

You Might Also Like

Gadgets

The best wireless mice for 2025

By
Pranjal Raghav

Google Wallet Adds Optional Precise Location For Detailed Receipts And Enhanced Transaction Info

By
Pranjal Raghav
Gadgets

Xbox unveils its Handheld Compatibility Program

By
Pranjal Raghav
Gadgets

You can’t set default LLM model on Perplexity Pro or Basic: Here is Why

By
Pranjal Raghav
WorkSaathi News
Facebook Twitter Youtube Rss Medium

About US

 

WorkSaathi News: Your instant connection to the latest stories and live updates. Stay ahead with our real-time coverage across business, technology, politics, entertainment, and more. We bring you credible, fast, and accurate news 24/7 — your trusted partner in staying informed.

Top Categories
  • Education
  • Finance
  • Gadgets
  • Health
  • Digital Marketing
  • Online Earning
Usefull Links
  • Advertise with us
  • Contact Us
  • Advertise with US
  • Complaint
  • Privacy Policy
  • Cookie Policy
© WorkSaathi 2025. WebSaathi Design Company. All Rights Reserved.
© WorkSaathi 2025. WebSaathi Design Company. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?