AT2k Design BBS Message Area
Casually read the BBS message area using an easy to use interface. Messages are categorized exactly like they are on the BBS. You may post new messages or reply to existing messages!

You are not logged in. Login here for full access privileges.

Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page
   Local Database  Slashdot   [237 / 287] RSS
 From   To   Subject   Date/Time 
Message   VRSS    All   OpenAI Puzzled as New Models Show Rising Hallucination Rates   April 18, 2025
 9:00 PM  

Feed: Slashdot
Feed Link: https://slashdot.org/
---

Title: OpenAI Puzzled as New Models Show Rising Hallucination Rates

Link: https://slashdot.org/story/25/04/18/2323216/o...

OpenAI's latest reasoning models, o3 and o4-mini, hallucinate more frequently
than the company's previous AI systems, according to both internal testing
and third-party research. On OpenAI's PersonQA benchmark, o3 hallucinated 33%
of the time -- double the rate of older models o1 (16%) and o3-mini (14.8%).
The o4-mini performed even worse, hallucinating 48% of the time. Nonprofit AI
lab Transluce discovered o3 fabricating processes it claimed to use,
including running code on a 2021 MacBook Pro "outside of ChatGPT." Stanford
adjunct professor Kian Katanforoosh noted his team found o3 frequently
generates broken website links. OpenAI says in its technical report that
"more research is needed" to understand why hallucinations worsen as
reasoning models scale up.

Read more of this story at Slashdot.

---
VRSS v2.1.180528
  Show ANSI Codes | Hide BBCodes | Show Color Codes | Hide Encoding | Hide HTML Tags | Show Routing
Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page

VADV-PHP
Execution Time: 0.0145 seconds

If you experience any problems with this website or need help, contact the webmaster.
VADV-PHP Copyright © 2002-2025 Steve Winn, Aspect Technologies. All Rights Reserved.
Virtual Advanced Copyright © 1995-1997 Roland De Graaf.
v2.1.250224