AT2k Design BBS's VADV-PHP Home

AT2k Design BBS Message Area

Casually read the BBS message area using an easy to use interface. Messages are categorized exactly like they are on the BBS. You may post new messages or reply to existing messages!

You are not logged in. Login here for full access privileges.

Previous Message | Next Message | Back to Slashdot <-- <---

Return to Home Page

Slashdot [237 / 287]

From

Subject

Date/Time

VRSS

All

OpenAI Puzzled as New Models Show Rising Hallucination Rates

April 18, 2025
9:00 PM

Feed: Slashdot
Feed Link: https://slashdot.org/
---

Title: OpenAI Puzzled as New Models Show Rising Hallucination Rates

Link: https://slashdot.org/story/25/04/18/2323216/o...

OpenAI's latest reasoning models, o3 and o4-mini, hallucinate more frequently
than the company's previous AI systems, according to both internal testing
and third-party research. On OpenAI's PersonQA benchmark, o3 hallucinated 33%
of the time -- double the rate of older models o1 (16%) and o3-mini (14.8%).
The o4-mini performed even worse, hallucinating 48% of the time. Nonprofit AI
lab Transluce discovered o3 fabricating processes it claimed to use,
including running code on a 2021 MacBook Pro "outside of ChatGPT." Stanford
adjunct professor Kian Katanforoosh noted his team found o3 frequently
generates broken website links. OpenAI says in its technical report that
"more research is needed" to understand why hallucinations worsen as
reasoning models scale up.

Read more of this story at Slashdot.

---
VRSS v2.1.180528

Previous Message | Next Message | Back to Slashdot <-- <---

Return to Home Page

Execution Time: 0.0145 seconds

If you experience any problems with this website or need help, contact the webmaster.
VADV-PHP Copyright © 2002-2025 Steve Winn, Aspect Technologies. All Rights Reserved.
Virtual Advanced Copyright © 1995-1997 Roland De Graaf.
v2.1.250224