OpenAI gets caught vibe graphing

Somethingâ€™s off with that chart on the left.

During its big GPT-5 livestream on Thursday, OpenAI showed off a few charts that made the model seem quite impressive â€” but if you look closely, some graphs were a little bit off.

In one, ironically showing how well GPT-5 does in â€œdeception evals across models,â€ the scale is all over the place. For â€œcoding deception,â€ for example, GPT-5 apparently gets a 50.0 percent deception rate, but thatâ€™s compared to OpenAIâ€™s smaller 47.4 percent o3 score which somehow has a larger bar.

who’s making these graphs pic.twitter.com/Zt6yhZuUoo

— Shrey Kothari (@shreyk0) August 7, 2025

Or this one, where one of GPT-5â€™s scores is lower than o3â€™s but is shown with a bigger bar. In this same chart, o3 and GPT-4oâ€™s scores are different but shown with equally-sized bars. That chart was bad enough that CEO Sam Altman commented on it, calling it a â€œmega chart screwup.â€ An OpenAI marketing staffer also apologized for the â€œunintentional chart crime.â€

this screenshot from GPT-5 livestream has to be among the worst chart crimes of the century pic.twitter.com/HXsK2CWCon

— Ege Erdil (@EgeErdil2) August 7, 2025

OpenAI didnâ€™t immediately respond to a request for comment. And while itâ€™s unclear if OpenAI used GPT-5 to actually make the charts, itâ€™s still not a great look for the company on its big launch day â€” especially when it is touting the â€œsignificant advances in reducing hallucinationsâ€ with its new model.

Original Source: https://www.theverge.com/news/756444/openai-gpt-5-vibe-graphing-chart-crime

Disclaimer: This article is a reblogged/syndicated piece from a third-party news source. Content is provided for informational purposes only. For the most up-to-date and complete information, please visit the original source. Digital Ground Media does not claim ownership of third-party content and is not responsible for its accuracy or completeness.

OpenAI gets caught vibe graphing

About The Author

admin

More From Author

7 AI coding techniques I use to ship real, reliable products – fast

I wore the world's first HDR10 XR glasses, and they turned me into Batman (literally)

I replaced my full-sized desktop with a mini Windows PC, and it's somehow just as capable

Leave a Reply Cancel reply