Media

Media › OpenAI's Deep Research has more fact-finding stamina than you, but it's still wrong half the time

OpenAI's Deep Research has more fact-finding stamina than you, but it's still wrong half the time

2025-04-29 13:45:04| Spiritual Career Counseling

The latest in generative artificial intelligence includes AI agents that can access the web to find answers to questions. While promising, agentic technology is very much a work in progress.In a paper published last week, OpenAI researchers relate how the company'sDeep Researchtechnology, which was built to use the Web, does far better than OpenAI's other models when answering web questions. It also does far better than humans on tasks requiring hours of searching.Also: What are AI agents? How to access a team of personalized assistantsBut Deep Research still stumbles almost half the time.OpenAI's new test suggests Deep Research can be more tenacious and dogged in pursuit of an answer than human researchers for some tasks, but it still fails to come up with an answer often.Called BrowseComp, the test is described by authors Jason Wei and team as "a simple yet challenging benchmark for measuring the ability of agents to browse the web."The premise is that AI agents -- meaning, AI models...

Category: Employment

Latest from this category

All news

13.12	Salah's farewell? If this was his final Liverpool chapter, he delivered a fitting end
13.12	Blackhawks recall Lardis following Bedard injury
13.12	Hawks' Young should return to practice next week
13.12	Cowboys keep CB Diggs on IR for Vikings game
13.12	Source: U-M launches athletic department query
13.12	Live updates from NBA Cup semifinals between Magic-Knicks, Spurs-Thunder in Vegas
13.12	Follow live: Army, Navy vie for Commander-In-Chief's Trophy as rivals meet in Baltimore
13.12	Salah makes EPL history on Liverpool return
More »

News from

Media

OpenAI's Deep Research has more fact-finding stamina than you, but it's still wrong half the time

2025-04-29 13:45:04| Spiritual Career Counseling

Latest from this category

All news