News from
OpenAI's Deep Research has more fact-finding stamina than you, but it's still wrong half the time
2025-04-29 13:45:04| Spiritual Career Counseling
The latest in generative artificial intelligence includes AI agents that can access the web to find answers to questions. While promising, agentic technology is very much a work in progress.In a paper published last week, OpenAI researchers relate how the company'sDeep Researchtechnology, which was built to use the Web, does far better than OpenAI's other models when answering web questions. It also does far better than humans on tasks requiring hours of searching.Also: What are AI agents? How to access a team of personalized assistantsBut Deep Research still stumbles almost half the time.OpenAI's new test suggests Deep Research can be more tenacious and dogged in pursuit of an answer than human researchers for some tasks, but it still fails to come up with an answer often.Called BrowseComp, the test is described by authors Jason Wei and team as "a simple yet challenging benchmark for measuring the ability of agents to browse the web."The premise is that AI agents -- meaning, AI models...
Category:
Employment