
Deep research is a retrieval problem
Give the agent the relevant documents and it answers 93% of questions correctly. Make it find those documents with a weak retriever and it scores 14%. In BrowseComp-Plus, that gap makes retrieval impossible to ignore.




