Naar het ophalen van interactieruimtes voor agentisch zoeken

Samenvatting

Retrieval voor zoekagenten wordt nog steeds overgenomen van niet-agentische informatieopvraging: een retriever rangschikt het corpus en de agent leest een kleine set geretourneerde documenten. Recent werk aan directe corpusinteractie (DCI) laat zien dat agenten in plaats daarvan kunnen interacteren met het ruwe corpus via shell-hulpmiddelen zoals grep en bestandsleesacties. Maar onbegrensde interactie schaalt niet: elke brede shell-opdracht is een scan van het gehele corpus, en de latentie verslechtert sterk naarmate het corpus groeit. Wij stellen dat de rol van retrieval voor agentisch zoeken niet alleen is om documenten te selecteren die in het LLM-contextvenster passen, maar om een interactieruimte te construeren: een begrensde deelverzameling van het corpus die de agent kan verkennen met bijbehorende tools. Twee ontwerpconsequenties volgen hieruit. De ruimte heeft een grens nodig die door retrieval wordt geleverd, en de objecten erbinnen moeten worden verwerkt voor interactie. Als proof of concept stellen we RISE (Retrieving Interaction SpacE) voor: we gebruiken BM25 om de interactieruimte te construeren; tegelijkertijd worden de documenten tijdens het indexeren verwerkt voor shell-achtige navigatie. Op BrowseComp-Plus evenaart RISE de zuivere shell DCI-baseline met 78% nauwkeurigheid met gpt-5.4-mini tegen ongeveer een kwart van de kosten per query. Bij 1 miljoen documenten bereikt RISE-BM25 81% op gpt-5.4-mini, terwijl DCI op gpt-5.4-nano degradeert tot 60% met 33 van de 100 kloktijd-mislukkingen.

English

Retrieval for search agents is still inherited from non-agentic information retrieval: a retriever ranks the corpus and the agent reads a small set of returned documents. Recent direct corpus interaction (DCI) work shows that agents can instead interact with the raw corpus through shell tools such as grep and file reads. But unbounded interaction does not scale: every broad shell command is a scan over the whole corpus, and latency degrades sharply as the corpus grows. We argue that the role of retrieval for agentic search is not just to select documents that fit in the LLM context window, but to construct an interaction space: a bounded subset of the corpus the agent can explore with associated tools. Two design consequences follow. The space needs a boundary supplied by retrieval, and the objects within it should be processed for interaction. As a proof of concept, we propose RISE (Retrieving Interaction SpacE): we use BM25 to construct the interaction space; meanwhile, its documents are processed during indexing for shell-style navigation. On BrowseComp-Plus, RISE matches the pure-shell DCI baseline at 78% accuracy with gpt-5.4-mini at roughly one quarter of the per-query cost. At 1M documents, RISE-BM25 reaches 81% on gpt-5.4-mini, whereas DCI on gpt-5.4-nano degrades to 60% with 33 of 100 wall-clock failures.