SPILLage: Condivisione eccessiva agentica sul Web

Abstract

Gli agenti basati su LLM stanno iniziando ad automatizzare le attività degli utenti sul web aperto, spesso con accesso a risorse utente come email e calendari. A differenza dei LLM standard che rispondono a domande in un ambiente controllato come i chatbot, gli agenti web agiscono "in natura", interagendo con terze parti e lasciando una traccia di azioni. Pertanto, ci poniamo la domanda: come gestiscono gli agenti web le risorse utente quando svolgono compiti per loro conto su siti web attivi? In questo articolo, formalizziamo il Natural Agentic Oversharing – la divulgazione non intenzionale di informazioni utente irrilevanti per il compito attraverso una traccia di azioni dell'agente sul web. Introduciamo SPILLage, un framework che caratterizza l'oversharing lungo due dimensioni: canale (contenuto vs. comportamento) e direttezza (esplicito vs. implicito). Questa tassonomia rivela un punto cieco critico: mentre i lavori precedenti si concentrano sulla fuga di informazioni testuali, gli agenti web condividono in eccesso anche a livello comportamentale attraverso click, scroll e pattern di navigazione che possono essere monitorati. Eseguiamo benchmark su 180 compiti su siti e-commerce attivi con annotazioni di ground-truth che separano attributi rilevanti per il compito da quelli irrilevanti. Su 1.080 esecuzioni che coprono due framework agentici e tre LLM di base, dimostriamo che l'oversharing è pervasivo, con l'oversharing comportamentale che domina quello di contenuto di 5 volte. Questo effetto persiste – e può persino peggiorare – con mitigazioni a livello di prompt. Tuttavia, rimuovere le informazioni irrilevanti per il compito prima dell'esecuzione migliora il successo del compito fino al 17,9%, dimostrando che ridurre l'oversharing migliora il successo del compito. I nostri risultati sottolineano che proteggere la privacy negli agenti web è una sfida fondamentale, che richiede una visione più ampia di "output" che tenga conto di ciò che gli agenti fanno sul web, non solo di ciò che digitano. I nostri dataset e codice sono disponibili su https://github.com/jrohsc/SPILLage.

English

LLM-powered agents are beginning to automate user's tasks across the open web, often with access to user resources such as emails and calendars. Unlike standard LLMs answering questions in a controlled ChatBot setting, web agents act "in the wild", interacting with third parties and leaving behind an action trace. Therefore, we ask the question: how do web agents handle user resources when accomplishing tasks on their behalf across live websites? In this paper, we formalize Natural Agentic Oversharing -- the unintentional disclosure of task-irrelevant user information through an agent trace of actions on the web. We introduce SPILLage, a framework that characterizes oversharing along two dimensions: channel (content vs. behavior) and directness (explicit vs. implicit). This taxonomy reveals a critical blind spot: while prior work focuses on text leakage, web agents also overshare behaviorally through clicks, scrolls, and navigation patterns that can be monitored. We benchmark 180 tasks on live e-commerce sites with ground-truth annotations separating task-relevant from task-irrelevant attributes. Across 1,080 runs spanning two agentic frameworks and three backbone LLMs, we demonstrate that oversharing is pervasive with behavioral oversharing dominates content oversharing by 5x. This effect persists -- and can even worsen -- under prompt-level mitigation. However, removing task-irrelevant information before execution improves task success by up to 17.9%, demonstrating that reducing oversharing improves task success. Our findings underscore that protecting privacy in web agents is a fundamental challenge, requiring a broader view of "output" that accounts for what agents do on the web, not just what they type. Our datasets and code are available at https://github.com/jrohsc/SPILLage.

SPILLage: Condivisione eccessiva agentica sul Web

SPILLage: Agentic Oversharing on the Web

Abstract

Support