Monopoly Deal: Een Benchmarkomgeving voor Beperkte Eenzijdige Responsspelen

Samenvatting

Kaartspellen worden veelvuldig gebruikt om sequentieel besluitvormingsgedrag onder onzekerheid te bestuderen, met realistische analogieën in onderhandelingen, financiën en cybersecurity. Deze spellen zijn doorgaans in drie categorieën in te delen op basis van de controleflow: strikt sequentieel (spelers wisselen af met individuele acties), deterministische respons (bepaalde acties leiden tot een vast resultaat) en onbegrensde wederkerige respons (afwisselende tegenzetten zijn toegestaan). Een minder onderzochte maar strategisch rijke structuur is de begrensde eenzijdige respons, waarbij een actie van een speler de controle tijdelijk overdraagt aan de tegenstander, die aan een vaste voorwaarde moet voldoen via één of meer zetten voordat de beurt wordt afgesloten. Wij noemen spellen met dit mechanisme Bounded One-Sided Response Games (BORGs). Wij introduceren een aangepaste versie van Monopoly Deal als een benchmarkomgeving die deze dynamiek isoleert, waarbij een Huur-actie de tegenstander dwingt tot het kiezen van betaalmiddelen. De gouden standaard-algoritme, Counterfactual Regret Minimization (CFR), convergeert naar effectieve strategieën zonder nieuwe algoritmische uitbreidingen. Een lichtgewicht full-stack onderzoeksplatform integreert de omgeving, een geparallelleerde CFR-runtime en een door mensen bespeelbare webinterface. De getrainde CFR-agent en broncode zijn beschikbaar op https://monopolydeal.ai.

English

Card games are widely used to study sequential decision-making under uncertainty, with real-world analogues in negotiation, finance, and cybersecurity. These games typically fall into three categories based on the flow of control: strictly sequential (players alternate single actions), deterministic response (some actions trigger a fixed outcome), and unbounded reciprocal response (alternating counterplays are permitted). A less-explored but strategically rich structure is the bounded one-sided response, where a player's action briefly transfers control to the opponent, who must satisfy a fixed condition through one or more moves before the turn resolves. We term games featuring this mechanism Bounded One-Sided Response Games (BORGs). We introduce a modified version of Monopoly Deal as a benchmark environment that isolates this dynamic, where a Rent action forces the opponent to choose payment assets. The gold-standard algorithm, Counterfactual Regret Minimization (CFR), converges on effective strategies without novel algorithmic extensions. A lightweight full-stack research platform unifies the environment, a parallelized CFR runtime, and a human-playable web interface. The trained CFR agent and source code are available at https://monopolydeal.ai.

Monopoly Deal: Een Benchmarkomgeving voor Beperkte Eenzijdige Responsspelen

Monopoly Deal: A Benchmark Environment for Bounded One-Sided Response Games

Samenvatting

Support