paper

Which Nash Equilibrium? Solver-Dependent Selection on Zero-Sum Nash Polytopes

paperactiveprovisionalwhich-nash-equilibrium-solver-dependent-selection-on-zero-sum-nash-polytopes-6d55ddb0·1 events·first seen 21h ago

Aliases: Which Nash Equilibrium? Solver-Dependent Selection on Zero-Sum Nash Polytopes

Co-occurring entities

CFR Kuhn poker R-NaD magnetic mirror descent

More like this (12)

Game-Theoretic Equilibria DNQ: Deep Nash Q-Network for Partially Observable n-Player Games Gradient Equilibrium Blackwell Approachability and Gradient Equilibrium are Equivalent Regret Minimization with Adaptive Opponents in Repeated Games Pareto Optimal Policy Optimization Greedy Ensemble Selection Stable Menus of Public Goods Algorithmic and Minimax Complexities in Kernel Bandits Equilibrium Reasoners (EqR)Error-Conditioned Neural Solvers Exact Posterior Score Estimation for Solving Linear Inverse Problems

Recent events (1)

5arXiv · cs.AI·21h ago·source ↗

Solver-dependent Nash equilibrium selection on zero-sum polytopes: regularized methods select max-entropy members

A new arXiv preprint investigates whether different Nash equilibrium solvers systematically select different members of the Nash polytope in two-player zero-sum games. Using six analytically tractable games including Kuhn poker, the authors find that regularized last-iterate methods (R-NaD, magnetic mirror descent) converge to the maximum-entropy Nash equilibrium — interpretable as an information projection — while regret-averaging methods (CFR, CFR+, fictitious play) drift to lower-entropy boundary solutions. The distinction has downstream consequences for performance against sub-optimal opponents in games with sequential or hidden-information structure, with implications for multi-agent AI training and game-solving pipelines.

Evaluation and Benchmarking CFR Kuhn poker R-NaD +2 more