Almanac
technique

DNQ

techniqueactiveprovisionaldnq-06d235d0·1 events·first seen 12d ago

Aliases: DNQ

Co-occurring entities

More like this (12)

Recent events (1)

4arXiv · cs.LG·12d ago·source ↗

DNQ: Deep Nash Q-Network framework for equilibrium learning in multi-agent bidding games

Researchers propose DNQ (Deep Nash Q-Network), a solver-in-the-loop framework for training agents to reach Nash equilibria in partially observable n-player simultaneous bidding games. The method alternates between trajectory collection, critic-based payoff estimation, external equilibrium computation, and policy imitation via KL divergence minimization. A scalable pairwise payoff formulation is shown to outperform the exact N-player tensor approach in computational cost while maintaining strategic quality, with experiments demonstrating the trade-off between fidelity and scalability as agent count grows.