MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games
Paper
⢠2510.15414 ⢠Published
⢠1
MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs š Accepted by ICLR 2026
Note Note: This paper has been updated to v3 on arXiv. MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs