LudoBench evaluates large language models' strategic reasoning using 480 spot-based Ludo scenarios, revealing key insights into AI decision-making behavior...
Discover SkyNet's belief-aware planning improving AI performance in partially-observable stochastic games with enhanced reinforcement learning techniques.