talks | works
Parabellum

Parabellum

May 6, 2025
Thun, Switzerland
Press D to download as a PDF and F to toggle presentation mode. Navigate up and down with K and J when presenting
PARABELLUMNoah SyrkisMay 6, 20251 |Introduction2 |Vectorization3 |Acceleration4 |Differentiation5 |SimulationThese slides presents a broad overview of the Parabellum environment shown in [1] T. Anne etal., β€œHarnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control,” IEEE Transactions on Games, pp. 1–25, 2025, doi: 10.1109/TG.2025.3564042. Thisresearch is funded by Armasuisse.1 of 91 |Introductionβ–ΆSandbox for large-scale, team-based wargames on real terrainβ–ΆDifferentiable JAX environment grounded in OpenStreetMapβ–ΆBuilt for fast iteration on autonomous tactics and analysis2 of 91 |Introductionβ–ΆTraditional wargaming is not ready for ML, real geography, or beyond real time simulationβ–ΆManual setup slows sensitivity analysisβ–ΆGradient-free simulators block learning-based planning and seamless integration with deep learning pipelines (including LLMs and RL agents)3 of 92 |Vectorizationβ–ΆProcedurally load maps + buildings for any geocoded area in a JAX [2] arrayβ–ΆUnits, teams, sensors as YAML config-files to specify game rules and team compositionsβ–ΆEntirely in JAX: batching, autodiff, vectorisationβ–ΆParallel rollouts across seeds + scenarios on accelerators4 of 93 |Accelerationβ–ΆReal war: high fidelity but slow, costly, unparallelizableβ–ΆParabellum: an RTS-like simulator where:β–ΆArbitrary numbers of sims can run in parallelβ–ΆFaster than real-timeβ–ΆTens of thousands of units per scenario5 of 94 |Differentiationβ–ΆFully written in JAX [2]β–ΆVectorized via vmap, parallelized with pmapβ–ΆDirect integration into deep learning pipelinesβ–ΆBoosts model capacity for long-horizon strategy6 of 95 |Simulationβ–ΆTrajectories as (𝑠𝑑, π‘Žπ‘‘)–tuplesβ–ΆNo rewards Figure 1 β€” only flows of state andactionState 𝑠𝑑+1State 𝑠𝑑Observation π‘œπ‘‘Action π‘Žπ‘‘Step 𝑑Figure 1: Rewardless partially observable MDPdiagram7 of 95 |Simulationβ–ΆState = (position, health, cooldown)β–ΆScene encodes terrain, ranges, unit typesβ–ΆAny Earth location loadable via OSM1β–ΆObservation = visible units’ location, health, type, team1OpenStreetMap data8 of 9References[1]T. Anne et al., β€œHarnessing Language for Coordination: A Framework and Benchmark forLLM-Driven Multi-Agent Control,” IEEE Transactions on Games, pp. 1–25, 2025, doi: 10.1109/TG.2025.3564042.[2]J. Bradbury et al., β€œJAX: Composable Transformations of Python+NumPy Programs.” 2018.9 of 9