HaM-World: Advanced Soft-Hamiltonian Models for Planning

Date:

HaM-World: Soft-Hamiltonian World Models with Selective Memory for Planning

A groundbreaking study has been released on arXiv with the title “HaM-World: Soft-Hamiltonian World Models with Selective Memory for Planning” (arXiv:2605.05951v1). This research introduces an innovative approach to world models, which are essential for model-based planning through learned latent dynamics. The study highlights the challenges faced when utilizing imagined rollouts, particularly the instability that arises as the planning horizon extends or as the dynamics distribution shifts.

The authors contend that this instability stems from two significant missing structures in the planner-facing latents: the absence of history-conditioned memory, which is crucial for achieving approximate Markov completeness, and the lack of geometric organization that effectively distinguishes between configuration, momentum, and task semantics.

Introducing HaM-World (HMW)

To address these issues, the researchers propose a structured world model known as HaM-World (HMW). This model decomposes the latent state into two fundamental components: a canonical subspace represented by (q, p) and a context subspace denoted as c. HMW utilizes Mamba selective state-space memory as a history-conditioned input that feeds into the same latent dynamics.

The evolution of the (q, p) subspace occurs through an energy-derived Hamiltonian vector field combined with learnable residual and control dynamics. Meanwhile, the context subspace c encapsulates semantic, dissipative, and non-conservative factors. This architectural design allows the planner to maintain a unified latent state that serves multiple purposes, including dynamics prediction, reward/value estimation, imagined rollouts, and CEM action search.

Performance and Results

The performance of HaM-World has been rigorously evaluated across four tasks in the DeepMind Control Suite. The results are impressive, with HaM-World achieving the highest average area under the curve (Avg. AUC) of 117.9, representing a significant improvement of 9.5%. Additionally, the model successfully reduces long-horizon rollout error to just 45% of that observed in a robust baseline model, and it excels in competitive settings, winning 11 out of 12 key performance metrics in various mean-squared error (MSE) cells.

  • Out-of-Distribution (OOD) Performance: HaM-World demonstrated remarkable resilience under 12 OOD perturbations, which included dynamics shifts, action delays, and observation masking.
  • Consistent Returns: The model achieved the highest return in every condition tested, with average OOD-return gains of 10.2% on the Finger Spin task and 13.6% on the Reacher Easy task.

Diagnostic Insights

Further diagnostics of the mechanisms underlying HaM-World reveal several key findings:

  • Bounded Action-Free Hamiltonian-Energy Drift: The model maintains stability even in the absence of actions.
  • Structured Energy Variation: Energy varies in a coherent manner under policy rollouts, suggesting effective control dynamics.
  • Coherent Control-Induced Energy Transfer: The design supports the intended Soft-Hamiltonian dynamics, facilitating enhanced planning capabilities.

In conclusion, HaM-World represents a significant advancement in the field of model-based planning, combining innovative structural elements with empirical performance improvements. As AI continues to evolve, models like HMW may pave the way for more robust and flexible planning systems.

Related AI Insights

Lazarus Omolua
Lazarus Omoluahttps://richlyai.com/blog
My mission is to make sure that people in Africa are not left behind in the global AI revolution. RichlyAI exists to give everyone — students, founders, creators, and businesses — the tools to compete globally.

Subscribe

Popular

More like this
Related

How Business Ops Teams Boost Productivity with Codex

Discover how business operations teams use Codex to streamline documentation, enhance collaboration, and improve decision-making with AI-powered automation...

OpenAI Partners with Malta to Offer ChatGPT Plus Nationwide

OpenAI and Malta team up to provide free ChatGPT Plus access and AI training to all citizens, promoting digital literacy and responsible AI use.

Critical Linux Kernel Flaw Risks SSH Host Key Theft

A critical Linux kernel flaw risks stolen SSH host keys. Learn how to protect your systems and stay secure until patches are widely available.

Top External Hard Drives 2026: Expert Reviews & Buying Guide

Discover the best external hard drives of 2026 with expert reviews. Find top picks for speed, durability, and security to suit all storage needs.