Springdrift: An Auditable Persistent Runtime for LLM Agents with Case-Based Memory, Normative Safety, and Ambient Self-Perception
We are excited to announce the release of a new technical report titled “Springdrift,” which introduces a groundbreaking persistent runtime for long-lived Large Language Model (LLM) agents. This innovative system is detailed in the arXiv report number 2604.04660v1 and showcases a multitude of advanced features that enhance the capabilities of LLM agents beyond traditional session-bound systems.
Key Features of Springdrift
Springdrift incorporates several state-of-the-art components designed to create a more effective and reliable operational framework for LLM agents:
- Auditable Execution Substrate: The system employs an append-only memory model combined with supervised processes and git-backed recovery mechanisms. This ensures that all actions taken by the agent can be verified and audited.
- Case-Based Reasoning Memory Layer: Springdrift features a hybrid retrieval system that enhances memory management and decision-making processes, evaluated against a dense cosine baseline for effectiveness.
- Deterministic Normative Calculus: This component provides safety gating with auditable axiom trails, ensuring that all decisions made by the agent adhere to established safety norms.
- Continuous Ambient Self-Perception: The system includes a structured self-state representation, referred to as the sensorium, which is injected into the agent’s processing cycle without requiring tool calls. This enables ongoing self-awareness and situational context.
Implications of Springdrift’s Design
The unique features of Springdrift facilitate several advanced behaviors that are challenging to achieve in traditional systems:
- Cross-Session Task Continuity: The agent can maintain task continuity across different sessions, allowing for long-term project management and execution.
- Cross-Channel Context Maintenance: Springdrift can retain context across various communication channels, such as email and web interactions, enhancing its utility in collaborative environments.
- End-to-End Forensic Reconstruction: The system allows for complete reconstruction of decisions made by the agent, providing transparency and accountability.
- Self-Diagnostic Behavior: The agent is capable of diagnosing its own infrastructure bugs and identifying architectural vulnerabilities without explicit instructions, showcasing a significant leap in autonomy.
Real-World Deployment and Findings
Springdrift has been deployed in a real-world scenario over a period of 23 days (19 operating days). During this deployment, the agent successfully diagnosed its own infrastructure issues, classified various failure modes, and maintained contextual awareness across multiple channels without any direct prompts. This performance illustrates the practical applications and benefits of the architectural properties designed into Springdrift.
The Concept of Artificial Retainer
We introduce the term “Artificial Retainer” to describe this category of non-human systems with persistent memory, defined authority, domain-specific autonomy, and forensic accountability. This term draws inspiration from professional retainer relationships and the bounded autonomy seen in trained working animals, distinguishing Springdrift from typical software assistants and fully autonomous agents.
Conclusion and Future Work
This technical report serves as a case study for the design and deployment of the Springdrift system, emphasizing its innovative features rather than benchmark-driven evaluations. The implementation has been carried out using Gleam on Erlang/OTP, and further details, including code, artifacts, and redacted operational logs, will be available on GitHub upon publication.
