Agent-BOM: Unified Security Auditing for LLM Agents

Towards Security-Auditable LLM Agents: A Unified Graph Representation

In a recent paper published on arXiv, researchers are addressing the challenges associated with security auditing in large language model (LLM)-based agentic systems. The rapid evolution of these systems, which perform complex autonomous tasks, has outpaced existing auditing mechanisms, highlighting a critical need for improved security frameworks.

LLM-based agentic systems leverage dynamic tool invocation, stateful memory management, and multi-agent collaboration to execute tasks. However, these advancements have introduced a significant semantic gap between low-level physical events and high-level execution intent. This gap complicates the process of post-hoc security auditing, making it difficult to ensure that these systems operate securely and as intended.

Current representation mechanisms, such as static Software Bill of Materials (SBOM) and runtime logs, provide only fragmented evidence. They fail to comprehensively capture key factors such as:

Cognitive-state evolution
Capability bindings
Persistent memory contamination
Cascading risk propagation across interacting agents

To address these challenges, the authors propose a novel solution called Agent-BOM, a unified structural representation designed specifically for agent security auditing. Agent-BOM models an agentic system as a hierarchical attributed directed graph, effectively separating static capability bases—including models, tools, and long-term memory—from dynamic runtime semantic states, which encompass goals, reasoning trajectories, and actions.

This innovative approach connects these layers through semantic edges and security attributes, transforming previously fragmented execution traces into queryable audit paths. By employing Agent-BOM, organizations can enhance their ability to perform security audits and ensure compliance with operational standards.

Building on the foundations of Agent-BOM, the researchers have developed a graph-query-based paradigm for path-level risk assessment. This assessment framework is instantiated with the OWASP Agentic Top 10, a set of identified risks unique to agentic systems. The integration of this framework allows for more nuanced risk evaluations and helps to identify vulnerabilities within agentic ecosystems.

Furthermore, an auditing plugin has been implemented in the OpenClaw environment, enabling the construction of Agent-BOM from live executions. This plugin facilitates real-time monitoring and analysis, providing valuable insights into the operation of agentic systems.

The evaluation of Agent-BOM has demonstrated its efficacy in reconstructing stealthy attack chains in various real-world scenarios, which include:

Cross-session memory poisoning and tool misuse
Capability supply-chain hijacking and unexpected code execution
Multi-agent ecosystem hijacking
Privilege and trust abuse

These findings indicate that Agent-BOM offers a unified and auditable foundation for root-cause analysis and security adjudication in complex agentic ecosystems. By bridging the semantic gap between low-level events and high-level intents, Agent-BOM significantly enhances the ability to conduct meaningful security audits.

As LLM-based agentic systems continue to evolve, the introduction of frameworks like Agent-BOM will be crucial in ensuring their safe and secure deployment across various applications, paving the way for more robust security measures in the field of artificial intelligence.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Agent-BOM: Unified Security Auditing for LLM Agents

Towards Security-Auditable LLM Agents: A Unified Graph Representation

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related