AgentForesight: Real-Time Failure Prediction in Multi-Agent AI

AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems

In the rapidly evolving domain of artificial intelligence, particularly in the realm of large language model (LLM)-based multi-agent systems, the stakes are high. These systems are often tasked with long-horizon goals, where a single misstep can lead to a cascade of failures affecting the entire trajectory of the operation. Traditionally, the approach to managing such failures has revolved around what is known as post-hoc failure attribution. This method involves diagnosing which agent failed and at what point in the process, but it lacks the capacity for real-time intervention. To address this critical gap, researchers have introduced a groundbreaking framework called AgentForesight.

The Challenge of Trajectory-Level Failures

The existing paradigm of diagnosing failures after they occur limits the ability to take proactive measures during the execution of tasks. This is particularly problematic in multi-agent systems where coordination and timely responses are essential. The cascading effects of a single decisive error can lead to significant operational inefficiencies and failures. AgentForesight aims to transform this reactive approach into a proactive online auditing mechanism.

What is AgentForesight?

AgentForesight is designed to conduct online audits of multi-agent trajectories in real time. It operates under the premise that at each step of an unfolding trajectory, an auditor—powered by an AI model—observes only the current sequence of actions. The auditor must then make a critical decision: either allow the trajectory to continue or raise an alarm at the earliest indication of a decisive error. This approach empowers the system to intervene before a failure cascades through the rest of the operational process.

AFTraj-2K: The Corpus Behind the Framework

To support the development of AgentForesight, researchers curated a substantial dataset known as AFTraj-2K. This corpus comprises agentic trajectories across various domains, including Coding, Math, and Agentic tasks. Each trajectory has been meticulously vetted through a rigorous curation pipeline, ensuring that safe trajectories are preserved while unsafe ones are annotated at the point of failure. This careful annotation process relies on consensus from multiple LLM judges, enhancing the reliability of the dataset.

AgentForesight-7B: The Auditor Model

At the heart of the AgentForesight framework is the AgentForesight-7B model. This compact online auditor employs a unique reinforcement learning approach that is both coarse and fine. Initially, the model is equipped with a risk-anticipation prior based on adjacent safe and unsafe prefix pairs. Subsequently, it refines this prior to achieve precise step-level localization, guided by a three-axis reward system that targets the essential aspects of an audit verdict: the what, where, and who of any potential failure.

Performance and Impact

In rigorous testing against the AFTraj-2K and an external benchmark known as Who&When, AgentForesight-7B demonstrated remarkable performance improvements. It outperformed leading proprietary models, including GPT-4.1 and DeepSeek-V4-Pro, achieving performance gains of up to +19.9% and reducing step localization errors by a factor of three. These advancements represent a significant shift from merely detecting post-hoc failures to enabling timely interventions during deployment.

Conclusion

The introduction of AgentForesight marks a significant advancement in the field of multi-agent systems. By shifting the focus from retrospective failure analysis to real-time auditing, the framework paves the way for more robust and resilient AI systems capable of navigating complex tasks with greater efficiency and safety. For further details, visit the project page at AgentForesight Project Page.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

AgentForesight: Real-Time Failure Prediction in Multi-Agent AI

AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems

The Challenge of Trajectory-Level Failures

What is AgentForesight?

AFTraj-2K: The Corpus Behind the Framework

AgentForesight-7B: The Auditor Model

Performance and Impact

Conclusion

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related