Genesis: Advanced Evolving Attacks for LLM Web Agents

Date:

Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming

Summary: arXiv:2510.18314v2 Announce Type: replace

As large language model (LLM) agents increasingly automate complex web tasks, they boost productivity while simultaneously introducing new security risks. However, relevant studies on web agent attacks remain limited. Existing red-teaming approaches mainly rely on manually crafted attack strategies or static models trained offline. Such methods fail to capture the underlying behavioral patterns of web agents, making it difficult to generalize across diverse environments.

In web agent attacks, success requires the continuous discovery and evolution of attack strategies. To this end, we propose Genesis, a novel agentic framework composed of three modules: Attacker, Scorer, and Strategist.

Framework Overview

The Genesis framework is designed to enhance the effectiveness of attack strategies against LLM web agents through a dynamic and iterative process. Each module plays a critical role in this ecosystem:

  • Attacker: This module generates adversarial injections by integrating a genetic algorithm with a hybrid strategy representation. This allows for the creation of diverse and unpredictable attack vectors that can adapt to the changing defenses of web agents.
  • Scorer: The Scorer evaluates the responses generated by the target web agent in reaction to the attacks. It provides critical feedback that informs the Attacker about the effectiveness of different strategies and helps refine future attacks.
  • Strategist: The Strategist dynamically uncovers effective strategies from interaction logs. It compiles these strategies into a continuously growing library, which is then used to enhance the Attacker’s capabilities, ensuring that the framework evolves over time.

Methodology

Genesis employs a unique methodology that focuses on the iterative improvement of attack strategies. The integration of a genetic algorithm allows for the combination and mutation of different attack techniques, leading to the emergence of novel strategies. This approach not only enhances the adaptability of the Attacker but also ensures that the framework can respond effectively to new challenges posed by LLM web agents.

Experimental Results

Extensive experiments conducted across various web tasks demonstrate the efficacy of the Genesis framework. The results indicate that Genesis consistently outperforms existing attack baselines, discovering novel strategies that significantly increase the success rate of attacks. The continuous feedback loop between the Scorer and the Attacker allows for rapid adaptation and improvement, setting Genesis apart from traditional red-teaming methods.

Conclusion

The Genesis framework represents a significant advancement in the field of web agent red-teaming. By focusing on the continuous evolution of attack strategies, it addresses the limitations of existing approaches and provides a robust solution for navigating the complexities of LLM agent security. Our code is available for public use at GitHub, encouraging further research and development in this critical area.


Related AI Insights

Lazarus Omolua
Lazarus Omoluahttps://richlyai.com/blog
My mission is to make sure that people in Africa are not left behind in the global AI revolution. RichlyAI exists to give everyone — students, founders, creators, and businesses — the tools to compete globally.

Subscribe

Popular

More like this
Related

How Business Ops Teams Boost Productivity with Codex

Discover how business operations teams use Codex to streamline documentation, enhance collaboration, and improve decision-making with AI-powered automation...

OpenAI Partners with Malta to Offer ChatGPT Plus Nationwide

OpenAI and Malta team up to provide free ChatGPT Plus access and AI training to all citizens, promoting digital literacy and responsible AI use.

Critical Linux Kernel Flaw Risks SSH Host Key Theft

A critical Linux kernel flaw risks stolen SSH host keys. Learn how to protect your systems and stay secure until patches are widely available.

Top External Hard Drives 2026: Expert Reviews & Buying Guide

Discover the best external hard drives of 2026 with expert reviews. Find top picks for speed, durability, and security to suit all storage needs.