ARES-LSHADE: Advanced Evolutionary Algorithm for GNBG

ARES-LSHADE: Autoresearch-Enhanced LSHADE with Memetic Polish for the GNBG Benchmark

In a significant development in the field of evolutionary algorithms, researchers have introduced ARES-LSHADE, a novel memetic differential-evolution variant. This algorithm was submitted to the GECCO 2026 competition, focusing on LLM-designed evolutionary algorithms for the Generalized Numerical Benchmark Generator (GNBG). The introduction of ARES-LSHADE builds upon the success of the LLM-LSHADE algorithm, which won the 2025 competition.

Key Contributions of ARES-LSHADE

ARES-LSHADE incorporates two innovative components that enhance its performance in tackling complex optimization problems:

Scout-augmented Mutation Operator: This feature integrates adaptive Covariance Matrix Adaptation Evolution Strategy (CMA-ES) into the mutation process. It was developed through an autonomous research loop that conducted approximately thirty LLM-driven design experiments, allowing for a more adaptive and effective mutation strategy.
Multi-start L-BFGS-B Polish Phase: This phase is designed to maintain strict blackbox treatment of the benchmark while refining solutions. It effectively polishes the results obtained during the initial optimization stages, ensuring higher precision without compromising the integrity of the evaluation process.

Impressive Performance Metrics

During the official evaluation consisting of 31 runs per function, ARES-LSHADE achieved remarkable results. It secured victories in 510 out of 744 instances, with the per-function gap remaining below 1e-8. Notably, the algorithm reached machine precision on 18 out of 24 functions tested. However, the remaining six functions displayed characteristic plateau signatures, a behavior consistent with the compositional structure of the GNBG. These challenging functions were identified by the autoresearch loop as the most difficult in the suite.

Methodological Observations

Beyond the algorithm’s performance, the research also yielded two critical methodological insights:

LLM-driven Research Loop Observations: The study found that an LLM-driven research loop, when restricted to an operator-only edit surface and a fitness-only observation space, tends to converge to a characteristic plateau on the GNBG benchmark. This suggests that certain constraints can lead to predictable outcomes in evolutionary algorithm design.
Observation Space Expansion Impact: Initially, researchers expanded the observation space to include the benchmark’s compositional metadata. This adjustment led to the algorithm trivially solving all 24 functions, but it also violated the competition’s blackbox rule. The team recognized this issue before submission, highlighting the importance of adhering to competition guidelines while harnessing LLM capabilities.

Future Considerations in LLM-driven Optimization

The tension observed between the capabilities of LLMs and the integrity of benchmark evaluations raises important questions for future research in LLM-driven optimization algorithms. As the field evolves, it is crucial to balance the potential of advanced AI models with the foundational principles of fair and rigorous testing standards.

For those interested in exploring ARES-LSHADE further, code and reproducibility artifacts can be accessed at this GitHub repository.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

ARES-LSHADE: Advanced Evolutionary Algorithm for GNBG

ARES-LSHADE: Autoresearch-Enhanced LSHADE with Memetic Polish for the GNBG Benchmark

Key Contributions of ARES-LSHADE

Impressive Performance Metrics

Methodological Observations

Future Considerations in LLM-driven Optimization

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related