Orchard: Open-Source Framework for Agentic AI Modeling

Orchard: An Open-Source Agentic Modeling Framework

In the rapidly evolving landscape of artificial intelligence, the emergence of autonomous agents capable of complex task execution is gaining significant attention. The recently released paper on Orchard, an open-source agentic modeling framework, marks a pivotal advancement in this domain. The paper, titled “Orchard: An Open-Source Agentic Modeling Framework” (arXiv:2605.15040v1), outlines the framework’s capabilities and its potential to reshape how large language models (LLMs) are utilized in autonomous applications.

Understanding Agentic Modeling

Agentic modeling is a transformative approach that seeks to enhance LLMs into self-sufficient agents capable of:

Planning
Reasoning
Tool use
Multi-turn interaction with various environments

Despite extensive investment in this area, open research has faced constraints related to infrastructure and training. Most high-performance systems are built on proprietary codebases, models, or services, while prevalent open-source frameworks primarily focus on orchestration and evaluation rather than on scalable agent training.

Introducing Orchard

Orchard seeks to bridge this gap by providing a robust, open-source framework for scalable agentic modeling. At the heart of Orchard is the Orchard Env, a lightweight environment service designed to facilitate sandbox lifecycle management across different task domains, agent harnesses, and pipeline stages. This innovative approach not only enhances the flexibility of agentic modeling but also promotes the reuse of agentic data and training methodologies.

Key Features and Components

Orchard incorporates three distinct agentic modeling recipes, which are tailored for specific applications:

Orchard-SWE: This recipe is designed for coding agents. It leverages 107,000 trajectories distilled from MiniMax-M2.5 and Qwen3.5-397B. By introducing a credit-assignment supervised fine-tuning (SFT) method, Orchard-SWE learns from productive segments of unresolved trajectories. It further applies Balanced Adaptive Rollout for reinforcement learning (RL). The results have been impressive, achieving:

64.3% on SWE-bench Verified post-SFT
67.5% after SFT+RL

Setting a new state of the art among open-source models of comparable size.
Orchard-GUI: This component focuses on training a 4B vision-language computer-use agent, utilizing only 400 distilled trajectories and 2,200 open-ended tasks. Its success rates are noteworthy, achieving:

74.1% on WebVoyager
67.0% on Online-Mind2Web
64.0% on DeepShop

Establishing it as the strongest open-source model while remaining competitive with proprietary systems.
Orchard-Claw: Targeting personal assistant agents, Orchard-Claw is trained with just 200 synthetic tasks, yet it achieves:

59.6% pass@3 on Claw-Eval
73.9% when paired with a stronger ZeroClaw harness.

Conclusion

The results from Orchard demonstrate that a lightweight, open, harness-agnostic environment layer can effectively enable reusable agentic data, training recipes, and evaluation across various domains. As the field of autonomous agents continues to advance, frameworks like Orchard are poised to play a crucial role in democratizing access to powerful agentic modeling capabilities, fostering innovation, and enhancing collaboration within the AI research community.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Orchard: Open-Source Framework for Agentic AI Modeling

Orchard: An Open-Source Agentic Modeling Framework

Understanding Agentic Modeling

Introducing Orchard

Key Features and Components

Conclusion

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related