Agri-CPJ: Explainable Pest Diagnosis Without Training

Agri-CPJ: A Training-Free Explainable Framework for Agricultural Pest Diagnosis

In the realm of agricultural technology, accurate pest diagnosis is critical for ensuring crop health and maximizing yield. Recent advancements have introduced a novel framework known as Agri-CPJ (Caption-Prompt-Judge), which addresses significant challenges in crop disease diagnosis through a training-free, few-shot approach. This innovative system leverages large vision-language models to enhance diagnostic accuracy while providing explainability for practitioners.

Understanding the Challenges

The agricultural sector faces substantial hurdles in diagnosing crop diseases, primarily due to two recurring issues:

Model Hallucination: Many models that perform well on benchmark tests often produce inaccurate species names, leading to confusion and misdiagnosis.
Lack of Explainability: When models do provide correct diagnoses, the reasoning behind these predictions is frequently opaque, making it difficult for practitioners to trust or understand the results.

Introducing Agri-CPJ

The Agri-CPJ framework is designed to overcome these challenges by employing a structured, systematic approach to pest diagnosis. The process begins with a large vision-language model that generates a detailed morphological caption based on field photographs. This caption undergoes iterative refinement through multi-dimensional quality gating before any diagnostic questions are addressed.

Once the caption is finalized, the framework generates two potential responses from differing perspectives. An LLM (Large Language Model) acts as a judge to evaluate these responses and selects the most robust one based on domain-specific criteria. This dual-response mechanism ensures that practitioners receive the most reliable and contextually relevant information.

Key Findings and Impact

Research and experimentation with Agri-CPJ have yielded impressive results:

The caption refinement process is crucial, with ablation studies demonstrating that skipping this step consistently degrades accuracy across various models.
When pairing GPT-5-Nano with captions generated by GPT-5-mini, the framework achieved a remarkable +22.7 percentage points improvement in disease classification and a +19.5 points increase in question-and-answer scores compared to no-caption baselines on CDDMBench.
Evaluated on the AgMMU-MCQs without any modifications, GPT-5-Nano reached an accuracy of 77.84%, while Qwen-VL-Chat achieved 64.54%, positioning them among the top-performing open-source models of similar scale.

Enhancing Trust Through Explainability

A standout feature of the Agri-CPJ framework is its commitment to explainability. The structured captions and the rationale provided by the judge create a transparent audit trail. This transparency empowers practitioners; if a diagnosis is disputed, they can refer back to the specific observations that led to the diagnosis, facilitating a better understanding of the decision-making process.

Conclusion

The Agri-CPJ framework represents a significant advancement in agricultural pest diagnosis, combining cutting-edge technology with a focus on explainability. As the agricultural sector continues to embrace digital solutions, tools like Agri-CPJ will play a pivotal role in enhancing the accuracy and reliability of crop disease diagnosis. For those interested in further exploration, the code and data associated with this framework are publicly available at GitHub.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Agri-CPJ: Explainable Pest Diagnosis Without Training

Agri-CPJ: A Training-Free Explainable Framework for Agricultural Pest Diagnosis

Understanding the Challenges

Introducing Agri-CPJ

Key Findings and Impact

Enhancing Trust Through Explainability

Conclusion

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related