UniDoc-RL enhances visual retrieval-augmented generation using hierarchical actions and dense rewards for superior reasoning in vision-language models.
Explore how prompt-induced hallucinations affect vision-language models and learn strategies to improve accuracy and reduce errors in AI visual understandi...