Discover UHR-BAT, a budget-aware token compression model enhancing ultra-high-resolution remote sensing with efficient, accurate vision-language processing...
Enhance generation in Unified Multimodal Models using UniRect-CoT, a reflective rectification framework inspired by human cognition for better AI outputs.
Explore self-supervised learning and predictive representation techniques to enhance AI models with robust, predictive capabilities from unlabeled data.