Jiuxiang Gu
Home
News
Publications
CV
Yanzhe Zhang
Latest
TRINS: Towards Multimodal Language Models that Can Read (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024)
Llavar: Enhanced visual instruction tuning for text-rich image understanding (arXiv 2023)
Cite
×