Search

Jiuxiang Gu

Home
News
Publications
CV

Yanzhe Zhang

Latest

TRINS: Towards Multimodal Language Models that Can Read (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2024)
Llavar: Enhanced visual instruction tuning for text-rich image understanding (arXiv 2023)

Powered by the Academic theme for Hugo.

Cite