Jiuxiang Gu
Home
News
Publications
CV
TRINS: Towards Multimodal Language Models that Can Read (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2024)
Ruiyi Zhang
,
Yanzhe Zhang
,
Jian Chen
,
Yufan Zhou
,
Jiuxiang Gu
,
Changyou Chen
,
Tong Sun
January 2024
Type
Journal article
Publication
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Multimodal Learning
Natural Language Processing
Language Models
Machine Learning
Deep Learning
Cite
×