Jiuxiang Gu
Home
News
Publications
CV
Changyou Chen
Latest
LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models (arXiv 2024)
MMR: Evaluating Reading Ability of Large Multimodal Models (arXiv 2024)
TRINS: Towards Multimodal Language Models that Can Read (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024)
Tigan: Text-based interactive image generation and manipulation (Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 2022)
Towards language-free training for text-to-image generation (Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2022)
Cite
×