Jiuxiang Gu
Home
News
Publications
CV
Natural Language Processing
A Multi-LLM Debiasing Framework (arXiv 2024)
Commit: Coordinated instruction tuning for multimodal large language models (arXiv 2024)
Customization assistant for text-to-image generation (Proceedings of the IEEE onference on Computer Vision and Pattern Recognition 2024)
DocScript: Document-level Script Event Prediction (Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING) 2024)
DocSynthv2: A Practical Autoregressive Modeling for Document Generation (arXiv 2024)
LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models (arXiv 2024)
MMR: Evaluating Reading Ability of Large Multimodal Models (arXiv 2024)
Self-Cleaning: Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances (Findings of the Association for Computational Linguistics (NAACL) 2024)
TRINS: Towards Multimodal Language Models that Can Read (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2024)
Toward Infinite-Long Prefix in Transformer (arXiv 2024)
»
Cite
×