Natural Language Processing

A Multi-LLM Debiasing Framework (arXiv 2024)

Commit: Coordinated instruction tuning for multimodal large language models (arXiv 2024)

Customization assistant for text-to-image generation (Proceedings of the IEEE onference on Computer Vision and Pattern Recognition 2024)

DocScript: Document-level Script Event Prediction (Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING) 2024)

DocSynthv2: A Practical Autoregressive Modeling for Document Generation (arXiv 2024)

LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models (arXiv 2024)

MMR: Evaluating Reading Ability of Large Multimodal Models (arXiv 2024)

Self-Cleaning: Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances (Findings of the Association for Computational Linguistics (NAACL) 2024)

TRINS: Towards Multimodal Language Models that Can Read (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2024)

Toward Infinite-Long Prefix in Transformer (arXiv 2024)