Language Models

TRINS: Towards Multimodal Language Models that Can Read (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2024)

DocEdit: language-guided document editing (Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 2023)