Sequence Modeling

Toward Infinite-Long Prefix in Transformer (arXiv 2024)

Watch It Twice: Video Captioning with a Refocused Video Encoder (Proceedings of the ACM International Conference on Multimedia (MM) 2019)