Computer Vision

Meta spatio-temporal debiasing for video scene graph generation (European Conference on Computer Vision (ECCV) 2022)

Open world entity segmentation (IEEE Transactions on Pattern Analysis and Machine Intelligence 2022)

Open-vocabulary instance segmentation via robust cross-modal pseudo-labeling (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2022)

UNISON: Unpaired cross-lingual image captioning (Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 2022)

Exploiting semantic embedding and visual feature for facial action unit detection (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2021)

Multi-scale aligned distillation for low-resolution detection (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2021)

Scene graph generation with external knowledge and image reconstruction (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2019)

Unpaired Image Captioning via Scene Graph Alignments (Proceedings of the IEEE International Conference on Computer Vision 2019)

Watch It Twice: Video Captioning with a Refocused Video Encoder (Proceedings of the ACM International Conference on Multimedia (MM) 2019)

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018)