Jiuxiang Gu
Jason Kuen
ImageFolder: Autoregressive Image Generation with Folded Tokens (ICLR 2025)
ADOPD: A Large-Scale Document Page Decomposition Dataset (The Twelfth International Conference on Learning Representations 2024)
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation (The Twelfth International Conference on Learning Representations 2024)
A Critical Analysis of Document Out-of-Distribution Detection (Findings of the Association for Computational Linguistics (EMNLP) 2023)
Aims: All-inclusive multi-level segmentation for anything (Advances in Neural Information Processing Systems (NeurIPS) 2023)
Ca-ssl: Class-agnostic semi-supervised learning for detection and segmentation (European Conference on Computer Vision (ECCV) 2022)
Improving the reliability for confidence estimation (European Conference on Computer Vision (ECCV) 2022)
Learning adaptive axis attentions in fine-tuning: Beyond fixed sparse attention patterns (Findings of the Association for Computational Linguistics (ACL) 2022)
Meta spatio-temporal debiasing for video scene graph generation (European Conference on Computer Vision (ECCV) 2022)
Open world entity segmentation (IEEE Transactions on Pattern Analysis and Machine Intelligence 2022)
Open-vocabulary instance segmentation via robust cross-modal pseudo-labeling (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2022)
Multi-scale aligned distillation for low-resolution detection (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2021)
Selfdoc: Self-supervised document representation learning (Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2021)
Unidoc: Unified pretraining framework for document understanding (Advances in Neural Information Processing Systems (NeurIPS) 2021)
Self-supervised relationship probing (Advances in Neural Information Processing Systems (NeurIPS) 2020)
Recent advances in convolutional neural networks (Pattern Recognition 2018)