Publications

Mobile energy storage systems (MESSs) provide mobility and flexibility to enhance distribution system resilience. The paper proposes a …

We, as humans, can easily use our vision and language capabilities to accomplish a wide variety of tasks that combine the image and the …

With the rapid growth of video data and the increasing demands of various applications such as intelligent video search and assistance …

The explosion of video data on the internet requires effective and efficient technology to generate captions automatically for people …

Most of the existing deep learning based image captioning methods are fully-supervised models, which require large-scale paired …

Scene graph generation has received growing attention with advancement image understanding tasks such as object detection, attributes …

Image captioning is a multimodal task involving computer vision and natural language processing, where the goal is to learn a mapping …

Textual-visual cross-modal retrieval has been a hot research topic in both computer vision and natural language processing communities. …

The existing image captioning approaches typically train a one-stage sentence decoder, which is difficult to generate rich fine-grained …

Language Models based on recurrent neural networks have dominated recent image caption generation tasks. In this paper, we introduce a …

In this paper, we provide a broad survey of the recent advances in convolutional neural networks. We detailize the improvements of CNN …