Skip to content

Publications

NeurIPS 2025

INST-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning.

Wujian Peng, Lingchen Meng, Yitong Chen, Yiweng Xie, Yang Liu, Tao Gui, Hang Xu, Xipeng Qiu, Zuxuan Wu, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control.

Danfeng Li, Hui Zhang, Sheng Wang, Jiacheng Li, Zuxuan Wu

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

OmniGen-AR: AutoRegressive Any-to-Image Generation.

Junke Wang, Xun Wang, Qiushan Guo, Peize Sun, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection.

Zhihao Sun, Haoran Jiang, Haoran Chen, Yixin Cao, Xipeng Qiu, Zuxuan Wu, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation.

Rui Tian, Mingfei Gao, Mingze Xu, Jiaming Hu, Jiasen Lu, Zuxuan Wu, Yinfei Yang, Afshin Dehghan

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

OmniSVG: A Unified Scalable Vector Graphics Generation Model.

Yiying Yang, Wei Cheng, Sijin Chen, Xianfang Zeng, Fukun Yin, Jiaxu Zhang, Liao Wang, Gang Yu, Xingjun Ma, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection.

Yu Li, Xingyu Qiu, Yuqian Fu, Jie Chen, Tianwen Qian, Xu Zheng, Danda Pani Paudel, Yanwei Fu, Xuanjing Huang, Luc Van Gool, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models.

Ye Sun, Hao Zhang, Henghui Ding, Tiehua Zhang, Xingjun Ma, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

Learning 3D Anisotropic Noise Distributions Improves Molecular Force Fields.

Xixian Liu, Rui Jiao, Zhiyuan Liu, Yurou Liu, Yang Liu, Ziheng Lu, Wenbing Huang, Yang Zhang, Yixin Cao

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

JailBound: Jailbreaking Internal Safety Boundaries of Vision-Language Models.

Jiaxin Song, Yixu Wang, Jie Li, Xuan Tong, Rui Yu, Yan Teng, Xingjun Ma, Yingchun Wang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models.

Yige Li, Hanxun Huang, Yunhan Zhao, Xingjun Ma, Jun Sun

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

SafeVid: Toward Safety Aligned Video Large Multimodal Models.

Yixu Wang, Jiaxin Song, Yifeng Gao, Xin Wang, Yang Yao, Yan Teng, Xingjun Ma, Yingchun Wang, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making.

Shanshan Li, Da Huang, Yu He, Yanwei Fu, Yu-Gang Jiang, Xiangyang Xue

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

NeurIPS 2025

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning.

Zhi Jing, Siyuan Yang, Jicong Ao, Ting Xiao, Yu-Gang Jiang, Chenjia Bai

Advances in Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025

ICCV 2025

From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning.

Pengkun Jiao, Bin Zhu, Jingjing Chen, Chong-Wah Ngo, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation.

Kaining Ying, Henghui Ding, Guangquan Jie, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks.

Shiduo Zhang, Zhe Xu, Peiju Liu, Xiaopeng Yu, Qinghui Gao, Yuan Li, Zhaoye Fei, Zhangyue Yin, Zuxuan Wu, Yu-Gang Jiang, Xipeng Qiu

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition.

Yongkun Du, Zhineng Chen, Hongtao Xie, Caiyan Jia, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

MotionFollower: Editing Video Motion via Score-Guided Diffusion.

Shuyuan Tu, Qi Dai, Zihao Zhang, Sicheng Xie, Zhi-Qi Cheng, Chong Luo, Xintong Han, Zuxuan Wu, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves.

Ruofan Wang, Juncheng Li, Yixu Wang, Bo Wang, Xiaosen Wang, Yan Teng, Yingchun Wang, Xingjun Ma, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning.

Haoran Chen, Ping Wang, Zihan Zhou, Xu Zhang, Zuxuan Wu, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction.

Zhen Xing, Qi Dai, Zejia Weng, Zuxuan Wu, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents.

Rui Tian, Qi Dai, Jianmin Bao, Kai Qiu, Yifan Yang, Chong Luo, Zuxuan Wu, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

ICCV 2025

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation.

Hui Zhang, Dexiang Hong, Yitong Wang, Jie Shao, Xinglong Wu, Zuxuan Wu, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2025.

CSUR 2025

A Survey on Video Diffusion Models.

Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang

ACM Computing Surveys (CSUR), 2025.

TPAMI 2025

Dynamic Routing and Knowledge Re-Learning for Data-Free Black-Box Attack.

Xuelin Qian, Wenxuan Wang, Yu-Gang Jiang, Xiangyang Xue, Yanwei Fu

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025.

ICLR 2025

Adaptive Retention & Correction for Continual Learning.

Haoran Chen, Micah Goldblum, Zuxuan Wu, Yu-Gang Jiang

International Conference on Learning Representations (ICLR), Singapore, 2025

ICLR 2025

BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks.

Yunhan Zhao, Xiang Zheng, Lin Luo, Yige Li, Xingjun Ma, Yu-Gang Jiang

International Conference on Learning Representations (ICLR), Singapore, 2025

AAAI 2025

Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection.

Yitong Chen, Wenhao Yao, Lingchen Meng, Sihong Wu, Zuxuan Wu, Yu-Gang Jiang

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

Explicit Relational Reasoning Network for Scene Text Detection.

Yuchen Su, Zhineng Chen, Yongkun Du, Zhilong Ji, Kai Hu, Jinfeng Bai, Xieping Gao

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

FOCUS: Towards Universal Foreground Segmentation.

Zuyao You, Lingyu Kong, Lingchen Meng, Zuxuan Wu

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

Out of Length Text Recognition with Sub-String Matching.

Yongkun Du, Zhineng Chen, Caiyan Jia, Xieping Gao, Yu-Gang Jiang

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

FaceA-Net: Facial Attribute-Driven ID Preserving Image Generation Network.

Jiayu Wang, Yue Yu, Jingjing Chen, Qi Dai, Yu-Gang Jiang

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

AdaDiff: Adaptive Step Selection for Fast Diffusion Models.

Hui Zhang, Zuxuan Wu, Zhen Xing, Jie Shao, Yu-Gang Jiang

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

S^3cMath: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners.

Yuchen Yan, Jin Jiang, Yang Liu, Yixin Cao, Xin Xu, Mengdi Zhang, Xunliang Cai, Jian Shao

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues.

Tao He, Lizi Liao, Yixin Cao, Yuanxing Liu, Yiheng Sun, Zerui Chen, Ming Liu, Bing Qin

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

DuMo: Dual Encoder Modulation Network for Precise Concept Erasure.

Feng Han, Kai Chen, Chao Gong, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

AIM: Additional Image Guided Generation of Transferable Adversarial Attacks.

Teng Li, Xingjun Ma, Yu-Gang Jiang

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

HoneypotNet: Backdoor Attacks Against Model Extraction.

Yixu Wang, Tianle Gu, Yan Teng, Yingchun Wang, Xingjun Ma

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

AAAI 2025

CALM: Curiosity-Driven Auditing for Large Language Models.

Xiang Zheng, Longxiang Wang, Yi Liu, Xingjun Ma, Chao Shen, Cong Wang

The 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, USA, 2025.

Springer 2024

Deep Learning for Video Understanding.

Zuxuan Wu, Yu-Gang Jiang

Springer, 2024.

ACL 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.

Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yu-Gang Jiang, Xipeng Qiu

Association for Computational Linguistics (ACL), Bangkok, Thailand, 2024.

ACM MM 2024

Navigating Weight Prediction with Diet Diary.

Yinxuan Gui, Bin Zhu, Jingjing Chen, Chong Wah Ngo, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

Highly Transferable Diffusion-based Unrestricted Adversarial Attack on Pre-trained Vision-Language Models.

Wenzhuo Xu, Kai Chen, Ziyi Gao, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

Identity-Driven Multimedia Forgery Detection via Reference Assistance.

Junhao Xu, Jingjing Chen, Xue Song, Feng Han, Haijun Shan, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack.

Ziyi Gao, Kai Chen, Zhipeng Wei, Tingshu Mou, Jingjing Chen, Zhiyu Tan, Hao Li, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

Decoder Pre-Training with only Text for Scene Text Recognition.

Shuai Zhao, Yongkun Du, Zhineng Chen, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning.

Xin Wang, Kai Chen, Xingjun Ma, Zhineng Chen, Jingjing Chen, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

White-box Multimodal Jailbreaks Against Large Vision-Language Models.

Ruofan Wang, Xingjun Ma, Hanxu Zhou, Chuanjun Ji, Guangnan Ye, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process.

Yang Luo, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Zhineng Chen, Yu-Gang Jiang, Tao Mei

ACM International Conference on Multimedia (ACM MM), 2024.

ACM MM 2024

ModelLock: Locking Your Model With a Spell.

Yifeng Gao, Yuhua Sun, Xingjun Ma, Zuxuan Wu, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2024.

NAACL 2024

Fake Alignment: Are LLMs Really Aligned Well.

Yixu Wang, Yan Teng, Kexin Huang, Chengqi Lyu, Songyang Zhang, Wenwei Zhang, Xingjun Ma, Yu-Gang Jiang, Yu Qiao, Yingchun Wang

North American Chapter of the Association for Computational Linguistics (NAACL), Mexico City, 2024.

NeurIPS 2024

Deepstack: Deeply Stacking Visual Tokensis Surprisingly Simple and Effective for LMMs.

Lingchen Meng, Jianwei Yang, Rui Tian, Xiyang Dai, Zuxuan Wu, JianFeng Gao, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

NeurIPS 2024

Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models.

Yang Jiao, Shaoxiang Chen, Zequn Jie, Jingjing Chen, Lin Ma, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

NeurIPS 2024

MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations.

Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

NeurIPS 2024

Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language Models.

Jiahao Ying, Yixin Cao, Yushi Bai, Qianru Sun, Bo Wang, Wei Tang, Zhaojun Ding, Yizhe Yang, Xuanjing Huang, Shuicheng Yan

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

NeurIPS 2024

GenRec: Unifying Video Generation and Recognition with Diffusion Models.

Zejia Weng, Xitong Yang, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

NeurIPS 2024

OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation.

Junke Wang, Yi Jiang, Zehuan Yuan, Binyue Peng, Zuxuan Wu, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

NeurIPS 2024

Knowledge Graph Completion by Intermediate Variables Regularization.

Changyi Xiao, Yixin Cao

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

NeurIPS 2024

UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation.

Ye Sun, Hao Zhang, Tihua Zhang, Xingjun Ma, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2024.

ECCV 2024

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing.

Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu, Hang Xu, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

ECCV 2024

DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation.

Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Tao Mei

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

ECCV 2024

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models.

Chao Gong*, Kai Chen*, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

ECCV 2024

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image.

Pengkun Jiao, Na Zhao, Jingjing Chen, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

ECCV 2024

PromptFusion: Decoupling Stability and Plasticity for Continual Learning.

Haoran Chen, Zuxuan Wu, Xintong Han, Menglin Jia, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

ECCV 2024

SegIC: Unleashing the Emergent Correspondence for In-Context Segmentation.

Lingchen Meng, Shiyi Lan, Hengduo Li, Jose M. Alvarez, Zuxuan Wu, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

ECCV 2024

Adversarial Prompt Tuning for Vision-Language Models.

Jiaming Zhang, Xingjun Ma, Xin Wang, Lingyu Qiu, Jiaqi Wang, Yu-Gang Jiang, Jitao Sang

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

ECCV 2024

Improving Text-guided Object Inpainting with semantic Pre-inpainting.

Yifu Chen, Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Zhineng Chen, Tao Mei

European Conference on Computer Vision (ECCV), Milano, Italy, 2024.

CVPR 2024

Doubly Abductive Counterfactual Inference for Text-based Image Editing.

Xue Song, Jiequan Cui, Hanwang Zhang, Jingjing Chen, Richang Hong, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle WA, USA, 2024.

CVPR 2024

OmniVid: A Generative Framework for Universal Video Understanding.

Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle WA, USA, 2024.

CVPR 2024

MotionEditor: Editing Video Motion via Content-Aware Diffusion.

Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle WA, USA, 2024.

CVPR 2024

Learning to Rank Patches for Unbiased Image Redundancy Reduction.

Yang Luo, Zhineng Chen, Peng Zhou, Zuxuan Wu, Xieping Gao, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle WA, USA, 2024.

CVPR 2024

SimDA: Simple Diffusion Adapter for Efficient Video Generation.

Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle WA, USA, 2024.

AAAI 2024

Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning.

Yang Jiao, Zequn Jie, Shaoxiang Chen, Lechao Cheng, Jingjing Chen, Lin Ma, Yu-Gang Jiang

The 38th AAAI Conference on Artificial Intelligence (AAAI), Vancouver, Canada, 2024.

AAAI 2024

NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

Tianwen Qian, Jingjing Chen, Linhai Zhuo, Yang Jiao, Yu-Gang Jiang

The 38th AAAI Conference on Artificial Intelligence (AAAI), Vancouver, Canada, 2024.

AAAI 2024

LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network.

Yuchen Su, Zhineng Chen, Zhiwen Shao, Yuning Du, Zhilong Ji, Jinfeng Bai, Yong Zhou, Yu-Gang Jiang

The 38th AAAI Conference on Artificial Intelligence (AAAI), Vancouver, Canada, 2024.

Machine Intelligence Research 2024

MOSS: An Open Conversational Large Language Model.

Tianxiang Sun, Xiaotian Zhang, Zhengfu He, Peng Li, Qinyuan Cheng, Xiangyang Liu, Hang Yan, Yunfan Shao, Qiong Tang, Shiduo Zhang, Xingjian Zhao, Ke Chen, Yining Zheng, Zhejian Zhou, Ruixiao Li, Jun Zhan, Yunhua Zhou, Linyang Li, Xiaogui Yang, Lingling Wu, Zhangyue Yin, Xuanjing Huang, Yu-Gang Jiang, Xipeng Qiu

Machine Intelligence Research, 2024.

TPAMI 2024

Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data.

Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024.

TPAMI 2024

Adaptive Cross-Modal Transferable Adversarial Attacks From Images to Videos.

Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024.

TMM 2024

Locate Before Answering: Answer Guided Question Localization for Video Question Answering.

Tianwen Qian, Ran Cui, Jingjing Chen, Pai Peng, Xiaowei Guo, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 26, pp. 4554-4563, 2024.

TMM 2024

From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios.

Guoshan Liu, Yang Jiao, Jingjing Chen, Bin Zhu, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), pp. 1-10, 2024.

Machine Learning 2024

Imbalanced gradients: a subtle cause of overestimated adversarial robustness.

Xingjun Ma, Linxi Jiang, Hanxun Huang, Zejia Weng, James Bailey, Yu-Gang Jiang

Machine Learning, 113, 2301-2326 (2024).

TOMM 2024

HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition.

Zejia Weng, Zuxuan Wu, Hengduo Li, Jingjing Chen, Yu-Gang Jiang

ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 20, issue 2, pp. 35:1-35:18, 2024.

IJCV 2024

CDistNet: Perceiving Multi-domain Character Distance for Robust Text Recognition.

Tianlun Zheng, Zhineng Chen, Shancheng Fang, Hongtao Xie, Yu-Gang Jiang

International Journal of Computer Vision(IJCV), vol. 132, issue 2, pp. 300-318, 2024.

CVPR 2023

Look Before You Match: Instance Understanding Matters in Video Object Segmentation.

Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning.

Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

SVFormer: Semi-Supervised Video Transformer for Action Recognition.

Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.

Yang Jiao, Zequn Jie, Shaoxiang Chen, Jingjing Chen, Lin Ma, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding.

Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

Prototypical Residual Networks for Anomaly Detection and Localization.

Hui Zhang, Zuxuan Wu, Zheng Wang, Zhineng Chen, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

Meta Style Adversarial Training for Cross-Domain Few-Shot Learning.

Yuqian Fu, Yu Xie, Yanwei Fu, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

ResFormer: Scaling ViTs with Multi-Resolution Training.

Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

Unlearnable Clusters: Towards Label-agnostic Unlearnable Examples.

Jiaming Zhang, Xingjun Ma, Qi Yi, Jitao Sang, Yu-Gang Jiang, Yaowei Wang, Changsheng Xu

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

Enhancing the Self-Universality for Transferable Targeted Attacks.

Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

CVPR 2023

Bi-directional Feature Fusion Generative Adversarial Network for Ultra-high Resolution Pathological Image Virtual Re-staining.

Kexin Sun, Zhineng Chen, Gongwei Wang, Jun Liu, Xiongjun Ye, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023.

NeurIPS 2023

Multi-Prompt Alignment for Multi-Source Unsupervised Domain Adaptation.

Haoran Chen, Xintong Han, Zuxuan Wu, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), 2023.

NeurIPS 2023

Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection.

Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Advances in Neural Information Processing Systems (NeurIPS), 2023.

ICCV 2023

Implicit Temporal Modeling with Learnable Alignment for Video Recognition.

Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2023.

ICCV 2023

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition.

Tianlun Zheng, Zhineng Chen, BingChen Huang, Wei Zhang, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), 2023.

Pattern Recognition 2023

Knowledge driven weights estimation for large-scale few-shot image recognition.

Jingjing Chen, Linhai Zhuo, Zhipeng Wei, Hao Zhang, Huazhu Fu, Yu-Gang Jiang

Pattern Recognition, 2023.

TIP 2023

Towards Transferable Adversarial Attacks on Image and Video Transformers.

Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang, Larry S. Davis

IEEE Transactions on Image Processing (TIP) vol.32, pp. 6346-6358, 2023.

TMM 2023

Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation.

Rui Wang, Zuxuan Wu, Zejia Weng, Jingjing Chen, Guo-Jun Qi, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 25, pp. 1665-1673, 2023.

TMM 2023

FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting.

Junke Wang, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 25, pp. 2382-2392, 2023.

TMM 2023

Scene Graph Refinement Network for Visual Question Answering.

Tianwen Qian, Jingjing Chen, Shaoxiang Chen, Bo Wu, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 25, pp. 3950-3961, 2023.

TMM 2023

Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition.

Jixiang Gao, Jingjing Chen, Huazhu Fu, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 25, pp. 4764-4773, 2023.

TMM 2023

Self-Supervised Learning for Semi-Supervised Temporal Language Grounding.

Fan Luo, Shaoxiang Chen, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 25, pp. 7747-7757, 2023.

ACM MM 2023

On the Importance of Spatial Relations for Few-shot Action Recognition.

Yilun Zhang, Yuqian Fu, Xingjun Ma, Lizhe Qi, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2023.

ACM MM 2023

Generalizing Face Forgery Detection via Uncertainty Learning.

Yanqi Wu, Xue Song, Jingjing Chen, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2023.

ACM MM 2023

GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos.

Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, and Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2023.

ACM MM 2023

Suspected Object Matters: Rethinking Model's Prediction for One-stage Visual Grounding.

Yang Jiao, Zequn Jie, Jingjing Chen, Lin Ma, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2023.

ACM MM 2023

Relation Triplet Construction for Cross-modal Text-to-Video Retrieval.

Xue Song, Jingjing Chen, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), 2023.

IJCAI 2023

TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition.

Tianlun Zheng, Zhineng Chen, Jinfeng Bai, Hongtao Xie, Yu-Gang Jiang

International Joint Conference on Artificial Intelligence (IJCAI), Macao, S.A.R, 2023.

ICML 2023

Reconstructive Neuron Pruning for Backdoor Defense.

Yige Li, Xixiang Lyu, Xingjun Ma, Nodens Koren, Lingjuan Lyu, Bo Li, Yu-Gang Jiang

International Conference on Machine Learning (ICML), 2023.

ICML 2023

Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization.

Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang

International Conference on Machine Learning (ICML), 2023.

SaTML 2023

Backdoor Attacks on Time Series: A Generative Approach.

Yujing Jiang, Xingjun Ma, Sarah M. Erfani, James Bailey

IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), 2023.

ICLR 2023

Transferable Unlearnable Examples.

Jie Ren, Han Xu, Yuxuan Wan, Xingjun Ma, Lichao Sun, Jiliang Tang

International Conference on Learning Representations (ICLR), 2023.

ICLR 2023

Distilling Cognitive Backdoor Patterns within an Image.

Hanxun Huang, Xingjun Ma, Sarah M. Erfani, James Bailey

International Conference on Learning Representations (ICLR), 2023.

ICME 2023

Adaptive Split-Fusion Transformer.

Zixuan Su, Jingjing Chen, Lei Pang, Chong-Wah Ngo, Yu-Gang Jiang

IEEE International Conference on Multimedia and Expo (ICME), 2023.

ICME 2023

Downstream Task-agnostic Transferable Attacks on Language-Image Pre-training Models.

Yiqiang Lv, Jingjing Chen, Zhipeng Wei, Kai Chen, Zuxuan Wu, Yu-Gang Jiang

IEEE International Conference on Multimedia and Expo (ICME), 2023.

AAAI 2023

PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer.

Yanqin Jiang, Li Zhang, Zhenwei Miao, Xiatian Zhu, Jin Gao, Weiming Hu, Yu-Gang Jiang

The AAAI Conference on Artificial Intelligence (AAAI), Washington DC, USA, 2023.

AAAI 2023

Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning.

Bingchen Huang, Zhineng Chen, Peng Zhou, Jiayin Chen, Zuxuan Wu

The AAAI Conference on Artificial Intelligence (AAAI), Washington DC, USA, 2023.

CVPR 2022

Cross-Modal Transferable Adversarial Attacks from Images to Videos.

Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 2022.

CVPR 2022

Balanced Contrastive Learning for Long-Tailed Visual Recognition.

Jianggang Zhu, Zheng Wang, Jingjing Chen, Yi-Ping Phoebe Chen, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 2022.

CVPR 2022

ObjectFormer for Image Manipulation Detection and Localization.

Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 2022.

CVPR 2022

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition.

Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 2022.

CVPR 2022

BEVT: BERT Pretraining of Video Transformers.

Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 2022.

ECCV 2022

MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes.

Yang Jiao, Shaoxiang Chen, Zequn Jie, Jingjing Chen, Lin Ma, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Tel-Aviv, Israel, 2022.

ECCV 2022

Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors.

Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Tel-Aviv, Israel, 2022.

ECCV 2022

Semi-Supervised Vision Transformers.

Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Tel-Aviv, Israel, 2022.

ECCV 2022

Efficient Video Transformers with Spatial-Temporal Token Selection.

Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Tel-Aviv, Israel, 2022.

TIP 2022

Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.

Yuqian Fu, Yanwei Fu, Jingjing Chen, Yu-Gang Jiang

IEEE Transactions on Image Processing (TIP), vol. 31, pp. 7078-7090, 2022.

ACM MM 2022

TGDM: Target Guided Dynamic Mixup for Cross-Domain Few-Shot Learning.

Linhai Zhuo, Yuqian Fu, Jingjing Chen, Yixin Cao, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Lisbon, Portugal, 2022.

ACM MM 2022

ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning.

Yuqian Fu, Yu Xie, Yanwei Fu, Jingjing Chen, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Lisbon, Portugal, 2022.

ACM MM 2022

Mix-DANN and Dynamic-Modal-Distillation for Video Domain Adaptation.

Yuehao Yin, Bin Zhu, Jingjing Chen, Lechao Cheng, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Lisbon, Portugal, 2022.

IJCAI 2022

SVTR: Scene Text Recognition with a Single Visual Model.

Yongkun Du, Zhineng Chen, Caiyan Jia, Xiaoting Yin, Tianlun Zheng, Chenxia Li, Yuning Du, Yu-Gang Jiang

International Joint Conference on Artificial Intelligence (IJCAI), Vienna, Austria, 2022 (Oral).

AAAI 2022

Boosting the Transferability of Video Adversarial Examples via Temporal Translation.

Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

The 36th AAAI Conference on Artificial Intelligence (AAAI), Honolulu, USA, 2022.

AAAI 2022

Attacking Video Recognition Models with Bullet-Screen Comments.

Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

The 36th AAAI Conference on Artificial Intelligence (AAAI), Honolulu, USA, 2022.

AAAI 2022

Towards transferable adversarial attacks on vision transformers.

Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang

The 36th AAAI Conference on Artificial Intelligence (AAAI), Honolulu, USA, 2022.

AAAI 2022

Rethinking Pseudo Labels for Semi-Supervised Object Detection.

Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis

The 36th AAAI Conference on Artificial Intelligence (AAAI), Honolulu, USA, 2022.

TPAMI 2022

A Dynamic Frame Selection Framework for Fast Video Recognition.

Zuxuan Wu, Hengduo Li, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44, issue 4, pp. 1699-1711, 2022.

TMM 2022

SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition.

Xing Zhang, Zuxuan Wu, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 24, pp. 313-322, 2022.

ICCV 2021

Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better.

Bojia Zi, Shihao Zhao, Xingjun Ma, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), Virtual, 2021.

ICCV 2021

Motion Guided Region Message Passing for Video Captioning.

Shaoxiang Chen, Yu-Gang Jiang

International Conference on Computer Vision (ICCV), Virtual, 2021.

ICCV 2021

VideoLT: Large-scale Long-tailed Video Recognition.

Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis

International Conference on Computer Vision (ICCV), Virtual, 2021.

ACM MM 2021

Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval.

Zheng Wang, Jingjing Chen, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Chengdu, China, 2021.

ACM MM 2021

Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.

Yuqian Fu, Yanwei Fu, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Chengdu, China, 2021.

ACM MM 2021

Two-stage Visual Cues Enhancement Network for Referring Image Segmentation.

Yang Jiao, Zequn Jie, Weixin Luo, Jingjing Chen, Yu-Gang Jiang, Xiaolin Wei, Lin Ma

ACM International Conference on Multimedia (ACM MM), Chengdu, China, 2021.

ACM MM 2021

A Multimodal Framework for Video Ads Understanding.

Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Chengdu, China, 2021.

ICMR 2021

Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos.

Yuqian Fu, Yanwei Fu, Yu-Gang Jiang

ACM International Conference on Multimedia Retrieval (ICMR), Virtual, 2021.

ICMR 2021

Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications.

Yongkun Du, Zhineng Chen, Caiyan Jia, Xuanya Li, Yu-Gang Jiang

ACM International Conference on Multimedia Retrieval (ICMR), Virtual, 2021.

CVPR 2021

Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning.

Shaoxiang Chen, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021.

TPAMI 2021

Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation.

Nanyang Wang, Yinda Zhang, Zhuwen Li, Yanwei Fu, Hang Yu, Wei Liu, Xiangyang Xue, Yu-Gang Jiang

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 43, issue 10, pp. 3600-3613, 2021.

TCSVT 2021

Predicting Content Similarity via Multimodal Modeling for Video-In-Video Advertising.

Xue Song, Baohan Xu, Yu-Gang Jiang

IEEE Transactions on Circuits System and Video Technology (TCSVT), vol. 31, issue 2, pp. 569-581, 2021.

TIP 2021

A Study of Multi-Task and Region-Wise Deep Learning for Food Ingredient Recognition.

Jingjing Chen, Bin Zhu, Chong-Wah Ngo, Tat-Seng Chua, Yu-Gang Jiang

IEEE Transactions on Image Processing (TIP), vol. 30, pp. 1514-1526, 2021.

TKDE 2021

Co-Attention Memory Network for Multimodal Microblog's Hashtag Recommendation.

Renfeng Ma, Xipeng Qiu, Qi Zhang, Xiangkun Hu, Yu-Gang Jiang, Xuanjing Huang

IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 33, issue 2, pp. 388-400, 2021.

IJCV 2021

A Coarse-to-Fine Framework for Resource Efficient Video Recognition.

Zuxuan Wu, Hengduo Li, Yingbin Zheng, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis

International Journal of Computer Vision(IJCV), vol. 129, issue 11, pp. 2965-2977, 2021.

TMM 2021

Story-driven Video Editing.

Zheng Wang, Jianguo Li, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 23, pp. 4027-4036, 2021.

CVPR 2020

Clean-Label Backdoor Attacks on Video Recognition Models.

Shihao Zhao, Xingjun Ma, Xiang Zheng, James Bailey, Jingjing Chen, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.

CVPR 2020

Hyperbolic Visual Embedding Learning for Zero-Shot Recognition.

Shaoteng Liu, Jingjing Chen, Liangming Pan, Chong-Wah Ngo, Tat-Seng Chua, Yu-Gang Jiang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.

CVPR 2020

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt.

Hangyu Lin, Yanwei Fu, Yu-Gang Jiang, Xiangyang Xue

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.

CVPR 2020

FM^2u-Net: Face Morphological Multi-branch Network for Makeup-invariant Face Verification.

Wenxuan Wang, Yanwei Fu, Xuelin Qian, Yu-Gang Jiang, Qi Tian, Xiangyang Xue

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.

AAAI 2020

Heuristic Black-box Adversarial Attacks on Video Recognition Models.

Zhi-Peng Wei, Jingjing Chen, Xingxing Wei, Lingxi Jiang, Tat-Seng Chua, Fengfeng Zhou, Yu-Gang Jiang

The 34th AAAI Conference on Artificial Intelligence (AAAI), New York, USA, 2020.

AAAI 2020

Feature Deformation Meta-Networks in Image Captioning of Novel Objects.

Tingjia Cao, Ke Han, Xiaomei Wang, Lin Ma, Yanwei Fu, Yu-Gang Jiang, Xiangyang Xue

The 34th AAAI Conference on Artificial Intelligence (AAAI), New York, USA, 2020.

TMM 2020

A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization.

Guoyun Tu, Yanwei Fu, Jiarui Gao, Boyang Li, Yu-Gang Jiang, Xiangyang Xue

IEEE Transactions on Multimedia (TMM), vol. 22, issue 1, pp. 148-159, 2020.

TPAMI 2020

Object Detection from Scratch with Deep Supervision.

Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang, Yurong Chen, Xiangyang Xue

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 42, issue 2, pp. 398-412, 2020.

TPAMI 2020

Leader-based Multi-Scale Attention Deep Architecture for Person Re-identification.

Xuelin Qian, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, Xiangyang Xue

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 42, issue 2, pp. 371-385, 2020.

TIP 2020

Re-Caption: Saliency-Enhanced Image Captioning through Two-Phase Learning.

Lian Zhou, Yuejie Zhang, Yu-Gang Jiang, Tao Zhang, Weiguo Fan

IEEE Transactions on Image Processing (TIP), vol. 29, issue 1, pp. 694-709, 2020.

TIP 2020

Deep Ranking for Image Zero-Shot Multi-Label Classification.

Zhong Ji, Biying Cui, Huihui Li, Yu-Gang Jiang, Tao Xiang, Timothy M. Hospedales, Yanwei Fu

IEEE Transactions on Image Processing (TIP), vol. 29, issue 1, pp. 6549-6560, 2020.

TIP 2020

Learning Layer-Skippable Inference Network.

Yu-Gang Jiang, Changmao Cheng, Hangyu Lin, Yanwei Fu

IEEE Transactions on Image Processing (TIP), vol. 29, issue 1, pp. 8747-8759, 2020.

TIP 2020

Pose-Guided Person Image Synthesis in the Non-Iconic Views.

Chengming Xu, Yanwei Fu, Chao Wen, Ye Pan, Yu-Gang Jiang, Xiangyang Xue

IEEE Transactions on Image Processing (TIP), vol. 29, issue 1, pp. 9060-9072, 2020.

TPAMI 2020

Vocabulary-Informed Zero-Shot and Open-Set Learning.

Yanwei Fu, Xiaomei Wang, Hanze Dong, Yu-Gang Jiang, Meng Wang, Xiangyang Xue, Leonid Sigal

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 42, issue 12, pp. 3136-3152, 2020.

TCSVT 2020

Matching Image and Sentence With Multi-Faceted Representations.

Lin Ma, Wenhao Jiang, Zequn Jie, Yu-Gang Jiang, Wei Liu

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 30, issue 12, pp. 4578-4590, 2020.

TCSVT 2020

Learning to Score Figure Skating Sport Videos.

Chengming Xu, Yanwei Fu, Bing Zhang, Zitian Chen, Yu-Gang Jiang, Xiangyang Xue

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 30, issue 12, pp. 4578-4590, 2020.

NeurIPS 2019

LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition.

Zuxuan Wu, Caiming Xiong, Yu-Gang Jiang, Larry Davis

Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2019.

AAAI 2019

Motion Guided Spatial Attention for Video Captioning.

Shaoxiang Chen, Yu-Gang Jiang

The 33rd AAAI Conference on Artificial Intelligence (AAAI), Honolulu, Hawaii, USA, 2019.

AAAI 2019

Semantic Proposal for Activity Localization in Videos via Sentence Query.

Shaoxiang Chen, Yu-Gang Jiang

The 33rd AAAI Conference on Artificial Intelligence (AAAI), Honolulu, Hawaii, USA, 2019.

AAAI 2019

Image Block Augmentation for One-Shot Learning.

Zitian Chen, Yanwei Fu, Kaiyu Chen, Yu-Gang Jiang

The 33rd AAAI Conference on Artificial Intelligence (AAAI), Honolulu, Hawaii, USA, 2019.

AAAI 2019

Composite Binary Decomposition Network.

Qiaoben You, Zheng Wang, Jianguo Li, Yinpeng Dong, Yu-Gang Jiang, Jun Zhu

The 33rd AAAI Conference on Artificial Intelligence (AAAI), Honolulu, Hawaii, USA, 2019.

AAAI 2019

Trainable Undersampling for Class-Imbalance Learning.

Minlong Peng, Qi Zhang, Xiaoyu Xing, Tao Gui, Xuanjing Huang, Yu-Gang Jiang, Keyu Ding, Zhigang Chen

The 33rd AAAI Conference on Artificial Intelligence (AAAI), Honolulu, Hawaii, USA, 2019.

IJCAI 2019

Deep Learning for Video Captioning: A Review.

Shaoxiang Chen, Ting Yao, Yu-Gang Jiang

International Joint Conference on Artificial Intelligence (IJCAI), Macao, China, 2019.

IJCAI 2019

Lexicon Rethinking Chinese Named Entity Recognition.

Tao Gui, Ruotian Ma, Qi Zhang, Lujun Zhao, Yu-Gang Jiang, Xuanjing Huang

International Joint Conference on Artificial Intelligence (IJCAI), Macao, China, 2019.

ACM MM 2019

Black-box Adversarial Attacks on Video Recognition Models.

Linxi Jiang, Xingjun Ma, Shaoxiang Chen, James Bailey, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Nice, France, 2019.

ACM MM 2019

Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent.

Yuqian Fu, Chengrong Wang, Yanwei Fu, Yu-Xiong Wang, Cong Bai, Xiangyang Xue, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Nice, France, 2019.

ACM MM 2019

TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved.

Juntong Cheng, Yi-Ping Phoebe Chen, Minjun Li, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Nice, France, 2019.

ACM MM 2019

Sparse Temporal Causal Convolution for Efficient Action Modeling.

Changmao Cheng, Chi Zhang, Yichen Wei, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Nice, France, 2019.

ACM MM 2019

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval.

Hangyu Lin, Yanwei Fu, Peng Lu, Shaogang Gong, Xiangyang Xue, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Nice, France, 2019.

ACM MM 2019

Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.

Wenxuan Wang, Qiang Sun, Yanwei Fu, Tao Chen, Chenjie Cao, Ziqi Zheng, Guoqiang Xu, Han Qiu, Yu-Gang Jiang, Xiangyang Xue

ACM International Conference on Multimedia (ACM MM), Nice, France, 2019.

SIGIR 2019

Hot Topic-Aware Retweet Prediction with Masked Self-attentive Model.

Renfeng Ma, Qi Zhang, Xiangkun Hu, Xuanjing Huang, Yu-Gang Jiang

ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR), Paris, France, 2019.

ICME 2019

An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation.

Yu Hao, Yanwei Fu, Yu-Gang Jiang, Qi Tian

IEEE International Conference on Multimedia & Expo (ICME), Shanghai, China, 2019.

ICMR 2019

Take Goods from Shelves: A Dataset for Class-incremental Object Detection.

Yu Hao, Yanwei Fu, Yu-Gang Jiang

ACM International Conference on Multimedia Retrieval (ICMR), Ottawa, Canada, 2019.

Science China Information Sciences

Reformulating Natural Language Queries using Sequence-to-Sequence Models.

Xiaoyu Liu, Shunda Pan, Qi Zhang, Yu-Gang Jiang, Xuanjing Huang

Science China Information Sciences, vol. 62, issue 12, 229103:1-229103:3, 2019.

TIP 2019

Dense Dilated Network for Video Action Recognition.

Baohan Xu, Hao Ye, Yingbin Zheng, Heng Wang, Tianyu Luwang, Yu-Gang Jiang

IEEE Transactions on Image Processing (TIP), vol. 28, issue 10, pp. 4941-4953, 2019.

TIP 2019

Multi-level Semantic Feature Augmentation for One-shot Learning.

Zitian Chen, Yanwei Fu, Yinda Zhang, Yu-Gang Jiang, Xiangyang Xue, Leonid Sigal

IEEE Transactions on Image Processing (TIP), vol. 28, issue 9, pp. 4594-4605, 2019.

TOMM 2019

Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning.

Rui-Wei Zhao, Zuxuan Wu, Qi Zhang, Jianguo Li, Yu-Gang Jiang

ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 15, issue 1, pp. 6:1-6:22, 2019.

TPAMI 2019

Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging.

Jinhui Tang, Xiangbo Shu, Zechao Li, Yu-Gang Jiang, Qi Tian

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 41, issue 8, pp. 2027-2034, 2019.

ECCV 2018

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images.

Nanyang Wang, Yinda Zhang, Zhuwen Li, Yanwei Fu, Wei Liu, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Munich, Germany, 2018.

ECCV 2018

Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks.

Minjun Li, Haozhi Huang, Lin Ma, Wei Liu, Tong Zhang, Yu-Gang Jiang

European Conference on Computer Vision (ECCV), Munich, Germany, 2018.

ECCV 2018

Pose-Normalized Image Generation for Person Re-identification.

Xuelin Qian, Yanwei Fu, Tao Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, Xiangyang Xue

European Conference on Computer Vision (ECCV), Munich, Germany, 2018.

ECCV 2018

Recurrent Fusion Network for Image Captioning.

Wenhao Jiang, Lin Ma, Yu-Gang Jiang, Wei Liu, Tong Zhang

European Conference on Computer Vision (ECCV), Munich, Germany, 2018.

CVPR 2018

Dual Skipping Networks.

Changmao Cheng, Yanwei Fu, Yu-Gang Jiang, Wei Liu, Wenlian Lu, Jianfeng Feng, Xiangyang Xue

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, USA, 2018.

IJCAI 2018

Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition.

Keke He, Yanwei Fu, Wuhao Zhang, Chengjie Wang, Yu-Gang Jiang, Feiyue Huang, Xiangyang Xue

International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, 2018.

ICMR 2018

Dense Dilated Network for Few Shot Action Recognition.

Baohan Xu, Hao Ye, Yingbin Zheng, Heng Wang, Tianyu Luwang, Yu-Gang Jiang

ACM International Conference on Multimedia Retrieval (ICMR), Yokohama, Japan, 2018.

ACL 2018

Cross-Domain Sentiment Classification with Target Domain Specific Information.

Minlong Peng, Qi Zhang, Yu-Gang Jiang, Xuanjing Huang

The 56th Annual Meeting of the Association for Computational Linguistics (ACL), Melbourne, Australia, 2018.

CIKM 2018

Generating Keyword Queries for Natural Language Queries to Alleviate Lexical Chasm Problem.

Xiaoyu Liu, Shunda Pan, Qi Zhang, Yu-Gang Jiang, Xuanjing Huang

The 27th ACM International Conference on Information and Knowledge Management (CIKM), Torino, Italy, 2018.

TMM 2018

Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification.

Yu-Gang Jiang, Zuxuan Wu, Jinhui Tang, Zechao Li, Xiangyang Xue, Shih-Fu Chang

IEEE Transactions on Multimedia (TMM), vol. 20, issue 11, pp. 3137-3147, 2018.

TKDE 2018

NAIS: Neural Attentive Item Similarity Model for Recommendation.

Xiangnan He, Zhankui He, Jingkuan Song, Zhenguang Liu, Yu-Gang Jiang, Tat-Seng Chua

IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 30, issue 12, pp. 2354-2366, 2018.

Multimedia Tools and Applications 2018

Stacked Multichannel Autoencoder - An Efficient Way of Learning Synthetic Data.

Xi Zhang, Yanwei Fu, Shanshan Jiang, Yu-Gang Jiang, Xiangyang Xue, Gady Agam

Multimedia Tools and Applications, vol. 77, issue 20, pp. 26563-26580, 2018.

TOMM 2018

DeepProduct: Mobile Product Search with Portable Deep Features.

Yu-Gang Jiang, Minjun Li, Xi Wang, Wei Liu, Xian-Sheng Hua

ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 14, issue 2, pp. 50:1-50:18, 2018.

TAC 2018

Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization.

Baohan Xu, Yanwei Fu, Yu-Gang Jiang, Boyang Li, Leonid Sigal

IEEE Transactions on Affective Computing (TAC), vol. 9, issue 2, pp. 255-270, 2018.

Frontiers of Multimedia Research 2018

Deep Learning for Video Classification and Captioning.

Zuxuan Wu, Ting Yao, Yanwei Fu, Yu-Gang Jiang

In Frontiers of Multimedia Research, Shih-Fu Chang (Ed.), Association for Computing Machinery and Morgan & Claypool, New York, NY, USA, pp. 3-29, 2018.

TIP 2018

Hookworm Detection in Wireless Capsule Endoscopy Images with Deep Learning.

Jun-Yan He, Xiao Wu, Yu-Gang Jiang, Qiang Peng, Ramesh Jain

IEEE Transactions on Image Processing (TIP), vol. 27, issue 5, pp. 2379-2392, 2018.

TCSVT 2018

Image Classification with Tailored Fine-grained Dictionaries.

Xiangbo Shu, Jinhui Tang, Guo-Jun Qi, Zechao Li, Yu-Gang Jiang, Shuicheng Yan

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 28, issue 2, pp. 454-467, 2018.

TPAMI 2018

Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks.

Yu-Gang Jiang, Zuxuan Wu, Jun Wang, Xiangyang Xue, Shih-Fu Chang

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 40, issue 2, pp. 352-364, 2018.

SPM 2018

Recent Advances in Zero-shot Recognition: Toward Data-Efficient Understanding of Visual Content.

Yanwei Fu, Tao Xiang, Yu-Gang Jiang, Xiangyang Xue, Leonid Sigal, Shaogang Gong

IEEE Signal Processing Magazine (SPM), vol. 35, issue 1, pp. 112-125, 2018.

CVIU 2017

The THUMOS Challenge on Action Recognition for Videos 'in the Wild'.

Haroon Idrees, Amir R. Zamir, Yu-Gang Jiang, Alex Gorban, Ivan Laptev, Rahul Sukthankar, Mubarak Shah

Computer Vision and Image Understanding (CVIU), vol. 155, pp. 1-23, 2017.

ICCV 2017

DSOD: Learning Deeply Supervised Object Detectors from Scratch.

Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang, Yurong Chen, Xiangyang Xue

International Conference on Computer Vision (ICCV), Italy, 2017.

ICCV 2017

Multi-scale Deep Learning Architectures for Person Re-identification.

Xuelin Qian, Yanwei Fu, Yu-Gang Jiang, Tao Xiang, Xiangyang Xue

International Conference on Computer Vision (ICCV), Italy, 2017.

ACM MM 2017

Learning Fashion Compatibility with Bidirectional LSTMs.

Xintong Han, Zuxuan Wu, Yu-Gang Jiang, Larry Davis

ACM International Conference on Multimedia (ACM MM), Mountain View, USA, 2017.

ACM MM 2017

Learning Semantic Feature Map for Visual Content Recognition.

Rui-Wei Zhao, Zuxuan Wu, Jianguo Li, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Mountain View, USA, 2017.

ACM MM 2017

Sketch Recognition with Deep Visual-Sequential Fusion Model.

Jun-Yan He, Xiao Wu, Yu-Gang Jiang, Bo Zhang, Qiang Peng

ACM International Conference on Multimedia (ACM MM), Mountain View, USA, 2017.

ACM MM 2017

Learning to Generate and Edit Hairstyles.

Weidong Yin, Yanwei Fu, Yiqing Ma, Yu-Gang Jiang, Xiangyang Xue

ACM International Conference on Multimedia (ACM MM), Mountain View, USA, 2017.

ACM MM 2017

Adaptively Weighted Multi-task Deep Network for Person Attribute Classification.

Keke He, Zhanxiong Wang, Yanwei Fu, Yu-Gang Jiang, Rui Feng, Xiangyang Xue

ACM International Conference on Multimedia (ACM MM), Mountain View, USA, 2017.

CVPR 2017

Weakly Supervised Dense Video Captioning.

Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, Yurong Chen, Yu-Gang Jiang, Xiangyang Xue

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Hawaii, USA, 2017.

ICMR 2017

Frame-Transformer Emotion Classification Network.

Jiarui Gao, Yanwei Fu, Yu-Gang Jiang, Xiangyang Xue

ACM International Conference on Multimedia Retrieval (ICMR), Bucharest, Romania, 2017.

ICMR 2017

Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction.

Zhanxiong Wang, Keke He, Yanwei Fu, Rui Feng, Yu-Gang Jiang, Xiangyang Xue

ACM International Conference on Multimedia Retrieval (ICMR), Bucharest, Romania, 2017.

ICME 2017

Iterative Object and Part Transfer for Fine-Grained Recognition.

Zhiqiang Shen, Yu-Gang Jiang, Xiangyang Xue

IEEE International Conference on Multimedia & Expo (ICME), Hong Kong, China, 2017.

AAAI 2017

Adaptive Proximal Average Approximation for Composite Convex Minimization.

Li Shen, Wei Liu, Junzhou Huang, Yu-Gang Jiang, Shiqian Ma

The 31st AAAI Conference on Artificial Intelligence (AAAI), San Francisco, 2017.

Journal of Computer Research and Development 2017

Video Copy Detection Method: A Review (In Chinese).

Jiawei Gu, Rui-Wei Zhao, Yu-Gang Jiang

Journal of Computer Research and Development, vol. 54, issue 6, pp. 1238-1250, 2017. Special Issue Highlighting Research Works of NSF China Outstanding Young Researcher Awardees.

ACM MM 2016

Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification.

Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, Xiangyang Xue

ACM International Conference on Multimedia (ACM MM), Amsterdam, The Netherlands, 2016.

ACM MM 2016

Binary Optimized Hashing.

Qi Dai, Jianguo Li, Jingdong Wang, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Amsterdam, The Netherlands, 2016.

CVPR 2016

Harnessing Object and Scene Semantics for Large-Scale Video Understanding.

Zuxuan Wu, Yanwei Fu, Yu-Gang Jiang, Leonid Sigal

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, USA, 2016.

ICMR 2016

Matching User Photos to Online Products with Robust Deep Features.

Xi Wang, Zhenfeng Sun, Wenqiang Zhang, Yu Zhou, Yu-Gang Jiang

ACM International Conference on Multimedia Retrieval (ICMR), New York, USA, 2016.

ICMR 2016

Video Emotion Recognition with Transferred Deep Feature Encodings.

Baohan Xu, Yanwei Fu, Yu-Gang Jiang, Boyang Li, Leonid Sigal

ACM International Conference on Multimedia Retrieval (ICMR), New York, USA, 2016.

BMVC 2016

Regional Gating Neural Networks for Multi-Label Image Classification.

Rui-Wei Zhao, Jianguo Li, Yurong Chen, Jia-Ming Liu, Yu-Gang Jiang, Xiangyang Xue

British Machine Vision Conference (BMVC), York, UK, 2016.

ECAI 2016

On Stochastic Primal-Dual Hybrid Gradient Approach for Compositely Regularized Minimization.

Linbo Qiao, Tianyi Lin, Yu-Gang Jiang, Fan Yang, Wei Liu, Xicheng Lu

European Conference on Artificial Intelligence (ECAI), The Hague, The Netherlands, 2016.

TRECVID 2016

NTTFudan Team at TRECVID 2016: Multimedia Event Detection.

Yonqqing Sun, Rui-Wei Zhao, Minjun Li, Chuan Lu, Hiroyuki Arai, Tetsuya Kinebuchi, Yu-Gang Jiang

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2016.

MediaEval 2016

BigVid at MediaEval 2016: Predicting Interestingness in Images and Videos.

Baohan Xu, Yanwei Fu, Yu-Gang Jiang

MediaEval 2016 Workshop, 2016.

TMM 2016

Hierarchical Visualization of Video Search Results for Topic-based Browsing.

Yu-Gang Jiang, Jiajun Wang, Qiang Wang, Wei Liu, Chong-Wah Ngo

IEEE Transactions on Multimedia (TMM), vol. 18, issue 11, pp. 2161-2170, 2016.

IEEE Transactions on Big Data 2016

Partial Copy Detection in Videos: A Benchmark and an Evaluation of Popular Methods.

Yu-Gang Jiang, Jiajun Wang

IEEE Transactions on Big Data, vol. 2, issue 1, pp. 32-42, 2016.

IEEE Multimedia 2016

Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional and Quality Clues.

Baohan Xu, Xi Wang, Yu-Gang Jiang

IEEE Multimedia, vol. 23, issue 3, pp. 23-33, 2016.

ACM MM 2015

Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification.

Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue

ACM International Conference on Multimedia (ACM MM), Brisbane, Australia, 2015.

ICMR 2015

Evaluating Two-Stream CNN for Video Classification.

Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang, Xiangyang Xue

ACM International Conference on Multimedia Retrieval (ICMR), Shanghai, China, 2015.

IJCAI 2015

Optimal Bayesian Hashing for Efficient Face Recognition.

Qi Dai, Jianguo Li, Jun Wang, Yurong Chen, Yu-Gang Jiang

The 24th International Joint Conference on Artificial Intelligence (IJCAI), Buenos Aires, Argentina, 2015.

IJCAI 2015

Portfolio Choices with Orthogonal Bandit Learning.

Weiwei Shen, Jun Wang, Yu-Gang Jiang, Hongyuan Zha

The 24th International Joint Conference on Artificial Intelligence (IJCAI), Buenos Aires, Argentina, 2015.

IEEE BigMM 2015

Categorizing Big Video Data on the Web: Challenges and Opportunities.

Yu-Gang Jiang

IEEE International Conference on Multimedia Big Data (IEEE BigMM), Beijing, China, April 2015. IEEE & ACM BigMM Summit 2015, Invited Position Paper.

MediaEval 2015

Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning.

Qi Dai, Rui-Wei Zhao, Zuxuan Wu, Xi Wang, Zichen Gu, Wenhai Wu, Yu-Gang Jiang

MediaEval 2015 Workshop, Wurzen, Germany, 2015.

TRECVID 2015

Fudan at TRECVID 2015: Adaptive Feature Fusion for Multimedia Event Detection in Videos.

Zuxuan Wu, Hao Ye, Yu-Gang Jiang, Xiangyang Xue

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2015.

TRECVID 2015

NTT-Fudan Team @ TRECVID 2015: Multimedia Event Detection.

Yongqing Sun, Zuxuan Wu, Xi Wang, Kyoko Sudo, Yukinobu Taniguchi, Tetsuya Kinebuchi, Yu-Gang Jiang

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2015.

CBMI 2015

VSD2014: A Dataset for Violent Scenes Detection in Hollywood Movies and Web Videos.

Markus Schedl, Mats Sjoberg, Ionut Mironica, Bogdan Ionescu, Vu Lam Quang, Yu-Gang Jiang and Claire-Helene Demarty

International Workshop on Content-Based Multimedia Indexing (CBMI), Prague, Czech Republic, 2015.

Multimedia Tools and Applications 2015

GPU-based MapReduce for Large-scale Near-duplicate Video Retrieval.

Hanli Wang, Fengkuangtian Zhu, Bo Xiao, Lei Wang, Yu-Gang Jiang

Multimedia Tools and Applications, vol. 74, issue 23, pp. 10515-10534, 2015.

DMKD 2015

A Relative Similarity Based Method for Interactive Patient Risk Prediction.

Buyue Qian, Xiang Wang, Nan Cao, Hongfei Li, Yu-Gang Jiang

Data Mining and Knowledge Discovery (DMKD), vol. 29, issue 4, pp. 1070-1093, 2015.

TIP 2015

Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling.

Yu-Gang Jiang, Qi Dai, Wei Liu, Xiangyang Xue, Chong-Wah Ngo

IEEE Transactions on Image Processing (TIP), vol. 24, issue 11, pp. 3781-3795, 2015.

TMM 2015

Super Fast Event Recognition in Internet Videos.

Yu-Gang Jiang, Qi Dai, Tao Mei, Yong Rui, Shih-Fu Chang

IEEE Transactions on Multimedia (TMM), vol. 17, issue 8, pp. 1-13, 2015.

TCSVT 2015

CHCF: A Cloud-based Heterogeneous Computing Framework for Large-Scale Image Retrieval.

Hanli Wang, Bo Xiao, Lei Wang, Fengkuangtian Zhu, Yu-Gang Jiang, Jun Wu

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 25, issue 12, pp. 1900-1913, 2015.

ACM MM 2014

Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification.

Zuxuan Wu, Yu-Gang Jiang, Jun Wang, Jian Pu, Xiangyang Xue

ACM International Conference on Multimedia (ACM MM), Orlando, USA, 2014.

ECCV 2014

VCDB: A Large-Scale Database for Partial Copy Detection in Videos.

Yu-Gang Jiang, Yudong Jiang, Jiajun Wang

European Conference on Computer Vision (ECCV), Zurich, 2014.

ECCV 2014

Which Looks Like Which: Exploring Inter-Class Relationships in Fine-Grained Visual Categorization.

Jian Pu, Yu-Gang Jiang, Jun Wang, Xiangyang Xue

European Conference on Computer Vision (ECCV), Zurich, 2014.

AAAI 2014

Predicting Emotions in User-Generated Videos.

Yu-Gang Jiang, Baohan Xu, Xiangyang Xue

The 28th AAAI Conference on Artificial Intelligence (AAAI), Quebec City, Canada, 2014.

ACM MM 2014

Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing.

Jiajun Wang, Yu-Gang Jiang, Qiang Wang, Kuiyuan Yang, Chong-Wah Ngo

ACM International Conference on Multimedia (ACM MM), Orlando, USA, 2014.

ACM MM 2014

Real-Time Summarization of User-Generated Videos Based on Semantic Recognition.

Xi Wang, Yu-Gang Jiang, Zhenhua Chai, Zichen Gu, Xinyu Du, Dong Wang

ACM International Conference on Multimedia (ACM MM), Orlando, USA, 2014.

ICDM 2014

News Credibility Evaluation on Microblog with a Hierarchical Propagation Model.

Zhiwei Jin, Juan Cao, Yu-Gang Jiang, Yongdong Zhang

IEEE International Conference on Data Mining (ICDM), Shenzhen, China, 2014.

CBMI 2014

Benchmarking Violent Scenes Detection in Movies.

Claire-Helene Demarty, Bogdan Ionescu, Yu-Gang Jiang, Vu Lam Quang, Markus Schedl, Cedric Penet

The 12th International Workshop on Content-Based Multimedia Indexing (CBMI), Klagenfurt, Austria, 2014.

ICME 2014

Challenge Huawei Challenge: Fusing Multimodal Features with Deep Neural Networks for Mobile Video Annotation.

Jian Tu, Zuxuan Wu, Qi Dai, Yu-Gang Jiang, Xiangyang Xue

IEEE International Conference on Multimedia & Expo (ICME), Chengdu, China, 2014. (Grand Challenge Session)

The MediaEval 2014 Affect Task: Violent Scenes Detection.
MediaEval 2014

The MediaEval 2014 Affect Task: Violent Scenes Detection.

Mats Sjoberg, Bogdan Ionescu, Yu-Gang Jiang, Vu Lam Quang, Markus Schedl, Claire-Helene Demarty

MediaEval 2014 Workshop, Barcelona, Spain, 2014. (Task Overview Paper)

MediaEval 2014

Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks.

Qi Dai, Zuxuan Wu, Yu-Gang Jiang, Xiangyang Xue, Jinhui Tang

MediaEval 2014 Workshop, Barcelona, Spain, 2014.

MMM 2014

A Framework of Video Coding for Compressing Near-Duplicate Videos.

Hanli Wang, Ming Ma, Yu-Gang Jiang, Zhihua Wei

International Conference on Multimedia Modeling (MMM), Dublin, Ireland, 2014.

TOMM 2014

Placing Videos on a Semantic Hierarchy for Search Result Navigation.

Song Tan, Yu-Gang Jiang, Chong-Wah Ngo

ACM Transactions on Multimedia Computing, Communications and Applications (TOMM, vol. 10, issue 4, 2014.

TMM 2014

Video Event Detection Using Motion Relativity and Feature Selection.

Feng Wang, Zhanhu Sun, Yu-Gang Jiang, Chong-Wah Ngo

IEEE Transactions on Multimedia (TMM), vol. 16, issue 5, pp. 1303-1315, 2014.

MVA 2014

Discovering Joint Audio-Visual Codewords for Video Event Detection.

I-Hong Jhuo, Guangnan Ye, Shenghua Gao, Dong Liu, Yu-Gang Jiang, D. T. Lee, Shih-Fu Chang

Machine Vision and Applications (MVA), vol. 25, issue 1, pp. 33-47, 2014.

Invited Paper

Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues.

Zhi-Neng Chen, Chong-Wah Ngo, Wei Zhang, Juan Cao, Yu-Gang Jiang

Invited Paper, Special Issue on Current Advances in NSFC Joint Research Projects.

TIP 2014

Learning Multiple Relative Attributes With Humans in the Loop.

Buyue Qian, Xiang Wang, Nan Cao, Yu-Gang Jiang, Ian Davidson

IEEE Transactions on Image Processing (TIP), vol. 23, issue 12, pp. 5573-5585, 2014.

AAAI 2013

Understanding and Predicting Interestingness of Videos.

Yu-Gang Jiang, Yanran Wang, Rui Feng, Xiangyang Xue, Yingbin Zheng, Hanfang Yang

The 27th AAAI Conference on Artificial Intelligence (AAAI), Bellevue, Washington, USA, 2013.

ICCV 2013

Learning Hash Codes with Listwise Supervision.

Jun Wang, Wei Liu, Andy X. Sun, Yu-Gang Jiang

IEEE International Conference on Computer Vision (ICCV), Sydney, Australia, 2013.

IJCAI 2013

Multiple Task Learning Using Iteratively Reweighted Least Square.

Jian Pu, Yu-Gang Jiang, Jun Wang, Xiangyang Xue

The 23rd International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013.

ACM MM 2013

Beauty is Here: Evaluating Aesthetics in Videos Using Multimodal Features and Free Training Data.

Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Barcelona, Spain, 2013.

ACM MM 2013

Strong Geometry Consistency for Large Scale Partial-Duplicate Image Search.

Junqiang Wang, Jinhui Tang, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Barcelona, Spain, 2013.

The MediaEval 2013 Affect Task: Violent Scenes Detection.
MediaEval 2013

The MediaEval 2013 Affect Task: Violent Scenes Detection.

Claire-Helene Demarty, Cedric Penet, Markus Schedl, Bogdan Ionescu, Vu Lam Quang, Yu-Gang Jiang

MediaEval 2013 Workshop, Barcelona, Spain, 2013. (Task Overview Paper)

MediaEval 2013

Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes.

Qi Dai, Jian Tu, Ziqiang Shi, Yu-Gang Jiang, Xiangyang Xue

MediaEval 2013 Workshop, Barcelona, Spain, 2013.

IJMIR 2013

High-Level Event Recognition in Unconstrained Videos.

Yu-Gang Jiang, Subhabrata Bhattacharya, Shih-Fu Chang, Mubarak Shah

International Journal of Multimedia Information Retrieval (IJMIR), vol. 2, issue 2, pp. 73-101, 2013.

TMM 2013

Query-Adaptive Image Search with Hash Codes.

Yu-Gang Jiang, Jun Wang, Xiangyang Xue, Shih-Fu Chang

IEEE Transactions on Multimedia (TMM), vol. 15, issue 2, pp. 442-453, 2013.

ECCV 2012

Trajectory-Based Modeling of Human Actions with Motion Reference Points.

Yu-Gang Jiang, Qi Dai, Xiangyang Xue, Wei Liu, Chong-Wah Ngo

European Conference on Computer Vision (ECCV), Firenze, Italy, 2012.

ECCV 2012

Learning Hybrid Part Filters for Scene Recognition.

Yingbin Zheng, Yu-Gang Jiang, Xiangyang Xue

European Conference on Computer Vision (ECCV), Firenze, Italy, 2012.

CVPR 2012

Supervised Hashing with Kernels.

Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, Shih-Fu Chang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Rhode Island, USA, 2012.

ICMR 2012

SUPER: Towards Real-time Event Recognition in Internet Videos.

Yu-Gang Jiang

ACM International Conference on Multimedia Retrieval (ICMR), Hong Kong, China, 2012.

ACM MM 2012

A Fast Video Event Recognition System and Its Application to Video Search.

Yu-Gang Jiang, Qi Dai, Yingbin Zheng, Xiangyang Xue, Jie Liu, Dong Wang

ACM International Conference on Multimedia (ACM MM), Nara, Japan, 2012. (Demo session)

MediaEval 2012

The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features.

Yu-Gang Jiang, Qi Dai, Chun Chet Tan, Xiangyang Xue, Chong-Wah Ngo

MediaEval 2012 Workshop, Pisa, Italy, 4-5, 2012.

ICMR 2012

Joint Audio-Visual Bi-Modal Codewords for Video Event Detection.

Guangnan Ye, I-Hong Jhuo, Dong Liu, Yu-Gang Jiang, D. T. Lee, Shih-Fu Chang

ACM International Conference on Multimedia Retrieval (ICMR), Hong Kong, China, 2012.

TMM 2012

Sampling and Ontologically Pooling Web Images for Visual Concept Learning.

Shiai Zhu, Chong-Wah Ngo, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 14, issue 4, pp. 1068-1078, 2012.

TIP 2012

Fast Semantic Diffusion for Large Scale Context-Based Image and Video Annotation.

Yu-Gang Jiang, Qi Dai, Jun Wang, Chong-Wah Ngo, Xiangyang Xue, Shih-Fu Chang

IEEE Transactions on Image Processing (TIP), vol. 21, issue 6, pp. 3080-3091, 2012.

ACM MM 2011

Towards Textually Describing Complex Video Contents with Audio-Visual Concept Classifiers.

Chun-Chet Tan, Yu-Gang Jiang, Chong-Wah Ngo

ACM International Conference on Multimedia (ACM MM), Arizona, USA, 2011.

ACM MM 2011

On the Pooling of Positive Examples with Ontology for Visual Concept Learning.

Shiai Zhu, Chong-Wah Ngo, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Arizona, USA, 2011.

ICMR 2011

Consumer Video Understanding: A Benchmark Database and An Evaluation of Human and Machine Performance.

Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel Ellis, Alexander C. Loui

ACM International Conference on Multimedia Retrieval (ICMR), Trento, Italy, 2011.

ICMR 2011

Lost in Binarization: Query-Adaptive Ranking for Similar Image Search with Compact Codes.

Yu-Gang Jiang, Jun Wang, Shih-Fu Chang

ACM International Conference on Multimedia Retrieval (ICMR), Trento, Italy, 2011.

CVPR 2011

Noise Resistant Graph Ranking for Improved Web Image Search.

Wei Liu, Yu-Gang Jiang, Jiebo Luo, Shih-Fu Chang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Colorado Springs, 2011.

TRECVID 2011

The MediaMill TRECVID 2011 Semantic Video Search Engine.

Cees G. M. Snoek, Koen E. A. van de Sande, Xirong Li, Masoud Mazloom, Yu-Gang Jiang, Dennis C. Koelma, Arnold W. M. Smeulders

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2011.

TCSVT 2011

Modeling Scene and Object Contexts for Human Action Retrieval with Few Examples.

Yu-Gang Jiang, Zhenguo Li, Shih-Fu Chang

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 21, issue 5, pp. 674-681, 2011.

TCSVT 2011

Concept-Driven Multi-Modality Fusion for Video Search.

Xiao-Yong Wei, Yu-Gang Jiang, Chong-Wah Ngo

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 21, issue 1, pp. 62-73, 2011.

TMM 2010

Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study.

Yu-Gang Jiang, Jun Yang, Chong-Wah Ngo, Alexander G. Hauptmann

IEEE Transactions on Multimedia (TMM), vol. 12, issue 1, pp. 42-53, 2010.

TRECVID 2010

Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching.

Yu-Gang Jiang, Xiaohong Zeng, Guangnan Ye, Subh Bhattacharya, Dan Ellis, Mubarak Shah, Shih-Fu Chang

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2010.

CIVR 2010

On the Sampling of Web Images for Learning Visual Concept Classifiers.

Shiai Zhu, Gang Wang, Chong-Wah Ngo, Yu-Gang Jiang

ACM International Conference on Image and Video Retrieval (CIVR), Xi'an, China, 2010.

ACM MM 2009

Brain State Decoding for Rapid Image Retrieval.

Jun Wang, Eric Pohlmeyer, Barbara Hanna, Yu-Gang Jiang, Paul Sajda, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), Beijing, China, 2009.

ACM MM 2009

Semantic Context Transfer across Heterogeneous Sources for Domain Adaptive Video Search.

Yu-Gang Jiang, Chong-Wah Ngo, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), Beijing, China, 2009.

ICCV 2009

Domain Adaptive Semantic Diffusion for Large Scale Context-Based Video Annotation.

Yu-Gang Jiang, Jun Wang, Shih-Fu Chang, Chong-Wah Ngo

IEEE International Conference on Computer Vision (ICCV), Kyoto, Japan, 2009.

CVPR 2009

Label Diagnosis through Self Tuning for Web Image Search.

Jun Wang, Yu-Gang Jiang, Shih-Fu Chang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami Beach, 2009.

CIVR 2009

Exploring Inter-Concept Relationship with Context Space for Semantic Video Indexing.

Xiao-Yong Wei, Yu-Gang Jiang, Chong-Wah Ngo

ACM International Conference on Image and Video Retrieval (CIVR), Santorini, Greece, 2009.

TRECVID 2009

VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection.

Chong-Wah Ngo, Yu-Gang Jiang, Xiao-Yong Wei

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2009.

CVIU 2009

Visual Word Proximity and Linguistics for Semantic Video Indexing and Near-Duplicate Retrieval.

Yu-Gang Jiang, Chong-Wah Ngo

Computer Vision and Image Understanding (CVIU), vol. 113, issue 3, pp. 405-414, 2009.

ACM MM 2008

Video Event Detection Using Motion Relativity and Visual Relatedness.

Feng Wang, Yu-Gang Jiang, Chong-Wah Ngo

ACM International Conference on Multimedia (ACM MM), Vancouver, Canada, 2008.

SIGIR 2008

Bag-of-Visual-Words Expansion Using Visual Relatedness for Video Indexing.

Yu-Gang Jiang, Chong-Wah Ngo

ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR), Singapore, 2008.

ICME 2008

Ontology-Based Visual Word Matching for Near-Duplicate Retrieval.

Yu-Gang Jiang, Chong-Wah Ngo

IEEE International Conference on Multimedia & Expo (ICME), Hannover, Germany, 2008.

Columbia University ADVENT Technical Report 2008

CU-VIREO374: Fusing Columbia374 and VIREO-374 for Large Scale Semantic Concept Detection.

Yu-Gang Jiang, Akira Yanagawa, Shih-Fu Chang, Chong-Wah Ngo

Columbia University ADVENT Technical Report #223-2008-1, 2008.

TRECVID 2008

Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search.

Shih-Fu Chang, Junfeng He, Yu-Gang Jiang, Elie El Khoury, Chong-Wah Ngo, Akira Yanagawa, Eric Zavesky

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2008.

TRECVID 2008

Beyond Semantic Search: What You Observe May Not Be What You Think.

Chong-Wah Ngo, Yu-Gang Jiang, Xiaoyong Wei

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2008.

TMM 2008

Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces.

Xiao-Yong Wei, Chong-Wah Ngo, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), vol. 10, issue 6, pp. 1085-1096, 2008.

CIVR 2007

Towards Optimal Bag-of-Features for Object Categorization and Semantic Video Retrieval.

Yu-Gang Jiang, Chong-Wah Ngo, Jun Yang

ACM International Conference on Image and Video Retrieval (CIVR), Amsterdam, The Netherlands, 2007.

ACM MM 2007

Evaluating Bag-of-Visual-Words Representations in Scene Classification.

Jun Yang, Yu-Gang Jiang, Alexander G. Hauptmann, Chong-Wah Ngo

ACM SIGMM Workshop on Multimedia Information Retrieval (MIR), in conjunction with ACM International Conference on Multimedia (ACM MM), Germany, 2007.

TRECVID 2007

Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and Search.

Chong-Wah Ngo, Yu-Gang Jiang, Xiaoyong Wei

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2007.

ACM MM 2006

Fast Tracking of Near-Duplicate Keyframes in Broadcast Domain with Transitivity Propagation.

Chong-Wah Ngo, Wan-Lei Zhao, Yu-Gang Jiang

ACM International Conference on Multimedia (ACM MM), Santa Barbara, CA, USA, 2006.

Asia-Pacific Workshop on Visual Information Processing 2006

Exploring Semantic Concept Using Local Invariant Features.

Yu-Gang Jiang, Wan-Lei Zhao, Chong-Wah Ngo

Asia-Pacific Workshop on Visual Information Processing, Beijing, China, 2006. (Invited paper&talk)

CIVR 2006

Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help.

Wanlei Zhao, Yu-Gang Jiang, Chong-Wah Ngo

International Conference on Image and Video Retrieval (CIVR), Tempe, USA, 2006.

TRECVID 2006

Modeling Local Interest Points for Semantic Detection and Video Search.

Yu-Gang Jiang, Xiaoyong Wei, Chong-Wah Ngo

NIST TRECVID Workshop (TRECVID), Gaithersburg, USA, 2006.