Bolei Zhou

Assistant Professor
Department of Information Engineering, The Chinese University of Hong Kong
Office: Room 717, Ho Sin-Hang Engineering Building
Email:
CVGoogle ScholarTwitterZhihu


Intro

News

All Publications
Selected Publications
GenForce and DecisionForce contain more recent works on generative models and machine autonomy.
2021
DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation
Jimmy Wu, Bolei Zhou, Diondra Peck, Scott Hsieh, Vandana Dialani, Lester Mackey, and Genevieve Patterson.
Harvard Data Science Review (HDSR), 2021. [PDF]
Improving the Generalization of End-to-End Driving through Procedural Generation
Quanyi Li*, Zhenghao Peng*, Qihang Zhang, Cong Qiu, Chunxiao Liu, Bolei Zhou.
arXiv:2012.13681. [PDF][Webpage][Code]
Closed-Form Factorization of Latent Semantics in GANs
Yujun Shen, Bolei Zhou.
CVPR 2021, Oral
[PDF][Webpage][Code]
Generative Hierarchical Features from Synthesizing Images
Yinghao Xu*, Yujun Shen*, Jiapeng Zhu, Ceyuan Yang, Bolei Zhou.
CVPR 2021, Oral
[PDF][Webpage][Code]
Multimodal Motion Prediction with Stacked Transformers
Yicheng Liu*, Jinghuai Zhang*, Liangji Fang, Qinhong Jiang, Bolei Zhou.
CVPR 2021
[PDF][Webpage]
Positional Encoding as Spatial Inductive Bias in GANs
Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy.
CVPR 2021
[PDF]
Instance Localization for Self-supervised Detection Pretraining
Ceyuan Yang, Zhirong Wu, Bolei Zhou, Stephen Lin.
CVPR 2021
[PDF]
Adversarial Inverse Reinforcement Learning with Self-attention Dynamics Model
Jiankai Sun, Lantao Yu, Pinqian Dong, Bo Lu, Bolei Zhou.
ICRA 2021 and IEEE Robotics and Automation Letters (RA-L)[PDF][Webpage][Code]
HiABP: Hierarchical Initialized ABP for Unsupervised Representation Learning
Jiankai Sun, Rui Liu, Bolei Zhou.
AAAI 2021
[PDF]
2020
Improving the Fairness of Deep Generative Models without Retraining
Shuhan Tan, Yujun Shen, Bolei Zhou.
arXiv:2012.04842, 2020. [PDF][Webpage]
Learning a Decision Module by Imitating Driver Control Behaviors
Junning Huang, Sirui Xie, Jiankai Sun, Qiurui Ma, Chunxiao Liu, Dahua Lin, Bolei Zhou.
The Conference on Robot Learning (CoRL), 2020. [PDF]
Neural-Symbolic Program Search: Towards Automatic Autonomous Driving System Design
Jiankai Sun, Hao Sun, Tian Han, Bolei Zhou.
The Conference on Robot Learning (CoRL), 2020. [PDF]
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs
Yujun Shen, Ceyuan Yang, Xiaoou Tang, Bolei Zhou
IEEE TPAMI, Oct 2020
[PDF][Webpage][Colab]
GenForce: Generative Modeling for Everyone
Yujun Shen, Yinghao Xu, Ceyuan Yang, Jiapeng Zhu, Bolei Zhou.
Efficient PyTorch library for deep generative modeling. [Code][Colab]
Understanding the Role of Individual Units in a Deep Neural Network
David Bau, Jun-Yan Zhu, Hendrik Strobelt, Agata Lapedriza, Bolei Zhou, and Antonio Torralba.
Proceedings of the National Academy of Sciences (PNAS), 2020. [PDF]
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng, Hao Sun, Bolei Zhou
arXiv:2006.07781
[PDF][Webpage][Code]
Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis
Ceyuan Yang*, Yujun Shen*, Bolei Zhou
International Journel on Computer Vision (IJCV), Dec. 2020
[PDF][Webpage][Code]
Cross-view Semantic Segmentation for Sensing Surroundings
Bowen Pan, Jiankai Sun, Ho Yin Tiga Leung, Alex Andonian, Bolei Zhou
IROS 2020 and IEEE Robotics and Automation Letters (RA-L)
[PDF][Webpage][Code]
Novel Policy Seeking with Constrained Optimization
Hao Sun, Zhenghao Peng, Bo Dai, Jian Guo, Dahua Lin, Bolei Zhou
arXiv:2005.10696
[PDF]
In-Domain GAN Inversion for Real Image Editing
Jiapeng Zhu, Yujun Shen, Deli Zhao, Bolei Zhou
ECCV 2020
[PDF][Webpage][Code]
A Unified Framework for Shot Type Classification Based on Subject Centric Lens
Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, Dahua Lin
ECCV 2020
[Webpage]
Evolutionary Stochastic Policy Distillation
Hao Sun, Xinyu Pan, Bo Dai, Dahua Lin, Bolei Zhou
arXiv:2004.12909
[PDF][Code]
Temporal Pyramid Network for Action Recognition
Ceyuan Yang*, Yinghao Xu*, Jianping Shi, Bo Dai, Bolei Zhou
CVPR 2020
[PDF][Webpage][Code]
Interpreting Latent Space of GANs for Semantic Face Editing
Yujun Shen, Jinjin Gu, Xiaoou Tang, Bolei Zhou
CVPR 2020
[PDF][Webpage]
Image Processing Using Multi-Code GAN Prior
Jinjin Gu, Yujun Shen, Bolei Zhou
CVPR 2020
[PDF][Webpage]
TPNet: Trajectory Proposal Network for Motion Prediction
Liangji Fang, Qinhong Jiang, Jianping Shi, Bolei Zhou.
CVPR 2020
[PDF][Webpage]
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Anyi Rao, Linning Xu, Yu Xiong, Guodong Xu, Qingqiu Huang, Bolei Zhou, Dahua Lin
CVPR 2020
[PDF][Webpage][Code]
Video Motion Retargeting via Invariance-Driven Unsupervised Representation Disentanglement
Zhuoqian Yang, Wentao Zhu, Wayne Wu, Chen Qian, Qiang Zhou, Bolei Zhou, Chen Change Loy
CVPR 2020
[PDF][Webpage]
Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow
Mingyu Ding, Zhe Wang, Bolei Zhou, Jianping Shi, Zhiwu Lu, Ping Luo
AAAI 2020
[PDF]
2019
Policy Continuation with Hindsight Inverse Dynamics
Hao Sun, Zhizhong Li, Xiaotong Liu, Dahua Lin, Bolei Zhou
NeurIPS 2019, Spotlight
[PDF][Webpage]
Seeing What a GAN Cannot Generate
David Bau, Jun-Yan Zhu, Jonas Wulff, William Peebles, Hendrik Strobelt, Bolei Zhou, Antonio Torralba
ICCV 2019, Oral
[PDF][Webpage]
A Graph-Based Framework to Bridge Movies and Synopses
Yu Xiong, Qingqiu Huang, Lingfen Guo, Hang Zhou, Bolei Zhou, Dahua Lin.
ICCV 2019, Oral
[PDF]
Reasoning About Human-Object Interactions Through Dual Attention Networks
Tete Xiao, Quanfu Fan, Dan Gutfreund, Mathew Monfort, Aude Oliva, Bolei Zhou
ICCV 2019
[PDF][Webpage]
Semantic Photo Manipulation with a Generative Image Prior
David Bau, Hendrik Strobelt, William Peebles, Jonas Wulff, Bolei Zhou, Jun-Yan Zhu, Antonio Torralba
SIGGRAPH 2019
[PDF][Webpage][Live Demo][MIT News]
Deep Flow-Guided Video Inpainting
Rui Xu, Xiaoxiao Li, Bolei Zhou, Chen Change Loy
CVPR 2019
[PDF][Webpage][Code]
DrivingStereo: A large-scale dataset for stereo matching in autonomous driving scenarios.
Guorun Yang*, Xiao Song*, Chaoqin Huang, Zhidong Deng, Jianping Shi, Bolei Zhou.
CVPR 2019
[PDF][Dataset]
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks.
David Bau, Jun-Yan Zhu, Hendrik Strobelt, Bolei Zhou, Joshua B. Tenenbaum, William T. Freeman, Antonio Torralba.
ICLR 2019.
[PDF][Webpage][Code]
Discovering place-informative scenes and objects using social media photos.
Fan Zhang, Bolei Zhou, Carlo Ratti, Yu Liu.
Royal Society Open Science, 2019
[PDF]
Measuring human perceptions of a large-scale urban region using machine learning.
Fan Zhang, Bolei Zhou, Liu Liu, Yu Liu, Helene Fung, Hui Lin, Carlo Ratti.
Landscape and Urban Planning, 2018
[PDF]
Moments in Time Dataset: one million videos for event understanding.
Mathew Monfort, Alex Andonian, Bolei Zhou, Sarah Adel Bargal, Tom Yan, Kandan Ramakrishnan, Lisa Brown, Quanfu Fan, Dan Gutfreund, Carl Vondrick, Aude Oliva.
IEEE Transaction on Pattern Analysis and Machine Intelligence, March 2019.
[PDF][Website][Code+Model]
2018
Semantic Understanding of Scenes through ADE20K Dataset.
Bolei Zhou, Hang Zhao, Xavier Puig, Tete Xiao, Sanja Fidler, Adela Barriuso and Antonio Torralba.
International Journal on Computer Vision (IJCV), 2018.
[PDF][Dataset][Pretrained Models][Benchmark Page][Demo]
Revisiting the Importance of Individual Units in CNNs via Ablation.
Bolei Zhou, Yiyou Sun, David Bau, and Antonio Torralba
arXiv:1806.02891, 2018.
[arXiv]
Temporal Relational Reasoning in Videos.
Bolei Zhou, Alex Andonian, Aude Oliva, and Antonio Torralba
ECCV 2018.
[PDF][arXiv][Webpage][Demo Video][Code][MIT News]
Interpretable Basis Decomposition for Visual Explanation.
Bolei Zhou*, Yiyou Sun*, David Bau*, Antonio Torralba.
ECCV 2018.
[PDF][Code]
Unified Perceptual Parsing for Scene Understanding.
Tete Xiao*, Yingcheng Liu*, Bolei Zhou*, Yuning Jiang, Jian Sun
ECCV 2018.
[PDF][Code & Data]
Single Image Intrinsic Decomposition without a Single Intrinsic Image.
Wei-Chiu Ma, Hang Chu, Bolei Zhou, Raquel Urtasun, Antonio Torralba.
ECCV 2018.
[PDF]
Factorizable Net: An Efficient Subgraph based Framework for Scene Graph Generation.
Yikang Li, Wanli Ouyang, Bolei Zhou, Yawen Cui, Jianping Shi, Xiaogang Wang.
ECCV 2018.
[PDF]
Real-Time Object Pose Estimation with Pose Interpreter Networks.
Jimmy Wu, Bolei Zhou, Rebecca Russell, Vincent Kee, Syler Wagner, Mitchell Hebert, Antonio Torralba, and David M.S. Johnson
IROS 2018.
[PDF][Code][Video]
Interpretable Representation Learning for Visual Intelligence.
Bolei Zhou
PhD thesis submitted to MIT EECS, May 17, 2018.
Committee: Antonio Torralba, Aude Oliva, Bill Freeman.
[PDF][Defense Talk]
DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation.
Jimmy Wu, Bolei Zhou, Diondra Peck, Scott Hsieh, Vandana Dialani, Vasilis Syrgkanis, Lester Mackey, and Genevieve Patterson
arXiv:1805.12323, 2018.
[arXiv]
Expert identification of visual primitives used by CNNs during mammogram classification.
Jimmy Wu, Diondra Peck, Scott Hsieh, Vandana Dialani, Constance D. Lehman, Bolei Zhou, Vasilis Syrgkanis, Lester Mackey, and Genevieve Patterson
SPIE Medical Imaging, 2018.
[PDF]
Visual Question Generation as Dual Task of Visual Question Answering.
Yikang Li, Nan Duan, Bolei Zhou, Xiao Chu, Wanli Ouyang, and Xiaogang Wang
CVPR 2018, spotlight.
[arXiv][Webpage][Code]
Recurrent Residual Module for Fast Inference in Videos.
Bowen Pan, Wuwei Lin, Xiaolin Fang, Chaoqin Huang, Bolei Zhou, Cewu Lu
CVPR 2018.
[arXiv]
Interpreting Deep Visual Representations via Network Dissection.
Bolei Zhou*, David Bau*, Aude Oliva, and Antonio Torralba.
IEEE Transactions on Pattern Analysis and Machine Intelligence, June 2018. *-indicates equal contributions
[arXiv][Webpage][Code]
2017
Places: A 10 Million Image Database for Scene Recognition.
Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba.
IEEE Transactions on Pattern Analysis and Machine Intelligence, July 2017.
[PDF][Places2 Dataset][Challenge Page][Places365 CNN models][Demo]
Scene Graph Generation from Objects, Phrases and Region Captions.
Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, and Xiaogang Wang
ICCV 2017.
[PDF][Code]
Open Vocabulary Scene Parsing.
Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, and Antonio Torralba
ICCV 2017.
[PDF][arXiv][Webpage]
Scene Parsing through ADE20K Dataset.
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso and Antonio Torralba.
CVPR 2017.
[PDF][Dataset][Benchmark Page][Challenge Page][Toolkit&Code][Demo]
Network Dissection: Quantifying Interpretability of Deep Visual Representations.
David Bau*, Bolei Zhou*, Aditya Khosla, Aude Oliva, and Antonio Torralba.
CVPR 2017. as oral. *-indicates equal contribution.
[PDF][arXiv][webpage][code][Talk Video]
Person Search with Natural Language Description.
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang.
CVPR 2017.
[PDF][Dataset]
SegICP: Integrated Deep Semantic Segmentation and Pose Estimation.
J. Wong, V. Kee, T. Le, S.Wagner, G. Mariottini, A. Schneider, L. Hamilton, R. Chiaplkatty, M. Herbert, D. Johnson
J. Wu, B. Zhou, and A. Torralba.
IROS 2017, Oral
[PDF]
2016
Learning Deep Features for Discriminative Localization.
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba
CVPR 2016.
[PDF] [arXiv][Project Page][Video of CNN shifting its attention]
Optimization as Estimation with Gaussian Processes in Bandit Settings.
Zi Wang, Bolei Zhou, Stephanie Jegelka
AISTATS 2016, Oral.
[PDF][Project][Code]
2015
Understanding Intra-Class Knowledge inside CNN.
Donglai Wei, Bolei Zhou, Antonio Torralba, William Freeman
arXiv:1507.02379, 2015.
[PDF][Page][Code]
Simple Baseline for Visual Question Answering.
Bolei Zhou, Yuandong Tian, Sainbar Suhkbaatar, Arthur Szlam, Rob Fergus
arXiv:1512.02167, 2015.
[PDF][Demo][Code]
Object Detectors Emerge in Deep Scene CNNs.
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba
ICLR 2015, Oral.
[PDF][Project Page][More Visualization][Code]
ConceptLearner: Discovering Visual Concepts from Weakly Labeled Image Collections.
Bolei Zhou, Vignesh Jagadeesh, and Robinson Piramuthu
CVPR 2015.
[PDF][Project Page & Demo]
2014
Learning Deep Features for Scene Recognition using Places Database.
Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, and Aude Oliva
NIPS 2014, Spotlight
[PDF][Project Page][Demo]
Recognizing City Identity via Attribute Analysis of Geo-tagged Images.
Bolei Zhou, Liu Liu, Aude Oliva and Antonio Torralba
ECCV 2014
[PDF][Project Page]
Liu Liu, Bolei Zhou, Jinhua Zhao, Brent D. Ryan
C-IMAGE: City Cognitive Mapping through Geo-tagged Photos
GeoJournal, Springer, 2016.
[PDF]
Measuring Crowd Collectiveness.
Bolei Zhou, Xiaoou Tang, Hepeng Zhang and Xiaogang Wang
IEEE transaction on Pattern Analysis and Machine Intelligence (PAMI), 2014.
CVPR 2013, Oral
[PDF(CVPR)][PDF(TPAMI)][Project Page]
2013 and earlier
Learning Collective Crowd Behaviors with Dynamic Pedestrian-Agents.
Bolei Zhou, Xiaoou Tang and Xiaogang Wang.
International Journal of Computer Vision (IJCV), 2014.
CVPR 2012, Oral
[PDF(CVPR)] [PDF(IJCV)][Project Page]
Coherent Filtering: Detecting Coherent Motions from Crowd Clutters.
Bolei Zhou, Xiaoou Tang and Xiaogang Wang.
ECCV 2012.
[PDF] [Project Page]
Random Field Topic Model for Semantic Region Analysis in Crowded Scenes from Tracklets.
Bolei Zhou, Xiaogang Wang and Xiaoou Tang.
CVPR 2011
[PDF][Project Page]

Students

Teachings

Honors

Services

  • Area Chairs for ICCV'21, CVPR'21, AAAI'21(SPC), BMVC'21, WACV'22, CVPR'22, AAAI'22(SPC)
  • Publicity Chair for ICCV'19
  • Associate Editor for Pattern Recognition

Personal

I like outdoor activities skiing in winter and rock climbing in summer. I also enjoy reading (see my reading list).