I am a PhD student in Department of Computer Science, The University of Hong Kong (HKU) since 2019, supervised by Prof. Ping Luo and co-supervised by Prof. Wenping Wang. I also work very close with my friend Wenhai Wang and Prof. Chunhua Shen. I obtained B.S. from Nanjing University of Aeronautics and Astronautics (2016) and M.S. from TongJi University (2019). From 2018 to present, I collaborated with several researchers from industry e.g. Face++(Megvii), SenseTime, Facebook, Huawei and NVIDIA.

My research interest is computer vision. I did some works about instance-level detection and self/semi/weak-supervised learning. I developed a few well-known computer vision algorithms including PolarMask, which was selected as CVPR 2020 Top-10 Influential Papers. I co-developed OpenSelfSup(1k+ star), a popular self-supervised learning framework.

Publications and Manuscripts

(* indicates equal contribution)

Conference & Journal

PolarMask: Single Shot Instance Segmentation with Polar Representation

Enze Xie*, Peize Sun*, Xiaoge Song*, Wenhai Wang, Ding Liang, Chunhua Shen, Ping Luo
CVPR2020 (Oral) [arXiv] [code] [中文解读] [talk] [CVPR20 Top-10 Influential Papers]
We introduced a new Polar Representation to reformulate instance segmentation.

PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond

Enze Xie*, Wenhai Wang*, Mingyu Ding, Ruimao Zhang, Ping Luo
TPAMI2021 [arXiv] [code]
We extend PolarMask(CVPR'20) to several instance-level detection tasks.

PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text

Wenhai Wang*, Enze Xie*, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu, Chunhua Shen
TPAMI2021 [arXiv] [code]
We extend PSENet (CVPR'19) and PAN (ICCV'19) to a text spotting system.

Segmenting Transparent Objects in the Wild with Transformer

Enze Xie, Wenjia Wang, Wenhai Wang, Peize Sun, Hang Xu, Ding Liang, Ping Luo
IJCAI2021 [arXiv] [code & dataset]

Segmenting Transparent Objects in the Wild

Enze Xie, Wenjia Wang, Wenhai Wang, Mingyu Ding, Chunhua Shen, Ping Luo
ECCV2020 [arXiv] [code & dataset]

Scene Text Image Super-Resolution in the Wild

Wenjia Wang*, Enze Xie*, Xuebo Liu, Wenhai Wang, Ding Liang, Chunhua Shen, Xiang Bai
ECCV2020 [arXiv] [code & dataset]

Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation

Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo
ECCV2020 [arXiv]

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting

Wenhai Wang, Xuebo Liu, Xiaozhong Ji, Enze Xie, Ding Liang, ZhiBo Yang, Tong Lu, Chunhua Shen, Ping Luo
ECCV2020 [arXiv] [Project Web]

Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

Wenhai Wang*, Enze Xie*, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen
ICCV 2019 [arXiv] [code]

Shape Robust Text Detection with Progressive Scale Expansion Network

Wenhai Wang*, Enze Xie*, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, Shuai Shao
CVPR 2019 [arXiv] [code]

Scene Text Detection with Supervised Pyramid Context Network

Enze Xie*, Yuhang Zang*, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li
AAAI 2019 [arXiv]

Tech Report

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo
Tech report, arXiv [arXiv] [code]

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao
Tech report, arXiv [arXiv] [code]

DetCo: Unsupervised Contrastive Learning for Object Detection

Enze Xie*, Jian Ding*, Wenhai Wang, Xiaohang Zhan, Hang Xu, Zhenguo Li, Ping Luo
Tech report, arXiv [arXiv] [code]

Unsupervised Pretraining for Object Detection by Patch Reidentification

Jian Ding*, Enze Xie*, Hang Xu, Chenhan Jiang, Zhenguo Li, Ping Luo, Gui-Song Xia
Tech report, arXiv [arXiv] [code]

TransTrack: Multiple-Object Tracking with Transformer

Peize Sun, Yi Jiang, Rufeng Zhang, Enze Xie, Jinkun Cao, Xinting Hu, Tao Kong, Zehuan Yuan, Changhu Wang, Ping Luo
Tech report, arXiv [arXiv] [code]

OneNet: Towards End-to-End One-Stage Object Detection

Peize Sun, Yi Jiang, Enze Xie, Zehuan Yuan, Changhu Wang, Ping Luo
Tech report, arXiv [arXiv] [code]

SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervision and Dynamic Self-Training

Weijia Wu*, Enze Xie* , Ruimao Zhang, Wenhai Wang, Guan Pang, Zhen Li, Hong Zhou, Ping Luo
Tech report, arXiv [arXiv] [code]

1st Place Solutions for OpenImage2019--Object Detection and Instance Segmentation

Yu Liu, Guanglu Song, Yuhang Zang, Yan Gao, Enze Xie, Junjie Yan, Chen Change Loy, Xiaogang Wang
Tech report, arXiv [arXiv]

TextSR: Content-Aware Text Super-Resolution Guided by Recognition

Wenjia Wang*, Enze Xie*, Peize Sun, Wenhai Wang, Lixun Tian, Chunhua Shen, Ping Luo
Tech report, arXiv [arXiv] [code]
Improved version has been accepted by ECCV2020


NVIDIA Research
2021.03 – Now

Research Intern
working on 3D detection->tracking->forecasting in autonomous driving with Zhiding Yu, Jose M. Alvarez and Sanja Fidler

AI Theory Group, HUAWEI Noah's Ark Lab
2020.06 – 2021.02

Research Intern
working on self-supervised learning and Transformer for dense prediction with Hang Xu, Zhenguo Li and Ping Luo

Apply Machine Learning (AML) Team, Facebook AI
2020.05 – 2020.07

Research Intern -> Project Co-Operator (Due to COVID19)
working on weak and semi-supervised OCR with Guan Pang

General Model Team, SenseTime Research
2019.07 – 2020.03

Research Intern
working on instace-level detection with Ding Liang

Detection Team, Megvii(Face++) Research
2018.04 – 2019.07

Research Intern
working on OCR and instance-level detection with Gang Yu


Rank 1 in National Artificial Intelligence Competition - Remote Sensing Segmentation (bonus 100,0000 RMB)

Rank 1 in Google Open Images 2019 - Instance Segmentation

Rank 1 in ICDAR 2019 Arbitrary-Shaped Text Detection

Rank 2 in ICDAR 2019 Large-scale Street View Text Detection

Professional Activities


SPC for IJCAI2021


Huawei Noah's Ark Lab - AI Theory Group : "Instance Level Detection and Beyond"

SenseTime : "Self-Supervised Learning for Classification and Beyond"

Microsoft Research Asia (MSRA) VCG : "Polar Representation in Instance Segmentation"
Hong Kong Computer Vision Workshop(HKCVW) : "Real-Time Scene Text Detection"

Honours and Awards

Outstanding Master Thesis Award, Tongji University

