Mingkai Zheng

Assistant Professor

Department of Computer Science and Engineering

Southern University of Science and Technology

zhengmk@sustech.edu.cn
Google Scholar
LinkedIn

I am currently an Assistant Professor in the Department of Computer Science and Engineering at Southern University of Science and Technology (SUSTech). I received my Ph.D. in Computer Science from the University of Sydney in 2025, where I was advised by Prof. Chang Xu. During my doctoral studies, I also had the privilege of working with Prof. Samuel Albanie during a research visit to the University of Cambridge, and I collaborated closely with Dr. Shan You at SenseTime Research. Prior to my Ph.D., I earned a Bachelor’s degree from the University of New South Wales in 2018, followed by an Honours degree from the University of Sydney in 2019.

Research Interest

Visual representation learning
Self-supervised, semi-supervised, and supervised learning
Efficient Machine Learning
Multimodal Large Language Models (with an emphasis on visual perception)
Generative Model, World Model
Embodied AI

Recruitment

I am currently looking for self-motivated Research Assistants, Master's, and Ph.D. students with strong coding skills and excellent research intuition to join my team. Successful candidates will engage in cutting-edge research and practical projects. Interested applicants are welcome to send their detailed resumes directly to me.

Working Experience

Assistant Professor, Southern University of Science and Technology (2025.03 – Present)
Machine Learning Engineer Intern, TikTok Sydney (2024.11 – 2025.03)
Computer Vision Research Intern, SenseTime Research (2020.08 – 2024.11)

Publications

Advancing Contrastive Learning for Visual Representation: From Unsupervised to Supervised Paradigms

Mingkai Zheng

University of Sydney, Ph.D. Thesis, 2025

[Link]

SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models

Guangxi Li, Yinsheng Song, Mingkai Zheng* (Corresponding Author)

International Conference on Neural Information Processing (ICONIP), 2024

[Link]

Decoupled Video Generation with Chain of Training-free Diffusion Model Experts

Wenhao Li, Yichao Cao, Xiu Su, Xi Lin, Shan You, Mingkai Zheng, Yi Chen, Chang Xu

arXiv preprint, 2024

[Link]

Weak Augmentation Guided Relational Self-Supervised Learning

Mingkai Zheng, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu

IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2024

[Link]

Knowledge diffusion for distillation

Tao Huang, Yuan Zhang, Mingkai Zheng, Shan You, Fei Wang, Chen Qian, Chang Xu

Advances in Neural Information Processing Systems (NeurIPS), 2023

[Link]

CoNe: Contrast your neighbours for supervised image classification

Mingkai Zheng, Shan You, Lang Huang, Xiu Su, Fei Wang, Chen Qian, Xiaogang Wang, Chang Xu

arXiv preprint, 2023

[Link]

Boosting semi-supervised semantic segmentation with probabilistic representations

Haoyu Xie, Changqi Wang, Mingkai Zheng, Minjing Dong, Shan You, Chong Fu, Chang Xu

AAAI conference on artificial intelligence (AAAI), 2023

[Link]

Can gpt-4 perform neural architecture search?

Mingkai Zheng, Xiu Su, Shan You, Fei Wang, Chen Qian, Chang Xu, Samuel Albanie

arXiv preprint, 2023

[Link]

SimMatchv2: Semi-supervised learning with graph consistency

Mingkai Zheng, Shan You, Lang Huang, Chen Luo, Fei Wang, Chen Qian, Chang Xu

International Conference on Computer Vision (ICCV), 2023

[Link]

Green hierarchical vision transformer for masked image modeling

Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki

Advances in Neural Information Processing Systems (NeurIPS), 2022

[Link]

ViTAS: Vision transformer architecture search

Xiu Su, Shan You, Jiyang Xie, Mingkai Zheng, Fei Wang, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu

European Conference on Computer Vision (ECCV), 2022

[Link]

SimMatch: Semi-supervised learning with similarity matching

Mingkai Zheng , Shan You, Lang Huang, Fei Wang, Chen Qian, Chang Xu

The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022

[Link]

Learning where to learn in cross-view self-supervised learning

Lang Huang, Shan You, Mingkai Zheng , Fei Wang, Chen Qian, Toshihiko Yamasaki

The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022

[Link]

ReSSL: Relational self-supervised learning with weak augmentation

Mingkai Zheng, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu

Advances in Neural Information Processing Systems (NeurIPS), 2021

[Link]

K-shot nas: Learnable weight-sharing for nas with k-shot supernets

Xiu Su, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu

International Conference on Machine Learning (ICML), 2021

[Link]

Weakly supervised contrastive learning

Mingkai Zheng, Fei Wang, Shan You, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu

International Conference on Computer Vision (ICCV), 2021

[Link]