Qi Lei (雷琦)

Assistant Professor of Mathematics and Data Science, and, by courtesy,
Assistant Professor of Computer Science,
Member of CILVR lab,
Member of Math and Data,
Google DeepMind Faculty,
Courant Institute of Mathematical Sciences and Center for Data Science,
New York University

Email: ql518 at nyu.edu

Research Overview

My research interests are machine learning, deep learning, and optimization. Specifically, I am interested in developing sample- and computationally efficient algorithms for some fundamental machine learning problems.

Recent research highlights: (Data and Model Pruning), (Data Reconstruction Attack and Defense), (Theoretical Foundations of Pre-trained Models)

(Curriculum Vitae, Github, Google Scholar)

I am actively looking for self-motivated and proactive students to work with. You are welcome to shoot me an email with your CV and short research plans/interests. (You may refer to this link to see whether our research interests match.)

For Ph.D. applicants, please apply to Courant Mathematics or Center for Data Science whichever you see fit and mention my name in your application. I do not plan to admit students from Courant CS for now.

For prospective students or interns who want to work with me in short term, please fill out this form so that we could find a suitable project for you.

For prospective post-doc applicants, I encourage you to apply for the positions of CDS Faculty Fellows, Courant Instructors, and Flatiron Research Fellows.

News and Announcement

05/2025 Paper accepted at UAI 2025:

Tianci Liu, Tong Yang, Quan Zhang, Qi Lei. Beyond Invisibility: Learning Robust Visible Watermarks for Stronger Copyright Protection

05/2025 Paper accepted at ICML 2025:

Yijun Dong, Yicheng Li, Yunai Li, Jason D Lee, Qi Lei. Discrepancies are Virtue: Weak-to-Strong Generalization through Lens of Intrinsic Dimension

01/2025 Three papers accepted at AISTATS 2025:

Sheng Liu, Zihan Wang, Yuxiao Chen, Qi Lei. Data Reconstruction Attacks and Defenses: A Systematic Evaluation
Tao Wen, Zihan Wang, Quan Zhang, Qi Lei. Elastic Representation: Mitigating Spurious Correlations for Group Robustness
Ziliang Samuel Zhong, Xiang Pan, Qi Lei. Bridging Domains with Approximately Shared Features

01/2025 Paper accepted at ICLR 2025:

Qi Zhang, Yifei Wang, Jingyi Cui, Xiang Pan, Qi Lei, Stefanie Jegelka, Yisen Wang. Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness

01/2025 Invited talk at IMS@NUS on Theoretical Bounds of Data Reconstruction Error and Induced Optimal Defenses (slides)

12/2024 Invited talk at ICSDS on Theoretical Bounds of Data Reconstruction Error and Induced Optimal Defenses (slides)

11/2024 Invited talk at Harvard Statistics on Distribution-aware Data and Model Pruning (slides)

10/2024 Organized the minisymposium “Efficient Computation and Learning with Randomized Sampling and Pruning” at SIAM MDS 2024

09/2024 Two papers accepted at NeurIPS:

Yijun Dong, Hoang Phan, Xiang Pan, Qi Lei. Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning
Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D Lee. Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity

08/2024 Invited talk at 2024 Workshop on Data-driven PDE-based inverse problem, in theory and practice on Data Reconstruction Error Analysis in the Lens of Inverse Problems

08/2024 Invited talk at JSM Harnessing Large Language Models: Opportunities and Challenges for Statistics on LLM Pruning

06/2024 Invited talk at Mathematics of Deep Learning on Data Reconstruction Error Analysis

05/2024 Two papers accepted at ICML 2024:

Hoang Phan, Andrew Gordon Wilson, Qi Lei. Controllable Prompt Tuning For Balancing Group Distributional Robustness
Hong Jun Jeon, Jason D Lee, Qi Lei, Benjamin Van Roy. An Information-Theoretic Analysis of In-Context Learning

Selected Papers

(full publication list)

8. Yijun Dong, Yicheng Li, Yunai Li, Jason D Lee, Qi Lei, Discrepancies are Virtue: Weak-to-Strong Generalization through Lens of Intrinsic Dimension, to appear at ICML 2025

7. Sheng Liu*, Zihan Wang*, Yuxiao Chen, Qi Lei, “Data Reconstruction Attacks and Defenses: A Systematic Evaluation”, AISTATS 2025

6. Zihan Wang, Jason Lee, Qi Lei. “Reconstructing Training Data from Model Gradient, Provably”, AISTATS 2023

5. Baihe Huang*, Kaixuan Huang*, Sham M. Kakade*, Jason D. Lee*, Qi Lei*, Runzhe Wang*, and Jiaqi Yang*. “Optimal Gradient-based Algorithms for Non-concave Bandit Optimization”, NeurIPS 2021

4. Jason D. Lee*, Qi Lei*, Nikunj Saunshi*, Jiacheng Zhuo*. “Predicting What You Already Know Helps: Provable Self-Supervised Learning”, NeurIPS 2021

3. Simon S. Du*, Wei Hu*, Sham M. Kakade*, Jason D. Lee*, Qi Lei*. “Few-Shot Learning via Learning the Representation, Provably”, The International Conference on Learning Representations (ICLR) 2021

2. Qi Lei*, Lingfei Wu*, Pin-Yu Chen, Alexandros G. Dimakis, Inderjit S. Dhillon, Michael Witbrock. “Discrete Adversarial Attacks and Submodular Optimization with Applications to Text Classification”, Systems and Machine Learning (sysML). 2019 (code, slides)

Press coverage: <Nature Story> <Vecturebeat> <Tech Talks> <机器之心>

1. Rashish Tandon, Qi Lei, Alexandros G. Dimakis, Nikos Karampatziakis, “Gradient Coding: Avoiding Stragglers in Distributed Learning”, Proc. of International Conference of Machine Learning (ICML), 2017 (code)

Qi Lei (雷琦)

Research Overview

Advertisement

News and Announcement

Selected Papers