Zhuangdi Zhu

Assistant Professor (Tenure-Track)

George Mason University

Biography

I am Zhu, Zhuangdi (朱庄翟). I am an assistant professor at the Department of Cyber Security Engineering of George Mason University. Prior to that I worked as a senior Data & Applied Scientist for Microsoft. I received my Ph.D. degree from the Department of Computer Science and Engineering, Michigan State University, advised by Dr. Jiayu Zhou.

My research focuses on making AI models safe and aligned. Our lab studies how foundation models comprehend intent, govern knowledge, and behave in real-world settings.

📢📢 Prospective Ph.D. students and research interns: Please email me your CV, transcript, and a Statement of Purpose if you are interested!

Interests

Foundation Model Reasoning and Alignment
Knowledge Transfer
Federated Learning
Reinforcement Learning
Robustness, Fairness, Privacy, and Security for AI
Wireless Networking; IoT; Edge Computing

Education

PhD in Computer Science, 2017 - 2022
Michigan State University
BSc in Computer Science, 2011 - 2015
Nanjing University of Science and Technology

Professional Activities

News

June, 2026: 📃 Our paper regarding benchmarking LLMs for proving robotic path planning optimality has been accepted by IEEE IROS 2026. Congratulations to Zhengbang and my collaborator Dr. Wei!
June, 2026: Invited Talk at NAIRR Pilot AI Unlocked Workshop 2026 about Mentoring Graduate Students in the Age of AI. [photo]
May, 2026: 🎉 Grateful to receive an award from the NAIRR Pilot to support collaborative research on AI for Evolutionary Biology.
April, 2026: 📃 Check out our new preprint on multi-objective LLM unlearning.
March, 2026: 📃 Check out our new preprint paper about benchmarking LLMs for proving robotic path planning optimality.
Jan, 2026: 📃 Our paper DUET: Distilled LLM Unlearning from an Efficiently Contextualized Teacher has been accepted by ICLR 2026!
Jan, 2026: 📃 Our paper Dialogue is Better Than Monologue: Instructing Medical LLMs via Strategical Conversations has been accepted by EACL 2026.
Dec, 2025: 📃 Our paper CATNIP: LLM Unlearning via Calibrated and Tokenized Negative Preference Alignment has been accepted by ResponsibleFM Workshop @ NeurIPS 2025!
Nov, 2025: 📃 Our paper about Attacks and Defenses in Federated Learning has been accepted by IEEE CSCLOUD 2025.
Aug, 2025: 📃 Our long paper, Web Intellectual Property at Risk: Preventing Unauthorized Real-Time Retrieval by Large Language Models, has been accepted by the EMNLP Main Conference. Check out the arXiv version here. Congratulations to the leading students Yisheng and Yizhu, and my collaborator Dr. Hanqing Guo!
Aug, 2025: Our poster about Preventing Unauthorized Real-Time Retrieval by LLMs has been accepted for presentation at USENIX Security 2025. We will present at the poster session on Aug 13th in Seattle, WA.
June, 2025: 🎉 Grateful to receive the NVIDIA Academic Grant Program Award to support our research on LLM unlearning.
May, 2025: 📃 Our paper about Class-Granular Attacks and Robust Defense in Federated Learning has been accepted by FedKDD 2025.
May, 2025: 📃 Our paper about Hierarchical Federated Unlearning for Large Language Models has been accepted by FedKDD 2025.
May, 2025: 📃 One paper got accepted by KDD 2025 Workshop SciSocLLM (PDF).
Mar, 2025: 🎉 Our Workshop on Federated Learning for Data Mining and Graph Analytics (FedKDD) has been accepted by KDD 2025.
Mar, 2025: 🎤 Invited Talk about Federated Learning at the George Washington University ECE Colloquium.
Feb, 2025: Checkout our preprint about AI-Powered Engaging Conversations for Enhancing Senior Cognitive Wellbeing.
Dec, 2024: 🎉 I am grateful to receive a Grant from CCI (The Commonwealth Cyber Initiative) on Secure and Privacy-Conscious Threat Detection via Federated Learning and GNN. Thanks to CCI and my collaborator Dr. Wajih Ul Hassan from University of Virginia.
Nov, 2024: 🎉 I am grateful to receive the NAIRR Pilot Program Grant.
Oct, 2024: Invited talk at CCI AI for Cybersecurity Workshop on Trustworthy Federated Learning.
Aug, 2024: 🎉 Two PhD students, Zhengbang Yang and Eason Zhong have joined my research lab.
May, 2024: 🆚 Invited debate at ASCIS on Teaching in AI Era: Challenges and Opportunities (and yes, we won the championship! :P).
April, 2024: 📢 Call for Participation: Please join our first International Joint Workshop on Federated Learning for Data Mining and Graph Analytics, co-located with KDD2024, August 25-26th, at Barcelona.
April, 2024: Our survey paper on Topology-aware Federated Learning in Edge Computing is accepted by ACM Computing Surveys and selected in the ACM Showcase.
Jan, 2024: I joined GMU as an assistant professor.
August, 2023 🎉 We hosted a KDD workshop on federated learning for distributed data mining (FL4Data-Mining). Check more details at fl4data-mining.github.io.
June, 2023 🎉 Our survey paper, Transfer Learning in Deep Reinforcement Learning has been accepted for publication in the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) journal.
Feb, 2023: Check out our preprint paper about Topology-aware Federated Learning in Edge Computing.
Sep, 2022: I joined Microsoft as a Senior Data & Applied Scientist.
Aug, 2022: Our paper about Robust Unsupervised Domain Adaptation has been accepted by ICDM 2022 [paper].
May, 2022: Our paper about Resilient and Communication Efficient Federated Learning has been accepted by ICML 2022 [paper].
Dec, 2021: Our paper about Self-Adaptive Imitation Learning has been accepted by AAAI 2022 [paper].
June, 2021: I joined the Ads Core Machine Learning team of Meta as a PhD SDE intern.
May, 2021: Our paper about Knowledge Transfer in Federated Learning has been accepted by ICML 2021 [paper] [code].
May, 2021: Our paper about Debiasing in Federated Learning has been accepted by KDD 2021 [paper] [project].
Sep, 2020 Our paper about Imitation Learning has been accepted by NeurIPS 2020 [paper] [code].

Invited Talks

Aug, 2023, Invited talk on AI2Healthcare. [video]
Jan, 2023, Invited talk at GMU: Knowledge Distillation for Efficient Learning in Heterogeneous Federated Systems.
Dec, 2022, Invited talk at UT Austin: Efficient Knowledge Transfer for Heterogeneous Machine Learning Domains.
ICML 2022 Spotlight Presentation: Resilient and Communication Efficient Learning for Heterogeneous Federated Systems. [video]
AAAI 2022 Short Presentation: Self Adaptive Imitation Learning: Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations.
ICML 2021 Poster Presentation: Data-free knowledge Distillation for Heterogeneous Federated Learning.
NeurIPS 2020 Poster Presentation: Off-Policy Imitation Learning from Observations. [slides].

Services

Program Chair:
Review Panel:
- NAIRR Pilot, 2024 - Present
Session Chair:
- 29th ACM SIGKDD Conference On Knowledge Discovery and Data Mining (KDD), 2023
Program Committee Member:
- AAAI Conference on Artificial Intelligence (AAAI), 2021-2023
- 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022
Conference Reviewer:
- Conference on Neural Information Processing Systems (NeurIPS), 2021 - 2023
- International Conference on Machine Learning (ICML), 2021 -2023
- AAAI Conference on Artificial Intelligence (AAAI), 2020 - 2023
- International Conference on Learning Representations (ICLR), 2022 - 2023
- ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2021 - 2023
- IEEE International Conference on Robotics and Automation (ICRA), 2022 - 2023
- IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022 - 2023
Journal Reviewer:
- IEEE TPAMI, 2022
- IEEE Network Magazine, 2021 - 2022
- IEEE Journal of Automatica Sinica, 2022
- IEEE Robotics and Automation Letters, 2021 - 2022
- NeuroComputing, 2020 -2023
- Information Sciences, 2021 - 2022

Teaching

GMU CYSE 686: Introduction to Federated Learning (Spring 2024 - 2026)
GMU CYSE 550: Cyber Security Engineering Fundamentals (Fall 2024)
GMU CYSE 499-004: Machine Learning and Artificial Intelligence (Fall 2025, Summer 2026)

During my PhD program, I served as a teaching assistant for the following courses at MSU. I enjoy helping students master skills on analytical thinking, mathematics, and programming.

MSU CSE 847: Machine Learning (Spring 2020, Spring 2021)
- Volunteer teaching assistant for graduate-level machine learning class.
- Instructor for pre-exam Q & A lab sessions.
- Proposed lecture materials for CSE 847 advanced topics including Reinforcement Learning and Federated Learning.
MSU CSE 231: Introduction to Programming (Spring 2017, Spring 2018, Fall 2018)
- Instructor for weekly lab sessions to teach Python programming techniques.
- Tutor for weekly in-person Q & A sessions for hundreds of students.
- Designed homework projects about Python data structures, including Class and String.
MSU CSE 260: Discrete Structures in Computer Science (Fall 2017)
- Teaching assistant for undergraduate-level classes; Served for grading, office-hours, and Q & A sessions.

Team

Yisheng (Eason) Zhong

Homepage · LinkedIn · Google Scholar

Zhengbang Yang

Homepage · LinkedIn · Google Scholar

Yumeng Zhang

Experience

Senior Data & Applied Scientist

Microsoft

Sep 2022 – Oct 2023 WA, USA

Revolutionizing AI-powered Search Engine with Large Language Models.

PhD Intern - Machine Learning Track

Meta (Facebook)

May 2021 – Aug 2021 WA, USA

Improved facebook users’ long-term engagement via Reinforcement Learning.

Reasearch Associate

CyberX

Jan 2019 – May 2019 Beijing, China

Improved digital market making with AI empowered risk prediction.

PhD Intern

Google

May 2018 – Aug 2018 CA, USA

Built HCI applications with one-handed guesture recognition.

Research Intern

IBM

Jan 2015 – May 2015 Beijing, China

Desigend scheduling algorithms for an FPGA cloud system.

Selected Publications

Please check Google Scholar for my complete publications

Quickly discover relevant content by filtering publications.

Zhengbang Yang, Yisheng Zhong, Junyuan Hong, Zhuangdi Zhu

February 2026 ResponsibleFM @ NeurIPS 2025

CATNIP: LLM Unlearning via Calibrated and Tokenized Negative Preference Alignment

Pretrained knowledge memorized in LLMs raises critical concerns over safety and privacy, which has motivated LLM Unlearning as a technique for selectively removing the influences of undesirable knowledge. Existing approaches, rooted in Gradient Ascent (GA), often degrade general domain knowledge while relying on retention data or curated contrastive pairs, which can be either impractical or data and computationally prohibitive. Negative Preference Alignment has been explored for unlearning to tackle the limitations of GA, which, however, remains confined by its choice of reference model and shows undermined performance in realistic data settings. These limitations raise two key questions: i) Can we achieve effective unlearning that quantifies model confidence in undesirable knowledge and uses it to calibrate gradient updates more precisely, thus reducing catastrophic forgetting? ii) Can we make unlearning robust to data scarcity and length variation? We answer both questions affirmatively with CATNIP (Calibrated and Tokenized Negative Preference Alignment), a principled method that rescales unlearning effects in proportion to the model’s token-level confidence, thus ensuring fine-grained control over forgetting.

Yisheng Zhong, Zhengbang Yang, Zhuangdi Zhu

January 2026 ICLR 2026

DUET: Distilled LLM Unlearning from an Efficiently Contextualized Teacher

LLM unlearning is a technique to remove the impacts of undesirable knowledge from the model without retraining from scratch, which is indispensable towards trustworthy AI. Existing unlearning methods face significant limitations: conventional tuning-based unlearning is computationally heavy and prone to catastrophic forgetting. In contrast, in-contextualized unlearning is lightweight for precise unlearning but vulnerable to prompt removal or reverse engineering attacks. In response, we propose Distilled Unlearning from an Efficient Teacher (DUET), a novel distillation-based unlearning method that combines the merits of these two lines of work.

Yisheng Zhong, Zhengbang Yang, Zhuangdi Zhu

October 2025 FedKDD 2025

Hierarchical Federated Unlearning for Large Language Models

Large Language Models (LLMs) are increasingly integrated into real-world applications, raising concerns about privacy, security and the need to remove undesirable knowledge. We propose a federated unlearning approach for LLMs that is scalable and privacy preserving, with task-specific adapter learning and hierarchical merging.

Yisheng Zhong, Yizhu Wen, Junfeng Guo, Mehran Kafai, Heng Huang, Hanqing Guo, Zhuangdi Zhu

May 2025 EMNLP 2025

Web Intellectual Property at Risk: Preventing Unauthorized Real-Time Retrieval by Large Language Models

The protection of cyber Intellectual Property (IP) such as web content is an increasingly critical concern. The rise of large language models (LLMs) with online retrieval capabilities enables convenient access to information but often undermines the rights of original content creators. In response, we propose a novel defense framework that empowers web content creators to safeguard their web-based IP from unauthorized LLM real-time extraction and redistribution.

Zhengbang Yang, Junyuan Hong, Yijiang Pang, Jiayu Zhou, Zhuangdi Zhu

February 2025 SciSocLLM @ KDD 2025

ChatWise: AI-Powered Engaging Conversations for Enhancing Senior Cognitive Wellbeing

Cognitive health in older adults presents a growing challenge. We propose a strategy-guided AI chatbot named ChatWise that follows a dual-level conversation reasoning framework with macro-level strategy planning and micro-level utterance generation.

Zhuangdi Zhu, Kaxiang Lin, Anil K. Jain, Jiayu Zhou

September 2023 TPAMI

Transfer Learning in Deep Reinforcement Learning: A Survey

Reinforcement learning is a learning paradigm for solving sequential decision-making problems. Recent years have witnessed remarkable progress in reinforcement learning upon the fast development of deep neural networks. Along with the promising prospects of reinforcement learning in numerous domains such as robotics and game-playing, transfer learning has arisen to tackle various challenges faced by reinforcement learning, by transferring knowledge from external expertise to facilitate the efficiency and effectiveness of the learning process. In this survey, we systematically investigate the recent progress of transfer learning approaches in the context of deep reinforcement learning. Specifically, we provide a framework for categorizing the state-of-the-art transfer learning approaches, under which we analyze their goals, methodologies, compatible reinforcement learning backbones, and practical applications. We also draw connections between transfer learning and other relevant topics from the reinforcement learning perspective and explore their potential challenges that await future research progress.

Zhuangdi Zhu, Junyuan Hong, Steve Drew, Jiayu Zhou

June 2022 ICML

Resilient and Communication Efficient Learning for Heterogeneous Federated Systems

The rise of Federated Learning (FL) is bringing machine learning to edge computing by utilizing data scattered across edge devices. However, the heterogeneity of edge network topologies and the uncertainty of wireless transmission are two major obstructions of FL’s wide application in edge computing, leading to prohibitive convergence time and high communication cost. In this work, we propose an FL scheme to address both challenges simultaneously. Specifically, we enable edge devices to learn self-distilled neural networks that are readily prunable to arbitrary sizes, which capture the knowledge of the learning domain in a nested and progressive manner. Not only does our approach tackle system heterogeneity by serving edge devices with varying model architectures, but it also alleviates the issue of connection uncertainty by allowing transmitting part of the model parameters under faulty network connections, without wasting the contributing knowledge of the transmitted parameters. Extensive empirical studies show that under system heterogeneity and network instability, our approach demonstrates significant resilience and higher communication efficiency compared to the state-of-the-art.

Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

June 2022 AAAI

Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-Optimal Demonstrations.

Reinforcement learning (RL) has demonstrated its superiority in solving sequential decision-making problems. However, heavy dependence on immediate reward feedback impedes the wide application of RL. On the other hand, imitation learning (IL) tackles RL without relying on environmental supervision by leveraging external demonstrations. In practice, however, collecting sufficient expert demonstrations can be prohibitively expensive, yet the quality of demonstrations typically limits the performance of the learning policy. To address a practical scenario, in this work, we propose SelfAdaptive Imitation Learning (SAIL), which, provided with a few demonstrations from a sub-optimal teacher, can perform well in RL tasks with extremely delayed rewards, where the only reward feedback is trajectory-wise ranking. SAIL bridges the advantages of IL and RL by interactively exploiting the demonstrations to catch up with the teacher and exploring the environment to yield demonstrations that surpass the teacher. Extensive empirical results show that not only does SAIL significantly improve the sample efficiency, but it also leads to higher asymptotic performance across different continuous control tasks, compared with the state-of-the-art.

Junyuan Hong, Zhuangdi Zhu, Shuyang Yu, Zhangyang Wang, Hiroko H Dodge, Jiayu Zhou

August 2021 KDD

Federated adversarial debiasing for fair and transferable representations

Federated learning is a distributed learning framework that is communication efficient and provides protection over participating users’ raw training data. One outstanding challenge of federate learning comes from the users’ heterogeneity, and learning from such data may yield biased and unfair models for minority groups. While adversarial learning is commonly used in centralized learning for mitigating bias, there are significant barriers when extending it to the federated framework. In this work, we study these barriers and address them by proposing a novel approach Federated Adversarial DEbiasing (FADE). FADE does not require users’ sensitive group information for debiasing and offers users the freedom to optout from the adversarial component when privacy or computational costs become a concern. We show that ideally, FADE can attain the same global optimality as the one by the centralized algorithm. We then analyze when its convergence may fail in practice and propose a simple yet effective method to address the problem. Finally, we demonstrate the effectiveness of the proposed framework through extensive empirical studies, including the problem settings of unsupervised domain adaptation and fair learning.

Zhuangdi Zhu, Junyuan Hong, Jiayu Zhou

June 2021 ICML

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Federated Learning (FL) is a decentralized machine-learning paradigm, in which a global server iteratively averages the model parameters of local users without accessing their data. User heterogeneity has imposed significant challenges to FL, which can incur drifted global models that are slow to converge. Knowledge Distillation has recently emerged to tackle this issue, by refining the server model using aggregated knowledge from heterogeneous users, other than directly averaging their model parameters. This approach, however, depends on a proxy dataset, making it impractical unless such a prerequisite is satisfied. Moreover, the ensemble knowledge is not fully utilized to guide local model learning, which may in turn affect the quality of the aggregated model. Inspired by the prior art, we propose a data-free knowledge distillation approach to address heterogeneous FL, where the server learns a lightweight generator to ensemble user information in a data-free manner, which is then broadcasted to users, regulating local training using the learned knowledge as an inductive bias. Empirical studies powered by theoretical implications show that our approach facilitates FL with better generalization performance using fewer communication rounds, compared with the state-of-the-art.

Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

June 2020 NuerIPs

Off-Policy Imitation Learning from Observations

Learning from Observations (LfO) is a practical reinforcement learning scenario from which many applications can benefit through the reuse of incomplete resources. Compared to conventional imitation learning (IL), LfO is more challenging because of the lack of expert action guidance. In both conventional IL and LfO, distribution matching is at the heart of their foundation. Traditional distribution matching approaches are sample-costly which depend on on-policy transitions for policy learning. Towards sample-efficiency, some off-policy solutions have been proposed, which, however, either lack comprehensive theoretical justifications or depend on the guidance of expert actions. In this work, we propose a sample-efficient LfO approach which enables off-policy optimization in a principled manner. To further accelerate the learning procedure, we regulate the policy update with an inverse action model, which assists distribution matching from the perspective of mode-covering. Extensive empirical results on challenging locomotion tasks indicate that our approach is comparable with state-of-the-art in terms of both sample-efficiency and asymptotic performance.

My Cat "RiceCake"

Silly Handsome

More About Me

I have a cat named RiceCake.
I play Just Dance like a pro.

Interests

Reading
Jazz
Musical Romance
Tennis
Snowboarding
Traveling