I am Zhipei Xu (εΎεΏζ²), a second-year Masterβs student at the School of Electronic and Computer Engineering,
Peking University, advised by Prof. Jian Zhang. Previously, I received my B.Eng degree from the School of Electronic and Information Engineering,
South China University of Technology. Please feel free to reach out via email (zhipeixu@stu.pku.edu.cn).
My research focuses on Trustworthy Multi-modal AI, with specific interests in Multi-modal Large Language Models, Image/Video Forgery Detection, and AIGC Security. I have published multiple papers at top-tier venues including CVPR, ICLR, NeurIPS, and ACM MM, with works spanning explainable forgery localization, multi-agent detection frameworks, and robust watermarking/steganography for copyright protection. For more details on my publications, please visit my profiles on Google Scholar ().
π₯ News
- 2025.08.02: Β ππ Gaussianseal has been accepted by MIR!
- 2025.02.27: Β ππ OmniGuard have been accepted by CVPR 2025!
- 2025.01.23: Β ππ FakeShield and SecureGS has been accepted by ICLR 2025!
Old News
- 2024.09.25: ππ GS-hider has been accepted by NeurIPS 2024!
- 2024.07.05: ππ V2A-Mark has been accepted by ACM MM 2024!
π Selected Publications
(* Equal contribution, β Project Leader, β‘ Corresponding author)

FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
Zhipei Xu, Xuanyu Zhang, Runyi Li, Zecheng Tang, Qing Huang, Jian Zhang
International Conference on Learning Representations (ICLR), 2025
- We propose the explainable IFDL task and design FakeShield, a multi-modal framework capable of evaluating image authenticity, generating tampered region masks, and providing a judgment basis based on pixel-level and image-level tampering clues.
AvatarShield: Visual Reinforcement Learning for Human-Centric Synthetic Video Detection
Zhipei Xu, Xuanyu Zhang, Qing Huang, Xing Zhou, Jian Zhang
Under Review
- We focus on pose, audio, and text-driven human video forgery and propose the first human-centered video forgery dataset, FakeHumanVid, along with the first reinforcement learning-based human video forgery detection framework, AvatarShield.

UniShield: An Adaptive Multi-Agent Framework for Unified Forgery Image Detection and Localization
Qing Huang*, Zhipei Xu*, Xuanyu Zhang, Xiangyu Yu, Jian Zhang
Under Review
- We propose the first multi-agent framework, UniShield, designed to address image forgery detection tasks. It efficiently utilizes a Perception Agent and a Detection Agent to call upon various expert detectors, enabling unified image forgery detection, including DeepFake, AIGC, image forgery, and document forgery.
π Selected Honors and Awards
- 2023.10 National Scholarship.
- 2023.10 National College Student Mathematical Modeling Competition - Provincial First Prize.
- 2022.10 National Scholarship.
- 2021.10 Samsung Scholarship.
π Educations
- 2024.09 - 2027.06 (expected), MSc., Computer Science and Technology, Peking University.
- 2020.09 - 2024.06, B.S., Communication Engineering, South China University of Technology.
π¬ Services
- Conference Reviewer for ICLR, ICML, NeurIPS, ECCV, ICCV, ACM MM.
- Journal Reviewer for IEEE TIP, IEEE TCSVT, VCIP.