Welcome to Hanning Zhang’s Personal Website
I am a first-year MSCS student at the University of Illinois Urbana-Champaign (UIUC), advised by Professor Tong Zhang. Previously, I graduated from The Hong Kong University of Science and Technology (HKUST) in 2024, majoring in Computer Science. Previously, I worked as a research intern on the topic of LLM hallucination and alignment, advised by Professor Tong Zhang. In 2023 Summer, I had the privilege to work as a research intern at Blender Lab, advised by Professor Heng Ji.
Research Interest
My research interests include Natural Language Processing (NLP) and Large Language Models (LLMs). I have a broad interest in LLM alignment. I am now working on Reward Modeling for Mathematical Reasoning. I also worked on LLM hallucination in the past.
Open-Source Contribution
RLHF-Reward-Modeling https://github.com/RLHFlow/RLHF-Reward-Modeling (1K Stars)
I am the main contributor to the math-rm project, where we train process-supervised reward (PRM) and outcome-supervised reward (ORM) using the next-token prediction. We open-source the data, code, hyper-parameter, and model for a robust recipe that is easy to reproduce.
Research Papers
R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Hanning Zhang*, Shizhe Diao*, Yong Lin*, Yi R. Fung, Qing Lian, Xingyao Wang, Yangyi Chen, Heng Ji, Tong Zhang. (* denotes equal contribution)
NAACL-2024 (Oral)
Outstanding Paper Award, 6/2434 = 0.25%Entropy-Regularized Process Reward Model
Hanning Zhang*, Pengcheng Wang*, Shizhe Diao, Yong Lin, Rui Pan, Hanze Dong, Dylan Zhang, Pavlo Molchanov, Tong Zhang. (* denotes equal contribution)
Under ReviewRAG-Reward: Optimizing RAG with Reward Modeling and RLHF
Hanning Zhang, Juntong Song, Juno Zhu, Yuanhao Wu, Tong Zhang, Cheng Niu
Under ReviewTowards understanding the efficiency of ensemble in fine-tuning
Yifan Hao, Xingyuan Pan, Hanning Zhang, Chenlu Ye, Rui Pan, Tong Zhang.
Under ReviewMitigating the Alignment Tax of RLHF
Yong Lin*, Hangyu Lin*, Wei Xiong*, Shizhe Diao*, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, and Tong Zhang
EMNLP-2024 (Main)InfoPattern: Unveiling Information Propagation Patterns in Social Media
Chi Han*, Manling Li*, Jialiang Xu*, Hanning Zhang*, Tarek Abdelzaher, Heng Ji (* denotes equal contribution)
Demo Report
Education
University of Illinois Urbana-Champaign (2024-2026)
Master of Science in Computer ScienceThe Hong Kong University of Science and Technology (2020-2024)
Bachelor of Science in Computer ScienceUniversity of Illinois Urbana-Champaign (2023)
Exchange Program in Computer Science