Welcome to Hanning Zhang’s Personal Website

I am a first-year MSCS student at the University of Illinois Urbana-Champaign (UIUC), advised by Professor Tong Zhang. Previously, I graduated from The Hong Kong University of Science and Technology (HKUST) in 2024, majoring in Computer Science. Previously, I worked as a research intern on the topic of LLM hallucination and alignment, advised by Professor Tong Zhang. In 2023 Summer, I had the privilege to work as a research intern at Blender Lab, advised by Professor Heng Ji.

Research Interest

My research interests include Natural Language Processing (NLP) and Large Language Models (LLMs). I have a broad interest in LLM alignment. I am now working on Reward Modeling for Mathematical Reasoning. I also worked on LLM hallucination in the past.

Open-Source Contribution

RLHF-Reward-Modeling GitHub Icon https://github.com/RLHFlow/RLHF-Reward-Modeling (1K Stars)

I am the main contributor to the math-rm project, where we train process-supervised reward (PRM) and outcome-supervised reward (ORM) using the next-token prediction. We open-source the data, code, hyper-parameter, and model for a robust recipe that is easy to reproduce.

Research Papers

Education

  • University of Illinois Urbana-Champaign (2024-2026)
    Master of Science in Computer Science

  • The Hong Kong University of Science and Technology (2020-2024)
    Bachelor of Science in Computer Science

  • University of Illinois Urbana-Champaign (2023)
    Exchange Program in Computer Science