I am a first-year CS Ph.D. student at National University of Singapore (NUS), supervised by Prof. Min-Yen Kan and Prof. Soujanya Poria. Previously, I obtained my Master’s degree in Computer Science and Bachelor’s degree in Inforamtion Engineering from Shanghai Jiao Tong University (SJTU), fortunately advised by Prof. Weinan Zhang.

Recently, my research works mainly focus on:

Evaluation and analysis of LLMs.
LLM reasoning, planning, and rule-following.
Interactive LLM agents.

Feel free to reach out if you are interested in my research or want to collaborate / chat with me.

🔥 News

2025.06: 🎉🎉 I will present our work RuleArena and AntiLeak-Bench at ACL 2025 in Vienna, Austria.
2025.05: 🎉🎉 Two papers RuleArena and AntiLeak-Bench are accepted by ACL 2025 (Main).
2025.04: 🎉🎉 I will present our work RuleArena at ICLR 2025 Workshops (Reason&Plan, SCI-FM). Hope to see you there.
2025.04: 🎉🎉 I will join National University of Singapore (NUS) for my Ph.D. journey, starting Aug. 2025.
2024.12: 🎉🎉 We release AntiLeak-Bench, an automated anti-leakage LLM benchmarking framework.
2024.12: 🎉🎉 We release RuleArena, an LLM rule-guided reasoning benchmark.

📝 Publications

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Ruiwen Zhou, Wenyue Hua, Liangming Pan, Sitao Cheng, Xiaobao Wu, En Yu, William Yang Wang

ACL 2025 [ Paper | Code ]

AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

Xiaobao Wu, Liangming Pan, Yuxi Xie, Ruiwen Zhou, Shuai Zhao, Yubo Ma, Mingzhe Du, Rui Mao, Shuai Zhao, Anh Tuan Luu, William Yang Wang

ACL 2025 (Oral) [ Paper | Code ]

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang

SIGIR 2024 [ Paper | Code ]

Is Risk-Sensitive Reinforcement Learning Properly Resolved?

Ruiwen Zhou, Minghuan Liu, Kan Ren, Xufang Luo, Weinan Zhang, Dongsheng Li

arXiv preprint [ Paper ]

Learning Enhanced Representations for Tabular Data via Neighborhood Propagation

Kounianhua Du, Weinan Zhang, Ruiwen Zhou, Yangkun Wang, Xilong Zhao, Jiarui Jin, Quan Gan, Zheng Zhang, David Wipf

NeurIPS 2022 [ Paper | Code ]

🎖 Honors and Awards

2024.11 Huatai Securities Scholarship (~Top 10% out of 179).
2024.11 First-Class Excellence Scholarship (Top 30% out of 179).
2022.11 First-Class Excellence Scholarship (Top 30% out of 179).
2021.12 B-Class Excellence Scholarship (Top 10% out of 144).
2021.12 Zhiyuan Honors Scholarship (Top 5% students in Zhiyuan Honors Program).
2021.04 Outstanding Winner of MCM/ICM 2021 (~Top 0.15% among the world).
2020.12 A-Class Excellence Scholarship (Top 1 out of 144).
2020.12 National Scholarship (Top 2 out of 144).
2020.12 Zhiyuan Honors Scholarship (Top 5% students in Zhiyuan Honors Program).
2019.11 B-Class Excellence Scholarship (Top 10% out of 144).
2019.11 Zhiyuan Honors Scholarship (Top 5% students in Zhiyuan Honors Program).

📖 Educations

2025.08 - Present, Ph.D. in Computer Science, NUS.
2022.09 - 2025.03, M.Eng. in Computer Science and Technology, SJTU.
2018.09 - 2022.06, B.Eng. in Information Engineering, SJTU.

💻 Internships

2025.04 - 2025.08, Shanghai AI Lab, Advised by: Jie Fu.
2024.07 - 2024.12, UCSB NLP Group, Advised by: Prof. William Yang Wang.
2022.02 - 2023.02, Amazon Web Service, Advised by: Quan Gan.
2021.08 - 2022.01, Microsoft Research Asia, Advised by: Kan Ren.

👀 Miscellaneous

In my spare time, I love:

Stroll: I often go for a walk to beautiful sites nearby and recover my energy.
Music: I listen to pop. songs, musicals, symphonies, etc. I also play the piano and sing.
Sports: I watch NBA, F1, etc. games. I am a fan of James Harden and Lewis Hamilton.