Guoqing Liu (刘国庆)
Senior Researcher
Microsoft Research AI for Science

Google Scholar | Github

Contact: guoqingliu (at) microsoft.com
Building 2, No. 5 DanLing Street, Haidian District, Beijing, China.
News
Short Bio
Guoqing Liu is a Senior Researcher at Microsoft Research AI for Science. His research interests include reinforcement learning, intersection of foundation models and decision making/reinforcement learning, and their applications in the scientific domain. In Project Suphx, he and his team built the world-best Mahjong AI, Suphx, which achieved 10 DAN on the Tenhou platform in mid 2019. In Project Mariana, he built the first general pixel-based automated game testing agent, Inspector, which was utilized by Xbox Studios Quality. Most recently, he has been focusing on developing Foundation models and RL agents in scientific discovery. Before joining MSR, he obtained his Ph.D. degree from University of Science and Technology of China (USTC) in 2021, under the joint Ph.D. program between USTC and Microsoft Research Asia (MSRA), advised by Dr. Tie-Yan Liu and Prof. Nenghai Yu.

Publications
    ("*": equal contribution; "†": correspondence)
  1. Chimera: Accurate retrosynthesis prediction by ensembling models with diverse inductive biases [Paper]
    Krzysztof Maziarz*, Guoqing Liu*, Hubert Misztela, Aleksei Kornev, Piotr Gaiński, Holger Hoefling, Mike Fortunato, Rishi Gupta, Marwin Segler. arXiv 2024.
  2. Accelerating Protein Engineering with Fitness Landscape Modeling and Reinforcement Learning [Paper]
    Haoran Sun, Liang He, Pan Deng, Guoqing Liu, Haiguang Liu, Chuan Cao, Fusong Ju, Lijun Wu, Tao Qin, Tie-Yan Liu. bioRxiv 2024.
  3. Reinforcement Learning from Bagged Reward [Paper]
    Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama
    ICML 2024 Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ICML-W 2024)
  4. Token-level Direct Preference Optimization [Paper]
    Yongcheng Zeng, Guoqing Liu, Weiyu Ma, Ning Yang, Haifeng Zhang, Jun Wang
    Forty-first International Conference on Machine Learning (ICML 2024)
  5. Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers [Paper]
    Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, Yujiu Yang
    Twelfth International Conference on Learning Representations (ICLR 2024)
  6. Re-evaluating Retrosynthesis Algorithms with Syntheseus [Paper]
    Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr Gaiński, Philipp Seidl, Marwin Segler
    NeurIPS 2023 Workshop on AI for Science (NeurIPS-W 2023)
  7. De novo Drug Design using Reinforcement Learning with Multiple GPT Agents [Paper]
    Xiuyuan Hu, Guoqing Liu†, Yang Zhao, Hao Zhang
    Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)
  8. QuinNet: Efficiently Incorporating Quintuple Interactions into Geometric Deep Learning Force Fields [Paper]
    Zun Wang, Guoqing Liu, Yichi Zhou, Tong Wang, Bin Shao
    Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)
  9. Retrosynthetic Planning with Dual Value Networks [Paper]
    Guoqing Liu*, Di Xue*, Shufang Xie, Yingce Xia, Austin Tripp, Krzysztof Maziarz, Marwin Segler, Tao Qin, Zongzhang Zhang, Tie-Yan Liu
    Fortieth International Conference on Machine Learning (ICML 2023)
  10. Re-evaluating Chemical Synthesis Planning Algorithms [Paper]
    Austin Tripp, Krzysztof Maziarz, Sarah Lewis, Guoqing Liu, Marwin Segler
    NeurIPS 2022 Workshop on AI for Science (NeurIPS-W 2022)
  11. You May Not Need Ratio Clipping in PPO [Paper]
    Mingfei Sun, Vitaly Kurin, Guoqing Liu, Sam Devlin, Tao Qin, Katja Hofmann, Shimon Whiteson. arXiv 2022.
  12. Inspector: Pixel-based Automated Game Testing via Exploration, Detection, and Investigation [Paper]
    Guoqing Liu, Mengzhang Cai, Li Zhao, Tao Qin, Adrian Brown, Jimmy Bischoff and Tie-Yan Liu
    IEEE Conference on Games 2022 (COG 2022, Oral)
  13. Independence-aware Advantage Estimation [Paper]
    Pushi Zhang, Li Zhao, Guoqing Liu, Jiang Bian, Minlie Huang, Tao Qin, Tie-Yan Liu
    30th International Joint Conference on Artificial Intelligence (IJCAI 2021)
  14. Return-based Contrastive Representation Learning for Reinforcement Learning [Paper]
    Guoqing Liu*, Chuheng Zhang*, Li Zhao, Tao Qin, Jinhua Zhu, Jian Li, Nenghai Yu, Tie-Yan Liu
    Ninth International Conference on Learning Representations (ICLR 2021)
  15. Demonstration Actor Critic [Paper]
    Guoqing Liu, Li Zhao, Pushi Zhang, Jiang Bian, Tao Qin, Nenghai Yu, Tie- Yan Liu
    Neurocomputing, Volume 434, 28 April 2021, Pages 194-202 (Neurocomputing 2021)
  16. Suphx: Mastering Mahjong with Deep Reinforcement Learning [Paper][News]
    Junjie Li, Sotetsu Koyamada, Qiwei Ye, Guoqing Liu, Chao Wang, Ruihan Yang, Li Zhao, Tao Qin, Tie-Yan Liu, Hsiao-Wuen Hon. arXiv 2020.
  17. Trust Region Evolution Strategies [Paper]
    Guoqing Liu, Li Zhao, Feidiao Yang, Jiang Bian, Tao Qin, Nenghai Yu, Tie-Yan Liu
    Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019)
Education
Professional Activities
Honors and Awards

Last update: Dec. 9, 2024