刘泽群

主要研究方向

大语言模型,药物发现


教育经历

  • 博士信息科学技术学院,北京大学. 导师:张铭.


个人介绍

刘泽群,北京大学信息科学技术学院博士,目前是微软研究院科学智能中心(北京)的研究员。她目前的研究兴趣为应用大模型技术解决药物发现领域的关键问题,是科学大模型MolXPT,NatureLM的主要研发者。她在Nature Machine Intelligence, ACL,EMNLP等顶级人工智能期刊与会议上发表多篇论文,H-index为11。


主要论文

Zequn Liu, Wei Zhang, Yingce Xia, Lijun Wu, Shufang Xie, Tao Qin, Ming Zhang, and Tie-Yan Liu. "MolXPT: Wrapping Molecules with Text for Generative Pre-training." In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 1606-1616. 2023.

Zequn Liu, Shukai Wang, Yiyang Gu, Ruiyi Zhang, Ming Zhang, and Sheng Wang. "Graphine: A Dataset for Graph-aware Terminology Definition Generation." In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp. 3453-3463. 2021.

Zequn Liu, Kefei Duan, Junwei Yang, Hanwen Xu, Ming Zhang, and Sheng Wang. "MetaFill: Text Infilling for Meta-Path Generation on Heterogeneous Information Networks." In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp. 5110-5122. 2022.

Wei Ju*, Zequn Liu*, Yifang Qin, Bin Feng, Chen Wang, Zhihui Guo, Xiao Luo, and Ming Zhang. "Few-shot molecular property prediction via hierarchically structured learning on relation graphs." Neural Networks 163 (2023): 122-131.

Bin Feng, Zequn Liu, Nanlan Huang, Zhiping Xiao, Haomiao Zhang, Srbuhi Mirzoyan, Hanwen Xu, Jiaran Hao, Yinghui Xu, Ming Zhang, Sheng Wang. "A bioactivity foundation model using pairwise meta-learning." Nature Machine Intelligence 6, no. 8 (2024): 962-974.

Yiping Song, Zequn Liu, Wei Bi, Rui Yan, and Ming Zhang. "Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks." In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5832-5841. 2020.

Junwei Yang, Zequn Liu, Ming Zhang, and Sheng Wang. "Pathway2text: Dataset and method for biomedical pathway description generation." In Findings of the Association for Computational Linguistics: NAACL 2022, pp. 1441-1454. 2022.

Junwei Yang, Hanwen Xu, Srbuhi Mirzoyan, Tong Chen, Zixuan Liu, Zequn Liu, Wei Ju, Luchen Liu, Zhiping Xiao, Ming Zhang, Sheng Wang. "Poisoning medical knowledge using large language models." Nature Machine Intelligence 6, no. 10 (2024): 1156-1168.