Haojie Pan
Senior Staff Research Scientist @
Bytedance Seed, working on
pushing LLMs to master competitive programming (ICPC, etc.).
๐ M.Phil., HKUST โ supervised by
Prof. Yangqiu Song
๐ B.E., ZJU โ supervised by
Prof. Deng Cai
Prev: Research @ Kuaishou, Alibaba Cloud | Intern @ Tencent WeChat, Alibaba DAMO NLP, NetEase Fuxi AI Lab
๐ก Research: Computational World Knowledge ยท Large Language Models ยท Super Machine Intelligence
๐
Github |
LinkedIn |
Google Scholar |
Zhihu
Publications
* means equal contribution
Papers
- ByteDance Seed. Seed-thinking-v1. 5: Advancing superb reasoning models with reinforcement learning. 2025. [paper]
- Jiafeng Liang, Shixin Jiang, Zekun Wang, Haojie Pan, Zerui Chen, Zheng Chu, Ming Liu, Ruiji Fu, Zhongyuan Wang, Bing Qin. GUIDE: A Guideline-Guided Dataset for Instructional Video Comprehension. IJCAI. 2024. [paper] [code]
- Yaojia Lv, Haojie Pan, Zekun Wang, Jiafeng Liang, Yuanxing Liu, Ruiji Fu, Ming Liu, Zhongyuan Wang, Bing Qin. Coggpt: Unleashing the power of cognitive dynamics on large language models. EMNLP 2024. [paper] [code]
- Haojie Pan, Zepeng Zhai, Hao Yuan, Yaojia Lv, Ruiji Fu, Ming Liu, Zhongyuan Wang, Bing Qin. KwaiAgents: Generalized Information-seeking Agent System with Large Language Models. CoRR. abs/2312.04889. 2023. [paper] [code]
- Jiaxin Deng, Dong Shen, Haojie Pan, Xiangyu Wu, Ximan Liu, Gaofeng Meng, Fan Yang, Tingting Gao, Ruiji Fu, Zhongyuan Wang. A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset. ICMR. 2023. (CCF-B) [paper] (Best Paper Candidate)
- Haojie Pan, Zepeng Zhai, Yuzhou Zhang, Ruiji Fu, Ming Liu, Yangqiu Song, Zhongyuan Wang and Bing Qin. Kuaipedia: a Large-scale Multi-modal Short-video Encyclopedia. CoRR. abs/2211.00732. 2022. [paper] [code]
- Hongming Zhang*, Xin Liu*, Haojie Pan*, Haowen Ke, Jiefu Ou, Tianqing Fang, and Yangqiu Song. ASER: Towards Large-scale Commonsense Knowledge Acquisition via Higher-order Selectional Preference over Eventualities. Artificial Intelligence(AI). 2022. (CCF-A) [paper] [code]
- Haojie Pan*, Chengyu Wang*, Minghui Qiu, Yichang Zhang, Yaliang Li, Jun Huang. Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains. ACL. 2021. (CCF-A) [paper] [code]
- Haojie Pan, Cen Chen, Chengyu Wang, Minghui Qiu, Liu Yang, Feng Ji and Jun Huang. Learning to Expand: Reinforced Response Expansion for Information-seeking Conversations. CIKM. 2021. (CCF-B) [paper]
- Chengyu Wang*, Haojie Pan*, Yuan Liu, Kehan Chen, Minghui Qiu, Wei Zhou, Jun Huang, Haiqing Chen, Wei Lin, Deng Cai. MeLL: Large-scale Extensible User Intent Classification for Dialogue Systems with Meta Lifelong Learning. KDD. 2021. (CCF-A) [paper] [code]
- Chengyu Wang, Haojie Pan, Minghui Qiu, Jun Huang, Fei Yang, Yin Zhang. Meta Distant Transfer Learning for Pre-trained Language Models. EMNLP. 2021. (CCF-B) [paper]
- Tianqing Fang, Haojie Pan, Hongming Zhang, Yangqiu Song, Kun Xu, Dong Yu. Do Boat and Ocean Suggest Beach? Dialogue Summarization with External Knowledge. AKBC. 2021. [paper] [code]
- Wenyi Xiao, Huan Zhao, Haojie Pan, Yangqiu Song, Vincent W. Zheng, Qiang Yang. Social explorative attention based recommendation for content distribution platforms. Data Mining and Knowledge Discovery(DMKD). 2021. (CCF-B) [paper] [code]
- Minghui Qiu, Peng Li*, Chengyu Wang*, Haojie Pan*, Ang Wang, Xianyan Jia, Le Jiang, Yaliang Li, Jun Huang, Jun Yang, Lin Wang, Deng Cai, Wei Lin. EasyTransfer: A Simple and Scalable Deep Transfer Learning Platform for NLP Applications. CIKM. 2021. (CCF-B) [paper] [code]
- Haojie Pan, Rongqin Yang, Xin Zhou, Rui Wang, Deng Cai, Xiaozhong Liu. Large Scale Abstractive Multi-Review Summarization (LSARS) via Aspect Alignment. SIGIR. 2020. (CCF-A) [paper] [data]
- Hongming Zhang*, Xin Liu*, Haojie Pan*, Yangqiu Song, and Cane Wing-Ki Leung. ASER: A Large-scale Eventuality Knowledge Graph. WWW. 2020. (CCF-A) [paper] [code]
- Xin Liu, Haojie Pan, Mutian He, Yangqiu Song, Xin Jiang, Neural Subgraph Isomorphism Counting. KDD. 2020 (CCF-A) [paper] [code]
- Zhou Zhao*, Haojie Pan*, Changjie Fan, Yan Liu, Linlin Li, Min Yang and Deng Cai. Abstractive Meeting Summarization via Hierarchical Adaptive Segmental Network Learning. WWW. 2019. (CCF-A) [paper] [code]
- Wenyi Xiao, Huan Zhao, Haojie Pan, Yangqiu Song, Vincent W. Zheng, Qiang Yang. Beyond Personalization - Social Content Recommendation for Creator Equality and Consumer Satisfaction. KDD. 2019. (CCF-A) [paper] [code]
- Haojie Pan, Junpei Zhou, Zhou Zhao, Yan Liu, Deng Cai and Min Yang. Dial2desc: end-to-end dialogue description generation. CoRR. abs/1811.00185. 2018. [paper]
Patents
- Hongming Zhang*, Xin Liu*, Haojie Pan* and Yangqiu Song. Knowledge graph (kg) construction method for eventuality prediction and eventuality prediction method. US Patent App. 17/613,940. 2022