I am a Ph.D. candidate at the University of Chinese Academy of Sciences (UCAS) and the Institute of Information Engineering, Chinese Academy of Sciences (IIE, CAS), under the supervision of Prof. Songlin Hu. I completed my bachelor’s degree in Physics from Shenzhen University and my master’s degree in Software Engineering from National Computer System Engineering Research Institute of China (joint program with UCAS). Prior to my Ph.D., I worked as an Algorithm Researcher at Ping An.

My research interests include natural language processing and machine learning. My research goal is to enable AI to better understand and generate human language. To achieve this, I am currently focused on representation learning principles and techniques to enhance the generalization, robustness, efficiency, and trustworthiness of language models.

Concat: hudou [AT] iie.ac.cn

Research Interests

Machine Learning for NLP
- Information-theoretic Representation Learning, Model Generalization & Robustness, Multi-task Learning
Trustworthy Large Language Models
- Truthfulness, Safety, Fairness
NLP Applications
- Sentiment Analysis, Misinformation Detection, Harmful Content Detection

🔥 News

Jun. 2025: 🎖 I received the CAS Presidential Scholarship (Special Prize).
May. 2025: 🎉🎉 Two papers are accepted at ACL 2025.
Jan. 2025: 🎖 I received the Baidu Scholarship Nominee (Global Top 40).
Dec. 2024: 🎉 One paper is accepted at AAAI 2025.
Nov. 2024: 🎖 I received the National Scholarship.
May. 2024: 🎉 One paper is accepted at ACL 2024.
Feb. 2024: 🎉 One paper is accepted at LREC-COLING 2024. Congrats to Yinan Bao!
Dec. 2023: 🎉 One paper is accepted at ICASSP 2024. Congrats to Lingwei Wei!
Dec. 2023: 🎉 One paper is accepted at AAAI 2024 (Oral).
Jul. 2023: 🏆 I received the Best System Award at Afrisenti-SemEval 2023.
May. 2023: 🎉🎉 Two papers are accepted at ACL 2023.
Mar. 2023: 🏆 We won 1st Place in SemEval 2023 Task 12.

📖 Selected Publications (Full List)

(* denotes equal contribution, # represents corresponding author)

Machine Learning for NLP

ACL 2025

Impartial Multi-task Representation Learning via Variance-invariant Probabilistic Decoding.
Dou Hu, Lingwei Wei, Wei Zhou, Songlin Hu. ACL 2025. (CCF-A)
[Paper] [Code]

AAAI 2025

An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding.
Dou Hu, Lingwei Wei, Wei Zhou, Songlin Hu. AAAI 2025. (CCF-A)
[Paper]

ACL 2024

Representation Learning with Conditional Information Flow Maximization.
Dou Hu, Lingwei Wei, Wei Zhou, Songlin Hu. ACL 2024. (CCF-A)
[Paper] [Code]

AAAI 2024

Structured Probabilistic Coding.
Dou Hu, Lingwei Wei, Yaxin Liu, Wei Zhou, Songlin Hu. AAAI 2024. (CCF-A, Oral)
[Paper] [Code]

EMNLP 2022

VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding.
Dou Hu, Xiaolong Hou, Xiyang Du, Mengyuan Zhou, Lianxin Jiang, Yang Mo, and Xiaofeng Shi. Findings of EMNLP 2022. (CCF-B Findings)
[Paper]

NLP Applications

Sentiment Analysis:

Multi-stream Information Fusion Framework for Emotional Support Conversation.
Yinan Bao, Dou Hu, Lingwei Wei, Shuchong Wei, Wei Zhou, Songlin Hu. LREC-COLING 2024. (CCF-B)
[Paper]
Supervised Adversarial Contrastive Learning for Emotion Recognition in Conversations.
Dou Hu, Yinan Bao, Lingwei Wei, Wei Zhou, Songlin Hu. ACL 2023. (CCF-A)
[Paper] [Code]
UCAS-IIE-NLP at SemEval-2023 Task 12: Enhancing Generalization of Multilingual BERT for Low-resource Sentiment Analysis.
Dou Hu, Lingwei Wei, Yaxin Liu, Wei Zhou, Songlin Hu. SemEval@ACL 2023. (1st Place, Best System Award)
[Paper] [Code]
MM-DFN: Multimodal Dynamic Fusion Network for Emotion Recognition in Conversations.
Dou Hu, Xiaolong Hou, Lingwei Wei, Lianxin Jiang, Yang Mo. ICASSP 2022. (CCF-B)
[Paper] [Code]
DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations.
Dou Hu, Lingwei Wei, Xiaoyong Huai. ACL-IJCNLP 2021. (CCF-A, Oral)
[Paper] [Code]
Hierarchical Interaction Networks with Rethinking Mechanism for Document-level Sentiment Analysis.
Lingwei Wei*, Dou Hu*, Wei Zhou, Xuehai Tang, Xiaodan Zhang, Xin Wang, Jizhong Han, Songlin Hu. ECML-PKDD 2020. (CCF-B)
[Paper] [Code]

Misinformation Detection:

Structure-adaptive Adversarial Contrastive Learning for Multi-Domain Fake News Detection.
Lingwei Wei, Dou Hu#, Wei Zhou, Philip S. Yu, Songlin Hu#. Findings of ACL 2025. (CCF-A Findings)
Transferring Structure Knowledge: A New Task to Fake News Detection Towards Cold-Start Propagation.
Lingwei Wei, Dou Hu, Wei Zhou, Songlin Hu. ICASSP 2024. (CCF-B)
[Paper]
Modeling the Uncertainty of Information Propagation for Rumor Detection: A Neuro-Fuzzy Approach.
Lingwei Wei, Dou Hu, Wei Zhou, Xin Wang, Songlin Hu. TNNLS 2024. (CCF-B, SCI-Q1)
[Paper]
Modeling Both Intra-and Inter-Modality Uncertainty for Multimodal Fake News Detection.
Lingwei Wei, Dou Hu, Wei Zhou, Songlin Hu. TMM 2023. (CCF-B, SCI-Q1)
[Paper]
Towards Propagation Uncertainty: Edge-enhanced Bayesian Graph Convolutional Networks for Rumor Detection.
Lingwei Wei, Dou Hu, Wei Zhou, Zhaojuan Yue, Songlin Hu. ACL-IJCNLP 2021. (CCF-A, Oral)
[Paper]
A Rumor Detection Approach based on Multi-relational Propagation Tree (一种基于多关系传播树的谣言检测方法).
Dou Hu, Lingwei Wei, Wei Zhou, Xiaoyong Huai, Jizhong Han, Songlin Hu. Journal of Computer Research and Development 2021. (CCF-A Chinese)
[Paper]

Harmful Content Detection:

PALI at SemEval-2022 Task 7: Identifying Plausible Clarifications of Implicit and Underspecified Phrases in Instructional Texts.
Mengyuan Zhou, Dou Hu, Mengfei Yuan, Meizhi Jin, Xiyang Du, Lianxin Jiang, Yang Mo, Xiaofeng Shi. SemEval@NAACL 2022. (1st Place)
[Paper]
PALI-NLP at SemEval-2022 Task 6: iSarcasmEval- Fine-tuning the Pre-trained Model for Detecting Intended Sarcasm.
Xiyang Du, Dou Hu, Meizhi Jin, Lianxin Jiang, Xiaofeng Shi. SemEval@NAACL 2022. (1st Place)
[Paper]
PAIC at SemEval-2022 Task 5: Multi-Modal Misogynous Detection in MEMES with Multi-Task Learning And Multi-model Fusion.
Meizhi Jin, Mengyuan Zhou, Mengfei Yuan, Dou Hu, Xiyang Du, Lianxin Jiang, Yang Mo, Xiaofeng Shi. SemEval@NAACL 2022. (2nd Place)
[Paper]
PALI-NLP at SemEval-2022 Task 4: Discriminative Fine-tuning of Transformers for Patronizing and Condescending Language Detection.
Dou Hu, Zhou Mengyuan, Xiyang Du, Mengfei Yuan, Jin Zhi, Lianxin Jiang, Mo Yang, Xiaofeng Shi. SemEval@NAACL 2022. (1st Place)
[Paper]

📈 Industry Applications

Galaxy Generative AI Safety Evaluation Platform, IIE, CAS, 2023-2025.
We developed a comprehensive safety evaluation platform for evaluating LLMs in China. The platform leverages three advanced representation learning techniques, including Supervised Adversarial Contrastive Learning (Hu et al., 2023a), Structured Probabilistic Coding (Hu et al., 2023b), and Conditional Information Flow Maximization (Hu et al., 2024), to achieve highly generalized and effective risk identification in generated content. It has generated over 100 automated risk reports for more than 60 mainstream LLMs in China, with detection performance significantly surpassing that of Google/OpenAI APIs.
AI Cloud Interview Platform, Ping An Life Insurance, 2021-2022.
We developed an AI interview platform for training recruitment and employee. The platform use multi-task learning techniques for efficient model training and inference, enabling better intent understanding and interview evaluation. It has supported over 20 million interviews, totaling 3.4 million hours, and provided annual support for 4.72 million recruitment interviews.
Intelligent Visit Assistant Tool, Ping An Life Insurance, 2021-2022.
We developed an AI-assisted visit tool for insurance sales. The tool leveraged the domain-specific pre-training technique Variational Masked Autoencoder (Hu et al., 2022) to enhance the language understanding capabilities of general PLMs for financial domain data. It has provided intelligent assistance and visit summaries for over 1 million agents during client interactions, achieving more than 10 million online client contacts annually.

🎖 Honors and Awards

Beijing Outstanding Graduate, Beijing Municipal Commission of Education, 2025
CAS Presidential Scholarship - Special Prize (中国科学院院长特别奖), CAS, 2025
Baidu Scholarship Nominee (Global Top 40), Baidu, 2024
National Scholarship, China’s Ministry of Education, 2024
Best System Award (First Contributor), Afrisenti-SemEval 2023
1st Place (1/213, First Contributor), Sentiment Analysis for African Languages Task, SemEval 2023
1st Place (1/300, First Contributor), Patronizing and Condescending Language Detection Task, SemEval 2022
1st Place, Intended Sarcasm Detection Task, SemEval 2022
1st Place, Multimedia Automatic Misogyny Identification Task, SemEval 2022
2nd Place, Identifying Plausible Clarifications Task, SemEval 2022

💬 Invited Talks

May 2025, Invited talk, “Information-theoretic Representation Learning and Its Applications in the Information Security” (信息论表示学习及其在内容安全领域的应用), IIE, CAS.
May 2025, Invited talk, “An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding”, The 7th Academic Forum on Artificial Intelligence of Beijing Universities.
Apr. 2025, Invited talk, “Information-theoretic Representation Learning and Its Applications in the Media Field” (信息论表示学习及其在传媒领域的应用), Communication University of China.
Dec. 2024, Invited talk, “Information-theoretic Representation learning on Natural Language Understanding” (面向自然语言理解的信息论表示学习), School of Artificial Intelligence, Beijing Normal University.
Jan. 2024, Invited talk, “Structured Probabilistic Coding”, Youth Working Committee, Chinese Information Processing Society of China (CIPS).
Jan. 2024, Youth PhD Talk, “Structured Probabilistic Coding”, AI TIME.
Sep. 2023, Youth PhD Talk, “Supervised Adversarial Contrastive Learning”, AI TIME.

📝 Academic Services

Area Chair:
- ACL ARR: ACL 2025, EMNLP 2025
Conference Reviewer or Program Committee Member:
- ACL 2023/2024, EMNLP 2023/2024, NAACL 2024, NeurIPS 2024, ICLR 2025
Journal Reviewer:
- Engineering Applications of Artificial Intelligence (EAAI)
- IEEE Transactions on Multimedia (TMM)
- IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
- IEEE Transactions on Mobile Computing (TMC)

Dou Hu