Nan DUAN (段楠)
Microsoft Research | Google Scholar | LinkedIn
Please contact me with this email: nanduan.nlp AT outlook.com.
Dr. Nan DUAN is a senior principal researcher and research manager of the Natural Language Computing group at Microsoft Research Asia. He is an adjunct Ph.D. supervisor at University of Science and Technology of China and Xi’an Jiaotong University, and an adjunct professor at Tianjin University. His research interests include natural language processing, multimodal intelligence, code intelligence, and machine reasoning. He served as the program chair and area chair at NLP/AI conferences. He published 100+ research papers with 20000+ Google Scholar citations and holds 20+ patents. He won CVPR Best Demo Award (2022). He was awarded as Distinguished Member of China Computer Federation (CCF), CCF-NLPCC Distinguished Young Scientist (2019), DeepTech Intelligent Computing Innovators China (2022).
段楠博士,微软亚洲研究院资深首席研究员,自然语言计算团队研究经理,中国科学技术大学、西安交通大学兼职博导,天津大学兼职教授,主要从事自然语言处理、多模态智能、代码智能、机器推理等研究,多次担任NLP/AI学术会议程序主席和领域主席,发表学术论文100余篇,Google Scholar引用20000余次,持有专利20余项。 他获得CVPR最佳演示奖(2022)。 他被评为中国计算机协会(CCF)杰出会员、CCF-NLPCC青年科学家(2019年)、DeepTech中国智能计算科技创新人物(2022年)。
Highlight
- Natural Language Processing
- text pre-training: Unicoder (EMNLP, 2019), XLM-K (AAAI, 2022), LogiGAN (NeurIPS, 2022).
- text generation: ProphetNet (EMNLP, 2020), BANG (ICML, 2021), GENIE (ICML, 2023).
- benchmark: NLPCC-KBQA/NLPCC-DBQA (NLPCC, 2016-2019), MSParS (NLPCC, 2019), XGLUE (EMNLP, 2020), GLGE (ACL, 2021), AGIEval (NAACL, 2024), CMMLU (ACL, 2024).
- Code Intelligence
- code pre-training: CodeBERT (EMNLP, 2020), GraphCodeBERT (ICLR, 2021), UniXcoder (ACL, 2022), CodeExecutor (ACL, 2023).
- code generation: XGPT-C (EMNLP, 2021), Grammformer (ICLR, 2022), ReACC (ACL, 2022), CodeReviewer (ESEC/FSE, 2022), LongCoder (ICML, 2023), MPSC (ACL, 2023), Selene (ACL, 2024).
- benchmark: CodeXGLUE (NeurIPS, 2021), CoSQA (ACL, 2021), CodeExp (EMNLP, 2022).
- Multimodal Intelligence
- multimodal pre-training (image): Unicoder-VL (AAAI, 2020), M3P (CVPR, 2021), KD-VLP (NAACL, 2022), BridgeTower (AAAI, 2023), ManagerTower (ACL, 2023).
- multimodal pre-training (video): UniVL (Preprint, 2020), CLIP4Clip (Neurocomputing, 2022).
- visual generation: GODIVA (Preprint, 2021), NUWA(女娲) (ECCV, 2022), NUWA-Infinity (NeurIPS, 2022), NUWA-LIP (CVPR, 2023), NUWA-3D (IJCAI, 2023), NUWA-XL (ACL, 2023), DragNUWA (Preprint, 2023), LayoutNUWA (ICLR, 2024), StrokeNUWA (ICML, 2024).
- Machine Reasoning
- (neural-)symbolic reasoning: Complex Reasoning in LSAT (TASLP, 2022) Analytical Reasoning (NAACL, 2022), Logical Reasoning (ACL, 2022), CRITIC (ICLR, 2024), ToRA (ICLR, 2024).
- retrieval-augmented reasoning: Fact Checking (ACL, 2020), Commonsense QA (AAAI, 2020), Text+Table QA (IJCAI, 2022).
- neural moduler reasoning: ReasonFormer (ACL, 2023).
- Compositional AI
- general framework: TaskMatrix.AI (Intelligent Computing, 2023).
- visual task completion: Visual ChatGPT (Preprint, 2023).
- task planning: Learning-to-Program (Preprint, 2023).
Talk
- 跨模态生成式人工智能. CCF-ADL, 2023-11.
- 从生成式人工智能到组合式人工智能. 三维AIGC与视觉大模型技术研讨会, 2023-11.
- Compositional AI - Advanced Abilities Emerge When All Necessary Basic Abilities Are Strong. IAS Workshop on Mathematical Theory for Emergent Intelligence, 2023-07.
- Code Intelligence: Models, Applications and Future. Korea AI Summit, 2022-12. (slides)
- 多维度编程语言预训练及实际应用. CNCC, 2022-12. (slides)
- AI赋能视觉内容创作. CNCC, 2022-12. (slides)
- Empowering Content Creation with AI. Keynote at CCMT, 2022-08. (slides)
- 基于多模态预训练的文本和视觉生成. CNCC, 2021-12. (slides)
- 视觉语言预训练:现状和挑战. CAAI, 2021-08. (slides)
- Pre-trained Models and Benchmark for Code Intelligence. Keynote at KDD Workshop on PLP, 2021-08. (slides)
- Really Reaching Human Parity? - Addressing Benchmark Issues on Robustness, Bias and Metric. ACL Workshop on BPPF, 2021-08. (slides)
- SOTAs, Benchmarks and Future of Pre-trained Models for Multilingual, Multimodal, Code and Generation. CCF-ADL, 2021-05. (slides)
- Multilingual Multimodal Pre-training. Keynote at Korean Artificial Intelligence Association, 2020-11. (slides)
- Machine Reasoning: Technology, Dilemma and Future. EMNLP Tutorial, 2020-11. (slides)
- Learning Universal Representations via Multitask Multilingual Multimodal Pre-training. CNCC, 2020-10. (slides)
- Machine Reasoning: Combining Knowledge and Pre-trained Models for Better NLU. NLPCC Technical Workshop, 2019-10.
- 预训练模型最新进展及其在跨任务、跨语言和跨模态场景下的应用. CCF-Tech Frontier, 2019-08.
- 当语言遇上视觉:基于可视化内容的跨模态自然语言处理. Language & Intelligence Summit, 2019-08. (slides)
- Question Answering with Heterogeneous Data. CIPS-ATT Tutorial, 2019-07. (slides)
- 基于预训练的自然语言处理和多模态学习. SMP, 2019-04.
- Building Informational Bot (InfoBot) with Question Answering & Generation. CCF-ADL, 2017-11. (slides)
- Knowledge-based Question Answering. CCF-ADL, 2014-12. (slides)
Book
- 《智能问答》.
段楠,周明.
高等教育出版社,2018. - 《人工智能导论 - 第11章:自然语言处理》.
周明,段楠,刘树杰,吴俣.
中国科学技术出版社,2018. - 《统计机器翻译中的一致性解码方法研究》. (slides)
段楠.
Ph.D. Dissertation, 2011.
Preprint
Publication
- Voila-A: Aligning Vision-Language Models with User’s Gaze Attention
Kun Yan, Lei Ji, Zeyu Wang, Yuntao Wang, Nan Duan, Shuai Ma.
NeurIPS, 2024. - Not All Tokens Are What You Need for Pretraining
Zhenghao Lin, Zhibin Gou, Yeyun Gong, Xiao Liu, yelong shen, Ruochen Xu, Chen Lin, Yujiu Yang, Jian Jiao, Nan Duan, Weizhu Chen.
NeurIPS, 2024. - Learning to Plan by Updating Natural Language
Yiduo Guo, Yaobo Liang, Chenfei Wu, Wenshan Wu, Dongyan Zhao, Nan Duan.
Findings of EMNLP, 2024. - PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Zekai Zhang, Yiduo Guo, Yaobo Liang, Dongyan Zhao, Nan Duan.
Findings of EMNLP, 2024. - Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency
Baizhou Huang, Shuai Lu, Xiaojun Wan, Nan Duan.
ACL, 2024. - Selene: Pioneering Automated Proof in Software Verification
Lichen Zhang, Shuai Lu, Nan Duan.
ACL, 2024. - PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Yiduo Guo, Zekai Zhang, Yaobo Liang, Dongyan Zhao, Nan Duan.
Findings of ACL, 2024. - Large Language Models Can Learn Representation in Natural Language
Yiduo Guo, Yaobo Liang, Dongyan Zhao, Nan Duan.
Findings of ACL, 2024. - CMMLU: Measuring massive multitask language understanding in Chinese
Haonan Li, Yixuan Zhang, Fajri Koto, Yifei Yang, hai zhao, Yeyun Gong, Nan Duan, Timothy Baldwin.
Findings of ACL, 2024. - Competition-Level Problems are Effective LLM Evaluators
Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, yelong shen, Chen Lin, Nan Duan, Weizhu Chen.
Findings of ACL, 2024. - Using Left and Right Brains Together: Towards Vision and Language Planning
Jun CEN, Chenfei Wu, Xiao Liu, Shengming Yin, Yixuan Pei, Jinglong Yang, Qifeng Chen, Nan Duan, Jianguo Zhang.
ICML, 2024. - StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis
Zecheng Tang, Chenfei Wu, Zekai Zhang, Minheng Ni, Shengming Yin, Yu Liu, Zhengyuan Yang, Lijuan Wang, Zicheng Liu, Juntao Li, Nan Duan.
ICML, 2024. - scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI.
Haotian Cui, Chloe Wang, Hassaan Maan, Kuan Pang, Fengning Luo, Nan Duan, Bo Wang.
Nature Method, 2024. - AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models.
Wanjun Zhong, Ruixiang Cui, Yiduo Guo, Yaobo Liang, Shuai Lu, Yanlin Wang, Amin Saied, Weizhu Chen, Nan Duan.
NAACL, 2024. - Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models.
Jiashuo Sun, Yi Luo, Yeyun Gong, Chen Lin, Yelong Shen, Jian Guo, Nan Duan.
NAACL, 2024. - LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models.
Zecheng Tang, Chenfei Wu, Juntao Li, Nan Duan.
ICLR, 2024. - CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing.
Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen.
ICLR, 2024. - ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving.
Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Minlie Huang, Nan Duan, Weizhu Chen.
ICLR, 2024. - Machine-Created Universal Language for Cross-lingual Transfer.
Yaobo Liang, Quanzhi Zhu, Junhe Zhao, Nan Duan.
AAAI, 2024. - ORES: Open-vocabulary Responsible Visual Synthesis.
Minheng Ni, Chenfei Wu, Xiaodong Wang, Shengming Yin, Lijuan Wang, Zicheng Liu, Nan Duan.
AAAI, 2024. - HORIZON: High-Resolution Semantically Controlled Panorama Synthesis.
Kun Yan, Lei Ji, Jian Liang, Chenfei Wu, Ming Zhou, Nan Duan, Shuai Ma.
AAAI, 2024. - LEAD: Liberal Feature-based Distillation for Dense Retrieval.
Hao Sun, Xiao Liu, Yeyun Gong, Anlei Dong, Jian Jiao, Jingwen Lu, Yan Zhang, Daxin Jiang, Linjun Yang, Rangan Majumder, Nan Duan.
WSDM, 2024. - TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs. (GitHub)
Yaobo Liang, Chenfei Wu, Ting Song, Wenshan Wu, Yan Xia, Yu Liu, Yang Ou, Shuai Lu, Lei Ji, Shaoguang Mao, Yun Wang, Linjun Shou, Ming Gong, Nan Duan.
Intelligent Computing, 2023. - Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data. (GitHub)
Canwen Xu, Daya Guo, Nan Duan, Julian McAuley.
EMNLP, 2023. - Query Rewriting in Retrieval-Augmented Large Language Models.
Xinbei Ma, Yeyun Gong, Pengcheng He, hai zhao, Nan Duan.
EMNLP, 2023. - CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion.
Xingwei He, Yeyun Gong, A-Long Jin, Hang Zhang, Anlei Dong, Jian Jiao, Siu Ming Yiu, Nan Duan.
EMNLP, 2023. - Intervention-Based Alignment of Code Search with Execution Feedback.
Hojae Han, Minsoo Kim, seung-won hwang, Nan Duan, Shuai Lu.
Findings of EMNLP, 2023. - Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy.
Zhihong Shao, Yeyun Gong, yelong shen, Minlie Huang, Nan Duan, Weizhu Chen.
Findings of EMNLP, 2023. - Allies: Prompting Large Language Model with Beam Search.
Hao Sun, Xiao Liu, Yeyun Gong, Yan Zhang, Daxin Jiang, Linjun Yang, Nan Duan.
Findings of EMNLP, 2023. - AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation.
Tong Wu, Zhihao Fan, Xiao Liu, Yeyun Gong, Yelong Shen, Jian Jiao, Hai-Tao Zheng, Juntao Li, Zhongyu Wei, Jian Guo, Nan Duan, Weizhu Chen.
NeurIPS, 2023. - NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. (Homepage)
Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Gong Ming, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan.
ACL, 2023. - Analysing and Reducing the Performance Gap in Cross-Lingual Fine-tuning with Learning Slow and Fast.
Yiduo Guo, Yaobo Liang, Dongyan Zhao, Bing Liu, Nan Duan.
ACL, 2023. - ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.
Xiao Xu, Bei Li, Chenfei Wu, Shao-Yen Tseng, Anahita Bhiwandiwalla, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan.
ACL, 2023. - CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding.
Zhijian Hou, Wanjun Zhong, Lei Ji, Difei Gao, Kun Yan, Wing-Kwong Chan, Chong-Wah Ngo, Zheng Shou, Nan Duan.
ACL, 2023. - Disentangling Reasoning Capabilities from Language Models with Compositional Reasoning Transformers.
Wanjun Zhong, Tingting Ma, Jiahai Wang, Jian Yin, Tiejun Zhao, Chin-Yew Lin, Nan Duan.
Findings of ACL, 2023. - Code Execution with Pre-trained Language Models.
Chenxiao Liu, Shuai Lu, Weizhu Chen, Daxin Jiang, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan, Nan Duan.
Findings of ACL, 2023. - Joint Generator-Ranker Learning for Natural Language Generation.
Weizhou Shen, Yeyun Gong, Yelong Shen, Song Wang, Xiaojun Quan, Nan Duan, Weizhu Chen.
Findings of ACL, 2023. - LongCoder: A Long-Range Pre-trained Language Model for Code Completion.
Daya Guo, Canwen Xu, Nan Duan, Jian Yin, Julian McAuley.
ICML, 2023. - Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models.
Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen.
ICML, 2023. - Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise.
Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen.
ICML, 2023. - Learning 3D Photography Videos via Self-supervised Diffusion on Single Images.
Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan.
IJCAI, 2023. - NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN.
Minheng Ni, Chenfei Wu, Haoyang Huang, Daxin Jiang, Wangmeng Zuo, Nan Duan.
CVPR, 2023. - ReCo: Region-Controlled Text-to-Image Generation.
Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang.
CVPR, 2023. - PROD: Progressive Distillation for Dense Retrieval.
Zhenghao Lin, Yeyun Gong, Xiao Liu, Hang Zhang, Chen Lin, Anlei Dong, Jian Jiao, Jingwen Lu, Daxin Jiang, Rangan Majumder, Nan Duan.
WWW, 2023. - Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval.
Shunyu Zhang, Yaobo Liang, MING GONG, Daxin Jiang, Nan Duan.
ICLR, 2023. - Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning. (GitHub)
Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Nan Duan.
AAAI, 2023. - An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language Queries Challenge 2022
Zhijian Hou, Wanjun Zhong, Lei Ji, Difei Gao, Kun Yan, Wing-Kwong Chan, Chong-Wah Ngo, Zheng Shou, Nan Duan.
ECCV Ego4D Workshop, 2022. - Towards Compositional Generalization in Code Search.
Hojae Han, Seung-won Hwang, Shuai Lu, Nan Duan, Seungtaek Choi.
EMNLP, 2022. - Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning.
Xingwei He, Yeyun Gong, A-Long Jin, Weizhen Qi, Hang Zhang, Jian Jiao, Bartuer Zhou, Biao Cheng, SM Yiu, Nan Duan.
EMNLP, 2022. - Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis.
Shuai Fan, Chen Lin, Haonan Li, Zhenghao Lin, Jinsong Su, Hang Zhang, Yeyun Gong, JIan Guo, Nan Duan.
EMNLP, 2022. - CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search.
Xiaonan Li, Yeyun Gong, Yelong Shen, Xipeng Qiu, Hang Zhang, Bolun Yao, Weizhen Qi, Daxin Jiang, Weizhu Chen, Nan Duan.
EMNLP, 2022. - CodeExp: Explanatory Code Document Generation. (GitHub)
Haotian Cui, Chenglong Wang, Junjie Huang, Jeevana Priya Inala, Todd Mytkowicz, Bo Wang, Jianfeng Gao, Nan Duan.
Findings of EMNLP, 2022. - Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQA. (GitHub)
Junjie Huang, Wanjun Zhong, Qian Liu, Ming Gong, Daxin Jiang, Nan Duan.
Findings of EMNLP, 2022. - Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation.
Xiaonan Li, Daya Guo, Yeyun Gong, Yun Lin, Yelong Shen, Xipeng Qiu, Daxin Jiang, Weizhu Chen, Nan Duan.
Findings of EMNLP, 2022. - SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval.
Kun Zhou, Yeyun Gong, Xiao Liu, Wayne Xin Zhao, Yelong Shen, Anlei Dong, Jingwen Lu, Rangan Majumder, Ji-Rong Wen, Nan Duan, Weizhu Chen.
Industry Track of EMNLP, 2022. - Execution-based Evaluation for Data Science Code Generation Models. (GitHub)
Junjie Huang, Chenglong Wang, Jipeng Zhang, cong yan, Haotian Cui, Jeevana Priya Inala, Colin Clement, Nan Duan, Jianfeng Gao.
EMNLP Workshop on Data Science with Human in the Loop, 2022. - NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis. (GitHub) (Homepage)
Chenfei Wu, Jian Liang, Xiaowei Hu, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan.
NeurIPS, 2022. - Less-forgetting Multi-lingual Fine-tuning.
Yuren Mao, Yaobo Liang, Nan Duan, Haobo Wang, Kai Wang, Lu Chen, Yunjun Gao.
NeurIPS, 2022. - LogiGAN: Learning Logical Reasoning via Adversarial Pre-training.
Xinyu Pi, Wanjun Zhong, Yan Gao, Nan Duan, Jian-Guang Lou.
NeurIPS, 2022. - NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion. (GitHub)
Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan.
ECCV, 2022. - Trace Controlled Text to Image Generation.
Kun Yan, Lei Ji, Chenfei Wu, Jianmin Bao, Ming Zhou, Nan Duan, Shuai Ma.
ECCV, 2022. - AdsCVLR: Commercial Visual-Linguistic Representation Modeling in Sponsored Search.
Yongjie Zhu, Chunhui Han, Yuefeng Zhan, Bochen Pang, Zhaoju Li, Hao Sun, Si Li, Boxin Shi, Nan Duan, Weiwei Deng, Ruofei Zhang, Liangjie Zhang, Qi Zhang.
ACM Multimedia, 2022. - CodeReviewer: Pre-Training for Automating Code Review Activities. (GitHub)
Zhiyu Li, Shuai Lu, Daya Guo, Nan Duan, Shailesh Jannu, Grant Jenks, Deep Majumder, Jared Green, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan.
ESEC/FSE, 2022. - VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers.
Estelle Aflalo, Meng Du, Shao-Yen Tseng, Yongfei Liu, Chenfei Wu, Nan Duan, Vasudev Lal.
CVPR, 2022. - Reasoning over Hybrid Chain for Table-and-Text Open Domain Question Answering.
Wanjun Zhong, Junjie Huang, Qian Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan.
IJCAI, 2022. - Unsupervised Context Aware Sentence Representation Pretraining for Multi-lingual Dense Retrieval.
Ning Wu, Yaobo Liang, Houxing Ren, Linjun Shou, Nan Duan, Ming Gong, Daxin Jiang.
IJCAI, 2022. - ProQA: Structural Prompt-based Pre-training for Unified Question Answering. (GitHub)
Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan.
NAACL, 2022. - KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation.
Yongfei Liu, Chenfei Wu, Shao-yen Tseng, Vasudev Lal, Xuming He, Nan Duan.
Findings of NAACL, 2022. - Analytical Reasoning of Text.
Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Yining Chen, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan.
Findings of NAACL, 2022. - CULG: Commercial Universal Language Generation.
Haonan Li, yameng huang, Yeyun Gong, Jian Jiao, Ruofei Zhang, Timothy Baldwin, Nan Duan.
Industry Track of NAACL, 2022. - ReACC: A Retrieval-Augmented Code Completion Framework. (GitHub)
Shuai Lu, Nan Duan, Hojae Han, Daya Guo, seung-won hwang, Alexey Svyatkovskiy.
ACL, 2022. - UniXcoder: Unified Cross-Modal Pre-training for Code Representation. (GitHub)
Daya Guo, Shuai Lu, Nan Duan, Yanlin Wang, Ming Zhou, Jian Yin.
ACL, 2022. - Cross-Lingual Ability of Multilingual Masked Language Models: A Study of Language Structure.
Yuan Chai, Yaobo Liang, Nan Duan.
ACL, 2022. - Multi-View Document Representation Learning for Open-Domain Dense Retrieval.
Shunyu Zhang, Yaobo Liang, MING GONG, Daxin Jiang, Nan Duan.
ACL, 2022. - DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation.
Wei Chen, Yeyun Gong, Song Wang, Bolun Yao, Weizhen Qi, zhongyu wei, Xiaowu Hu, Bartuer Zhou, Yi Mao, Weizhu Chen, Biao Cheng, Nan Duan.
ACL, 2022. - Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations.
Wei Chen, Yeyun Gong, Can Xu, Huang Hu, Bolun Yao, zhongyu wei, Zhihao Fan, Xiaowu Hu, Bartuer Zhou, Biao Cheng, Daxin Jiang, Nan Duan.
ACL, 2022. - Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text.
Siyuan Wang, Wanjun Zhong, Duyu Tang, zhongyu wei, Zhihao Fan, Daxin Jiang, Ming Zhou, Nan Duan.
Findings of ACL, 2022. - LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval.
Canwen Xu, Daya Guo, Nan Duan, Julian McAuley.
Findings of ACL, 2022. - Adversarial Retriever-Ranker for Dense Text Retrieval. (GitHub)
Hang Zhang, Yeyun Gong, Yelong Shen, Jiancheng Lv, Nan Duan, Weizhu Chen.
ICLR, 2022. - Learning to Generate Code Sketches.
Daya Guo, Alexey Svyatkovskiy, Jian Yin, Nan Duan, Marc Brockschmidt, Miltiadis Allamanis.
ICLR, 2022. - XLM-K: Improving Cross-Lingual Language Model Pre-Training with Multilingual Knowledge. (GitHub)
Xiaoze Jiang, Yaobo Liang, Weizhu Chen, Nan Duan.
AAAI, 2022. - Learning Temporal Video Procedure Segmentation from an Automatically Collected Large Dataset.
Lei Ji, Chenfei Wu, Daisy Zhou, Kun Yan, Edward Cui, Xilin Chen, Nan Duan.
WACV, 2022. - From LSAT: The Progress and Challenges of Complex Reasoning.
Siyuan Wang, Zhongkun Liu, Wanjun Zhong, Ming Zhou, Zhongyu Wei, Zhumin Chen, Nan Duan.
IEEE TASLP, 2022. - CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval. (GitHub)
Huaishao Luo, Lei Ji, Ming Zhong, Yang Chen, Wen Lei, Nan Duan, Tianrui Li.
Neurocomputing, 2022. - Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy.
Colin Clement, Shuai Lu, Xiaoyu Liu, Michele Tufano, Dawn Drain, Nan Duan, Neel Sundaresan, Alexey Svyatkovskiy.
EMNLP, 2021. - Discovering Representation Sprachbund For Multilingual Pre-Training.
Yimin Fan, Yaobo Liang, Alexandre Muzio, Hany Hassan, Houqiang Li, Ming Zhou, Nan Duan.
Findings of EMNLP, 2021. - KFC: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning.
Haonan Li, Yeyun Gong, Jian Jiao, Ruofei Zhang, Timothy Baldwin, Nan Duan.
Findings of EMNLP, 2021. - WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.
Junjie Huang, Duyu Tang, Wanjun Zhong, Shuai Lu, Linjun Shou, Ming Gong, Daxin Jiang, Nan Duan.
Findings of EMNLP, 2021. - Learning from Inside: Self-driven Siamese Sampling and Reasoning for Video Question Answering.
Weijiang Yu, Haoteng Zheng, Mengfei Li, Lei Ji, Lijun Wu, Nong Xiao, Nan Duan.
NeurIPS, 2021. - CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation. (GitHub)
Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu.
NeurIPS (Datasets and Benchmarks Track), 2021. - Question Generation from Code Snippets and Programming Error Messages.
Bolun Yao, Wei Chen, Yeyun Gong, Bartuer Zhou, Jin Xie, Zhongyu Wei, Biao Cheng, Nan Duan.
NLPCC, 2021. - XGPT: Cross-modal Generative Pre-Training for Image Captioning.
Qiaolin Xia, Haoyang Huang, Nan Duan, Dongdong Zhang, Lei Ji, Zhifang Sui, Edward Cui, Taroon Bharti, Ming Zhou.
NLPCC, 2021. - Hybrid Reasoning Network for Video-based Commonsense Captioning.
Weijiang Yu, Jian Liang, Lei Ji, Lu Li, Yuejian Fang, Nong Xiao, Nan Duan.
ACM Multimedia, 2021. - ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation. (GitHub)
Weizhen Qi, Yeyun Gong, Yu Yan, Can Xu, Bolun Yao, Bartuer Zhou, Biao Cheng, Daxin Jiang, Jiusheng Chen, Ruofei Zhang, Houqiang Li, Nan Duan.
ACL-Demo, 2021. - FastSeq: Make Sequence Generation Faster.
Yu Yan, Fei Hu, Jiusheng Chen, Nikhil Bhendawade, Ting Ye, Yeyun Gong, Nan Duan, Desheng Cui, Bingyu Chi, Ruofei Zhang.
ACL-Demo, 2021. - BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining.
Weizhen Qi, Yeyun Gong, Jian Jiao, Yu Yan, Dayiheng Liu, Weizhu Chen, Kewen Tang, Houqiang Li, Jiusheng Chen, Ruofei Zhang, Ming Zhou, Nan Duan.
ICML, 2021. - EL-Attention: Memory Efficient Lossless Attention for Generation.
Yu Yan, Jiusheng Chen, Weizhen Qi, Nikhil Bhendawade, Yeyun Gong, Nan Duan, Ruofei Zhang.
ICML, 2021. - Poolingformer: Long Document Modeling with Pooling Attention.
Hang Zhang, Yeyun Gong, Yelong Shen, Weisheng Li, Jiancheng Lv, Nan Duan, Weizhu Chen.
ICML, 2021. - Control Image Captioning Spatially and Temporally.
Kun Yan, Lei Ji, Huaishao Luo, Ming Zhou, Nan Duan, Shuai Ma.
ACL, 2021. - Syntax-Enhanced Pre-trained Model.
Zenan Xu, Daya Guo, Duyu Tang, Qinliang Su, Linjun Shou, Ming Gong, Wanjun Zhong, Xiaojun Quan, Daxin Jiang, Nan Duan.
ACL, 2021. - CoSQA: 20,000+ Web Queries for Code Search and Question Answering.
Junjie Huang, Duyu Tang, Linjun Shou, Ming Gong, Ke Xu, Daxin Jiang, Ming Zhou, Nan Duan.
ACL, 2021. - Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge.
Linmei Hu, Tianchi Yang, Luhao Zhang, Wanjun Zhong, Duyu Tang, Chuan Shi, Nan Duan, Ming Zhou.
ACL, 2021. - GEM: A General Evaluation Benchmark for Multimodal Tasks.
Lin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti, Arun Sacheti.
Findings of ACL, 2021. - GLGE: A New General Language Generation Evaluation Benchmark. (GitHub)
Dayiheng Liu, Yu Yan, Yeyun Gong, Weizhen Qi, Hang Zhang, Jian Jiao, Weizhu Chen, Jie Fu, Linjun Shou, Ming Gong, Pengcheng Wang, Jiusheng Chen, Daxin Jiang, Jiancheng Lv, Ruofei Zhang, Winnie Wu, Ming Zhou, Nan Duan.
Findings of ACL, 2021. - Hashing based Efficient Inference for Image-Text Matching.
Rong-Cheng Tu, Lei Ji, Huaishao Luo, Botian Shi, Heyan Huang, Nan Duan, Xian-Ling Mao.
Findings of ACL, 2021. - K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters. (GitHub)
Ruize Wang, Duyu Tang, Nan Duan, Zhongyu Wei, Xuanjing Huang, Jianshu Ji, Guihong Cao, Daxin Jiang, Ming Zhou.
Findings of ACL, 2021. - UserAdapter: Few-Shot User Learning in Sentiment Analysis.
Wanjun Zhong, Duyu Tang, Jiahai Wang, Jian Yin, Nan Duan.
Findings of ACL, 2021. - Tree-Capsule: Tree-Structured Capsule Network for Improving Relation Extraction.
Tianchi Yang, Linmei Hu, Luhao Zhang, Chuan Shi, Cheng Yang, Nan Duan, Ming Zhou.
PAKDD, 2021. - M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training. (GitHub)
Minheng Ni, Haoyang Huang, Lin Su, Edward Cui, Taroon Bharti, Lijuan Wang, Dongdong Zhang, Nan Duan.
CVPR, 2021. - GraphCodeBERT: Pre-training Code Representations with Data Flow. (GitHub)
Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie Liu, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, Ming Zhou.
ICLR, 2021. - Mask Attention Networks: Rethinking and Strengthen Transformer.
Zhihao Fan, Yeyun Gong, Dayiheng Liu, Zhongyu Wei, Siyuan Wang, Jian Jiao, Nan Duan, Ruofei Zhang, Xuanjing Huang.
NAACL, 2021. - Machine Reasoning: Technology, Dilemma and Future.
Nan Duan, Duyu Tang, Ming Zhou.
EMNLP Tutorial, 2020. - A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos.
Frank F. Xu, Lei Ji, Botian Shi, Junyi Du, Graham Neubig, Yonatan Bisk, Nan Duan.
NLPBT, 2020. - An Enhanced Knowledge Injection Model for Commonsense Generation.
Zhihao Fan, Yeyun Gong, Zhongyu Wei, Siyuan Wang, Yameng Huang, Jian Jiao, Xuanjing Huang, Nan Duan, Ruofei Zhang.
COLING, 2020. - Multi-level Alignment Pretraining for Multi-lingual Semantic Parsing.
Bo Shao, Yeyun Gong, Weizhen Qi, Nan Duan, Xiaola Lin.
COLING, 2020. - XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation. (GitHub)
Yaobo Liang, Nan Duan, Yeyun Gong, Ning Wu, Fenfei Guo, Weizhen Qi, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Xiaodong Fan, Bruce Zhang, Rahul Agrawal, Edward Cui, Sining Wei, Taroon Bharti, Jiun-Hung Chen, Winnie Wu, Shuguang Liu, Fan Yang, Ming Zhou.
EMNLP, 2020. - CodeBERT: A Pre-Trained Model for Programming and Natural Language. (GitHub)
Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin, Ting Liu, Daxin Jiang, Ming Zhou.
EMNLP, 2020. - ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training. (GitHub)
Yeyun Gong, Yu Yan, Weizhen Qi, Dayiheng Liu, Nan Duan, Jiusheng Chen, Bruce Zhang, Ming Zhou.
EMNLP, 2020. - GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis.
Huaishao Luo, Lei Ji, Tianrui Li, Daxin Jiang, Nan Duan.
EMNLP, 2020. - Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.
Dayiheng Liu, Yeyun Gong, Yu Yan, Jie Fu, Bo Shao, Daxin Jiang, Jiancheng Lv, Nan Duan.
EMNLP, 2020. - Neural Deepfake Detection with Factual Structure of Text.
Wanjun Zhong, Duyu Tang, Zenan Xu, Ruize Wang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin.
EMNLP, 2020. - Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space.
Dayiheng Liu, Yeyun Gong, Jie Fu, Yu Yan, Jiusheng Chen, Jiancheng Lv, Nan Duan, Ming Zhou.
EMNLP, 2020. - Leveraging Declarative Knowledge in Text and First-Order Logic for Fine-Grained Propaganda Detection.
Ruize Wang, Duyu Tang, Nan Duan, Wanjun Zhong, Zhongyu Wei, Xuanjing Huang, Daxin Jiang, Ming Zhou.
EMNLP, 2020. - No Answer is Better Than Wrong Answer: A Reflection Model for Document Level Machine Reading Comprehension.
Xuguang Wang, Linjun Shou, Ming Gong, Nan Duan, Daxin Jiang.
EMNLP, 2020. - ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search Engine.
Weizhen Qi, Yeyun Gong, Yu Yan, Jian Jiao, Bo Shao, Ruofei Zhang, Houqiang Li, Nan Duan and Ming Zhou.
NLPCC, 2020. - Learning Semantic Concepts and Temporal Alignment for Narrated Video Procedural Captioning.
Botian Shi, Lei Ji, Zhendong Niu, Nan Duan, Ming Zhou, Xilin Chen.
ACM Multimedia, 2020. - RikiNet: Reading Wikipedia Pages for Natural Question Answering.
Dayiheng Liu, Yeyun Gong, Jie Fu, Yu Yan, Jiusheng Chen, Daxin Jiang, Jiancheng Lv, Nan Duan.
ACL, 2020. - Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension.
Bo Zheng, Haoyang Wen, Yaobo Liang, Nan Duan, Wanxiang Che, Daxin Jiang, Ming Zhou, Ting Liu.
ACL, 2020. - Reasoning Over Semantic-Level Graph for Fact Checking.
Wanjun Zhong, Jingjing Xu, Duyu Tang, Zenan Xu, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin.
ACL, 2020. - LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network.
Wanjun Zhong, Duyu Tang, Zhangyin Feng, Nan Duan, Ming Zhou, Ming Gong, Linjun Shou, Daxin Jiang, Jiahai Wang, Jian Yin.
ACL, 2020. - Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder.
Daya Guo, Duyu Tang, Nan Duan, Jian Yin, Daxin Jiang, Ming Zhou.
ACL, 2020. - Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension.
Fei Yuan, Linjun Shou, Xuanyu Bai, Ming Gong, Yaobo Liang, Nan Duan, Daxin Jiang, Yan Fu.
ACL, 2020. - Graph Neural News Recommendation with Unsupervised Preference Disentanglement.
Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie, Ming Zhou.
ACL, 2020. - Joint Learning of Question Answering and Question Generation.
Yibo Sun, Duyu Tang, Nan Duan, Tao Qin, Shujie Liu, Zhao Yan, Ming Zhou, Yuanhua Lv, Wenpeng Yin, Xiaocheng Feng, Bing Qin, Ting Liu.
IEEE TKDE, 2020. - Progress in Neural NLP: Modeling, Learning, and Reasoning.
Ming Zhou, Nan Duan, Shujie Liu, Heung-Yeung Shum.
Engineering, 2020. - Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training.
Gen Li, Nan Duan, Yuejian Fang, Ming Gong, Daxin Jiang, Ming Zhou.
AAAI, 2020. - Segment-then-Rank: Non-factoid Question Answering on Instructional Videos.
Kyungjae Lee, Nan Duan, Lei Ji, Jason Li, Seung-won Hwang.
AAAI, 2020. - Graph-based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering.
Shangwen Lv, Daya Guo, Jingjing Xu, Duyu Tang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Songlin Hu.
AAAI, 2020. - Neural Semantic Parsing in Low-Resource Settings with Back-Translation and Meta-Learning.
Yibo Sun, Duyu Tang, Nan Duan, Yeyun Gong, Xiaocheng Feng, Bing Qin, Daxin Jiang.
AAAI, 2020. - A Tensorized Transformer for Language Modeling.
Xindian Ma, Peng Zhang, Shuai Zhang, Nan Duan, Yuexian Hou, Dawei Song, Ming Zhou.
NeurIPS, 2019. - PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph.
Yikang Li, Nan Duan, Tao Ma, Yeqi Bai, Sining Wei, Xiaogang Wang.
NeurIPS, 2019. - Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks. (GitHub)
Haoyang Huang, Yaobo Liang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Ming Zhou.
EMNLP, 2019. - Multi-task Learning for Conversational Question Answering Over a Large-Scale Knowledge Base.
Tao Shen, Xiubo Geng, Tao Qin, Daya Guo, Duyu Tang, Nan Duan, Guodong Long, Daxin Jiang.
EMNLP, 2019. - Aggregating Bidirectional Encoder Representations Using MatchLSTM for Sequence Matching.
Bo Shao, Yeyun Gong, Weizhen Qi, Xiaola Lin, Nan Duan.
EMNLP, 2019. - Asking Clarification Questions in Knowledge-Based Question Answering.
Jingjing Xu, Yuechen Wang, Duyu Tang, Nan Duan, Pengcheng Yang, Qi Zeng, Ming Zhou, Xu Sun.
EMNLP, 2019. - Overview of the NLPCC 2019 Shared Task: Open Domain Semantic Parsing.
Nan Duan.
NLPCC, 2019. - Knowledge-Aware Conversational Semantic Parsing Over Web Tables.
Yibo Sun, Duyu Tang, Jingjing Xu, Nan Duan, Xiaocheng Feng, Bing Qin, Ting Liu, Ming Zhou.
NLPCC, 2019. - Improving Question Answering by Commonsense-Based Pre-Training.
Wanjun Zhong, Duyu Tang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin.
NLPCC, 2019. - Deep Reason: A Strong Baseline for Real-World Visual Reasoning.
Chenfei Wu, Yanzhao Zhou, Gen Li, Nan Duan, Duyu Tang, Xiaojie Wang.
CVPR VQA Workshop, 2019. - Dense Procedure Captioning in Narrated Instructional Videos.
Botian Shi, Lei Ji, Yaobo Liang, Zhendong NIU, Nan Duan, Ming Zhou.
ACL, 2019. - Coupling Retrieval and Meta-Learning for Context-Dependent Semantic Parsing.
Daya Guo, Duyu Tang, Nan Duan, Ming Zhou, Jian Yin.
ACL, 2019. - Joint Type Inference on Entities and Relations via Graph Convolutional Networks.
Changzhi Sun, Yeyun Gong, Nan Duan, Ming Gong, Daxin Jiang, Shiliang Sun, Man Lan, Yuanbin Wu, Ming Zhou.
ACL, 2019. - Complex Question Decomposition for Semantic Parsing.
Haoyu Zhang, Yeyun Gong, Nan Duan, Jianjun Xu, Ji Wang, Ming Zhou.
ACL, 2019. - Knowledge Aware Semantic Concept Expansion for Image-Text Matching.
Botian Shi, Lei Ji, Pan Lu, Zhendong Niu, Nan Duan.
IJCAI, 2019. - Weakly Supervised Multi-task Learning for Semantic Parsing.
Bo Shao, Yeyun Gong, Junwei Bao, Xiaola Lin, Jianshu Ji, Guihong Cao, Nan Duan.
IJCAI, 2019. - Text Generation from Tables.
Junwei Bao, Duyu Tang, Nan Duan, Zhao Yan, Ming Zhou, Tiejun Zhao.
Transactions on Audio, Speech and Language Processing, 2019. - Content-Based Table Retrieval for Web Queries.
Yibo Sun, Zhao Yan, Duyu Tang, Nan Duan, Bing Qin.
Neurocomputing, 2019. - Knowledge-Aware Conversational Semantic Parsing Over Web Tables.
Yibo Sun, Duyu Tang, Nan Duan, Jingjing Xu, Xiaocheng Feng, Bing Qin.
NLPCC, 2018. - Dialog-to-Action: Conversational Question Answering over a Large-Scale Knowledge Base.
Daya Guo, Duyu Tang, Nan Duan, Jian Yin, Ming Zhou.
NeurIPS, 2018. - Question Generation from SQL Queries Improves Neural Semantic Parsing.
Daya Guo, Yibo Sun, Duyu Tang, Nan Duan, Ming Zhou, Jian Yin.
EMNLP, 2018. - Question Generation with Doubly-Adversarial Nets.
Junwei Bao, Yeyun Gong, Nan Duan, Ming Zhou, Tiejun Zhao.
Transactions on Audio, Speech and Language Processing, 2018. - R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering.
Pan Lu, Lei Ji, Wei Zhang, Nan Duan, Ming Zhou, Jianyong Wang.
KDD, 2018. - Semantic Parsing with Syntax- and Table-Aware SQL Generation.
Yibo Sun, Duyu Tang, Nan Duan, Jianshu Ji, Guihong Cao, Xiaocheng Feng, Bing Qin, Ting Liu, Ming Zhou.
ACL, 2018. - Response Selection from Unstructured Documents for Human-Computer Conversation Systems.
Zhao Yan, Nan Duan, Junwei Bao, Peng Chen, Ming Zhou, Zhoujun Li.
Knowledge-Based System, 2018. - Visual Question Generation as Dual Task of Visual Question Answering.
Yikang Li, Nan Duan, Bolei Zhou, Xiao Chu, Wanli Ouyang, Xiaogang Wang, Ming Zhou.
CVPR, 2018. - Learning to Collaborate for Question Answering and Asking.
Duyu Tang, Nan Duan, Zhao Yan, Zhirui Zhang, Yibo Sun, Shujie Liu, Yuanhua Lv, Ming Zhou.
NAACL, 2018. - Table-to-Text: Describing Table Region with Natural Language.
Junwei Bao, Duyu Tang, Nan Duan, Zhao Yan, Yuanhua Lv, Ming Zhou, Tiejun Zhao.
AAAI, 2018. - Assertion-based QA with Question-Aware Open Information Extraction.
Zhao Yan, Duyu Tang, Nan Duan, Shujie Liu, Wendi Wang, Daxin Jiang, Ming Zhou, Zhoujun Li.
AAAI, 2018. - Overview of the NLPCC 2017 Shared Task: Open Domain QA.
Nan Duan.
NLPCC, 2017. - Question Generation for Question Answering.
Nan Duan, Duyu Tang, Peng Chen, Ming Zhou.
EMNLP, 2017. - Building Task-Oriented Dialogue Systems for Online Shopping.
Zhao Yan, Nan Duan, Peng Chen, Ming Zhou, Jianshe Zhou, Zhoujun Li.
AAAI, 2017. - An Open Domain Topic Prediction Model for Answer Selection.
Zhao Yan, Nan Duan, Ming Zhou, Zhoujun Li.
NLPCC-ICCPOL, 2016. - Overview of the NLPCC-ICCPOL 2016 Shared Task: Open Domain QA.
Nan Duan.
NLPCC-ICCPOL, 2016. - Constraint-Based Question Answering with Knowledge Graph.
Junwei Bao, Nan Duan, Zhao Yan, Ming Zhou, Tiejun Zhao.
COLING, 2016. - DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents.
Zhao Yan, Nan Duan, Junwei Bao, Peng Chen, Ming Zhou, Zhoujun Li, Jianshe Zhou.
ACL, 2016. - Overview of the NLPCC 2015 Shared Task: Open Domain QA.
Nan Duan.
NLPCC, 2015. - Answering Questions with Complex Semantic Constraints on Open Knowledge Bases.
Pengcheng Yin, Nan Duan, Ben Kao, Junwei Bao, Ming Zhou.
CIKM, 2015. - Joint Relational Embeddings for Knowledge-based Question Answering.
Min-Chul Yang, Nan Duan, Ming Zhou, Hae-Chang Rim.
EMNLP, 2014. - Knowledge-based Question Answering as Machine Translation.
Junwei Bao, Nan Duan, Ming Zhou, Tiejun Zhao.
ACL, 2014. - 从图谱搜索看搜索技术的发展趋势.
段楠.
《中国计算机学会通讯》, 2013. - Minimum Bayes Risk based Answer Re-ranking for Question Answering.
Nan Duan.
ACL, 2013. - Paraphrasing Adaptation for Web Search Ranking.
Chenguang Wang, Nan Duan, Ming Zhou, Ming Zhang.
ACL, 2013. - Answer Extraction from Passage Graph for Factoid Question Answering.
Hong Sun, Nan Duan, Yajuan Duan, Ming Zhou.
IJCAI, 2013. - Forced Derivation Tree based Model Training to Statistical Machine Translation.
Nan Duan, Mu Li, Ming Zhou.
EMNLP, 2012. - Improving Phrase Extraction via MBR Phrase Scoring and Pruning.
Nan Duan, Mu Li, Ming Zhou.
MT Summit XIII, 2011. - A Comparative Analysis of Consensus Decoding Methods for Statistical Machine Translation.
Nan Duan, Mu Li, Ming Zhou.
Journal of Chinese Information Processing, 2011. (in Chinese) - Hypothesis Mixture Decoding for Statistical Machine Translation.
Nan Duan, Mu Li, Ming Zhou.
ACL, 2011. - The MSRA Machine Translation System for IWSLT 2010.
Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, Li Deng.
IWSLT, 2010. - Translation Model Generalization using Probability Averaging for Machine Translation.
Nan Duan, Hong Sun, Ming Zhou.
COLING, 2010. - Mixture Model-based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems.
Nan Duan, Mu Li, Dongdong Zhang, Ming Zhou.
COLING, 2010. - The Feature Subspace Method for SMT System Combination.
Nan Duan, Mu Li, Tong Xiao, Ming Zhou.
EMNLP, 2009. - Collaborative Decoding: Partial Hypothesis Re-ranking using Translation Consensus between Decoders.
Mu Li, Nan Duan, Dongdong Zhang, Chi-Ho Li, Ming Zhou.
ACL, 2009. - MSRA Technical Report for the 5th China Workshop on Machine Translation.
Dongdong Zhang, Chi-Ho Li, Nan Duan, Shujie Liu, Mu Li, Ming Zhou.
CWMT, 2009. - Measure Word Generation for English-Chinese SMT Systems.
Dongdong Zhang, Mu Li, Nan Duan, Chi-Ho Li, Ming Zhou.
ACL, 2008.
Academic Service & Award
- Adjunct Ph.D. Supervisor at Xi’an Jiaotong University (西安交通大学), 2023-present.
- Adjunct Ph.D. Supervisor at University of Science and Technology of China (中国科学技术大学), 2022-present.
-
Adjunct Professor at Tianjin University (天津大学), 2020-2022.
- Program Committee Chair of NLPCC, 2023.
- Evaluation Chair of NLPCC, 2019/2018.
- Senior Action Editor of ACL Rolling Review (ARR), 2022-present.
- Senior Area Chair/Area Chair of NeurIPS/ACL/EMNLP/NAACL/SIGKDD.
- Standing Reviewer of TACL, 2020-present.
-
Program Committee Member of ACL/EMNLP/NAACL/COLING/NeurIPS/ICLR/CVPR/AAAI/SIGKDD/IJCAI/etc.
- Distinguished Member of CCF, 2021.
- Senior Member of CCF, 2021.
- Member of CIPS Technical Committee of NLG, 2021-present.
- Member of CCF Committee on Academic Affairs, 2020-present.
- Member of CCF Technical Committee of NLP, 2018-present.
-
Secretary of CCF Committee on Terminology, 2016-2018.
- TaskMatrix was selected as Open100 (2022-2023).
- AGIEval was selected as Bench100 (2022-2023).
- World’s Top 2% Scientists by Stanford, 2023/2022.
- The Intelligent Computing Innovators China (中国智能计算科技创新人物), 2023.
- CVPR Best Demo Award, 2022.
- CCF-NLPCC Distinguished Young Scientist Award (CCF-NLPCC青年科学家奖), 2019.
- CCF Distinguished Speaker (CCF杰出演讲者), 2021/2020/2017.
Lecture (2017-present)
- Peking University (news)
- Tsinghua University
- Nankai University (news)
- University of Science and Technology of China (news)
- Chinese Academy of Sciences
- Xi’an Jiaotong University (news)
- Southwest Jiaotong University
- Northeastern University
- Jiangnan University (news)
- Fudan University
- North China Electric Power University
- Xiamen University (news)
- Tianjin University (news) (news)
- Beijing University of Posts and Telecommunications
- Beijing University of Aeronautics and Astronautics
- Nanjing University of Aeronautics and Astronautics (news)
Patent
- Code Execution with Pre-trained Language Models, 2023.
- Pretraining for Automating Code Review Activities, 2022.
- SimANS: Simple Ambiguous Negative Sampling for Dense Text Retrieval, 2022.
- Distilling Knowledge from Metric to Ranker and Retriever, 2022.
- Retrieval Augmented Code Completion, 2022.
- Sentence representation generation for cross-lingual retrieval, 2022.
- Code Bug Detection, 2021.
- Performing multiple tasks with continual adaptation, 2021.
- Resource-Efficient Attention in a Neural Network, 2021.
- Interpretable Bug Detection for Codes with Structural Attention Constraints, 2021.
- Generation of data models for predicting data, 2020.
- Knowledge injection model for generative commonsense reasoning, 2020.
- A look-ahead strategy for trie-based beam search in generative retrieval, 2020.
- Transformer-Based Neural Network including a Mask Attention Network, 2020.
- Fact checking based on semantic graphs, 2019.
- Cross-lingual task training, 2019.
- Text generation with customizable style, 2019.
- Matching based intent understanding with transfer learning, 2019.
- VideoChat, 2018.
- Natural language question answering, 2018.
- Assertion-based question answering, 2017.
- Generation of text from structured data, 2017.
- Conversation oriented machine-user interaction, 2016.