访问量:5912
 
宋彦
单位:信息科学技术学院
地址:中科大高新校区一号学科楼C410
邮编:
电话:
个人主页: http://dslx.ustc.edu.cn/?menu=expert_paper&expertid=6569681
 
个人简历 Personal resume
宋彦教授,全球前2%顶尖科学家(2022,2023,2024),安徽省领军人才特聘教授。研究方向覆盖人工智能,包括自然语言处理、信息检索和抽取、文本表征学习、多模态内容处理、大模型理论与技术等。宋教授发表人工智能国际顶级期刊及会议论文100余篇,申请专利70余项,授权30余项,常年担任国际顶级会议的程序委员会委员及高级委员,目前担任CCF-NLP专委会执委。加入中科大之前,宋教授于工业界工作多年,在微软、腾讯人工智能团队担任核心研究人员,是“微软小冰”项目的创始团队成员之一,领导构建了腾讯大规模中文词向量数据集,具有丰富的研究成果转化经验,包括对话机器人,大规模中文表征基础资源及系统等。其中,大规模中文词向量填补了国内文本表征技术领域的空白,为相应的研究和利用提供了高质量的开源数据,并被外媒评选为2018年世界十大人工智能开源数据集之一。
 
研究方向 Research direction
自然语言处理、文本表征学习、大模型及机器学习
多模态内容处理、信息抽取、知识挖掘
 
招生信息 Enrollment information
依托中科大优秀科研平台,配备相应计算资源,与国内外顶尖高校与企业深度合作,支持国际会议交流与联合培养。
面向海内外招收学术及工程博士研究生,热忱欢迎有志于学术探索、创新能力突出的优秀学子加入。
请有意向的同学将个人简历(含学术成果清单,代表论文1-3篇等)发送至 songyan@ustc.edu.cn
 
论文专著 The monograph
1) Dependency-driven Relation Extraction with Attentive Graph Convolutional Networks - Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing - 2021
2) Improving Chinese Word Segmentation with Wordhood Memory Networks - Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics - 2020
3) Generating Radiology Reports via Memory-driven Transformer - Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing - 2020
4) ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations - Findings of the Association for Computational Linguistics: EMNLP 2020 - 2020
5) Aspect-based Sentiment Analysis with Type-aware Graph Convolutional Networks and Layer Ensemble - Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - 2021
6) Combinatory Grammar Tells Underlying Relevance among Entities - Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing - 2022
7) Improving English-Arabic Transliteration with Phonemic Memories - Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing - 2022
8) Composing Ci with Reinforced Non-autoregressive Text Generation - Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing - 2022
9) Enhancing Structure-aware Encoder with Extremely Limited Data for Graph-based Dependency Parsing - Proceedings of the 29th International Conference on Computational Linguistics - 2022
10) Chinese Couplet Generation with Syntactic Information - Proceedings of the 29th International Conference on Computational Linguistics - 2022
11) Learning Semantic Relationship among Instances for Image-Text Matching - 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) - 2023
12) Emotion Cause Extraction in?Conversations with?Response Graphing - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 2024
13) Preserving Content in?Text Style Transfer via?Normalizing Flow and?Adversarial Learning - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 2024
14) Improving radiology report generation with multi-grained abnormality prediction - Neurocomputing - 2024
15) Diffusion Networks with Task-Specific Noise Control for Radiology Report Generation - MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia - 2024
16) ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
17) Learning Multimodal Contrast with Cross-modal Memory and Reinforced Contrast Recognition - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
18) Dialogue Summarization with Mixture of Experts based on Large Language Models - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
19) Challenging Large Language Models with New Tasks: A Study on their Adaptability and Robustness - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
20) RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
21) Large Language Models Are No Longer Shallow Parsers - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
22) Prompting Few-shot Multi-hop Question Generation via Comprehending Type-aware Semantics - Findings of the Association for Computational Linguistics: NAACL 2024 - 2024
23) Aspect-based Sentiment Analysis with Context Denoising - Findings of the Association for Computational Linguistics: NAACL 2024 - 2024
24) Bootstrapping Large Language Models for Radiology Report Generation - Proceedings of the AAAI Conference on Artificial Intelligence - 2024
25) Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects - arXiv - 2023
26) Improving Image Captioning via Predicting Structured Concepts - Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing - 2023
27) Ziya2: Data-centric Learning is All LLMs Need - arXiv - 2023
28) A Systematic Review of Deep Learning-based Research on Radiology Report Generation - arXiv - 2023
29) ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences - arXiv - 2023
30) Ziya-VL: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning - arXiv - 2023
31) End-to-end Aspect-based Sentiment Analysis with Combinatory Categorial Grammar - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2023
32) Text Style Transfer with Contrastive Transfer Pattern Mining - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2023
33) Hashtag-Guided Low-Resource Tweet Classification - Proceedings of the International Conference of World Wide Web - 2023
34) Hashtag-Guided Low-Resource Tweet Classification - Proceedings of the International Conference of World Wide Web - 2023
35) Text Style Transfer with Contrastive Transfer Pattern Mining - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2023
36) End-to-end Aspect-based Sentiment Analysis with Combinatory Categorial Grammar - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2023
37) Ziya-VL: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning - arXiv - 2023
38) ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences - arXiv - 2023
39) A Systematic Review of Deep Learning-based Research on Radiology Report Generation - arXiv - 2023
40) Ziya2: Data-centric Learning is All LLMs Need - arXiv - 2023
41) Improving Image Captioning via Predicting Structured Concepts - Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing - 2023
42) Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects - arXiv - 2023
43) Bootstrapping Large Language Models for Radiology Report Generation - Proceedings of the AAAI Conference on Artificial Intelligence - 2024
44) Aspect-based Sentiment Analysis with Context Denoising - Findings of the Association for Computational Linguistics: NAACL 2024 - 2024
45) Prompting Few-shot Multi-hop Question Generation via Comprehending Type-aware Semantics - Findings of the Association for Computational Linguistics: NAACL 2024 - 2024
46) Large Language Models Are No Longer Shallow Parsers - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
47) RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
48) Challenging Large Language Models with New Tasks: A Study on their Adaptability and Robustness - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
49) Dialogue Summarization with Mixture of Experts based on Large Language Models - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
50) Learning Multimodal Contrast with Cross-modal Memory and Reinforced Contrast Recognition - Proceedings of the Annual Meeting of the Association for Computational Linguistics - 2024
51) Diffusion Networks with Task-Specific Noise Control for Radiology Report Generation - MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia - 2024
52) Improving radiology report generation with multi-grained abnormality prediction - Neurocomputing - 2024
53) Preserving Content in?Text Style Transfer via?Normalizing Flow and?Adversarial Learning - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 2024
54) Emotion Cause Extraction in?Conversations with?Response Graphing - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 2024
55) Learning Semantic Relationship among Instances for Image-Text Matching - 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) - 2023
56) Dependency-driven Relation Extraction with Attentive Graph Convolutional Networks - Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing - 2021
57) Improving Chinese Word Segmentation with Wordhood Memory Networks - Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics - 2020
58) Generating Radiology Reports via Memory-driven Transformer - Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing - 2020
59) ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations - Findings of the Association for Computational Linguistics: EMNLP 2020 - 2020
60) Aspect-based Sentiment Analysis with Type-aware Graph Convolutional Networks and Layer Ensemble - Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - 2021
61) Combinatory Grammar Tells Underlying Relevance among Entities - Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing - 2022
62) Improving English-Arabic Transliteration with Phonemic Memories - Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing - 2022
63) Composing Ci with Reinforced Non-autoregressive Text Generation - Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing - 2022
64) Enhancing Structure-aware Encoder with Extremely Limited Data for Graph-based Dependency Parsing - Proceedings of the 29th International Conference on Computational Linguistics - 2022
65) Chinese Couplet Generation with Syntactic Information - Proceedings of the 29th International Conference on Computational Linguistics - 2022
 
COPYRIGHT 2007 中国科学技术大学研究生院、校学位办 All Rights Reserved 地址:安徽省合肥市金寨路96号 邮编:230026。