Journal of South China University of Technology(Natural Science Edition) ›› 2022, Vol. 50 ›› Issue (4): 10-25.doi: 10.12141/j.issn.1000-565X.210152

Special Issue: 2022年计算机科学与技术

• Computer Science & Technology • Previous Articles     Next Articles

A Survey on Document-Level Relation Extraction

ZHOU Youhua1 HUANG Han2 LIU Haolong3 HAO Zhifeng4   

  1. 1. School of Software Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China;
    2. School of Mathematics and Big Data,Foshan University,Foshan 528225,Guangdong,China
  • Received:2021-03-21 Revised:2021-08-09 Online:2022-04-25 Published:2021-08-27
  • Contact: 黄翰 (1980-),男,教授,博士生导师,主要从事智能算法理论与应用研究 E-mail:hhan@ scut. edu. cn
  • About author:周友华 (1986-),男,博士生,主要从事大数据审计与知识图谱研究
  • Supported by:
    National Natural Science Foundation of China

Abstract: Relation extraction (RE) is one of the most important tasks in information extraction of NLP, the result of RE can be used to downstream missions such as construction of knowledge graphs, knowledge base question answering, semantic search et al. which means RE has wide-ranging application scenarios and important research value. Recent years, RE achieves frutiful results, but most of them are limited in sentence-level RE, which focus on extract relation between two mentions within a single sentence. Reserches shows that a large number of relations can’t extract from a single sentence, in rencent years, document-level RE faces new opportunities and challenges with the development of deep learning and NLP. This study reviews the recent advances in document-level RE research, summarize a general technology roadmap of this task, and then analyzes the encoding and aggregation methods used in the researches, We also introduce the common datasets and evaluation metrics of this task. This paper ends up with forecasting the future development trend of this task.

Key words: document-level, relation extraction, encoding, aggregation

CLC Number: