Abstract
Relation extraction (RE) is a well-known NLP application often treated as a sentence or document-level task. However, a handful of recent efforts explore it across documents or in the cross-document setting (CrossDocRE). This is distinct from the single document case because different documents often focus on disparate themes, while text within a document tends to have a single goal. Current CrossDocRE efforts do not consider domain knowledge, which are often assumed to be known to the reader when documents are authored. Here, we propose a novel approach, KXDocRE, that embed domain knowledge of entities with input text for cross-document RE. Our proposed framework has three main benefits over baselines: 1) it incorporates domain knowledge of entities along with documents' text; 2) it offers interpretability by producing explanatory text for predicted relations between entities 3) it improves performance over the prior methods. Code and models are available at https://github.com/kracr/cross-doc-relation-extraction.
| Original language | English |
|---|---|
| Title of host publication | The 62nd Annual Meeting of the Association for Computational Linguistics |
| Subtitle of host publication | Findings of the Association for Computational Linguistics, ACL 2024 |
| Editors | Lun-Wei Ku, Andre Martins, Vivek Srikumar |
| Pages | 3787-3797 |
| Number of pages | 11 |
| ISBN (Electronic) | 9798891760998 |
| DOIs | |
| State | Published - 2024 |
| Event | Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Hybrid, Bangkok, Thailand Duration: Aug 11 2024 → Aug 16 2024 |
Publication series
| Name | Proceedings of the Annual Meeting of the Association for Computational Linguistics |
|---|---|
| ISSN (Print) | 0736-587X |
Conference
| Conference | Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 |
|---|---|
| Country/Territory | Thailand |
| City | Hybrid, Bangkok |
| Period | 8/11/24 → 8/16/24 |
Bibliographical note
Publisher Copyright:© 2024 Association for Computational Linguistics.
Funding
We express our sincere gratitude to the Infosys Centre for Artificial Intelligence (CAI) at IIIT-Delhi for their support. RK's effort has been supported by the U.S. National Library of Medicine (through grant R01LM013240.)
| Funders | Funder number |
|---|---|
| Infosys Centre for Artificial Intelligence, Indraprastha institute of Information Technology | |
| Caja de Ahorros de la Inmaculada de Aragón | |
| U.S. National Library of Medicine | R01LM013240 |
| U.S. National Library of Medicine |
ASJC Scopus subject areas
- Computer Science Applications
- Linguistics and Language
- Language and Linguistics