Knowledge-Driven Cross-Document Relation Extraction

Monika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Relation extraction (RE) is a well-known NLP application often treated as a sentence or document-level task. However, a handful of recent efforts explore it across documents or in the cross-document setting (CrossDocRE). This is distinct from the single document case because different documents often focus on disparate themes, while text within a document tends to have a single goal. Current CrossDocRE efforts do not consider domain knowledge, which are often assumed to be known to the reader when documents are authored. Here, we propose a novel approach, KXDocRE, that embed domain knowledge of entities with input text for cross-document RE. Our proposed framework has three main benefits over baselines: 1) it incorporates domain knowledge of entities along with documents' text; 2) it offers interpretability by producing explanatory text for predicted relations between entities 3) it improves performance over the prior methods. Code and models are available at https://github.com/kracr/cross-doc-relation-extraction.

Original languageEnglish
Title of host publicationThe 62nd Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationFindings of the Association for Computational Linguistics, ACL 2024
EditorsLun-Wei Ku, Andre Martins, Vivek Srikumar
Pages3787-3797
Number of pages11
ISBN (Electronic)9798891760998
DOIs
StatePublished - 2024
EventFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Hybrid, Bangkok, Thailand
Duration: Aug 11 2024Aug 16 2024

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

ConferenceFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
Country/TerritoryThailand
CityHybrid, Bangkok
Period8/11/248/16/24

Bibliographical note

Publisher Copyright:
© 2024 Association for Computational Linguistics.

Funding

We express our sincere gratitude to the Infosys Centre for Artificial Intelligence (CAI) at IIIT-Delhi for their support. RK's effort has been supported by the U.S. National Library of Medicine (through grant R01LM013240.)

FundersFunder number
Infosys Centre for Artificial Intelligence, Indraprastha institute of Information Technology
Caja de Ahorros de la Inmaculada de Aragón
U.S. National Library of MedicineR01LM013240
U.S. National Library of Medicine

    ASJC Scopus subject areas

    • Computer Science Applications
    • Linguistics and Language
    • Language and Linguistics

    Fingerprint

    Dive into the research topics of 'Knowledge-Driven Cross-Document Relation Extraction'. Together they form a unique fingerprint.

    Cite this