A sequence-to-sequence approach for document-level relation extraction

Giorgi, John; Bader, Gary; Wang, Bo

doi:10.18653/v1/2022.bionlp-1.2

A sequence-to-sequence approach for document-level relation extraction

Open in Zotero

View on zotero.org

Resource type

Conference Paper

Authors/contributors

Giorgi, John (Author)
Bader, Gary (Author)
Wang, Bo (Author)
Demner-Fushman, Dina (Editor)
Cohen, Kevin Bretonnel (Editor)
Ananiadou, Sophia (Editor)
Tsujii, Junichi (Editor)

Title

A sequence-to-sequence approach for document-level relation extraction

Abstract

Motivated by the fact that many relations cross the sentence boundary, there has been increasing interest in document-level relation extraction (DocRE). DocRE requires integrating information within and across sentences, capturing complex interactions between mentions of entities. Most existing methods are pipeline-based, requiring entities as input. However, jointly learning to extract entities and relations can improve performance and be more efficient due to shared parameters and training steps. In this paper, we develop a sequence-to-sequence approach, seq2rel, that can learn the subtasks of DocRE (entity extraction, coreference resolution and relation extraction) end-to-end, replacing a pipeline of task-specific components. Using a simple strategy we call entity hinting, we compare our approach to existing pipeline-based methods on several popular biomedical datasets, in some cases exceeding their performance. We also report the first end-to-end results on these datasets for future comparison. Finally, we demonstrate that, under our model, an end-to-end approach outperforms a pipeline-based approach. Our code, data and trained models are available at https://github.com/johngiorgi/seq2rel. An online demo is available at https://share.streamlit.io/johngiorgi/seq2rel/main/demo.py.

Date

2022-05

Proceedings Title

Proceedings of the 21st Workshop on Biomedical Language Processing

Conference Name

BioNLP 2022

Place

Dublin, Ireland

Publisher

Association for Computational Linguistics

Pages

10–25

DOI

10.18653/v1/2022.bionlp-1.2

URL

https://aclanthology.org/2022.bionlp-1.2

Accessed

24/02/2024, 17:42

Library Catalogue

ACLWeb

Citation

Giorgi, J., Bader, G., & Wang, B. (2022). A sequence-to-sequence approach for document-level relation extraction. In D. Demner-Fushman, K. B. Cohen, S. Ananiadou, & J. Tsujii (Eds.), Proceedings of the 21st Workshop on Biomedical Language Processing (pp. 10–25). Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.bionlp-1.2

Theme

Artificial Intelligence

Link to this record

https://docs.opendeved.net/lib/2QQB29T2