Extracting chemical-protein relations with ensembles of SVM and deep learning models

Yifan Peng, Anthony Rios, Ramakanth Kavuluru, Zhiyong Lu

Research output: Contribution to journalArticlepeer-review

55 Scopus citations


Mining relations between chemicals and proteins from the biomedical literature is an increasingly important task. The CHEMPROT track at BioCreative VI aims to promote the development and evaluation of systems that can automatically detect the chemical- protein relations in running text (PubMed abstracts). This work describes our CHEMPROT track entry, which is an ensemble of three systems, including a support vector machine, a convolutional neural network, and a recurrent neural network. Their output is combined using majority voting or stacking for final predictions. Our CHEMPROT system obtained 0.7266 in precision and 0.5735 in recall for an F-score of 0.6410 during the challenge, demonstrating the effectiveness of machine learning-based approaches for automatic relation extraction from biomedical literature and achieving the highest performance in the task during the 2017 challenge.

Original languageEnglish
Article numberbay073
Issue number2018
StatePublished - Jan 1 2018

Bibliographical note

Funding Information:
This work was supported by the Intramural Research Programs of the National Institutes of Health, National Library of Medicine [Y.P. and Z.L.] and through an extramural National Library of Medicine grant [R21LM012274 to A.R. and R.K.]. A.R. was a summer intern at the NCBI/NIH and supported by the NIH Intramural Research Training Award.

Publisher Copyright:
© 2018 Oxford University Press. All rights reserved.

ASJC Scopus subject areas

  • Information Systems
  • Biochemistry, Genetics and Molecular Biology (all)
  • Agricultural and Biological Sciences (all)


Dive into the research topics of 'Extracting chemical-protein relations with ensembles of SVM and deep learning models'. Together they form a unique fingerprint.

Cite this