UNF Faculty Research and Scholarship

DeepPPPred: Deep Ensemble Learning with Transformers, Recurrent and Convolutional Neural Networks for Human Protein-Phenotype Co-mention Classification

Morteza Pourreza Shahri, Montana State University
Katrina Lyon, Montana State University
Julia Schearer, Montana State University
Indika Kahanda, University of North FloridaFollow

Document Type

Conference Proceeding

Publication Date

1-1-2021

Abstract

The extensive collection of biomedical literature is arguably the best source of knowledge and information on the latest scientific findings and fundamental problems for the biological and clinical communities. However, these articles contain unstructured text; therefore, this valuable knowledge may remain hidden without manual curation, which is tedious and time-consuming due to the rapid growth of publication. The relationships and associations between human proteins and phenotypic abnormalities associated with human disease are one such area of valuable knowledge. This situation calls for the development of accurate computational tools capable of automatically inferring these associations from text data, assisting human curators in expediting their triage and information extraction tasks. This work develops DeepPPPred, a deep ensemble learning model for protein-phenotype co-mention classification at the sentence level. In particular, DeepPPPred combines Support Vector Machines, Transformer models, Recurrent Neural Networks, and Convolutional Neural Networks via stacking. Our experimental results obtained using a manually curated gold-standard dataset demonstrate that DeepPPPred can provide state-of-the-art performance while outperforming all its competitors. This is the first study that develops deep learning models for the problem of classifying human protein-phenotype co-mentions. Our findings have implications for the biological and clinical communities and text mining and natural language processing developers working on biomedical relation extraction.

Publication Title

Proceedings - 2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021

First Page

2869

Last Page

2876

Digital Object Identifier (DOI)

10.1109/BIBM52615.2021.9669352

ISBN

9781665401265

Citation Information

M. P. Shahri, K. Lyon, J. Schearer and I. Kahanda, "DeepPPPred: Deep Ensemble Learning with Transformers, Recurrent and Convolutional Neural Networks for Human Protein-Phenotype Co-mention Classification," 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2021, pp. 2869-2876, doi: 10.1109/BIBM52615.2021.9669352.

Link to Full Text

COinS

UNF Faculty Research and Scholarship

DeepPPPred: Deep Ensemble Learning with Transformers, Recurrent and Convolutional Neural Networks for Human Protein-Phenotype Co-mention Classification

Document Type

Publication Date

Abstract

Publication Title

First Page

Last Page

Digital Object Identifier (DOI)

ISBN

Citation Information

Search

Links

Browse

Author Corner

UNF Faculty Research and Scholarship

DeepPPPred: Deep Ensemble Learning with Transformers, Recurrent and Convolutional Neural Networks for Human Protein-Phenotype Co-mention Classification

Authors

Document Type

Publication Date

Abstract

Publication Title

First Page

Last Page

Digital Object Identifier (DOI)

ISBN

Citation Information

Share

Search

Links

Browse

Author Corner