Skip to main navigation Skip to search Skip to main content

Measuring semantic similarities by combining gene ontology annotations and gene co-function networks

  • Jiajie Peng
  • , Sahra Uygun
  • , Taehyong Kim
  • , Yadong Wang
  • , Seung Y. Rhee
  • , Jin Chen

Research output: Contribution to journalArticlepeer-review

43 Scopus citations

Abstract

Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information fromgene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstrate that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited. Supplementary information and software are available at http://www.msu.edu/~jinchen/NETSIM.

Original languageEnglish
Article number44
JournalBMC Bioinformatics
Volume16
Issue number1
DOIs
StatePublished - Dec 14 2015

Bibliographical note

Publisher Copyright:
© 2015 Peng et al.

Funding

The authors wish to thank Dr. See-Kiong Ng for useful discussions. This work is supported in part by Chemical Sciences, Geosciences and Biosciences Division, Office of Basic Energy Sciences, Office of Science, U.S. Department of Energy (grant no. DE-FG02-91ER20021) to JC; National Science Foundation (grants IOS-1026003, DBI-0640769, and MCB-1052348) and U.S. Department of Energy (grant no. BER65472) to SYR; the National High Technology Research and Development Program of China (grant no. 2012AA020404, 2012AA02A602 and 2012AA02A604) to YW; and China Scholarship Council to JP.

FundersFunder number
National Science Foundation (NSF)MCB-1052348, IOS-1026003, BER65472, DBI-0640769
Michigan State University-U.S. Department of Energy (MSU-DOE) Plant Research LaboratoryDE-FG02-91ER20021
National Institute on Alcohol Abuse and AlcoholismR01AA020404
Directorate for Biological Sciences1026003, 1052348, 0640769
Office of Science Programs
Office of Basic Energy Sciences
Chemical Sciences, Geosciences, and Biosciences Division
China Scholarship Council
National High-tech Research and Development Program2012AA02A602, 2012AA02A604, 2012AA020404

    Keywords

    • Co-Function network
    • Gene function annotation
    • Gene ontology
    • Semantic similarity

    ASJC Scopus subject areas

    • Structural Biology
    • Biochemistry
    • Molecular Biology
    • Computer Science Applications
    • Applied Mathematics

    Fingerprint

    Dive into the research topics of 'Measuring semantic similarities by combining gene ontology annotations and gene co-function networks'. Together they form a unique fingerprint.

    Cite this