Correlation Engine 2.0
Clear Search sequence regions


  • algorithms (3)
  • cellular (2)
  • cluster analysis (1)
  • family (1)
  • genes (21)
  • human (2)
  • mice (1)
  • molecular function (1)
  • proteins by (1)
  • tree (2)
  • yeast (2)
  • Sizes of these terms reflect their relevance to your search.

    To describe the cellular functions of proteins and genes, a potential dynamic vocabulary is Gene Ontology (GO), which comprises of three sub-ontologies namely, Biological-process, Cellular-component, and Molecular-function. It has several applications in the field of bioinformatics like annotating/measuring gene-gene or protein-protein semantic similarity, identifying genes/proteins by their GO annotations for disease gene and target discovery, etc. To determine semantic similarity between genes, several semantic measures have been proposed in literature, which involve information content of GO-terms, GO tree structure, or the combination of both. But, most of the existing semantic similarity measures do not consider different topological and information theoretic aspects of GO-terms collectively. Inspired by this fact, in this article, we have first proposed three novel semantic similarity/distance measures for genes covering different aspects of GO-tree. These are further implanted in the frameworks of well-known multi-objective and single-objective based clustering algorithms to determine functionally similar genes. For comparative analysis, 10 popular existing GO based semantic similarity/distance measures and tools are also considered. Experimental results on Mouse genome, Yeast, and Human genome datasets evidently demonstrate the supremacy of multi-objective clustering algorithms in association with proposed multi-factored similarity/distance measures. Clustering outcomes are further validated by conducting some biological/statistical significance tests. Supplementary information is available at https://www.iitp.ac.in/sriparna/journals.html.

    Citation

    Sudipta Acharya, Sriparna Saha, Prasanna Pradhan. Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering. IEEE/ACM transactions on computational biology and bioinformatics. 2020 Jan-Feb;17(1):207-219

    Expand section icon Mesh Tags

    Expand section icon Substances


    PMID: 29994130

    View Full Text