Show simple item record

dc.contributor.authorDotan-Cohen, Diklaen_US
dc.contributor.authorKasif, Simonen_US
dc.contributor.authorMelkman, Avraham A.en_US
dc.date.accessioned2012-01-11T00:37:41Z
dc.date.available2012-01-11T00:37:41Z
dc.date.issued2009-6-3
dc.identifier.citationDotan-Cohen, Dikla, Simon Kasif, Avraham A. Melkman. "Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering" Bioinformatics 25(14): 1789-1795. (2009)
dc.identifier.issn1460-2059
dc.identifier.urihttps://hdl.handle.net/2144/3010
dc.description.abstractMotivation: There is a growing interest in improving the cluster analysis of expression data by incorporating into it prior knowledge, such as the Gene Ontology (GO) annotations of genes, in order to improve the biological relevance of the clusters that are subjected to subsequent scrutiny. The structure of the GO is another source of background knowledge that can be exploited through the use of semantic similarity. Results: We propose here a novel algorithm that integrates semantic similarities (derived from the ontology structure) into the procedure of deriving clusters from the dendrogram constructed during expression-based hierarchical clustering. Our approach can handle the multiple annotations, from different levels of the GO hierarchy, which most genes have. Moreover, it treats annotated and unannotated genes in a uniform manner. Consequently, the clusters obtained by our algorithm are characterized by significantly enriched annotations. In both cross-validation tests and when using an external index such as protein–protein interactions, our algorithm performs better than previous approaches. When applied to human cancer expression data, our algorithm identifies, among others, clusters of genes related to immune response and glucose metabolism. These clusters are also supported by protein–protein interaction data. Contact: dotna@cs.bgu.ac.il Supplementary information: Supplementary data are available at Bioinformatics online.en_US
dc.description.sponsorshipLynne and William Frankel Center for Computer Science; Paul Ivanier center for robotics research and production; National Institutes of Health (R01 HG003367-01A1)en_US
dc.language.isoen
dc.publisherOxford University Pressen_US
dc.rightsThis is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc/2.0/uk/
dc.titleSeeing the Forest for the Trees: Using the Gene Ontology to Restructure Hierarchical Clusteringen_US
dc.typeArticleen_US
dc.identifier.doi10.1093/bioinformatics/btp327
dc.identifier.pmid19497934
dc.identifier.pmcid2705235


This item appears in the following Collection(s)

Show simple item record

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Except where otherwise noted, this item's license is described as This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.