TY - GEN
T1 - Exploiting domain knowledge to improve biological significance of biclusters with key missing genes
AU - Chen, Jin
AU - Ji, Liping
AU - Hsu, Wynne
AU - Tan, Kian Lee
AU - Rhee, Seung Y.
PY - 2009
Y1 - 2009
N2 - In an era of increasingly complex biological datasets, one of the key steps in gene functional analysis comes rom clustering genes based on co-expression. Biclustering algorithmscan identify gene clusters with local co-expressed patterns, which are more likely to define genes functioning togetherthan global clustering methods. However, these algorithms are not effective in uncovering gene regulatory networks because the mined biclusters lack genes that may be critical in the function but may not be co-expressed with the clustered genes. In this paper, we introduce a biclustering method called SKeleton Biclustering (SKB), which builds high quality biclusters from microarray data, creates relationships among the biclustered genes based on Gene Ontology annotations, and identifies genes that are missing in the biclusters. SKB thus defines inter-bicluster and intra-bicluster functional relationships. The delineation of functional relationships and incorporation of such missing genes may help biologists to discover biological processes that are important in a given study and provides clues for how the processes may be functioning together. Experimental results show that, with SKB, the biological significance of the biclusters is considerably improved.
AB - In an era of increasingly complex biological datasets, one of the key steps in gene functional analysis comes rom clustering genes based on co-expression. Biclustering algorithmscan identify gene clusters with local co-expressed patterns, which are more likely to define genes functioning togetherthan global clustering methods. However, these algorithms are not effective in uncovering gene regulatory networks because the mined biclusters lack genes that may be critical in the function but may not be co-expressed with the clustered genes. In this paper, we introduce a biclustering method called SKeleton Biclustering (SKB), which builds high quality biclusters from microarray data, creates relationships among the biclustered genes based on Gene Ontology annotations, and identifies genes that are missing in the biclusters. SKB thus defines inter-bicluster and intra-bicluster functional relationships. The delineation of functional relationships and incorporation of such missing genes may help biologists to discover biological processes that are important in a given study and provides clues for how the processes may be functioning together. Experimental results show that, with SKB, the biological significance of the biclusters is considerably improved.
UR - http://www.scopus.com/inward/record.url?scp=67649649591&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67649649591&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2009.205
DO - 10.1109/ICDE.2009.205
M3 - Conference contribution
AN - SCOPUS:67649649591
SN - 9780769535456
T3 - Proceedings - International Conference on Data Engineering
SP - 1219
EP - 1222
BT - Proceedings - 25th IEEE International Conference on Data Engineering, ICDE 2009
T2 - 25th IEEE International Conference on Data Engineering, ICDE 2009
Y2 - 29 March 2009 through 2 April 2009
ER -