Aller au contenu. | Aller à la navigation

Outils personnels

Navigation
Vous êtes ici : Accueil / Équipes / Systems Biology of Decision Making - O. Gandrillon / Publications (not up to date) / Clustering formal concepts to discover biologically relevant knowledge from gene expression data.

Clustering formal concepts to discover biologically relevant knowledge from gene expression data.

Sylvain Blachon, Ruggero G Pensa, Jeremy Besson, Celine Robardet, Jean-Francois Boulicaut, and Olivier Gandrillon (2007)

In Silico Biol, 7(4-5):467-83.

The production of high-throughput gene expression data has generated a crucial need for bioinformatics tools to generate biologically interesting hypotheses. Whereas many tools are available for extracting global patterns, less attention has been focused on local pattern discovery. We propose here an original way to discover knowledge from gene expression data by means of the so-called formal concepts which hold in derived Boolean gene expression datasets. We first encoded the over-expression properties of genes in human cells using human SAGE data. Ithas given rise to a Boolean matrix from which we extracted the complete collection of formal concepts, i.e., all the largest sets of over-expressed genes associated to a largest set of biological situations in which their over-expression is observed. Complete collections of such patterns tend to be huge. Since their interpretation is a time-consuming task, we propose a new method to rapidly visualize clusters of formal concepts. This designates a reasonable number of Quasi-Synexpression-Groups (QSGs) for further analysis. Theinterest of our approach is illustrated using human SAGE data and interpreting one of the extracted QSGs. The assessment of its biological relevancy leads to the formulation of both previously proposed and new biological hypotheses.

 
automatic medline import

Actions sur le document