-->

Dissertationsserver


Springe direkt zu:Inhalt


Service-Navigation


Hauptnavigation/Hauptmenü: Links auf direkt erreichbare, übergeordnete Webseiten


Grafischer Identitätsbereich:




Navigation/Menü: Links auf weitere Seiten dieser Website


Navigationspfad:

Navigation: FU Dissertationen Online / Mycore 2.0.2

Drucken Icon


Objekt-Metadaten

Mixture Models for the Analysis of Gene Expression
Gesteira Costa Filho, Ivan

HaupttitelMixture Models for the Analysis of Gene Expression
TitelzusatzIntegration of Multiple Experiments and Cluster Validation
TitelvarianteMischmodelle für die Analyse von Genexpression
Zusatz zur TitelvarianteIntegration multipler Experimente und Validierung von Clustern
AutorGesteira Costa Filho, Ivan
Geburtsort: Rio de Janeiro, Brasilien
GutachterProf. Dr. Martin Vingron
weitere GutachterProf. Dr. Joachim Selbig
Freie Schlagwörtermixture models, gene expression, clustering, hidden Markov models, dependence treesI.5, J.3
DDC570 Biowissenschaften; Biologie
ZusammenfassungThe main focus of this thesis is the problem of finding groups of co-expressed genes from data obtained in DNA microarray experiments. As we assume co-expressed genes to: (1) perform related functional task, and (2) be regulated by the same transcription regulation program, such an analysis is helpful in identifying the biological function and the regulatory roles of genes. One traditional approach for finding co-expressed genes is the use of clustering methods. In this thesis, we use mixture models as a statistical formalism for clustering gene expression data. Mixture models are robust to noise, can model uncertainty about cluster assignments, allow the inclusion of prior knowledge, such as intrinsic dependencies of the experimental design, and offer a flexible framework for integration of additional biological data. In Chapter 2, we introduce the mixture model formalism. Then, in Chapter 3, we describe how mixture models can be used to solve the clustering problem, and how questions as choosing the number of clusters and cluster validation can be answered in the context of mixture models. Additionally, in Chapter 3 we propose a novel external index for validating clusterings computed by mixtures. Mixture models allow, with a proper choice of component models, to make explicit assumptions about the data. We propose here two novel types of components models for analyzing gene expression. The use of hidden Markov models with linear topologies to analyze gene expression time courses will be the focus of Chapter 4. With a benchmark data set, we show that mixture of HMMs have better class recovery than other methods proposed for time course analysis. In Chapter 5, we propose a new type of probabilistic model, dependence trees, to model gene expression profiles during a developmental process. We also explore the benefits of using priors of model parameters to obtain maximum-a-posteriori point estimates, and show how this improves the robustness of the method. For data collected in lymphoid development, mixtures of dependence trees compare favorably to other methods used for finding groups of co-expressed genes. Furthermore, by incorporating microRNA binding data, we identify promising novel regulatory roles of genes and their functional assignments. We propose in Chapter 6 an extension of the mixture model estimation. This semi-supervised learning can integrate additional biological data and improve clusterings of gene expression time-courses. We propose a novel method, which combines gene expression time-courses with spatial patterns of gene expression in Drosophila embryos, for finding groups of syn-expressed genes. Our results demonstrate that the cluster results, obtained after integrating additional data, demonstrate a better recovery of syn-expressed genes then cluster results obtained with the gene expression data alone.
Dokumente
FUDISS_derivate_000000003441
Falls Ihr Browser eine Datei nicht öffnen kann, die Datei zuerst herunterladen und dann öffnen.
 
Fachbereich/EinrichtungFB Mathematik und Informatik
Erscheinungsjahr2008
Dokumententyp/-SammlungenDissertation
Medientyp/FormatText
SpracheEnglisch
RechteNutzungsbedingungen
Tag der Disputation29.05.2008
Erstellt am03.06.2008 - 00:00:00
Letzte Änderung19.02.2010 - 10:59:59
 
Alte Darwin URLhttp://www.diss.fu-berlin.de/2008/345/
Statische URLhttp://www.diss.fu-berlin.de/diss/receive/FUDISS_thesis_000000003441
URNurn:nbn:de:kobv:188-fudissthesis000000003441-2
Zugriffsstatistik
 

 
© 2010 Universitätsbibliothek der Freien Universität Berlin | Feedback | powered by <MyCoRe>

Stand: 28.02.2010

Diese Grafiken werden nur in der Druckvorschau verwendet: