English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Thesis

Context-specific independence mixture models for cluster analysis of biological data

MPS-Authors

Georgi,  Benjamin
Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

Georgi.zip
(Any fulltext), 3MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Georgi, B. (in preparation). Context-specific independence mixture models for cluster analysis of biological data.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0010-7D6F-0
Abstract
Clustering is a crucial first step in the exploratory analysis of biological data. This thesis is concerned with cluster analysis of biological data using mixture models. Mixture models is a class of powerful and versatile statistical models. We develop an extension to the conventional mixtures in form of the context-specific independence (CSI) framework. CSI mixtures are particularly suited for the analysis of biological data since they perform robustly in the presence of noise and uninformative features in the data. This is achieved by adapting the model complexity to the degree of variation observed in a given data set. We present a learning algorithm for CSI mixtures in a Bayesian framework. We apply CSI mixture clustering on data sets of transcription factor binding sites, protein sequences and complex disease phenotype data.