Jaana Wessman defends her PhD thesis on April 13th, 2012, on Mixture Model Clustering in the Analysis of Complex Diseases
Lic. Med., MSc Jaana Wessman will defend her doctoral thesis Mixture Model Clustering in the Analysis of Complex Diseases on Friday the 13th of April, 2012 at noon in the University of Helsinki Main Building, Unioninkatu 34, Auditorium XIV (old part), 3rd floor. The defense will be held in Finnish.
Mixture Model Clustering in the Analysis of Complex Diseases
The topic of this thesis is the analysis of complex diseases, and specifically the use of certain clustering methods to do it. We concern ourselves mostly with the modeling of complex phenotypes of diseases: the symptoms and signs of diseases, and the other multiple co-phenotypes that go with them. The two related questions we seek answers for are: 1) how can we use these clustering methods to summarize the complex, multivariate phenotype data, for example to be used as a simple phenotype in genetic analyses and 2) how can we use these clustering methods to find subgroups of sufferers of a particular disease, such that might share the same causal factors of the disease.
Current methods for studies on medical genetics ideally call for a single or at most handful of univariate phenotypes to be compared to genetic markers. Multidimensional phenotypes cannot be handled by the standard methods, and treating each variable as independent and testing one hundred phenotypes with unclear true dependency structure against thousands of markers results into problems with both running times and multiple testing correction.
In this work, clustering is utilized to summarize a multi-dimensional phenotype into something that can then be used in association studies of both genetic and other type of potential causes.
We describe a clustering process and some clustering methods used in this work, with comments on practical issues and references to the relevant literature. After some experiments on artificial data to gain insight to the properties of these methods, We present four case-studies on real data, highlighting both ways to successfully use these methods and problems that can arise in the process.
Availability of the dissertation
An electronic version of the doctoral dissertation is available on the e-thesis site at http://urn.fi/URN:ISBN:978-952-10-7898-9.
Printed copies are available on request from Jaana Wessman: 050-5416024 or jaana.wessman(at)iki.fi.