Clustering of SNPs by a Structural EM Algorithm

2009 
In population based human genetic studies, unrelated individuals are collected and SNPs are measured. There are several kinds of generative models proposed for modeling the data containing a large number of SNPs loci according to the characters of human genome. However, such models can only deal with ordered loci. In this paper, we try to model the same data without using the order information. Firstly, we present a clustering model for SNPs by modifying the multi-block model used in GERBIL. It is a two-layer Bayesian network with multiple latent variables. It does not use the order information of the loci. Secondly, we solve the model by employing a structural EM algorithm combined with simulated annealing mechanism. A real data set was analyzed by the model. The results show that the SNPs can be clustered effectively. Such a model is potentially useful for clustering distantly correlated SNPs loci.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    25
    References
    8
    Citations
    NaN
    KQI
    []