Correlation between Gene Expression and Clinical Data through Linear and Nonlinear Principal Components Analyses: Muscular Dystrophies as Case Studies

2009 
Abstract The large dimension of microarray data and the complex dependence structure among genes make data analysis extremely challenging. In the last decade several statistical techniques have been proposed to tackle genome-wide expression data; however, clinical and molecular data associated to pathologies have often been considered as separate dimensions of the same phenomenon, especially when clinical variables lie on a multidimensional space. A better comprehension of the relationships between clinical and molecular data can be obtained if both data types are combined and integrated. In this work we adopt a multidimensional correlation strategy together with linear and nonlinear principal component, to integrate genetic and clinical information obtained from two sets of dystrophic patients. With this approach we decompose different aspects of clinical manifestations and correlate these features with the correspondent patterns of differential gene expression.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    33
    References
    7
    Citations
    NaN
    KQI
    []