Whole Genome Sequencing of Four CD34+-Derived iPSC Polycythemia Vera Clones From a Single Female

2012 
Abstract 1755 Polycythemia vera (PV) is an acquired clonal hematopoietic disorder characterized by the JAK2 V617F somatic mutation, which, however, constitutes only a variable part of the PV clone. Even within the JAK2 V617F subclone, heterozygous and homozygous JAK2 V617F progenitors coexist. Further, family clustering of PV suggests the existence of predisposing germline mutations. The nature of these pre- JAK2 V617F somatic and germline mutations has not yet been identified, and the search for these predisposing genetic lesions using marrow and blood cells is hampered by their genetic heterogeneity. In order to overcome this obstacle, we generated 5 inducible pluripotent stem cell lines (iPSC) from the same PV female with different JAK2 genomic configuration (4 from CD34+ blood cells, one from marrow mesenchymal cells having germ-line DNA configuration). Genomic DNA was isolated from the cell lines, followed by whole genome sequencing (WGS) at >20X depth, and whole exome sequencing (WES) at >130X depth. We have developed a comprehensive WGS/WES pipeline for sequence alignment and variant calling. The major components are: a) Pre-processing and quality control of raw reads outputted from sequencing instruments; b) Align the read against the Human Genome Project reference using BWA (Burrows-Wheeler Aligner); c) Local realignment at indels using GATK (Genome Analysis Toolkit); d) Identify germline/somatic single nucleotide variants (SNVs), copy number variants (CNV) and structural variants (SV) using GATK, Varscan2, Breakdancer, CNVnator, and Pindel. The variant calls are intersected and merged. Calls detected by two or more algorithms are regarded as high confidence. Two somatic mutations, rs1047840, and rs3795677 were identified by WGS and WES in the CD34+ cell derived iPSCs. rs1047840 (E589K) is a SNP in EXO1 gene that encodes a protein with 59 to 39 exonuclease activity as well as an RNase H activity. EXO1 is involved in DNA mismatch repair and homologous recombination. rs1047840 conferred increased risk of breast, lung, and gastric cancer. rs3795677 (R64H) is a SNP in FMN2 gene, which is required for microtubule-independent chromatin positioning during metaphase. Interestingly, these two mutations are all common SNVs, and they are located in a duplicated region on 1q43. Whole genome analysis is ongoing to characterize and validate the complex mixture of SVs in this region, identify potential translocations and determine the exact genomic breakpoints. The variants will be validated in non-iPSC cells from the same individual to rule out iPSC artifacts. This work identifies somatic mutations in EXO1 and FMN2 , and CNVs in 1q43 as the potential genetic defects involved in the early pathogenesis of PV. It also demonstrates the value of combining WGS and WES for the identification of somatic mutations. Disclosures: No relevant conflicts of interest to declare.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    1
    Citations
    NaN
    KQI
    []