IGenomic answers for children: Dynamic analyses of >1000 pediatric rare disease genomes
2021
PURPOSE: To provide comprehensive diagnostic and candidate analyses in a pediatric rare disease cohort through the Genomic Answers for Kids (GA4K) program.
METHODS: Extensive analyses of 960 families with suspected genetic disorders including short-read exome (ES) and genome sequencing (srGS); PacBio HiFi long-read GS (HiFi-GS); variant calling for small-nucleotide (SNV), structural (SV) and repeat variants; and machine-learning variant prioritization. Structured phenotypes, prioritized variants and pedigrees are stored in PhenoTips database, with data sharing through controlled access (dbGAP).
RESULTS: Diagnostic rates ranged from 11% for cases with prior negative genetic tests to 34.5% in naive patients. Incorporating SVs from GS added up to 13% of new diagnoses in previously unsolved cases. HiFi-GS yielded increased discovery rate with >4-fold more rare coding SVs than srGS. Variants and genes of unknown significance (VUS/GUS) remain the most common finding (58% of non-diagnostic cases).
CONCLUSION: Computational prioritization is efficient for diagnostic SNVs. Thorough identification of non-SNVs remains challenging and is partly mitigated by HiFi-GS sequencing. Importantly, community research is supported by sharing real-time data to accelerate gene validation, and by providing HiFi variant (SNV/SV) resources from >1,000 human alleles to facilitate implementation of new sequencing platforms for rare disease diagnoses.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
42
References
0
Citations
NaN
KQI