Dynamic Scan Procedure for Detecting Rare-Variant Association Regions in Whole-Genome Sequencing Studies. | BCM-HGSC Publications

Title	Dynamic Scan Procedure for Detecting Rare-Variant Association Regions in Whole-Genome Sequencing Studies.
Publication Type	Journal Article
Year of Publication	2019
Authors	Li, Z, Li, X, Liu, Y, Shen, J, Chen, H, Zhou, H, Morrison, AC, Boerwinkle, E, Lin, X
Journal	Am J Hum Genet
Volume	104
Issue	5
Pagination	802-814
Date Published	2019 May 02
ISSN	1537-6605
Keywords	Algorithms, Computational Biology, Genetic Variation, Genome, Human, Genome-Wide Association Study, Humans, Linkage Disequilibrium, Models, Genetic, Whole Genome Sequencing
Abstract	Whole-genome sequencing (WGS) studies are being widely conducted in order to identify rare variants associated with human diseases and disease-related traits. Classical single-marker association analyses for rare variants have limited power, and variant-set-based analyses are commonly used by researchers for analyzing rare variants. However, existing variant-set-based approaches need to pre-specify genetic regions for analysis; hence, they are not directly applicable to WGS data because of the large number of intergenic and intron regions that consist of a massive number of non-coding variants. The commonly used sliding-window method requires the pre-specification of fixed window sizes, which are often unknown as a priori, are difficult to specify in practice, and are subject to limitations given that the sizes of genetic-association regions are likely to vary across the genome and phenotypes. We propose a computationally efficient and dynamic scan-statistic method (Scan the Genome [SCANG]) for analyzing WGS data; this method flexibly detects the sizes and the locations of rare-variant association regions without the need to specify a prior, fixed window size. The proposed method controls for the genome-wise type I error rate and accounts for the linkage disequilibrium among genetic variants. It allows the detected sizes of rare-variant association regions to vary across the genome. Through extensive simulated studies that consider a wide variety of scenarios, we show that SCANG substantially outperforms several alternative methods for detecting rare-variant-associations while controlling for the genome-wise type I error rates. We illustrate SCANG by analyzing the WGS lipids data from the Atherosclerosis Risk in Communities (ARIC) study.
DOI	10.1016/j.ajhg.2019.03.002
Alternate Journal	Am J Hum Genet
PubMed ID	30982610
PubMed Central ID	PMC6507043
Grant List	RC2 HL102419 / HL / NHLBI NIH HHS / United States R35 CA197449 / CA / NCI NIH HHS / United States U19 CA203654 / CA / NCI NIH HHS / United States R01 HL113338 / HL / NHLBI NIH HHS / United States U54 HG003273 / HG / NHGRI NIH HHS / United States HHSN268201700001I / HL / NHLBI NIH HHS / United States HHSN268201700004I / HL / NHLBI NIH HHS / United States P01 CA134294 / CA / NCI NIH HHS / United States HHSN268201700002I / HL / NHLBI NIH HHS / United States HHSN268201700005I / HL / NHLBI NIH HHS / United States U01 HG009088 / HG / NHGRI NIH HHS / United States UM1 HG008898 / HG / NHGRI NIH HHS / United States / RA / ARRA NIH HHS / United States

Similar Publications

Cinciripini PM, Wetter DW, Wang J, Yu R, Kypriotakis G, Kumar T, et al.. Deep sequencing of candidate genes identified 14 variants associated with smoking abstinence in an ethnically diverse sample. Sci Rep. 2024;14(1):6385.

Wright A, Wilkinson MD, Mungall C, Cain S, Richards S, Sternberg P, et al.. FAIR Header Reference genome: a TRUSTworthy standard. Brief Bioinform. 2024;25(3).

Wang Z, Peters BA, Yu B, Grove ML, Wang T, Xue X, et al.. Gut Microbiota and Blood Metabolites Related to Fiber Intake and Type 2 Diabetes. Circ Res. 2024;134(7):842-854.