Show simple item record

dc.identifier.urihttp://hdl.handle.net/1951/60212
dc.identifier.urihttp://hdl.handle.net/11401/71024
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractGenome-wide association studies (GWAS) are widely used to detect genotypes associated with complex diseases. Such GWAS studies of disease progression over time may be clinically significant. Longitudinal quantitative trait locus (LQTL) methods are used in these studies to simulate disease progression. However, population stratification (PS) can lead to false positive or negative findings when conducting a GWAS study. PS is induced by a candidate marker's variation in allele frequency across ancestral populations. One of the approaches used to adjust for population stratification in GWAS is the global principal component analysis (PCA) approach. In this thesis I examine the statistical properties of GWAS analysis procedures using principal component adjustments across the whole genome. I use additive risk allele models to test the association between rare genetic variants and the longitudinal quantitative phenotypes across the whole genome. The genotype data are taken from the Hapmap 3 dataset for 1198 unrelated individuals. The simulated quantitative phenotype data are estimated using the Bayesian posterior probabilities (BPPs) that a participant belongs to a clinically important trajectory curve. The PCA method implemented in the EIGENSTRAT program is then used to reduce the data to ten variables containing most of the genetic variability information. The power and rejection rates are evaluated based on 1000 simulated replicates. The association test follows a chi-square distribution with one degree of freedom under the null hypothesis of no association. The p-values of the test of the coefficient of a genotype with and without a PC adjustment for PS are documented. For each disease gene, I select 25 matching SNPs (the ones with high correlation coefficient of allele frequencies with the disease gene across population) and 25 non-correlated SNPs (the ones with low correlation coefficient of allele frequencies with the disease gene across population). All SNPs considered are in overall Hardy Weinberg equilibrium (HWE). The additive risk allele model LQTL models have strong empirical power. The model with global PCA adjustment for PS is able to consistently maintain correct false positive rates.
dcterms.available2013-05-24T16:38:14Z
dcterms.available2015-04-24T14:45:36Z
dcterms.contributorMendell, Nancy R.Wu, Songen_US
dcterms.contributorFinch, Stephen J.en_US
dcterms.contributorGordon, Derek.en_US
dcterms.creatorWang, Yifan
dcterms.dateAccepted2013-05-24T16:38:14Z
dcterms.dateAccepted2015-04-24T14:45:36Z
dcterms.dateSubmitted2013-05-24T16:38:14Z
dcterms.dateSubmitted2015-04-24T14:45:36Z
dcterms.descriptionDepartment of Applied Mathematics and Statisticsen_US
dcterms.extent164 pg.en_US
dcterms.formatApplication/PDFen_US
dcterms.formatMonograph
dcterms.identifierhttp://hdl.handle.net/1951/60212
dcterms.identifierhttp://hdl.handle.net/11401/71024
dcterms.issued2012-08-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2013-05-24T16:38:14Z (GMT). No. of bitstreams: 1 StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf: 41286 bytes, checksum: 425a156df10bbe213bfdf4d175026e82 (MD5) Previous issue date: 1en
dcterms.provenanceMade available in DSpace on 2015-04-24T14:45:36Z (GMT). No. of bitstreams: 3 StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf.jpg: 1934 bytes, checksum: c116f0e1e7be19420106a88253e31f2e (MD5) StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf.txt: 336 bytes, checksum: 84c0f8f99f2b4ae66b3cc3ade09ad2e9 (MD5) StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf: 41286 bytes, checksum: 425a156df10bbe213bfdf4d175026e82 (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectgenome wide association study, longitudinal quantitative trait locus, population stratification, principal component analysis
dcterms.subjectStatistics
dcterms.titleAdjusting for population stratification in longitudinal quantitative trait locus identification
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record