Dissertations, Master's Theses and Master's Reports

Off-campus Michigan Tech users: To download campus access theses or dissertations, please use the following button to log in with your Michigan Tech ID and password: log in to proxy server

Non-Michigan Tech users: Please talk to your librarian about requesting this thesis or dissertation through interlibrary loan.

STATISTICAL METHOD of GENETIC ASSOCIATION STUDIES

Hongjing Xie, Michigan Technological UniversityFollow

Date of Award

2022

Document Type

Campus Access Dissertation

Degree Name

Doctor of Philosophy in Statistics (PhD)

Administrative Home Department

Department of Mathematical Sciences

Advisor 1

Qiuying Sha

Committee Member 1

Shuanglin Zhang

Committee Member 2

Kui Zhang

Committee Member 3

Jingfeng Jiang

Abstract

In genome-wide association studies (GWAS) for thousands of phenotypes in biobanks, most binary phenotypes have substantially fewer cases than controls. Many widely used approaches for joint analysis of multiple phenotypes in association studies produce inflated type I error rates for such extremely unbalanced case-control phenotypes. In our research, we develop two novel methods to jointly analyze multiple unbalanced case-control phenotypes to circumvent this issue. In the first method, we cluster multiple phenotypes into different clusters based on a hierarchical clustering method, then we merge phenotypes in each cluster into a single phenotype. In each cluster, we use the saddlepoint approximation to estimate the p-value of an association test between the merged phenotype and a SNP which eliminates the issue of inflated type I error rate of the test for extremely unbalanced case-control phenotypes. Finally, we use the Cauchy combination method to obtain an integrated p-value for all clusters to test the association between multiple phenotypes and a SNP. In the second method, we first construct a Multi-Layer Network (MLN) using all individuals with at least one case status among all phenotypes. Then, we introduce a computational efficient community detection method to group phenotypes into different disjoint clusters based on the MLN. The phenotypes in the same cluster are merged to a single phenotype which mainly eliminates the issue of inflated type I error rate of test for extremely unbalanced binary phenotypes. Finally, to test the association between all phenotypes and a SNP, we use the score test statistic to test the association between each merged phenotype and a SNP and then use the Omnibus test to obtain an overall p-value (MLN-O). Extensive simulation studies reveal that the newly proposed approaches can control type I error rates and are more powerful than other methods we compared with. The real data analyses also show that our methods outperform other methods we compared with.

Recommended Citation

Xie, Hongjing, "STATISTICAL METHOD of GENETIC ASSOCIATION STUDIES", Campus Access Dissertation, Michigan Technological University, 2022.

https://doi.org/10.37099/mtu.dc.etdr/1429

Download

COinS

ORCID

0000-0003-2709-5612

Dissertations, Master's Theses and Master's Reports

STATISTICAL METHOD of GENETIC ASSOCIATION STUDIES

Date of Award

Document Type

Degree Name

Administrative Home Department

Advisor 1

Committee Member 1

Committee Member 2

Committee Member 3

Abstract

Recommended Citation

ORCID

LINKS

Browse

Search

Author Corner

Dissertations, Master's Theses and Master's Reports

STATISTICAL METHOD of GENETIC ASSOCIATION STUDIES

Author

Date of Award

Document Type

Degree Name

Administrative Home Department

Advisor 1

Committee Member 1

Committee Member 2

Committee Member 3

Abstract

Recommended Citation

Share

ORCID

LINKS

Browse

Search

Author Corner