Date of Award

2021

Document Type

Open Access Dissertation

Degree Name

Doctor of Philosophy in Statistics (PhD)

Administrative Home Department

Department of Mathematical Sciences

Advisor 1

Kui Zhang

Committee Member 1

Qiuying Sha

Committee Member 2

Hairong Wei

Committee Member 3

Shuanglin Zhang

Committee Member 4

Xiao Zhang

Abstract

This dissertation includes three Chapters. A brief description of each chapter is organized as follows.

In Chapter 1, we proposed a new method, called MF-TOWmuT, for genome-wide association studies with multiple genetic variants and multiple phenotypes using family samples. MF-TOWmuT uses kinship matrix to account for sample relatedness. It is worth mentioning that in simulations, we considered hidden polygenic effects and varied the proportion of variance contributed by it to generate phenotypes. Simulation studies show that MF-TOWmuT can preserve the type I error rates and is more powerful than several existing methods in different simulation scenarios, MFTOWmuT is also quite robust to the proportion of variance explained by invisible polygenic effects and to the direction of effects of genetic variants.

In Chapter 2, we proposed a fast and efficient low rank penalized regression with the Elastic Net penalty for the eQTL mapping, called LORSEN. By considering the Elastic Net penalty instead of the L1 penalty, our method can overcome two crucial drawbacks of the L1 penalty, and outperforms two commonly used methods for the eQTL mapping, LORS and FastLORS, in many simulation scenarios in terms of average Area Under the Curve (AUC).

In Chapter 3, we proposed a bipartite network-based penalized regression model for the eQTL mapping, called BiNetPeR. This method takes into account the SNPgene marginal association evidence to construct the SNP-gene bipartite network, then uses such a bipartite network to obtain the projected SNP network. Based on the normalized Laplacian matrix of the projected SNP network, we then formulate the eQTL mapping into a penalized regression model. Our simulation results show that our proposed method can maintain the appropriate false positive rate and outperforms two competing methods for the eQTL mapping, FastLORS and mtLasso2G.

Recommended Citation

GAO, CHENG, "STATISTICAL METHODS IN GENETIC STUDIES", Open Access Dissertation, Michigan Technological University, 2021.

https://doi.org/10.37099/mtu.dc.etdr/1284

Download

Included in

Applied Statistics Commons, Biostatistics Commons, Computational Biology Commons, Genetics Commons, Genomics Commons, Multivariate Analysis Commons, Statistical Methodology Commons, Statistical Models Commons

COinS

Dissertations, Master's Theses and Master's Reports

STATISTICAL METHODS IN GENETIC STUDIES

Date of Award

Document Type

Degree Name

Administrative Home Department

Advisor 1

Committee Member 1

Committee Member 2

Committee Member 3

Committee Member 4

Abstract

Recommended Citation

Included in

LINKS

Browse

Search

Author Corner

Dissertations, Master's Theses and Master's Reports

STATISTICAL METHODS IN GENETIC STUDIES

Author

Date of Award

Document Type

Degree Name

Administrative Home Department

Advisor 1

Committee Member 1

Committee Member 2

Committee Member 3

Committee Member 4

Abstract

Recommended Citation

Included in

Share

LINKS

Browse

Search

Author Corner