Michigan Tech Publications, Part 2

CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data

Chen Zhao, Kennesaw State University
Anqi Liu, Tulane University
Xiao Zhang, Tulane University
Xuewei Cao, Michigan Technological University
Zhengming Ding, Tulane University
Qiuying Sha, Michigan Technological University
Hui Shen, Tulane University
Hong-Wen Deng, Tulane University
Weihua Zhou, Michigan Technological University

Document Type

Article

Publication Date

1-28-2024

Abstract

Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding etiology of complex genetic diseases. Each omics technique only provides a limited view of the underlying biological process and integrating heterogeneous omics layers simultaneously would lead to a more comprehensive and detailed understanding of diseases and phenotypes. However, one obstacle faced when performing multi-omics data integration is the existence of unpaired multi-omics data due to instrument sensitivity and cost. Studies may fail if certain aspects of the subjects are missing or incomplete. In this paper, we propose a deep learning method for multi-omics integration with incomplete data by Cross-omics Linked unified embedding with Contrastive Learning and Self Attention (CLCLSA). Utilizing complete multi-omics data as supervision, the model employs cross-omics autoencoders to learn the feature representation across different types of biological data. The multi-omics contrastive learning is employed, which maximizes the mutual information between different types of omics. In addition, the feature-level self-attention and omics-level self-attention are employed to dynamically identify the most informative features for multi-omics data integration. Finally, a Softmax classifier is employed to perform multi-omics data classification. Extensive experiments were conducted on four public multi-omics datasets. The experimental results indicate that our proposed CLCLSA produces promising results in multi-omics data classification using both complete and incomplete multi-omics data.

Publication Title

Computers in biology and medicine

Recommended Citation

Zhao, C., Liu, A., Zhang, X., Cao, X., Ding, Z., Sha, Q., Shen, H., Deng, H., & Zhou, W. (2024). CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data. Computers in biology and medicine, 170, 108058. http://doi.org/10.1016/j.compbiomed.2024.108058
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p2/503

Link to Full Text

COinS

Michigan Tech Publications, Part 2

CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data

Document Type

Publication Date

Abstract

Publication Title

Recommended Citation

LINKS

Browse

Search

Author Corner

Michigan Tech Publications, Part 2

CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data

Authors

Document Type

Publication Date

Abstract

Publication Title

Recommended Citation

Share

LINKS

Browse

Search

Author Corner