Abstract
Integrating multi-omics data may help researchers understand the genetic underpinnings of complex traits and diseases. However, the best ways to integrate multi-omics data and use them to address pressing scientific questions remain a challenge. One important and topical problem is how to assess the aggregate effect of multiple genomic data types (e.g. genotypes and gene expression levels) on a phenotype, particularly while accommodating routine issues, such as having related subjects' data in analyses. In this paper, we extend an existing composite kernel machine regression model to integrate two multi-omics data types, while accommodating for general correlation structures amongst outcomes. Due to the kernel machine regression framework, our methods allow for the integration of high-dimensional omics data with small, nonlinear, and interactive effects, and accommodation of general study designs. Here, we focus on scientific questions that aim to assess the association between a functional grouping (such as a gene or a pathway) and a quantitative trait of interest. We use a kernel machine regression to integrate the two multi-omics data types, as they may relate to the trait, and perform a global test of association. We demonstrate the advantage of this approach over single data type association tests via simulation. Finally, we apply this method to a large, multi-ethnic data set to investigate how predicted gene expression and rare genetic variation may be related to two platelet traits.
Original language | English |
---|---|
Article number | e22610 |
Journal | Genetic Epidemiology |
Volume | 49 |
Issue number | 1 |
Number of pages | 17 |
ISSN | 0741-0395 |
DOIs | |
Publication status | Published - 2025 |
Bibliographical note
© 2025 Wiley Periodicals LLC.Keywords
- Humans
- Genome-Wide Association Study/methods
- Genomics/methods
- Phenotype
- Algorithms
- Models, Genetic
- Polymorphism, Single Nucleotide
- Genotype
- Computer Simulation
- Machine Learning
- Multiomics