Chapter 10 Some other analyses
Read more in this PowerPoint presentation.
Here we just mention some other types of analyses that haven’t really been covered in the course, but that are useful to know that they exist. We simply mention a few tools; you can find a list with more here.
Estimation of SNP-based heritability: The proportion of phenotypic variance in a trait that can be explained by genetic variation captured by measured SNPs. A simple way to estimate it is with LD Score regression (LDSC) using GWAS summary statistics (B. K. Bulik-Sullivan et al., 2015). R packages bigsnpr and GenomicSEM also provide some implementation of LDSC. There exist many other tools to estimate SNP-heritability, such as LDpred2-auto (mentioned before).
Genetic Correlation: LDSC can be also used to estimate the genome-wide genetic correlation between two traits using GWAS summary statistics (B. Bulik-Sullivan et al., 2015), and is also implemented in R package GenomicSEM. Local genetic correlations (e.g. within LD blocks) can also be estimated (Darlay et al., 2025; Zhang, Zhang, Zhang, & Zhao, 2023).
Partitioned heritability: Heritability partitioned by functional annotation (which may be cell-type or state-specific), which can be estimated using S-LDSC (Finucane et al., 2015), SumHer (Speed & Balding, 2019) and SBayesRC (Zheng et al., 2024).
Gene-based tests: These determine the genes associated with a trait from a GWAS by using proximity, such as mBAT-combo as an R package or a command-line software (Li et al., 2023).
Summary statistic imputation: imputation of GWAS summary statistics help increase the SNP overlap between GWAS summary statistics and LD reference panels.
Meta-analysis of GWAS:
- same trait, different cohorts: METAL (Willer, Li, & Abecasis, 2010)
- multiple traits: MTAG (Turley et al., 2018)
- multiple ancestries: MR-MEGA (Mägi et al., 2017)
Colocalisation: The same variant may be significant for two traits, but not necessarily be causal for both. Tools like COLOC + SuSiE directly assess this, yielding probabilities the signal is driven by the same variant, or two different variants (Wallace, 2021). SMR is a complementary approach designed for testing if a molQTL is also a causal variant for a trait, yielding an association statistic (Zhu et al., 2016).
Gene expression signature of a trait: Use GWAS summary stats with eQTL/gene expression data. Early tools include FUSION (Gusev et al., 2016) and PrediXcan (Gamazon et al., 2015), although many more have been developed since.
Structural Equation Modelling: GenomicSEM is a tool for structural equation modelling of GWAS summary statistics, allowing you to model the relationships between traits using their genetic architecture (Grotzinger et al., 2019).