Package: colocboost 1.0.9

Xuewei Cao

colocboost: Multi-Context Colocalization Analysis for QTL and GWAS Studies

A multi-task learning approach to variable selection regression with highly correlated predictors and sparse effects, based on frequentist statistical inference. It provides statistical evidence to identify which subsets of predictors have non-zero effects on which subsets of response variables, motivated and designed for colocalization analysis across genome-wide association studies (GWAS) and quantitative trait loci (QTL) studies. The ColocBoost model is described in Cao et. al. (2025) <doi:10.1101/2025.04.17.25326042>.

Authors:Xuewei Cao [cre, aut, cph], Haochen Sun [aut, cph], Ru Feng [aut, cph], Daniel Nachun [aut, cph], Kushal Dey [aut, cph], Gao Wang [aut, cph]

colocboost_1.0.9.tar.gz
colocboost_1.0.9.zip(r-4.7)colocboost_1.0.9.zip(r-4.6)colocboost_1.0.9.zip(r-4.5)
colocboost_1.0.9.tgz(r-4.6-any)colocboost_1.0.9.tgz(r-4.5-any)
colocboost_1.0.9.tar.gz(r-4.7-any)colocboost_1.0.9.tar.gz(r-4.6-any)
colocboost_1.0.9.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION
card.svg |card.png
colocboost/json (API)

# Install 'colocboost' in R:
install.packages('colocboost', repos = c('https://statfungen.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/statfungen/colocboost/issues

Datasets:

On CRAN:

Conda:

8.39 score 15 stars 25 scripts 387 downloads 12 exports 6 dependencies

Last updated from:2f4e3067f3. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK226
source / vignettesOK300
linux-release-x86_64OK236
macos-release-arm64OK289
macos-oldrel-arm64OK186
windows-develOK185
windows-releaseOK173
windows-oldrelOK170
wasm-releaseOK111

Exports:colocboostcolocboost_plotget_ambiguous_colocalizationget_colocboost_summaryget_cormatget_cosget_cos_purityget_cos_summaryget_hierarchical_clustersget_robust_colocalizationget_robust_ucosget_ucos_summary

Dependencies:matrixStatsRcppRcppArmadilloRcppParallelRfastzigg

News
ColocBoost on the media | Software updates

Last update: 2026-06-07
Started: 2025-04-13

Bioinformatics Pipeline for ColocBoost
1. ColocBoost analysis with basic QC steps | 2. Loading Data using colocboost_pipeline function | 2.1. Loading individual-level data from multiple cohorts | 2.2. Loading summary statistics from multiple cohorts or datasets | 3. Perform ColocBoost using colocboost_pipeline function

Last update: 2026-06-06
Started: 2025-04-20

Single-trait Fine-mapping with FineBoost
1. Fine-mapping with individual-level data | 2. Fine-mapping with summary statistics | 3. LD-free fine-mapping with one causal variant assumption

Last update: 2026-06-06
Started: 2025-04-19

Summary Statistics Data Colocalization
1. The Sumstat_5traits Dataset | Causal variant structure | Important data format for summary data | 2. Multiple summary statistics data with shared LD reference | Results Interpretation | 3. Other summary statistics and LD input combinations | 3.1. Matched LD with multiple sumstat (Trait-specific LD) | 3.2. LD matrix is a superset of variants across different summary statistics | 3.3. Arbitrary LD and sumstat with dictionary provided | 3.4. Using a reference panel genotype matrix (X_ref) instead of LD | 3.5. HyPrColoc compatible format: effect size and standard error matrices

Last update: 2026-06-06
Started: 2025-04-17

Handling partial overlapping variants across traits in ColocBoost
Causal variant structure | 1. Run ColocBoost with partial overlapping variants | 2. Limitations of using only overlapping variables | 3. Disease-prioritized colocalization analysis with variables in the focal trait

Last update: 2026-04-22
Started: 2025-04-17

Input Data Format
1. Individual Level Data | 2. Summary Statistics | 3. Optional: mapping between arbitrary input $X$ and $Y$ | 4. HyPrColoc compatible format: effect size and standard error matrices

Last update: 2026-04-22
Started: 2025-04-16

Ambiguous Colocalization from Trait-Specific Effects
1. The Ambiguous_Colocalization Dataset | 2. ColocBoost results | 2. Fine-mapping results from SuSiE and colocalization with COLOC | 3. Get the ambiguous colocalization results and summary | 3.1. Get the ambiguous colocalization results | 3.2. Get the summary of ambiguous colocalization results | 4. Take home message | Acknowledgment

Last update: 2025-11-22
Started: 2025-04-22

Individual Level Data Colocalization
1. The Ind_5traits Dataset | Causal variant structure | 2. Matched individual level input $X$ and $Y$ | Results Interpretation | 3. Other structures of individual level data | 3.1. Single genotype matrix | 3.2. Genotype matrix is a superset of individuals across different phenotypes | 3.3. Arbitrary input matrices with mapping dictionary provided

Last update: 2025-11-22
Started: 2025-04-16

Interpret ColocBoost Output
1. Summarize ColocBoost results | Causal variant structure | 2. Filter colocalization events by relative strength of evidence | 3. More details on ColocBoost output | 3.1. Variant colocalization probability (vcp) | 3.2. Analyzed data information (data_info) | 3.3. Colocalization details (cos_details) | 3.3.1. Colocalized variants for each CoS (cos) | 3.3.2. Colocalized traits for each 95% CoS (cos_outcomes) | 3.4. Model information (model_info) | 3.5. Trait-specific effects information (ucos_details) | 3.5.1. Trait-specific (uncolocalized) confidence sets (ucos) | 3.5.2. Trait-specific (uncolocalized) outcomes (ucos_outcomes) | 3.5.3. Purity across CoS and uCoS (cos_ucos_purity) | 3.5.4. Other components | 3.6. Diagnostic details (diagnostic_details)

Last update: 2025-11-22
Started: 2025-04-17

LD mismatch and LD-free Colocalization
1. LD mismatch diagnosis | Why LD Mismatch Matters | Example of including LD mismatch | Running ColocBoost with LD Mismatch | 2. LD-free and LD-mismatch colocalization analysis

Last update: 2025-11-22
Started: 2025-04-19

Mixed Data-type and Disease Prioritized Colocalization
1. Loading individual and summary statistics data | Causal variant structure | 2. ColocBoost in disease-agnostic mode | Results Interpretation | 3. ColocBoost in disease-prioritized mode

Last update: 2025-11-22
Started: 2025-04-17

Visualization of ColocBoost Results
1. Default plot function | 2. Advanced options | 2.1. Plot with a zoom-in region | 2.2. Plot with marked top variants | 2.3. Plot CoS variants to uncolocalized traits to diagnostic the colocalization. | 2.4. Plot with added highlight points | 2.5. Plot with trait-specific sets if exists | 2.6 Plot with focal trait for disease prioritized colocalization

Last update: 2025-11-22
Started: 2025-04-17

Installation
CRAN (Stable Release) | GitHub | Conda

Last update: 2025-05-06
Started: 2025-04-16

Animation Example: Proximity Gradient Boosting Algorithm

Last update: 2025-04-23
Started: 2025-04-23

Pairwise Colocalization with Flexible Input Formats

Last update: 2025-04-20
Started: 2025-04-19

Readme and manuals