Interpreting polygenic scores#

Overview#

The PGS Catalog Calculator (pgsc_calc) produces standardised outputs and reports that make polygenic score (PGS) analyses transparent, reproducible, and interpretable.

While the report explanation describes file structures and field definitions, this page explains how to read, interpret, and communicate those results responsibly.

1. Interpreting PGS Values#

A polygenic score represents a relative estimate of genetic predisposition, derived by summing the effects of many genetic variants weighted by their estimated effect sizes.

The SUM (or normalised) value in pgsc_calc output reflects an individual’s relative genetic tendency within the dataset analysed, with some important caveats:

  • Higher scores indicate greater genetic predisposition relative to others in the same cohort – if basic mode is run on a genetically homogenous population, or if ancestry adjustment is performed on a diverse population.

  • The distribution of PGS scores can differ between genetic ancestry groups, but this does not necessarily correspond to differences in risk (e.g., disease prevalence or mean biomarker values). These differences arise from changes in allele frequencies and linkage disequilibrium (LD) patterns. We recommend using the ancestry normalisation feature to address this confounding.

  • PGS values are not absolute measures of risk or causation. Even after adjusting PGS distributions for genetic ancestry, the effect size of the PGS might still differ between ancestry groups.

  • Predictive accuracy depends strongly on similarity between the target population and the discovery cohort.

Note

Interpretation

  • Treat PGS as statistical predictors, not diagnostic tests.

  • Individuals with high scores may never develop the phenotype, and individuals with low scores sometimes will.

  • Interpretation should always consider population context, phenotype definition, and validation performance.

See Abu-El-Hija et al. in 7. Further Reading for more details.

2. Variant Matching and Quality Control#

Variant matching underpins reliable score calculation.

The match outputs and matching tables in the HTML report show how many variants in each PGS scoring file were successfully aligned with the target genotypes.

Key indicators include:

  • High match rate (>95%) indicates good compatibility.

  • A high proportion of excluded or flipped variants can signal build or reference mismatches.

  • Scores that require reducing the --min_overlap threshold to calculate should be reviewed carefully.

Note

Interpretation

Variant matching statistics are direct indicators of dissimilarity from the original PGS and uncertainty. A poorly matched score may not reproduce published performance (e.g., explain less variation in the trait) and can yield biased results. Always check the variant overlap summaries in report.html before using scores in downstream analyses.

3. Understanding the Report#

Each pgsc_calc run produces a summary HTML report (report.html) containing:

  • Metadata for the PGS files applied.

  • Variant matching and overlap statistics.

  • Ancestry projections (PCA plots).

  • PGS distributions (density plots).

These components give a quick overview of data quality and interpretability.

Note

Interpretation

Use these plots and tables to identify potential issues:

  • Narrow or truncated score distributions may indicate low variant overlap.

  • Outliers in PCA plots may represent misclassified or poor-quality samples.

  • Large shifts in ancestry-adjusted distributions may reflect population stratification.

  • The report is designed to be reviewed before further analysis or publication to ensure scores have been applied appropriately.

4. Ancestry and Normalisation#

Tip

Currently ancestry normalisation is only available in pgsc_calc v2.

When run with the --run_ancestry option, pgsc_calc produces additional ancestry-aware outputs (Z_MostSimilarPop, Z_norm1, Z_norm2). These provide normalised scores that can be compared across ancestry groups, improving equity and interpretability.

Note

Interpretation

Ancestry normalisation helps make scores more comparable across populations, but it introduces its own uncertainties. The “Most Similar Population” label is based on statistical similarity in genetic space; it is not a definitive ancestry classification.

Z-scores and percentiles should always be interpreted relative to the specific reference data used. Percentiles and Z-scores are more interpretable scales for cross-population comparison, but the precision of these adjustments depends on reference population diversity and sample size. The effect size of standardised PGS may still differ between different ancestry groups. Read the full documentation to understand what transformations are performed:

https://pgsc-calc.readthedocs.io/en/latest/explanation/geneticancestry.html

5. Communicating Uncertainty#

Polygenic scores are probabilistic estimates, not deterministic outcomes.

Uncertainty arises from multiple sources, including:

  • Statistical error in GWAS effect estimates or unadjusted confounders (e.g. heritable environmental and lifestyle factors, indirect effects)

  • Differences between discovery and target populations

  • Technical artefacts in variant matching or imputation

Note

Interpretation

Communicate PGS results as relative and probabilistic, never as fixed measures of risk.

Always include:

  • The PGS Catalog ID(s) or discovery GWAS details if the score is not in the PGS Catalog.

  • Target population characteristics (ancestry, sample size).

  • pgsc_calc version and parameter settings.

  • Performance metrics (e.g., R² or AUC) when available.

  • Where possible, visualise results as distributions or deciles rather than single-point estimates.

Explicitly acknowledging uncertainty strengthens reproducibility and responsible communication.

6. Best Practices#

  • Review variant matching and overlap thresholds before interpreting results.

  • Use ancestry-normalised or Z-score outputs when applying PGS to individuals of diverse ancestries.

  • Interpret scores within the population context; avoid making absolute statements about risk.

  • When using population descriptors, use language describing similarity to reference genetic ancestry groups, e.g. 1KG-EUR-like (See Box 5-3, NASEM, 2023).

  • Report full metadata, target ancestry, and pgsc_calc version in any downstream use.

  • Treat uncertainty as an inherent and informative feature of PGS interpretation, not as an error term to be ignored.

7. Further Reading#