A1
A1
Article https://doi.org/10.1038/s41588-024-02035-8
Received: 16 August 2023 Sile Hu 1,2,3 , Lino A. F. Ferreira 1,2, Sinan Shi 1, Garrett Hellenthal4,5,9,
Jonathan Marchini 6,9, Daniel J. Lawson 7,8,9 & Simon R. Myers 1,2,9
Accepted: 18 November 2024
Genome-wide association studies (GWAS) have uncovered numerous biases effect size estimates6,7, complicating studies of temporal trait
genetic influences on complex human traits, regulated by many loci evolution6 and comparisons between human groups. Moreover, PGS
with small effect sizes. For traits such as height, large sample sizes in accuracy drops in populations with different ancestry from the GWAS8,9
European groups have allowed PGS to explain a substantial fraction of cohort, particularly those with strong genetic differentiation. This lack
heritability by summing effects across many single nucleotide poly- of portability is partly caused by genetic drift making some causal vari-
morphisms (SNPs) genome-wide1. Because the true causal variants are ants group-specific, and population-specific linkage disequilibrium
unknown, SNPs in PGS are expected not to influence a trait directly, but (LD) patterns affecting tagging accuracy.
rather correlate with (‘tag’) a true causal mutation. Genetic stratifica- As well as such ‘local’ differences, recent studies10–14 have sug-
tion is a major confounder in GWAS, potentially causing false positives gested that causal variants have different effect sizes on traits in dif-
when phenotypes correlate with stratification. Methods including ferent groups, because of ‘global’ (nonlocal) factors. Changes in causal
mixed models2,3 and principal component analysis4,5 have proved SNP effect sizes, defined as their mean effects on a trait of interest, can
powerful in solving major issues, but subtle population structure still arise by either gene–gene interactions not captured by the additive
Department of Statistics, University of Oxford, Oxford, UK. 2Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK. 3Human Genetics
1
Centre of Excellence, Novo Nordisk Research Centre Oxford, Oxford, UK. 4Department of Genetics, Evolution and Environment, University College
London, London, UK. 5UCL Genetics Institute, University College London, London, UK. 6Regeneron Genetic Center, Tarrytown, NY, USA. 7Department of
Statistical Science, School of Mathematics, University of Bristol, Bristol, UK. 8MRC Integrative Epidemiology Unit, Population Health Sciences, Bristol
Medical School, University of Bristol, Bristol, UK. 9These authors contributed equally: Garrett Hellenthal, Jonathan Marchini, Daniel J. Lawson, Simon R.
Myers. e-mail: [email protected]; [email protected]; [email protected]
model, or gene–environment interactions, differing across popula- Our pipeline uses this panel to generate an ancestry decom-
tions. Varying effect sizes have been documented in populations9, so position for a new ‘target’ sample (Fig. 1a), extending our previous
potentially contribute to differences between populations. However, approaches22,33, by closely mirroring the steps used to generate the
other studies suggest similarities across groups in underlying effect reference panel itself. Unified panel markers are phased and imputed
sizes15–19 and, for stronger GWAS hits, in direction of effect at least20. in the target, using the phasing and imputation panel. For UKB, hap-
One recent study14 leverages individuals of mixed African and Euro- lotype data are prephased. Imputed haplotypes are then matched
pean ancestry, decomposing local ancestry to test whether causal vari- against the labeled painting reference panel using ChromoPainter
ants on African versus European chromosomes show different effect to quantify recent ancestor sharing. Finally, a non-negative least
sizes. By focusing on within-individual comparisons, this approach square (NNLS) based approach (Methods)22 is used to infer ancestry
eliminated factors including gene–environment interactions14,21 and coefficients from ChromoPainter output by fitting the observed
long-range gene–gene interactions21 that might differ across popu- haplotype-matching vector as a mixture of those from the 127 ref-
lations. It instead examines whether, for example, local gene–gene erence panel groups. This approach leverages haplotype informa-
interactions alter effect sizes on different ancestral backgrounds. tion to infer population structure33 alongside Hidden Markov Model
The study inferred strong sharing of underlying effect sizes at this approximations to the coalescent33, analyzing each sample in minutes
within-individual level, suggesting local interactions are not major (Methods).
factors for most traits. However, it also inferred strong differences We applied our pipeline to 487,409 UKB participants. Because
in causal effects across individuals possessing different continen- our pipeline uses ‘out-of-sample’ comparisons based on population
tal ancestries, possibly because of gene–environment interactions. structure, analyzing individuals independently, it captures popula-
Resolving the role of such interactions is vital to successfully apply tion structure information in the United Kingdom and Ireland but not
genetic findings across groups, design efficient studies and understand genetic relatedness between samples. Ancestry is defined through
whether evolution or environmental differences drive interpopulation subjective groupings of reference individuals using genetic clusters
trait variation. Here, we develop an approach to do this, using models identifiable via fineSTRUCTURE. Because the true genetic ancestry of
in which genetic effect sizes are shared across ancestries within indi- UKB individuals is unknown, we assessed the pipeline’s effectiveness by
viduals (as observed in ref. 14), but may vary between individuals with examining the relationship between inferred ancestry and individuals’
mainly European or African ancestry. birthplace and self-reported ethnicity, the ability of Ancestry Compo-
To analyze the impacts of population structure, we introduce a nents (ACs) to predict principal components (PCs) and the capacity of
fine-scale ancestry pipeline. We use ‘ancestry’ throughout to refer to our ACs to correct for ancestry in GWAS, compared with widely used
sharing inferred most-recent ancestors with someone from a particular approaches including PCs and BOLT-LMM4,34. We also developed an
self-reported ethnicity or geographic region. Expanding on our previous expectation-maximization algorithm (Methods and Supplementary
work22–24, we created a pipeline inferring individuals’ ancestry contribu- Note 1) to estimate allele frequencies across SNPs. This method pro-
tions from 127 geographically meaningful and genetically recoverable vides frequency estimates for each of 25,485,700 UKB imputed muta-
(Methods) regions worldwide. We applied this pipeline to all 487,409 tions25 within 127 reference groups, aiding studies of regional allele
participating UK Biobank25 (UKB) individuals. We show that using frequencies within the United Kingdom.
detailed ancestry information in GWAS better corrects for population
stratification versus other state-of-the-art methods2,4,26, reduces likely Fine-scale population structure across the United Kingdom
false positives and uncovers previously undiscovered associations, but As a first test, the mean ancestries for the 434,781 UKB participants
nonportability remains strong. We also develop ANCHOR, a statistical born in the United Kingdom or Republic of Ireland with self-reported
inference approach to estimating cross-population similarity in causal white British and/or Irish (WBI) ethnicity were: British–Irish (BI), 94.9%;
effect sizes using admixed individuals, complementing approaches Dutch, 1.35%; Swiss, 0.79%; Norwegian, 0.49%; Polish, 0.29%; and Dan-
including POPCORN10 and XPASS27 that only apply to nonadmixed ish, 0.19%. For participants whose self-reported ethnicity is ‘other white
individuals (‘Discussion’). Notably, ANCHOR requires no prior assump- background’, the BI proportion drops to 25.5%. For individuals born in
tions about the underlying effect size distribution. As in ref. 14 and other the Republic of Ireland, inferred Irish ancestry averages 74.2%, within
recent studies11–13, ANCHOR decomposes ancestry at fine scales along 98.4% BI ancestry, demonstrating the pipeline’s accuracy in capturing
the genome. It assigns mutations to specific ancestries to quantify the geographic and ethnicity information (Extended Data Fig. 1).
contributions of gene–environment interactions on differences of The average ancestry proportion from 22 British Isles regions
predictive power for individuals with different ancestries. Application (Methods) varies by birthplace (Fig. 2a). These ancestry proportions are
of ANCHOR to 8,003 UKB mixed-ancestry individuals yields different based on an out-of-sample dataset of UK individuals with grandparents
findings from recent studies, which we discuss along with implications. born within an ~80-km radius in each region23. Mean BI ancestry associ-
ated with a region decreases with geographic distance from that region,
Results indicating that DNA information is informative for birthplace (Fig. 1b
A statistical pipeline to infer precise individual ancestry and Extended Data Fig. 1). Some 41.5% of UK-born individuals have more
We developed an approach able to decompose the fine-scale ancestry of than 50% of their ancestry from a single region, matching their birth-
a genome into a mix of 127 regions, including 23 in the United Kingdom place 59.2% of the time, rising to 82.7% after expanding to neighboring
and Ireland (Supplementary Table 1). The UKB dataset comprises many regions (Supplementary Table 2a). There is a strong correspondence
individuals of mainly British ancestry, alongside a substantial fraction between self-identified ethnicity and birthplace for non-UK ancestries
with ancestry from elsewhere. Our approach identified 105 regions (Fig. 2b and Extended Data Fig. 2). However, regional variation exists:
present in at least 5 individuals for at least 10% of their ancestry. Our ancestry localization is weaker in southern and eastern England23, and
pipeline leverages data from previous studies23,28–31 of human genetic stronger in Scotland, Wales, Northern England and South West England.
variation, and uses methods25,32 to impute these data on a common set An entropy-based statistic (Methods) shows regional mixing, with
of variants for high-quality imputation. This generates a unified ‘refer- London having the highest entropy and the highest average fraction of
ence panel’ of haplotypes (Methods and Fig. 1a), phased using SHAPEIT2 ancestry outside the United Kingdom. Other major cities and South East
(ref. 32) to ensure consistent data quality. Reference haplotypes are England also show strong mixing (Extended Data Fig. 3). Non-British
labeled through semi-supervised clustering with ChromoPainter33 and ancestries also vary geographically, with higher Irish ancestry (>10%)
fineSTRUCTURE33, combined with geographic and ethnicity labels, to seen in Liverpool, Birmingham, Manchester and London; Dutch ances-
produce a painting reference panel. try found in the south of England and around Bristol (3.5%); and Polish
a
Ancestral inference pipeline
Genotype
b
North Yorkshire South Yorkshire South East England
0.6 0.30
0.35
0.5 0.25
0.30
0.20 0.15
0.3
0.15
0.2 0.10
0.10
0.1 0.05
0.05
Fig. 1 | Schematic diagram of the ancestral inference pipeline and the fitting stage, non-negative coefficients summing to one and representing the
performance in the UKB BI individuals. a, Main steps in the ancestral inference proportions of ancestries from the labeled groups in the reference panel are
pipeline. The pipeline accepts individual genotype data (microarray or inferred. b, Geographical average proportion of DNA in BI individuals, positioned
sequencing data) as input. The genotype data are phased and imputed against according to their birthplaces (Methods), inferred to come from three regional
a phasing and imputation reference panel in first ‘Phasing’ and ‘Imputation’ groupings: North Yorkshire, South Yorkshire and South East England (which
stages, and painted against the painting reference panel including preclustered correspond to the excess ancestry locations colored red). Pop., population; A,
groupings (5 groups in this example; 127 in our actual analysis). In a final mixture proportion of ancestries from each population.
ancestry peaking near Wrexham in Wales, the site of the second highest PCs from these 127 ACs, and conversely predicted 16 common BI ACs
concentration of Polish burials in the United Kingdom35. from the first 140 UKB PCs (Methods, Fig. 3a,b and Supplementary
For non-UK-born individuals, we again achieve fine-scale ancestry Figs. 1 and 2). We found a good prediction of most PCs from ACs, but
resolution (Fig. 2b, Extended Data Fig. 2 and Supplementary Table 2b). not the converse, indicating that ACs capture additional information
Although some countries (for example, Philippines and Japan) show (Supplementary Fig. 1). PC-based prediction was also poor for non-UK
genetic homogeneity, perhaps because of small sample sizes, most regions (Supplementary Fig. 2). Second, we performed GWAS on 104
exhibit diverse ancestry patterns. Because non-UK-born individuals UKB quantitative traits with more than 10,000 unrelated white British
reflect people who made their home in the United Kingdom, rather than individuals (Methods). We compared the effectiveness of ACs versus
unbiased samples from birthplace countries, we observe widespread BI PCs for correcting population stratification by using LD-score regres-
ancestry. Other patterns are present, such as the over-representation sion (LDSC; Extended Data Fig. 4)37, which measures systematic inflation
of Gujarat ancestry among UKB individuals born in Uganda and Kenya, because of uncorrected stratification. An intercept estimate close to
which might be explained by the post-colonial exodus of South Asians 1 indicates effective correction, although intercepts slightly above 1
from East Africa. can occur for highly heritable traits like height, or large sample sizes
in practice37,38.
ACs aid GWAS stratification for geographically linked traits Birth location is often used as a control phenotype to test for
To avoid false-positive GWAS associations, correcting for population stratification39 because few SNPs are likely to be causally associated
stratification is crucial. PCs are widely used for this purpose4,5,36, either with it. For latitude (Fig. 3c), PC-based correction or BOLT-LMM analysis
alone or with mixed-model analysis3,34. However, selecting the number (Fig. 3c, Supplementary Fig. 3a and Supplementary Table 3a,b) yielded
of PCs is nontrivial; too few can result in false positives, whereas too many apparent independent hits, and substantial genome-wide infla-
many can cause overcorrection because of extensive LD (for example, tion (LDSC intercept of 1.6608; Extended Data Fig. 4a,b), even with 100
beyond the first 40 publicly released UKB PCs36). We compared the PCs (Supplementary Fig. 3b). By contrast, AC-based correction reduced
use of 127 ACs with PCs in GWAS. First, we predicted the first 16 UKB the number of association signals from 470 to 7 (Fig. 3c; with five shared
a
34,484 n = 92
4,699 2,585
2,123 2,357
83,091 42,765
2,556 12,408
1,029 50,244
39,597 25,636
34,808 3,062
5,201 6,051
619 5,751
344 58,585
6,635
East Anglia South East England Devon Lincolnshire Cyprus Switzerland Syria
South Central England Merseyside North West Wales Cumbria Germany Balkans Denmark
Central England South England South Yorkshire North West Scotland Lithuania Portugal
North Central England South East Wales Cornwall North Ireland and South Scotland Norway Netherlands
North Yorkshire North East Scotland Cornwall Tip Ireland Poland Italy
Northumberland South West Wales Orkney France Spain Tuscany
b Turkey (153) Iraq (325) Iran (520) Afghanistan (101) Pakistan (1412) Nepal (134) Japan (261)
Oth Whi Asi Oth Whi Asi Oth Whi Asi Oth Pa Asi Oth Asi Oth
Lebanon (72) China (408)
Ind Bri
India (3921) Asi Oth
Uganda (608) Malaysia (666)
Worldwide
Africa
North Europe Ind Bri
Ind Bri Afr Chi Bri
Sri Lanka (730) Myanmar (Burma)
Tanzania (421) Indonesia (62)
South Europe
West Europe
Pacific America Asi Oth Asi Oth Mix Bri
Ind Bri
Chi Oth Bri Whi
Malawi (109) Mauritius (384) Bangladesh (24) Thailand (99) Singapore (496) Australia (919) New Zealand (680)
Ind Mix Bri Afr Ind Asi Oth Ind Oth Mix Bri Asi Chi Oth Bri Chi Bri Bri Whi Bri Whi
Fig. 2 | Ancestry inference for UKB individuals born in the United Kingdom or African countries. Colors are as shown on the map, with colors for ancestry from
Ireland and worldwide. a, Ancestry inference stratified by birthplace region for additional regions given in the legend. White lines on the map delineate the
UKB WBI individuals; for each regionally labeled bar plot, each column shows borders of different countries. Self-reported ethnicity labels are shown below
ancestry decomposition for a single individual, with colors representing regions each bar plot. Color legends differ in a and b. Self-reported ethnicity: Afr, African;
shown on the map and numbers representing counts of individuals from each Asi, Asian; Bri, British; Chi, Chinese; Ind, Indian; Ire, Irish; Mix, mixed; Other,
area. b, As a, but showing decomposition for Asian, Oceanian and selected East other ethnic group; Pa, Pakistani (Asia); Whi, white (Europe).
a b
0.8 0.5 0.05 1.0 1.0 1.0
PC2 PC3 PC4 South 0.8 North West 0.8 Northumber- Ireland
0.6 PC1 1.0 0.8
0.3 0.1 –0.05 0.8 Yorkshire 0.6 England 0.6 land 0.6
0.4 0.6 0.4 0.4 0.4
0.1 0 0.4 0.2 0.2 0.2
0.2 –0.15 0.2
0 0 0 0
–0.1
R2 = 0.999 –0.1 0.996 0.987 0.93 R2 = 0.526 –0.2 0.438 –0.2 0.423 –0.2 0.734
0 –0.25 –0.4
0
0.2
0.4
0.6
0.8
–0.1
0.1
0.3
0.5
–0.1
0
0.1
–0.25
–0.15
–0.05
0.05
–0.2
0
0.2
0.4
0.6
0.8
–0.2
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
1.0
1.0
–0.4
–0.2
1.0
0.8
0.6
0.4
0.2
0
0.05 0.05 1.0 North Central 1.0 South Central 1.0 North Ireland 1.0 Nigeria
PC5 PC6 PC7 0.05 PC8
0.05 0.8 England 0.8 England 0.8 0.8
0.03 –0.10
–0.05 0.6 0.6 0.6 0.6
0.01 –0.25 –0.05 0.4 0.4 0.4 0.4
–0.01 –0.40 –0.15 0.2 0.2 0.2 0.2
–0.15 0 0 0
–0.03 0.883 –0.55 0.851 0.681 –0.25 0.747 0.197 0.228 0.303 0 0.972
–0.2 –0.2
–0.03
–0.01
0.01
0.03
0.05
–0.55
–0.40
–0.25
–0.10
0.05
–0.15
–0.05
0.05
–0.25
–0.15
–0.05
0.05
–0.2
0
0.2
0.4
0.6
0.8
1.0
–0.2
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
1.0
0.06 1.0 1.0 1.0 1.0
0.10 South East South East North West North
0.04 PC9 PC10 0.10 PC11 0.05 PC12 0.8 England 0.8 Wales 0.8 0.8
0.05 Scotland Yorkshire
0.6 0.6 0.6 0.6
0.02 0 0.05 –0.05 0.4 0.4 0.4 0.4
0 –0.05 0.2 0.2 0.2 0.2 0.062
0
0 0
PC predicted by AC
AC predicted by PC
–0.02 0.79 –0.10 0.722 0.713 –0.15 0.677 0.171 0 0.247 0.394 0
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
1.0
–0.02
0
0.02
0.04
0.06
–0.10
–0.05
0
0.05
0.10
0.05
0.10
–0.15
–0.05
0.05
1.0
0.2 1.0 1.0 1.0 1.0
0.02 South West
PC13 PC14 0.10 PC15 PC16 0.8
Cumbria Central
0.8 England 0.8
Anglia
0.8
0.1 0.01 0.02 Wales
0.05 0.6 0.6 0.6 0.6
0 0 0 0.4 0.4 0.4 0.4
–0.01 0.117 0.2 0.102 0.084 0.2
–0.05 –0.02 0.2 0.2
–0.1 –0.02 0
0.722 0.577 0.631 0.47 0 0 0 0.629
–0.10
–0.1
0.1
0.2
–0.02
–0.01
0.01
0.02
–0.10
–0.05
0
0.05
0.10
–0.02
0
0.02
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
1.0
0
0.2
0.4
0.6
0.8
1.0
UKB PC UKB AC
c d e
Birth place North/South Employment score England Waist circumference
25 25 25
20 20 20
160
15 15 15
120
80
10 10 10 40
40 80 120 160
–log10(PPC)
5 5 5
0 0 0
0 5 10 15 20 25 0 5 10 15 20 25 0 5 10 15 20 25
–log10(PAC)
Allergy Cognitive function Blood pressure Other medical condition Autoimmune Education attainment Anthropometric
No GWAS hits Lipids Blood counts Lifestyle Population density Socioeconomic status Not queried
Fig. 3 | Comparison between AC-corrected and PC-corrected GWAS. a,b, and waist circumference (e). All plots are colored according to the legend shown
Predictions are based on ‘linear’ combinations of ACs or PCs. a, Prediction of first at the bottom, indicating earlier evidence from GWAS for each SNP in particular
16 UKB PCs (x axes) using a linear model-based prediction from the 127 ACs (y phenotypic categories (gray: SNPs show no prior GWAS evidence, perhaps
axes) shows strong correlations (R2 values). b, As a, but now predicting 16 UK and consistent with likely false-positive associations). The horizontal and vertical
worldwide ACs from 140 PCs, often showing poor prediction. c–e, Comparison dark blue lines indicate the genome-wide P-value threshold (P = 5 × 10⁻⁸) in a −
of AC-corrected (x axis) and PC-corrected (y axis) −log10(P values) for SNPs in log10 scale, while the light blue line represents y = x. In each plot, the points show
three exemplar GWAS for labeled traits: birthplace (c), employment score (d) only independent SNPs with P < 5 × 10−8 for one or both approaches.
hits), implying that fine-scale ancestry information can effectively association. Because the protective allele against hay fever is more com-
remove stratification impacts. Similar results were observed for mon in southern England where hay fever is most prevalent (Extended
home location (Extended Data Fig. 4a,b and Supplementary Fig. 3c,d). Data Fig. 5), such geographic differentiation44 might reflect selective
Although any residual birthplace hits may reflect inadequate adjust- migration of people carrying this variant and/or past natural selection.
ment for stratification, the few association signals identified using ACs We also performed a GWAS for ‘employment score England’, a trait
show a modest enrichment in SNPs showing previous GWAS evidence defined based on the region in which a person lives. GWAS associations
(Supplementary Table 4; odds ratio (OR) = 5.9, P = 0.039 compared with such traits have been observed6, but such associations can be con-
with 12% PC-corrected hits with previous GWAS evidence). Specifically, founded by regional stratification. Indeed (Fig. 3d), ACs and PCs shared
the strongest remaining signal (P < 10−15) after using ACs surrounds a number of hits, but some signals were only observed in the PC-based
rs5743618 on chromosome 4, linked to toll-like-receptor genes involved analysis, and AC-based showed a stronger enrichment of previous
in innate immunity40,41, including TLR1 and TLR10, one of the strongest GWAS signals (Supplementary Table 4; OR = 11, P = 0.0013) compared
hay fever hits in European GWAS cohorts42,43, with weaker asthma risk with PCs. In total, 18% of PC-only associations overlapped with previous
GWAS signals, similar to the null trait of latitude (OR = 1.6, P = 0.44), and European-like segments, yielding ‘African PGS’ (APGS’ and ‘Euro-
compared with 71% for AC-based associations (OR = 18, P = 1.8 × 10−10 pean PGS’ (EPGS) scores for each individual, each ‘mean-centered’
versus latitude). Moreover, many AC-based signals were significant in by ancestry-dependent mean SNP frequencies, a critical step both in
previous GWAS for educational attainment or socioeconomic status theory (Supplementary Note 2) and in simulations. The phenotype
(Fig. 3d and Supplementary Table 3c). These trait classes were fur- is regressed against APGS and EPGS, including other covariates, to
ther enriched relative to PC-only hits (OR = 16, P = 0.005), suggesting estimate coefficients βAf and βEu, which quantify ancestry-specific PGS
PC-only hits are likely false positives (Fig. 3d). These findings suggest predictive power as the respective average trait increases in admixed
that previous GWAS for ‘regionally defined’ traits can produce false individuals (Supplementary Note 2) per unit APGS or EPGS increase.
positives, and ACs help correct for this. Finally, βEu is then compared with the corresponding βObs.Eu estimate
Among the other 99 nonregionally defined traits, PC-based and from BI individuals to estimate ρ, as βEu/βObs.Eu (Supplementary Note
AC-based correction provided similar P values and LDSC intercepts, 2). Importantly, the validity of the ρ estimation requires no assump-
with subtle improvements for AC (Extended Data Fig. 4a,b). However, tions regarding underlying effect size distribution, causal mutation
differences included five hits unique to the AC-corrected analysis frequencies, LD patterns or selection. Under a local effect only (‘null’)
(Methods, Fig. 3e and Supplementary Table 3e), some in strong LD model, causal effect sizes are identical in all European-ancestry regions
with known GWAS hits for the same traits43,45–47, and thus unlikely to and ρ = 1. If nonlocal ancestry-specific (gene–gene or gene–environ-
be false positives. For traits including waist circumference (Fig. 3e and ment) interactions occur, ρ ≠ 1, and if the variance of underlying genetic
Supplementary Table 3e), AC-specific hits occurred at SNPs in regions effect sizes is the same in each group, then ρ < 1 (Supplementary Note
of strong LD with high regional SNP loadings for particular PCs (Sup- 2). Therefore, testing for ρ ≠ 1 provides a test for differing effect sizes
plementary Table 5a–e and Extended Data Fig. 6). Therefore, applying across groups. Assuming effect sizes scale linearly with genome-wide
ACs likely removed false negatives caused by PC overcorrection from ancestry, ANCHOR can predict ρ for 100% European or African ances-
PCs strongly correlated with genotypes in particular genomic regions. try without sampling (Methods and Supplementary Note 2). Pooling
ρ values among traits helps reduce overall uncertainty, and confi-
Causal effects are similar across ancestries dence intervals (CIs) for all coefficients are obtained by bootstrapping
To study PGS portability across UKB groups, we created PGS using (Methods).
343,047 white British individuals for 53 heritable (estimated herit- We verified HAPMIX’s48 ability to accurately infer ancestry and
ability >5%) UKB-measured quantitative traits, correcting for ancestry construct ancestry-specific PGS by comparing it with trios with known
using either ACs or PCs, and tested their performance in independent phase (Extended Data Fig. 7 and Supplementary Fig. 7). We then tested
samples representing different ancestries; that is, with more than 50% ANCHOR by simulating 24 quantitative traits with various settings of
inferred ancestry (by ACs) from seven respective labeled regions: South heritabilities, clustering of causal mutations and causal marker fre-
Central England, Northumberland, Republic of Ireland, Poland, India, quency spectra, using genetic data from the 8,003 African-ancestry
China and West Africa. AC-based versus PC-based correction yielded individuals (Methods and Supplementary Note 1). We performed GWAS
different group-specific PGS means, but strong within-group corre- and downstream analyses exactly as for the real phenotypes. In nonad-
lations (92.7% to 99.9%) (Supplementary Fig. 4 and Supplementary mixed populations, mean-centering of genotypes has no impact on the
Table 6) so we focus on AC-corrected PGS for the remaining analyses. PGS predictive power, but is crucial for ANCHOR’s validity in admixed
By regressing the PGS against actual traits in each group, we quanti- populations (Supplementary Note 2). Under the null (ρ = 1), across sim-
fied the increase in trait per unit PGS increase (that is, the regression ulations ρ is correctly estimated from mean-centered EPGS and APGS,
slope) and denoted it as β, and denoted the variance explained by the but estimates are strongly downward biased without mean-centering
PGS as ΔR2, after regressing out covariates (Methods). Both ΔR2 and β (Extended Data Fig. 8 and Supplementary Figs. 8 and 9), even when
decreased with increasing genetic distance from the British ancestry masking short or uncertain segments (Supplementary Figs. 8a and 9).
groups (Supplementary Figs. 5 and 6), particularly for sub-Saharan Therefore, we use mean-centering for all ANCHOR analyses. We also
African ancestry, with a >2.2-fold reduction in ΔR2. Fine-scale ances- observe significantly reduced ρ estimates for real quantitative traits
try showed this occurs even between BI regional ancestries for some without mean-centering, fully consistent with the simulation findings
traits: for standing height, forced expiration volume in 1 s (FEV1) and (Supplementary Figs. 10–12).
apolipoprotein B, ΔR2 for Northumberland differs from that for Ireland If ρ = 1, 95% bootstrapped CIs for 96% of individual traits con-
(P < 0.02, P < 0.005 and P < 0.02, respectively). tained the true value (ρ = 1), with robustness to different GWAS P-value
To partition the drop-off in PGS performance across ances- thresholds (0.05 versus 0.0001) (Supplementary Fig. 13). Averaging
tries into local effects (causal variant tagging) and nonlocal effects ρ estimated by ANCHOR across traits for groups of individuals with
(ancestry-specific gene–gene or gene–environment interactions), we similar ancestry levels also yielded ρ = 1 (Extended Data Fig. 9), indicat-
developed ANCHOR, which leverages variation in local ancestry along ing good performance under the null across both traits and ancestry.
the genome and between admixed individuals (Fig. 4a). ANCHOR takes In simulations in which ρ declines, ANCHOR estimates of ρ remain well
PGS coefficients from an independent sample as input, and analyzes calibrated (Fig. 4b, Extended Data Fig. 9 and Supplementary Fig. 14). In
quantitative phenotype and genotype data for a group of admixed indi- all cases we observe βAf/βObs.Eu < 1 as expected because of local effects
viduals. It produces an estimate of ρ, the mean trait increase in admixed (Supplementary Note 2), so African-ancestry segments are less predic-
individuals per unit increase in a perfect PGS for nonadmixed (for tive of traits, but βAf values varied similarly to βEu (Extended Data Fig. 9
example, European) individuals, constructed using their (unknown) and Supplementary Fig. 14).
true effect sizes. ρ equals the correlation in true effect sizes between We applied ANCHOR to 53 UKB quantitative phenotypes and found
populations under reasonable assumptions (Supplementary Note strikingly constant average ρ = βEu/βObs.Eu estimates across genome-wide
2). For our analysis, we generated PGS coefficients from 343,047 UKB ancestry bins (Fig. 4c and Methods), overlapping the null value ρ = 1
white British samples, and analyzed 8,003 ‘African ancestry’ individuals for all bins. This indicates that European-ancestry segments retain
with varying (mean 83.6%) inferred sub-Saharan African ancestry and predictive power similar to that in European-ancestry individuals.
BI + Europe (mean 11.5%) ancestry. Thus, a ‘true’ PGS from Europeans predicts similar trait increase, on
ANCHOR estimates local ancestry along the phased genome, for average, per unit PGS increase, in African-ancestry individuals. This
example, using HAPMIX48 here, masking uncertain and short segments implies either conserved effect sizes between groups at shared causal
to ensure ancestry segments extend further than LD (Methods and SNPs, across the broad range of human molecular and quantitative
Supplementary Note 2). It calculates separate PGSs for African-like phenotypes we examined or—less parsimoniously—systematically
a b βEu/βObs.Eu
All
1.2
[0.10, 0.35] n = 373
European African 1.1 [0.35, 0.55] 560
[0.55, 0.78] 475
1.0
[0.78, 0.95] 2,556
0.9
0.7
0.6 n = 8,003
0.5
0.4
0.3
0.2
Causal SNP 0
1.0
3
05
01
0.
0.
0.
0.
0.
00
0.
True effect size correlation
0.
=
P
=
(simulation)
P
[0.00, 0.10] n = 20,107 [0.35, 0.55] 560 [0.78, 0.95] 2,556 53 UKB traits
c 1.2
[0.10, 0.35] 373 [0.55, 0.78] 475 [0.95, 1.00] 4,039 d
Standing height
1.1
0.5 n = 8,003
0.9
0.4 βObs.Eu All
Impact of local LD
n = 1,491
βEu
5,355 1,502
0.8
13,261
0.7 0.3
4,438
0.6 1,266
Eu
0.5 0.2 βAf
7,104
All Af
0.4 African βAf βAf
0.3 0.1
0.2
pe
Ire nd
be nd
Local ancestry
i
a/ in
0
d
0 21 46 71 89 99
la
la
ro
rla
a
In
um gl
All Af Eu
th n
Global ancestry
N tra
fr
en
tA
/C
es
th
W
u
So
Fig. 4 | Separation of local and nonlocal factors influencing portability. a, Test ANCHOR for 53 UKB traits across varying African ancestry binned as shown (x
principles: in UKB samples with European (blue) and African (red) ancestries, a axis; colored regions). For each bin, mean estimates across traits of ratios
causal variant contributing to a trait is captured by a tag SNP whose predictive βEu /βObs.Eu (blue) and βAf /βObs.Eu (red) are shown. Also shown are ratio estimates
power (pink arrow thickness) varies by ‘local’ ancestry (upper versus lower for individuals of ~100% European (leftmost point at y = 1) or ~100% African (red
chromosomes), or nonlocal factors captured by genome-wide ‘global’ ancestry horizontal bar) ancestry. CIs crossing y = 1 are consistent with identical effects to
(left versus right individuals); ANCHOR separates these contributions to PGS ~100% European-ancestry individuals, and similarly for red points or bar. d, Mean
portability. b–d, βij values refer to the mean increase in phenotype per PGS unit increase in standing height per PGS unit increase across populations (seven
increase for local ancestry j and global ancestry i (see Methods for further left-hand columns); alongside corresponding ANCHOR estimates for height
details). b, ANCHOR performance for 24 simulated traits and 53 UKB quantitative (final six columns) labeled by global or local ancestry combinations. Data are
traits with PGSs constructed using different P-value thresholds (P = 0.05 and presented as (weighted) means (b,c) or as estimated values (d) with 95% central
P = 0.0001; right). True effect size correlations ρ (x axis) between African and bootstrapped CIs. Error bars indicate 95% bootstrapped CIs, Af, African; Eu,
European ancestries are compared with the ANCHOR estimator βAll Eu
/βObs.Eu (y European.
axis). Colors denote African ancestry bins, as defined in c. c, Application of
larger average effects across all these phenotypes in African-ancestry ancestry or African ancestry genome-wide (Figs. 4d and 5). For stand-
individuals, in such a manner as to coincidentally balance the impact ing height (Fig. 4d), we show βAf and βEu estimates alongside βObs.Eu, and
of incomplete correlation. β estimates for various AC-defined UKB cohorts. β estimates decline
For individual traits, as in the simulations we combine all individu- with increasing genetic distance because of LD changes, reducing PGS
als to estimate a trait-specific ρ, and extrapolated to estimate effect predictive power. However, ANCHOR shows that European-ancestry
sizes for European segments in individuals with almost 100% European segments in African-ancestry genomes have strong predictive power,
0 0.5 1.0 1.5 2.0 0 0.5 1.0 1.5 2.0 0 0.5 1.0 1.5 2.0
Fig. 5 | ANCHOR results for 53 UKB traits. Data are presented as estimated values African ancestry and (as expected, reduced) predictive power for African-
of ratio of true effect sizes with 95% central bootstrapped CIs. βij values refer to ancestry segments. From top to bottom, the rows above and below the first
mean increase in phenotype per PGS unit increase for local ancestry j and global horizontal dashed line represent non-molecular and molecular traits and rows
ancestry i (see Methods for further details). Colors of βij : blue, European; purple, above and below the second dashed line represent individual traits and their
projected to 100% African ancestry; red, African ancestry. Black rows represent weighted average estimation. Vertical dotted lines: grey lines indicate ρ = 0.5 (left
individual UKB traits; the first standing height row uses an existing PGS16; the of the red dotted lines) and ρ = 1.5 (right of the red dotted lines); red lines indicate
dark green rows show combined estimates. Columns (left to right) estimate ρ for ρ = 1. ALP, alkaline phosphatase; FVC, forced vital capacity; HDL, high-density
‘all’ 8,003 African-ancestry individuals, ρ for individuals of 100% projected lipoprotein; IGF1, Insulin-like growth factor-1; LDL, low-density lipoprotein.
similar to wholly European-ancestry individuals (blue bars), and does Bonferroni correction. In future, larger sample sizes may better detect
not identify any significant effect sizes changes across the range of heterogeneity of effect sizes for individual traits.
genome-wide ancestry. African-ancestry segments (red) show much
lower predictive power, explaining nonportability because of local LD Discussion
and allele frequency differences, rather than gene–gene or gene–envi- We introduce new approaches to understand human ancestry and its
ronment interactions. connections to GWAS and PGS prediction. Decomposing UKB individu-
Across all 53 quantitative and molecular phenotypes (Fig. 5), ρ als’ ancestry into ACs at a subnational level, improves confounding
estimates (first column) were almost indistinguishable from 1, even for correction by capturing information missing from PCs, reducing likely
individuals with the highest African ancestry (second column), indicat- false positives and likely false negatives. ACs offer better control for
ing embedded European segments have similar predictive power to geographic effects than current methods, within the UKB at least. Many
nonadmixed Europeans, but with strongly reduced overall PGS power traits show similar P values using ACs versus other popular approaches,
(third column). The joint bootstrap yielded an overall ρ = 0.98 ± 0.07 but ACs uniquely avoid likely false positives in ‘regional traits’ defined
for these traits, extremely close to 1. Results were consistent using on groups of individuals6. Together with observations of likely false
varying P-value thresholds (P value of 0.05 and 0.0001) from the initial negatives in GWAS using PCs, and potential larger sample sizes of future
GWAS (Supplementary Fig. 15), and correction methods (ACs versus studies, incorporating ACs into GWAS may improve stratification cor-
PCs) in the initial GWAS (Supplementary Fig. 16). Results for standing rection by reducing overcorrecting linked to local genomic regions, and
height show little change using a previously published, alternative, avoids the issue of deciding the number of PCs in GWAS. The complex
PGS16 (Fig. 5). Two lung-function traits: FEV1 and forced vital capacity, patterns of ancestral mixing within UK regions or between, for exam-
and the correlated trait49,50 of sitting height, showed ρ < 1 at nominal ple, cities and rural areas, suggest particular care is needed for traits
significance (P < 0.05) (Fig. 5, Supplementary Fig. 15), with white blood showing similar regional variation such as educational attainment24. We
cell count, red blood cell count and albumin showing ρ > 1 (P < 0.05). note that the achievable granularity and interpretation of ACs depends
Only the white blood cell count remained significant (P < 0.05) with a on the size and diversity of the ancestry reference panel used, as well
GWAS P-value threshold of 0.0001, and no traits were significant after as the related number of identifiable predefined ‘ancestry groups’. A
limitation of our approach is, therefore, that current ACs are some- Our results do not imply that gene–environment, or even gene–
what United-Kingdom-specific, but region-specific ACs for non-UK gene, interactions are absent across the traits we studied. Such interac-
GWAS could offer similar benefit. Until then, ACs can complement tions likely cause variation in effect size across individuals with African
PCs, though even the best ACs may not completely correct for subtle ancestry but must largely be shared with other ancestries to avoid
population stratification. differences in overall (mean) effect sizes across populations. Previous
To compare causal effect sizes across human populations, we work9 has shown effect sizes variation within UKB individuals of Brit-
consider the mean effect of a causal mutation on a trait12,51 and compare ish ancestry, based on age, gender and socioeconomic status. We also
this across groups. We note that the common and often convenient observe differences between males and females in mean effect sizes
scaling of genotypes by their population standard deviations also (Extended Data Fig. 10), and subtle differences among UK groups strati-
scales effect sizes differently across populations, so to avoid down- fied by ACs (Supplementary Fig. 6), likely reflecting gene–environment
ward biases in estimating causal effect correlations, we avoid such interactions. However, the strong lack of portability in African-ancestry
scaling14, and instead define identical impacts as a mutation causing individuals seems not driven by these interactions, apart from specific
the same average increase in phenotype in each group. This biologi- traits like FEV1. This result encourages joint fine-mapping56 efforts
cally natural approach provides an interpretable scale for practical and suggests that improving causal variant tagging is key to applying
applications. genetic findings across groups, simplifying the process by avoiding
ANCHOR leverages local ancestry inference to estimate the ratio the need of re-estimating effect sizes.
of underlying effect sizes between an admixed group and a reference The UKB resource contains diverse ancestries, but collects homog-
group similar to that used for the initial GWAS and PGS construction, by enous data for individuals within a single country, minimizing trait
assessing the predictive power of PGS among groups in terms of their definition differences and environmental effects to better isolate
relative ‘mean trait increase per unit PGS increase’. This ratio is 1 if effect underlying biological impacts. Our results likely rule out differential
sizes are identical, and estimates the correlation between true effect gene–gene interactions as a major driver of nonportability, even in
sizes in the two groups (Supplementary Note 2). Overall our results indi- other settings, because these would still operate strongly within the
cate clearly that—within the UK at least—using effect sizes from 100% United Kingdom. However, effect size differences might be stronger
European individuals yields near-identical performance in individuals between countries with greater environmental differences or differing
of African ancestry for various quantitative phenotypes. This suggests trait definitions. In future, ANCHOR might be applied to groups outside
PGS utility across populations, at least for causal mutations not private the United Kingdom, for example, African Americans14, to analyze
to one group. Although an underlying correlation between effect sizes various traits, or extended to analyze binary disease traits. As GWAS
ρ < 1 is possible (Supplementary Note 2), if coincidentally counterbal- sample sizes in admixed and other populations grow, methods includ-
anced by larger average effect sizes in African-ancestry individuals, ing ANCHOR will likely uncover variable effect sizes across countries
this seems unlikely across diverse quantitative traits a priori. If instead or cohorts, whereas other approaches10,52,54 enable comparisons of
ρ ≈ 1, most causal effect sizes are very similar between individuals of groups of similar ancestry.
African ancestry and European ancestry, across the range of quantita-
tive phenotypes we studied. Of interest for further study, a few traits Online content
such as FEV1 do suggest evidence of differences. Any methods, additional references, Nature Portfolio reporting sum-
Gene–gene and gene–environment interactions likely, therefore, maries, source data, extended data, supplementary information,
do not drive the lack of PGS portability in the UKB. Population structure acknowledgements, peer review information; details of author contri-
is also an unlikely cause, given consistent effect size estimation between butions and competing interests; and statements of data and code avail-
PGSs constructed by AC-corrected and PC-corrected GWAS. Instead, ability are available at https://doi.org/10.1038/s41588-024-02035-8.
local LD and SNP frequency differences appear to be the main factors.
African-ancestry segments show significantly reduced predictive References
power (about a threefold reduction) compared to European ancestry 1. Yengo, L. et al. A saturated map of common genetic variants
segments (Fig. 5). Our (ρ = 1) simulations confirm that PGS coeffi- associated with human height. Nature 610, 704–712 (2022).
cients match real data for European segments, but show only a modest 2. Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool
reduction (less than twofold for traits with >100 causal markers) for for genome-wide complex trait analysis. Am. J. Hum. Genet. 88,
African-ancestry segments. This suggests reduced tagging of causal 76–82 (2011).
variants by the PGS in African-ancestry regions potentially because of 3. Yang, J. et al. Common SNPs explain a large proportion of the
selection against trait-impacting variants52,53 causing larger frequency heritability for human height. Nat. Genet. 42, 565–569
differences for true causal variants between populations than the ran- (2010).
domly selected variants in our simulations, and the observed strong 4. Price, A. L. et al. Principal components analysis corrects for
drop-off between groups. Stronger stratification correction by ACs stratification in genome-wide association studies. Nat. Genet. 38,
could help investigate such selection in future. 904–909 (2006).
Our results differ from previous findings51,54,55 of often considerably 5. Novembre, J. & Stephens, M. Interpreting principal component
lower correlations between European-ancestry and African-ancestry analyses of spatial population genetic variation. Nat. Genet. 40,
samples using methods that model genetic variance components and 646–649 (2008).
local LD differences. For example, one study51 finds different effects for 6. Abdellaoui, A. et al. Genetic correlates of social stratification in
body mass index in UKB, another55 finds different effects for height, and Great Britain. Nat. Hum. Behav. 3, 1332–1342 (2019).
an estimated correlation of only ~50% across 26 traits in UKB. Because 7. Abdellaoui, A., Dolan, C. V., Verweij, K. J. & Nivard, M. G. Gene–
ANCHOR focuses on PGS prediction with minimal assumptions, it is environment correlations across geographic regions affect
challenging to attribute our near-parity estimates to confounding. genome-wide association studies. Nat. Genet. 54, 1345–1354
These differences might result from how genotypes are scaled and cen- (2022).
tered, as shown by the importance of appropriate centering in ANCHOR 8. Duncan, L. et al. Analysis of polygenic risk score usage and
(Extended Data Fig. 8 and Supplementary Figs. 8 and 9). Alternatively, performance in diverse human populations. Nat. Commun. 10,
if causal variants have unique evolutionary histories because of natural 3328 (2019).
selection, this might impact some methods more than others. Further 9. Mostafavi, H. et al. Variable prediction accuracy of polygenic
study is needed to understand these discrepancies. scores within an ancestry group. eLife 9, e48376 (2020).
10. Brown, B. C., Ye, C. J., Price, A. L. & Zaitlen, N. Transethnic 33. Lawson, D. J., Hellenthal, G., Myers, S. & Falush, D. Inference of
genetic-correlation estimates from summary statistics. Am. J. population structure using dense haplotype data. PLoS Genet. 8,
Hum. Genet. 99, 76–88 (2016). e1002453 (2012).
11. Bitarello, B. D. & Mathieson, I. Polygenic scores for height in 34. Loh, P.-R. et al. Efficient Bayesian mixed-model analysis increases
admixed populations. G3 (Bethesda) 10, 4027–4036 association power in large cohorts. Nat. Genet. 47, 284–290
(2020). (2015).
12. Patel, R. A. et al. Genetic interactions drive heterogeneity in 35. Krausova, A. & Vargas-Silva, C. Wales: census profile.
causal variant effect sizes for gene expression and complex traits. Migration Observatory at the University of Oxford Briefing
Am. J. Hum. Genet. 109, 1286–1297 (2022). https://migrationobservatory.ox.ac.uk/resources/briefings/
13. Marnetto, D. et al. Ancestry deconvolution and partial polygenic wales-census-profile/ (2013).
score can improve susceptibility predictions in recently admixed 36. Privé, F., Luu, K., Blum, M. G., McGrath, J. J. & Vilhjálmsson, B.
individuals. Nat. Commun. 11, 1628 (2020). J. Efficient toolkit implementing best practices for principal
14. Hou, H. et al. Causal effects on complex traits are similar for component analysis of population genetic data. Bioinformatics
common variants across segments of different continental 36, 4449–4457 (2020).
ancestries within admixed individuals. Nat. Genet. 55, 549–558 37. Bulik-Sullivan, B. K. et al. LD Score regression distinguishes
(2023). confounding from polygenicity in genome-wide association
15. Weissbrod, O. et al. Leveraging fine-mapping and multipopulation studies. Nat. Genet. 47, 291–295 (2015).
training data to improve cross-population polygenic risk scores. 38. Loh, P.-R., Kichaev, G., Gazal, S., Schoech, A. P. & Price, A. L.
Nat. Genet. 54, 450–458 (2022). Mixed-model association for biobank-scale datasets. Nat. Genet.
16. Privé, F. et al. Portability of 245 polygenic scores when derived 50, 906–908 (2018).
from the UK Biobank and applied to 9 ancestry groups from the 39. Cook, J. P., Mahajan, A. & Morris, A. P. Fine-scale population
same cohort. Am. J. Hum. Genet. 109, 12–23 (2022). structure in the UK Biobank: implications for genome-wide
17. Márquez‐Luna, C., Loh, P. R., South Asian Type 2 Diabetes (SAT2D) association studies. Hum. Mol. Genet. 29, 2803–2811 (2020).
Consortium, SIGMA Type 2 Diabetes Consortium & Price, A. 40. Kawasaki, T. & Kawai, T. Toll-like receptor signaling pathways.
L. Multiethnic polygenic risk scores improve risk prediction in Front. Immunol. 5, 461 (2014).
diverse populations. Genet. Epidemiol. 41, 811–823 (2017). 41. Mikacenic, C., Reiner, A. P., Holden, T. D., Nickerson, D. A. & Wurfel,
18. Martin, A. R. et al. Current clinical use of polygenic scores will M. M. Variation in the TLR10/TLR1/TLR6 locus is the major genetic
risk exacerbating health disparities. Nat. Genet. 51, 584–591 determinant of interindividual difference in TLR1/2-mediated
(2019). responses. Genes Immun. 14, 52–57 (2013).
19. Wang, Y. et al. Theoretical and empirical quantification of the 42. Waage, J. et al. Genome-wide association and HLA fine-mapping
accuracy of polygenic scores in ancestry divergent populations. studies identify risk loci and genetic pathways underlying allergic
Nat. Commun. 11, 3865 (2020). rhinitis. Nat. Genet. 50, 1072–1080 (2018).
20. Carlson, C. S. et al. Generalization and dilution of association 43. Churchhouse, C. et al. Rapid GWAS of thousands of
results from European GWAS in populations of non-European phenotypes for 337,000 samples in the UK Biobank.
ancestry: the PAGE study. PLoS Biol. 11, e1001661 (2013). Neale Lab Blog www.nealelab.is/blog/2017/7/19/
21. Atkinson, E. G. Estimation of cross-ancestry genetic correlations rapid-gwas-of-thousands-of-phenotypes-for-
within ancestry tracts of admixed samples. Nat. Genet. 55, 337000-samples-in-the-uk-biobank (2019).
527–529 (2023). 44. Bycroft, C. et al. Patterns of genetic differentiation and the
22. Hellenthal, G. et al. A genetic atlas of human admixture history. footprints of historical migrations in the Iberian Peninsula. Nat.
Science 343, 747–751 (2014). Commun. 10, 551 (2019).
23. Leslie, S. et al. The fine-scale genetic structure of the British 45. Martin, S. et al. Genetic evidence for different adiposity
population. Nature 519, 309–314 (2015). phenotypes and their opposing influences on ectopic fat and risk
24. Haworth, S. et al. Apparent latent structure within the UK Biobank of cardiometabolic disease. Diabetes 70, 1843–1856 (2021).
sample has implications for epidemiological analysis. Nat. 46. Christakoudi, S., Evangelou, E., Riboli, E. & Tsilidis, K. K. GWAS
Commun. 10, 333 (2019). of allometric body-shape indices in UK Biobank identifies loci
25. Bycroft, C. et al. The UK Biobank resource with deep phenotyping suggesting associations with morphogenesis, organogenesis,
and genomic data. Nature 562, 203–209 (2018). adrenal cell renewal and cancer. Sci. Rep. 11, 10688 (2021).
26. Patterson, N., Price, A. L. & Reich, D. Population structure and 47. Graff, M. et al. Genome-wide physical activity interactions in
eigenanalysis. PLoS Genet. 2, e190 (2006). adiposity―A meta-analysis of 200,452 adults. PLoS Genet. 13,
27. Cai, M. et al. A unified framework for cross-population trait e1006528 (2017).
prediction by leveraging the genetic correlation of polygenic 48. Price, A. L. et al. Sensitive detection of chromosomal segments
traits. Am. J. Hum. Genet. 108, 632–655 (2021). of distinct ancestry in admixed populations. PLoS Genet. 5,
28. Cann, H. M. et al. A human genome diversity cell line panel. e1000519 (2009).
Science 296, 261–262 (2002). 49. Torres, L. A., Martinez, F. E. & Manço, J. C. Correlation between
29. Schlebusch, C. M. et al. Genomic variation in seven Khoe-San standing height, sitting height, and arm span as an index of
groups reveals adaptation and complex African history. Science pulmonary function in 6–10‐year‐old children. Pediatr. Pulmonol.
338, 374–379 (2012). 36, 202–208 (2003).
30. Pagani, L. et al. Ethiopian genetic diversity reveals linguistic 50. Lum, S. et al. Lung function in children in relation to ethnicity, phy
stratification and complex influences on the Ethiopian gene pool. sique and socioeconomic factors. Eur. Respir. J. 46, 1662–1671 (2015).
Am. J. Hum. Genet. 91, 83–96 (2012). 51. Momin, M. M. et al. A method for an unbiased estimate of
31. Novembre, J. et al. Genes mirror geography within Europe. Nature cross-ancestry genetic correlation using individual-level data.
456, 98–101 (2008). Nat. Commun. 14, 722 (2023).
32. Delaneau, O., Zagury, J.-F. & Marchini, J. Improved 52. Shi, H. et al. Population-specific causal disease effect sizes in
whole-chromosome phasing for disease and population genetic functionally important regions impacted by selection. Nat.
studies. Nat. Methods 10, 5–6 (2012). Commun. 12, 1098 (2021).
53. Yair, S. & Coop, G. Population differentiation of polygenic score Open Access This article is licensed under a Creative Commons
predictions under stabilizing selection. Philos. Trans. R. Soc. Lond. Attribution 4.0 International License, which permits use, sharing,
B Biol. Sci. 377, 20200416 (2022). adaptation, distribution and reproduction in any medium or format,
54. Galinsky, K. J. et al. Estimating cross‐population genetic as long as you give appropriate credit to the original author(s) and the
correlations of causal effect sizes. Genet. Epidemiol. 43, 180–188 source, provide a link to the Creative Commons licence, and indicate
(2019). if changes were made. The images or other third party material in this
55. Guo, J. et al. Quantifying genetic heterogeneity between article are included in the article’s Creative Commons licence, unless
continental populations for human height and body mass index. indicated otherwise in a credit line to the material. If material is not
Sci. Rep. 11, 5240 (2021). included in the article’s Creative Commons licence and your intended
56. LaPierre, N. et al. Identifying causal variants by fine mapping use is not permitted by statutory regulation or exceeds the permitted
across multiple studies. PLoS Genet. 17, e1009733 (2021). use, you will need to obtain permission directly from the copyright
holder. To view a copy of this licence, visit http://creativecommons.
Publisher’s note Springer Nature remains neutral with regard org/licenses/by/4.0/.
to jurisdictional claims in published maps and institutional
affiliations. © The Author(s) 2025
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
PGS was calculated as a linear sum over SNPs and genotypes: is each of the four possibilities, ordering the two chromosomes arbi-
n
trarily (for example, ‘EE’ refers to both chromosomes possessing Euro-
PGSAC = ∑ γˆ
AC
ji
Gj (6) pean ancestry). Given genotype Gij at site j for individual i, we estimate
j=1 allele frequencies f Ej , f Aj for European and African background, respec-
tively, by fitting the model
n
PGSPC = ∑ γˆ
PC EE EA AA EA
ji
Gj (7) Gij = f Ej (2Pij +Pij + PAE
ij
) + fAj (2Pij +Pij + PAE
ij
) + εi (9)
j=1
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
in this case, conditional on the ratio βEu /(βEu + βAf ) the model is again distributed along the genome. Effects sizes of the causal variants are
linear, so we use a grid search (1,000 values) over this ratio from 0 to 1, drawn independently either from a N(0,1) distribution, or following a
fitting the linear model for each possible value and then minimizing LDAK model64 where the effect size of variant j is a draw from a standard
the achieved sum of squares. normal multiplied by a factor [2pj (1 − pj)−0.25], where pj is that variant’s
Parameters βEu and βAf measure the increase in the phenotype per frequency, resulting in larger effects for rarer variants. This generates
unit increase in the respective PGS, indicating the scores’ predictive 3 × 2 × 2 = 12 scenarios; we simulated two different heritabilities 0.3,
power. Local factors mean that we expect βEu > βAf (Supplementary 0.6 resulting in 24 simulations in total.
Note 2; Extended Data Fig. 9 shows this via simulation). If no addi- Finally, to generate simulated phenotypes Yi for individual i = 1,
tional nonlocal factors contribute to nonportability, then βEu and βAf 2, … N, we apply the following additive model which leverages the
should remain constant across ancestry bins, that is varying θi, with βEu actual UKB genotypes and so matches properties of these data, includ-
also shared between African-ancestry individuals and 100% European ing in particular population stratification:
ancestry individuals (Supplementary Note 2). With nonlocal factors
J
acting, this no longer holds. To investigate this setting, we allow the Yi = ∑ γj gij + εi (16)
predictive power of the two scores varies with genome-wide European j=1
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
genetic data used in constructing the ancestry pipeline is provided 66. Hu, S., Lawson, D. & Myers, S. fuopen/UKB_anc: Source code of
by third parties and is available for use by others. Variant Frequency analysis for reproducing the main results/figures in ANCHOR
information for every SNP in each genetic grouping is available at paper. v0.2 edn. Zenodo https://doi.org/10.5281/zenodo.14026928
the University of Bristol data repository, data.bris, at https://doi. (2024).
org/10.5523/bris.3g5oatl682kz82as80jakjrq91. All other resources
were downloaded from their respective websites without registration Acknowledgements
requirements. The following files have been returned to UK Biobank This research has been conducted using the UK Biobank Resource
so that they might be made available to other researchers: (1) ACs under application number 27960. We thank all the participants of
on all UK Biobank participants; (2) group-specific allele frequency the UK Biobank project. POPRES was accessed under application
estimates for 25 M variants; (3) (Mean-centered) European/African number 16508: ‘Admixture and Selection for Fine-Scale Population
genotypes and local ancestry of 8,003 UK Biobank African ancestry Genetics of Europe’. This work was supported by the Wellcome Trust
individuals (including the variants annotation information); (4) Euro- grant no. 200186/Z/15/Z awarded to J.M., S.R.M. and G.H., grant no.
pean/African PGS for 8,003 African ancestry individuals across 53 UK 224575/Z/21/Z awarded to G.H. and grant no. 212284/Z/18/Z awarded
Biobank phenotypes. GWAS summary statistics files with population to S.R.M. L.A.F.F. is supported by the Wellcome Trust grant no.
structure corrected by ACs and PCs are available in GWAS catalog 222334/Z/21/Z. Computations used the high-performance computing
(https://www.ebi.ac.uk/gwas/, GCST90310137–GCST90310200 and facilities at the Department of Statistics, University of Oxford and
GCST90429571–GCST90429610). Oxford Biomedical Research Computing facility, a joint development
between the Wellcome Centre for Human Genetics and the Big Data
Code availability Institute supported by Health Data Research UK and the NIHR Oxford
The ANCHOR software package is available via GitHub at https://github. Biomedical Research Centre. This work was carried out using the
com/MyersGroup/ANCHOR and Zenodo at https://doi.org/10.5281/ computational facilities of the Advanced Computing Research Centre,
zenodo.13847648 (ref. 65). Analysis scripts can be downloaded via University of Bristol—http://www.bris.ac.uk/acrc.
GitHub at http://github.com/fuopen/UKB_anc and Zenodo at https://
doi.org/10.5281/zenodo.14026928 (ref. 66). External software/pack- Author contributions
ages used in this study are available via GitHub at https://github.com/ S.R.M., D.J.L., J.M. and G.H. designed the study. S.H., G.H., J.M., D.J.L.
danjlawson/pcapred.largedata (‘pcapred.largedata’) and https:// and S.R.M. developed the methods. S.H., G.H., J.M., D.J.L. and S.R.M.
github.com/danjlawson/pcapred (‘pcapred’). performed the main analyses with contributions for specific analyses
from L.A.F.F. and S.S. S.H., G.H., J.M., D.J.L. and S.R.M. wrote the
References manuscript.
57. Chang, C. C. et al. Second-generation PLINK: rising to the
challenge of larger and richer datasets. Gigascience 4, s13742- Competing interests
015-0047-8 (2015). S.H. became a full-time employee of Novo Nordisk Ltd during the
58. Walter, K. et al. The UK10K project identifies rare variants in health drafting of this manuscript. J.M. is a current employee and stockholder
and disease. Nature 526, 82–90 (2015). of Regeneron Pharmaceuticals. The other authors declare no
59. Abraham, G., Qiu, Y. & Inouye, M. FlashPCA2: principal competing interests.
component analysis of Biobank-scale genotype datasets.
Bioinformatics 33, 2776–2778 (2017). Additional information
60. Mountjoy, E. et al. An open approach to systematically prioritize Extended data is available for this paper at https://doi.org/10.1038/
causal variants and genes at all published human GWAS s41588-024-02035-8.
trait-associated loci. Nat. Genet. 53, 1527–1533 (2021).
61. Band, G. & Marchini, J. BGEN: a binary file format for imputed Supplementary information The online version contains
genotype and haplotype data. Preprint at BioRxiv https://doi. supplementary material available at https://doi.org/10.1038/s41588-
org/10.1101/308296 (2018). 024-02035-8.
62. International HeatMap 3 Consortium. Integrating common and
rare genetic variation in diverse human populations. Nature 467, Correspondence and requests for materials should be addressed to
52–58 (2010). Sile Hu, Daniel J. Lawson or Simon R. Myers.
63. 1000 Genomes Project Consortium. A global reference for human
genetic variation. Nature 526, 68–74 (2015). Peer review information Nature Genetics thanks Hakhamanesh
64. Speed, D. et al. Reevaluation of SNP heritability in complex Mostafavi and Benjamin Neale for their contribution to the peer review
human traits. Nat. Genet. 49, 986–992 (2017). of this work. Peer reviewer reports are available.
65. Hu, S. & Myers, S. MyersGroup/ANCHOR: ANCHOR v0.1-beta.1.
v0.1 edn. Zenodo https://doi.org/10.5281/zenodo.13847648 Reprints and permissions information is available at
(2024). www.nature.com/reprints.
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
Extended Data Fig. 1 | Heatmap of regional mean ancestry proportions of 30 UK+ EU ancestries based on birthplace among UK-born and Ireland-born UKB
participants. One panel per ancestry, brightest red colour on maps indicates highest ancestry, darkest blue lowest ancestry.
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
c. Africa d. Americas
Extended Data Fig. 2 | World ancestries in the UKB inferred by the ancestry Whi = White, Ire = Irish, Chi = Chinese, Asi = Asian, Pa = Pakistani, Ind = Indian,
pipeline. UKB participants are mapped to their self-reported country of birth. Mix = Mixed, Car = Caribbean, and Oth = Other Ethnic Group. a). Ancestries in
World countries are coloured by different colours if they are present in the Central Asia; b). Ancestries in Europe; c). Ancestries in Africa; d). Ancestries in the
pipeline. Ethnic groups are abbreviated as follows in the bar plots: Bri = British, Americas. White lines on the map delineate the borders of different countries.
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
1.8
Aberdeen (n=833)
Fife
1.6
Angus
Perthshire and Kinross
Dundee (2,649)
Glasgow(12,132)
Edinburgh (8,022)
Hartlepool
Stockon-on-Tees
West Yorkshire(30,120) Middlesbrough (1,894)
Liverpool (14,683) East Riding of Yorkshire 1.2
Manchester(31,359) North East Lincolnshire
(1,439)
Birmingham (12,434)
Ceredigion
Carmarthenshire 0.8
London (41,325)
(935)
Cardiff (5,578)
Oxfordshire (7,366)
Plymouth (944)
Isle of Wight (258)
Bristol (7,487)
Extended Data Fig. 3 | Heatmap of regional mean entropy statistic across and areas with low entropy are coloured by blue. Selected regions of high or low
the UK and Republic of Ireland. Regional mean entropy indicates varying entropy are highlighted by the boundaries, with the barplots detailing ancestry
degrees of admixture in the UK. Areas with high entropy are coloured by red decomposition for each such region.
Nature Genetics
Article https://doi.org/10.1038/s41588-024-02035-8
a b
1.0
Place of birth (North/South)
Place of birth (East/West)
1.6 Place of birthe (North/South)
Place of birth (East/West)
Home location(East/West)
0.8
Home location (North/South)
0.6
Home location (North/South)
0.4
1.2
0.2
Employment score (England)
Education score (England)
Housing score (England)
1.0
0.0
1.0 1.2 1.4 1.6 0.0 0.2 0.4 0.6 0.8 1.0
LDSC Intercept(AC) LDSC attenuation ratio(AC)
Extended Data Fig. 4 | Comparison of inflation between ACs-corrected corrected GWAS >0.015; b). Comparison of the LDSC attenuation ratio between
GWAS and PC-corrected GWAS across 104 quantitative traits by running LD AC-corrected GWAS (x-axis) and PC-correct GWAS (y-axis), where red dots
score regression (LDSC). a). Comparison of the LDSC intercept between AC- indicate phenotypes with difference of attenuation ratio between PC-corrected
corrected GWAS (x-axis) and PC-corrected GWAS (y-axis) where red dots indicate GWAS and AC-corrected GWAS >0.02.
phenotypes with intercept difference between PC-corrected GWAS and AC-
Nature Genetics