Modeling Intercellular Communication in Tissues Using Spatial Graphs of Cells
Modeling Intercellular Communication in Tissues Using Spatial Graphs of Cells
https://doi.org/10.1038/s41587-022-01467-z
Models of intercellular communication in tissues are based on 69 µm across the six datasets (Fig. 1c), showing that cell–cell depen-
molecular profiles of dissociated cells, are limited to recep- dencies appear on length scales characteristic of molecular mecha-
tor–ligand signaling and ignore spatial proximity in situ. We nisms of cell communication. NCEMs outperformed nonspatial
present node-centric expression modeling, a method based baseline models consistently by an average ΔR2 (Online Methods)
on graph neural networks that estimates the effects of niche of 0.016 (Fig. 1c). As expected, the ΔR2 is small compared to the
composition on gene expression in an unbiased manner from baseline model R2 that characterizes between-cell-type variance
spatial molecular profiling data. We recover signatures of (0.39–0.79) because cell type identity accounts for a large fraction
molecular processes known to underlie cell communication. of variance in single-cell gene expression assays17. The inferred
Cells interact on multiple length-scales through direct contact of length scales were robust to data downsampling (Extended Data
surface-bound receptors and ligands, tight junctions and mechani- Fig. 3a), out-of-domain data from an unseen genetic knockout con-
cal effects, and through indirect mechanisms, including soluble dition (Extended Data Fig. 3b–e), to simulated segmentation errors
factors. Usually, these communication events cannot be directly (Extended Data Fig. 3f,g) and to removal of the interaction terms
observed but are critical to understand emergent phenomena in tis- from the linear model (Supplementary Fig. 2). The spatial effect on
sue niches1. Molecular signatures of sender and receiver cell types model performance varies between target cell types, suggesting that
are used to infer latent cell communication events in a tissue through cell-type-specific molecular mediators of cell–cell dependency are
co-occurrence of ligand and receptor expression across putatively captured (Supplementary Fig. 3).
communicating cell types2,3 and through gene expression signatures NCEMs can be extended to spot transcriptomics if
in the receiving cell2,4. Here we propose node-centric expression within-cell-type variation can be recovered from spot transcrip-
models (NCEM) to improve cell communication inference through tomics datasets in deconvolution analyses18,19. Here NCEMs model
the use of spatial graphs of cells to constrain axes of cellular com- the expression variation within cell types across spots as a function
munication. We infer cell communication from image-structured of the inferred spot composition (Fig. 1d and Methods). We con-
molecular profiling assays of RNA or proteins with subcellular reso- sidered data from lymph nodes18,19 (Extended Data Fig. 4a,b) for
lution (Fig. 1a). We defined an NCEM as a graph neural network which a deconvolution was previously demonstrated with cell2loca-
that predicts a cell’s observed gene expression vector from its cell tion19. Linear NCEMs were substantially better at predicting gene
type label and its niche5 (Fig. 1b and Methods). Cell–cell dependen- expression states of cell types in particular spots than a nonspatial
cies may be caused by diverse molecular mechanisms not limited baseline model, both globally (Fig. 1e) and for each cell type (Fig. 1f).
to ligand–receptor-based communication. Therefore, we consider The inferred couplings were stable to moderate subsampling of the
the effects of niche composition on all genes in an unbiased man- transcriptomics spots in the training data (Extended Data Fig. 4d).
ner. Previous mathematical models of cell–cell interactions in spa- We found putatively interacting ligand–receptor pairs for almost all
tial data differed in at least one out of the following central design type couplings in CellPhoneDB3 and NicheNet2 analyses of matched
choices that constitute an NCEM (Supplementary Table 1): they did single-cell RNA sequencing (scRNA-seq) data, thus demonstrat-
not represent statistical dependencies of gene expression6,7, did not ing the need for a quantitative description of statistical couplings
model cell communication events6–8, did not work on targeted pro- in niches (Extended Data Fig. 4e–g). We also identified spatial
tocols with limited ligand and receptor gene capture8–10 or relied on dependencies between entire niches when modeling spot graphs
leave-one-gene hold-outs4,9, which can result in false discoveries of (Supplementary Fig. 4).
dependencies (Extended Data Fig. 1 and Methods). Next we interpreted the spatial dependencies in the MERFISH
We demonstrate cell communication inference with NCEMs on brain data. We found that L2/3 intratelencephalic (IT) cells depend
six datasets measured with MERFISH11,12, CODEX13, MIBI-TOF14, on the presence of Sncg expressing cells, vascular leptomenin-
MELC15 and chip cytometry16 (Extended Data Fig. 2 and Methods). geal cells, and L4/5 cells in their niche (Extended Data Fig. 5 and
On average, intracell-type variance accounted for 40.6% of the total Supplementary Fig. 5). These associations are reproduced by
variance (Supplementary Fig. 1 and Methods). We defined the CellPhoneDB (Supplementary Fig. 6a,b). The L2/3 IT cell subclus-
screened neighborhood sizes such that they cover the range of aver- ters are spatially localized in distinct areas of the primary motor
age node degrees of the given dataset (Extended Data Fig. 2b). We cortex12. Indeed, the relative performance of NCEM is spatially
fit a linear model of gene expression based on a niche represented structured (Extended Data Fig. 5f and Supplementary Fig. 5e). We
as interaction terms between the receiver cell type and the presence quantified these dependencies between cell types as ‘cell type cou-
of each (sender) cell type in the neighborhood (Methods). Linear plings’, the number of significant gene-wise coefficients of the cell
NCEMs were most predictive on an intermediate length scale of type pair in an NCEM fit (Extended Data Fig. 5g and Methods).
Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany. 2TUM School of Life Sciences Weihenstephan, Technical
1
University of Munich, Freising, Germany. 3Department of Mathematics, Technical University of Munich, Garching bei München, Germany. 4These authors
contributed equally: David S. Fischer, Anna C. Schaar. ✉e-mail: [email protected]
b
4 Self Niche Repression
2
Gene
f( 1 , ( , , )) =
expression 3
1
cell type
Sender
f( 2 , ( , , )) =
f
NCEM f
Linear NCEM Effect
f f( 3 , ( , , )) = summary by
2 4 cell-type pair
Cell types f( 4 , ( , , )) = Receiver
and spatial cell type
context 3
1
c
Fetal liver wild Colon
Brain (MERFISH) Cancer (MIBI-TOF) Tonsils (MELC) Cancer (CODEX)
type (MERFISH) (chip cytometry)
0.460 0.630 Example
0.760 cell radius
Variance explained (R 2)
0.565 0.525
0.805
0.440 0.628 Model:
0.520
0.560 0.750 Baseline
0.420
0.515 0.800 NCEM
0.626
0.555 Split:
0.510 0.740 0.400
0.795 0.624 1
0.550 0.505
0.380 2
0.730 3
10
50
200
1,000
0
10
200
2,500
10
50
200
1,000
0
10
200
6,000
10
50
200
1,000
0
5
10
50
200
* * * * * *
Resolution (μm)
d e f
es
yp
Baseline
Spots Cell types genes 0.800
lt
Variance explained (R 2 )
el
C
NCEM
Spot i
0.700
i
Spots
Spots
0.600
NCEM
B cells
T Treg
NKT
Macrophages
DC
CD8 T cells
CD4 T cells
FDC
T TfR
T TIM
ILC
Monocytes
NK
VSMC
Mast
Endo
Fig. 1 | Node-centric expression models capture statistical dependencies between cells in space. a, Spatial graphs of cells are based on segmentation
masks of cells in spatial molecular profiling data. Resolution is the radius of neighborhood used to define a niche. Numbers label cells and are used
in Fig. 1b. b, NCEMs describe the gene expression observation of a cell as a function (f) of its spatial neighborhood (niche). c, Linear models capture
neighborhood dependencies in spatially resolved single-cell data. Shown are the R2 values between predicted and observed expression vectors on held-out
test cells by resolution for six datasets. Green line, 10 µm; baseline (blue points with cross-validation split indicated as shape), a nonspatial linear model;
bracket (*), significant difference in paired t-test. d, Variation in deconvoluted expression vectors over spots for a given cell type can be attributed to spot
composition with a linear NCEM. A, spot adjacency matrix. e,f, NCEM performance on deconvoluted data. Shown are the R2 values between predicted
and observed gene expression vectors for held-out test spots of a linear NCEM in comparison to a baseline model that does not use the spot composition
information. The performance is shown across the entire test set (e) and split by cell type (f) (n = 3 cross-validation splits). For each box in (e,f), the
centerline defines the median, the height of the box is given by the interquartile range (IQR), the whiskers are given by 1.5 × IQR and outliers are given as
points beyond the minimum or maximum whisker.
a b
Endo
0 0 0
Subcluster
FD
1
DC
2
1 1 lls
C
ce
UMAP2
2 2
3 T
4 3 3 8
CD
IL
C
UMAP1 Ma
cro s
B cells pha cell
ges 4T
CD4 T cells 6 CD
CD8 T cells 8
DC 8
Abundance in neighborhood
5
Endo
FDC Mast B cells
ILC 6 6 4
Macrophages
Mast
3
Monocytes 4 4 VS
tes MC
NK ocy
2 Mon
NKT
T TIM 2 2
T TfR
NK
T
1
Tr
T Treg
eg
VSMC
TT
T
NK
0 1 2 3 4 0 1 2 3 0 1 2 3
fR
T TIM
B cell subclusters FDC subclusters Mast subclusters
c d e f Correlation
Fold change Fold change
–0.10 –0.15 0 0.05 0.10 –0.4 –0.2 0 0.2 0.4 0 0.5 1.0
14
LTF LTF
CXCL13
FTL FTL
12 B cells
–log10 FDR-corrected P values
Sender
CXCL13 CXCL13 IGHG3
NKT
EEF1A1 EEF1A1 Monocytes
IGFBP7 IGFBP7 6 FDC
RPSA RPSA DC
CD52 CD52 Macrophages
RPS27 RPS27 4 Endo
FDCSP FDCSP IGHA1 VSMC
IGLC3 IGLC3 Mast
IGHA1 IGHA1 2
T TIM
IGHG1 IGHG1
IGHG3
Macrophages
CD4 T cells
CD8 T cells
Monocytes
IGHG3 0
B cells
T Treg
VSMC
T TIM
T TfR
Endo
Mast
NKT
FDC
ILC
DC
NK
T TIM
T TfR
T Treg
B cells
ILC
VSMC
CD4 T cells
CD8 T cells
DC
Endo
Macrophages
Mast
Monocytes
NK
NKT
Endo
FDC
Mast
Monocytes
NKT
T TIM
T TfR
T Treg
VSMC
Macrophages
ILC
DC
NK
i 0.800
i
f( , )= Ŷ
0.700
0.715
0.950
0.800
0.710
0.900
Graph 0.705 0.795
attention 0.850
Receptor Ligand network Decoder 0.700 0.790
genes Cells genes 0.800
All genes
G = (A, YR , YL) Ŷ
0
5
10
50
200
0
5
10
50
200
0
5
10
50
200
Cell i Cell i
i i
* * *
Cells
YR YL
Cells
A Y Resolution (μm)
We discovered a dependency of CD8 T cells on multiple other cell compared to categorical cell type labels. We observed differential
types in the chip cytometry colon data (Extended Data Fig. 6) and receptor signaling as differential latent neuron activation of Kit11 in
a dependency of CD8 T cells on proximity to the tumor–immune sinusoidal endothelial cells (SECs) depending on their proximity to
boundary14 in colorectal cancer (Extended Data Fig. 7). arterial endothelial cells (AECs) (Supplementary Data 2).
Similarly, we interpreted NCEM fits on the deconvoluted Visium NCEMs are linear and nonlinear graph neural networks and
lymph node data. We identified a bidirectional dependency of B CVAEs that model cell communication events in spatial omics
cells and follicular dendritic cells (FDCs) that is indicative of posi- assays (Supplementary Table 2). We identified statistical depen-
tive feedback between both cell types in germinal center organiza- dencies between cells on physiologically relevant length scales and
tion20 (Fig. 2a,b and Extended Data Fig. 4c). Similarly, we found interpreted fits based on model parameters. The statistical identifi-
evidence for a dependency of mast cells on B cells (Fig. 2b) and ability of cell type couplings may improve with increased capture
a mast cell subcluster associated with niches enriched in B cells of niche heterogeneity, through the inclusion of three-dimensional
(Fig. 2a). The FDC subcluster associated with niches enriched in data, by increasing the number of cells measured and by increas-
B cells (cluster 3) showed increased expression of Cxcl13, a key ing the variation in the training data through perturbations of
chemokine for germinal centers20 (Extended Data Fig. 4c), support- niche structure. Uncertainty in segmentation of cells or nuclei can
ing the association of these couplings with germinal centers. We be improved on the level of the spatial measurement22 or may be
further dissected these couplings based on the gene-wise effects of addressed in model extensions. We found that linear NCEMs per-
all senders on one receiver type (‘receiver effect analysis’, Fig. 2c) form well in the presented tasks and are promising candidates for
and of one sender on all receivers (‘sender effect analysis’, Fig. 2d), cell communication inference. The complexity of the graph neural
which contextualizes differential expression results of the FDC–B network used in the NCEM defines the complexity of the motifs of
cell axis (Fig. 2e and Supplementary Data 1). Multiple T cell clusters cell communication that can be discovered and may be expanded
had a similar effect on B cells in a ‘sender similarity analysis’ given more complex datasets. CVAE–NCEMs may be used to model
(Fig. 2f), in which we correlated the coefficient vectors of sender cell-intrinsic variation together with niche effects. Similarly, the
cell types that correspond to B cell receivers, which demonstrates graph kernels tailored to ligand–receptor signaling presented here
conservation of cell type identity in the sender profile. provide constrained latent variables that explain extrinsic variation
In contrast to linear NCEMs, nonlinear NCEMs can account for and could be used together with variables for cell-intrinsic varia-
weighted or higher-order interactions between cell types (Extended tion. Nonlinear NCEMs learn a cellular representation within the
Data Fig. 8a). As for linear NCEMs, we found resolution-dependent spatial graph23, and we demonstrated that these representations
prediction performance profiles in nonlinear NCEMs (Extended can model niches and may be exploited for unsupervised analysis
Data Fig. 8b and Methods) and a dependency between L4/5 IT and of tissue structures.
L2/3 IT cells (Extended Data Fig. 8c and Methods). Notably, the
nonlinear models did not outperform linear models in gene expres- Online content
sion prediction, which suggests that the spatial dependencies in the Any methods, additional references, Nature Research report-
given datasets are well described by linear models (Extended Data ing summaries, source data, extended data, supplementary infor-
Fig. 8b). Next, we considered a conditional variational autoencoder mation, acknowledgements, peer review information; details of
(CVAE) version of an NCEM to model cell-intrinsic latent states. author contributions and competing interests; and statements of
We conditioned the distribution over node expression states on a data and code availability are available at https://doi.org/10.1038/
graph embedding of the niche and the cell type (Extended Data s41587-022-01467-z.
Fig. 9a). Even though CVAE–NCEM attained much higher predictive
performance in reconstruction tasks (Extended Data Fig. 9b and Received: 1 July 2021; Accepted: 11 August 2022;
Supplementary Fig. 7a), these models did not consistently capture Published online: 27 October 2022
spatial dependencies because niche states were represented in latent
variables (Extended Data Fig. 9c–f and Supplementary Fig. 7b–e). References
The interpretation of spatial dependencies inferred on targeted 1. Palla, G., Fischer, D. S., Regev, A. & Theis, F. J. Spatial components of
spatial molecular profiling assays is constrained by the limited molecular tissue biology. Nat. Biotechnol. 40, 308–318 (2022).
2. Browaeys, R., Saelens, W. & Saeys, Y. NicheNet: modeling intercellular
capture of ligand–receptor pairs. We imputed the cell-wise gene communication by linking ligands to target genes. Nat. Methods 17,
expression in the MERFISH fetal liver data using corresponding 159–162 (2020).
scRNA-seq data21 (Extended Data Fig. 10). We designed a graph 3. Efremova, M., Vento-Tormo, M., Teichmann, S. A. & Vento-Tormo, R.
kernel of cell-wise receptor activity to directly model ligand–recep- CellPhoneDB: inferring cell–cell communication from combined
tor interactions between neighboring cells (Fig. 2g). The peak expression of multi-subunit ligand–receptor complexes. Nat. Protoc. 15,
1484–1506 (2020).
predictive performance of the ligand–receptor nonlinear NCEM 4. Arnol, D., Schapiro, D., Bodenmiller, B., Saez-Rodriguez, J. & Stegle, O.
was much higher compared to the nonlinear NCEM (R2 of 0.799 Modeling cell-cell interactions from spatial molecular data with spatial
and 0.947), demonstrating the increased complexity of the input variance component analysis. Cell Rep. 29, 202–211 (2019).
subclasses of GABAergic cells. We removed cells labeled as ‘other’ from the dataset. N J 2 N J 2 N J 2
We used an identifier for the respective mouse as domain information. = yi,j − ȳk(i),j + ȳk(i),j − ȳj + ȳj − ȳ
i j i j i j
Colon (chip cytometry). Jarosch et al.16 measured an inflamed colon with chip intracell− type variance intercell− type variance gene variance
cytometry in two images from one patient, with 19 genes observed in 11,321 (1)
cells. We used the cell types originally annotated by Jarosch et al.: B cells, CD4
T cells, CD8 T cells, GATA3+ epithelial, Ki67 high epithelial, Ki67 low epithelial, where yi,j is the gene expression of cell i out of N and gene j out of J, x̄k,j is the mean
lamina propria cells, macrophages, monocytes, PD-L1+ cells, intraepithelial expression of each gene j in each cell type k, k(i) is the cell type of cell i, ȳj is the
lymphocytes, muscular cells and other lymphocytes (Extended Data Fig. 2a). We mean expression of each gene j and ȳ is the mean of the dataset.
coarsened the cell type annotation by combining Ki67 high epithelial and Ki67 low
epithelial to a joined annotation of Ki67 epithelial. We log-transformed the gene Simulations. Segmentation errors. We simulated segmentation errors in which
expression values for use in the analyses presented here to mitigate count noise the segment boundary between two neighboring cells is misplaced (Extended
(Supplementary Fig. 8b). Data Fig. 3f,g). We selected a fraction of cells (10% or 50%) in the MERFISH fetal
liver data at random, selecting one neighbor at random for each selected cell, and
Cancer (MIBI-TOF). Hartmann et al.14 measured colorectal carcinoma and healthy transferred a fraction of the total molecular abundance vector of the selected cell to
adjacent tissue with MIBI-TOF in 58 images across four individuals, with 36 genes its neighbor.
observed in 63,747 cells. We used the cell types originally annotated by Hartmann
et al.: endothelial, epithelial, fibroblast, CD11c myeloid, CD68 myeloid, CD4 Spatial dependencies. We simulated single-cell resolved spatial data from scratch
T cells, CD8 T cells and other immune cells (Extended Data Fig. 2a). The cohort by using the cell graph from the chip cytometry colon data (Extended Data Fig. 2)
in this dataset includes two patients with colorectal carcinoma and two healthy and the simulated node-wise gene expression vectors. We modeled cell types using
controls. We scaled the model outputs by cell-wise size factors. the number of genes originally defined in the dataset and drew a mean expression
value for each gene from a uniform distribution between 0 and 10. We considered
Tonsils (MELC). Pascual-Reguant et al.15 measured tonsils from patients two scenarios of dependencies between cells: (1) a dataset without spatial
undergoing tonsillectomy with multiepitope ligand cartography (MELC), an dependencies in which all cells are drawn from one cell type and are independent
immunohistochemistry approach, in one image across one patient, with 51 and identically distributed and (2) a dataset with spatial dependencies in which
genes observed in 9,512 cells. We used the cell types originally annotated by cells belong to either one of two cell types, where we introduced dependencies on
Pascual-Reguant et al.: B cells, endothelial cells, innate lymphoid cell (ILC), the presence of the respective other cell type in the neighborhood in 50% of the
monocytes/macrophages/dendritic cells (DC), natural killer (NK) cells, plasma genes, with strong effect sizes drawn from a uniform distribution between 4 and 6.
cells, T cytotoxic cells, T helper cells (Extended Data Fig. 2a). We removed cells We fitted NCEMs, Misty9 and SVCA4 on both simulated datasets (Supplementary
labeled as ‘other’ from the dataset. Fig. 2). The simulated images contained over 4,500 cells per image. To reduce the
runtime for SVCA for these samples we cropped both images in the lower-right
Cancer (CODEX). Schürch et al.13 measured advanced-stage colorectal cancer corner to create images with approximately 850 cells each.
with CODEX in 140 images across 35 patients, with 57 genes observed in 272,266
cells. We used the cell types originally annotated by Schürch et al.: B cells, CD11b+ Models. The inputs to NCEMs are (1) a gene expression matrix Y ∈ RN×J where
monocytes, CD11c+ dendritic cells, CD11b+ CD68+ macrophages, CD163+ N is the number of cells and J is the number of genes, (2) a matrix of observed
macrophages, CD68+ macrophages, CD68+ macrophages GzmB+, CD68+ CD163+ cell types Xl ∈ RN×L where L is the number of unique cell type labels and (3)
macrophages, CD3+ T cells, CD4+ T cells, CD4+ T cells CD45RO+, CD4+ T cells a matrix specifying the batch assignments Xc ∈ RN×C of C distinct batches or
GATA3+, CD8+ T cells, NK cells, T regs, adipocytes, dirt, granulocytes, immune domains, such as images or patients. We denote the adjacency matrix of connected
cells, immune cells/vasculature, lymphatics, nerves, plasma cells, smooth muscle, cells as A ∈ RN×N , which is calculated based on the spatial proximity of cells
stroma, tumor cells, tumor cells/immune cells and vasculature (Extended Data Fig. per image. For linear models and models with an indicator aggregator, we used
2a). We removed cells with an annotation of dirt or an undefined label from the a binary adjacency matrix Aij = 1 if d(xa , xb ) ≤ δ max where d(·, ·) describes the
dataset. We merged the macrophage subclusters (CD11b+ CD68+ macrophages, euclidean distance between nodes a, b ∈ N in space and δmax is the neighborhood
CD163+ macrophages, CD68+ macrophages, CD68+ macrophages GzmB+ and size (resolution), and Aij = 0 otherwise. For models using graph convolutions,
CD68+ CD163+ macrophages) and the CD4+ T cell subclusters (CD4+ T cells, CD4+ we normalized A such that all rows sum to one: D−1A where D is the diagonal
T cells CD45RO+ and CD4+ T cells GATA3+). We scaled model outputs by the node degree matrix. The output of NCEMs is Ŷ ∈ RN×J , a reconstruction of the
node size in the respective output layer of each model class to mitigate count noise gene expression matrix. In selected datasets, we applied∑size factor scaling to
J
y
(Supplementary Fig. 8e). the network output Y using the size factors sfi = 1
∑ N ∑J
j ij
. The global data
y′ ′
N i ′ j′ i j
patients as the test set. From the remaining nodes, 10% of all nodes are selected as σj is the predicted standard deviation of a gene (Supplementary Fig. 8). The loss
the validation set. function of CVAE model is the negative data log-likelihood in addition to the
Data availability Peer review information Nature Biotechnology thanks Qing Nie, Tommaso Biancalani
The MERFISH fetal liver11, MERFISH brain12, MIBI TOF cancer14, MELC tonsils15, and the other, anonymous, reviewer(s) for their contribution to the peer review of
CODEX cancer13, chip cytometry colon16 and Visium lymph node19 datasets are this work.
publicly available (Methods). Reprints and permissions information is available at www.nature.com/reprints.
Extended Data Fig. 1 | Benchmarking cell–cell dependency inference on simulated data. Shown are fits of NCEMs (a, b), MistyR (c, d), and SVCA (e,
f) on simulated data with simulated dependencies between cells (a, c, e) and without simulated dependencies between cells (b, d, f). The simulation is
described in the Methods.
Extended Data Fig. 2 | Cell-type centric summary statistics of the considered datasets. (a) Cell-type frequencies by dataset. Shown is a barplot with the
number of cells in each cell-type for MERFISH – brain data, chip cytometry – colon data, MIBI TOF – cancer data, MELC – tonsils data, CODEX – cancer
data, MERFISH - fetal liver wild type and MERFISH - fetal liver Tet2−/−. (b) Mean node degree (number of neighbors) by resolution in µm and dataset
for MERFISH – brain data (n = 284,098 cells), chip cytometry – colon data (n = 11.321 cells), MIBI TOF – cancer data (n = 63,747 cells), MELC – tonsils
data (n = 9,512 cells), CODEX – cancer data (n = 272,266 cells), MERFISH - fetal liver wild type (n = 40,864 cells) and MERFISH - fetal liver Tet2−/−
(n = 54,970 cells). For each box in (b), the centerline defines the median, the height of the box is given by the interquartile range (IQR), the whiskers are
given by 1.5 * IQR and outliers are given as points beyond the minimum or maximum whisker.
Extended Data Fig. 4 | Robustness of cell communication inference on deconvoluted spot transcriptomics data. (a) 10x Visium slide of a lymph node
with the spot-wise abundance of B cells, follicular dendritic cells (FDC), and mast cells inferred with cell2location superimposed. (b) UMAP of cells in a
matched scRNA-seq data set of human lymph nodes, spleen and tonsils with cell types superimposed. (c) Type coupling heatmap of the Visium – lymph
node dataset, with edge width proportional to the number of differentially expressed genes at a false-discovery-rate-corrected p-value threshold of
0.05 for each pair of sender and receiver cell types. Only edges with at least 200 genes are shown. (d) Violin plot of Cxcl13 expression per cell for FDC
subclusters in the Visium - lymph node data. (e) Robustness of type coupling analysis in a Visium slide on human lymph nodes. Shown are R2 between
inferred cell type coupling vectors of randomly subsampled spots and the complete data for subsampling ratios of 5 %, 25 %, 50% and 75 % in three cross
validations (n = 256 type couplings in each boxplot). For each box, the centerline defines the median, the height of the box is given by the interquartile
range (IQR), the whiskers are given by 1.5 * IQR and outliers are given as points beyond the minimum or maximum whisker. (f-h) Correlation of measures
of cell communication events between pairs of cell types compared with type coupling scores from NCEM on the tabular sapiens lymph node dataset.
Shown are CellphoneDB permutation test results with the number of ligand-receptor pairs with positive mean expression (f), number of ligand-receptor
pairs with a FDR-corrected p-value below a threshold of 0.05 (g) and the number of ligands associated with a pair of cell types as identified by NicheNet
(h) (Methods). Each point is one pair of cell types. The vertical line indicates the threshold for showing edges in Fig. 2b.
Extended Data Fig. 6 | Attributing cell heterogeneity to niche composition in inflamed colon. (a) Area 1 of chip cytometry – colon dataset with cell-types
superimposed. (b) UMAPs of molecular embedding of CD8 T cells only with molecular subclustering superimposed (colors as in c). (c) Distribution of
cell-wise difference of R2 between spatial model non nonspatial baseline model by molecular sub-cluster (CD8 T cells 0: n = 74, CD8 T cells 1: n = 58,
CD8 T cells 2: n = 41, CD8 T cells 3: n = 37, CD8 T cells 4: n = 24). The centerline of the boxplots defines the median, the height of the box is given by the
interquartile range (IQR), the whiskers are given by 1.5 * IQR and outliers are given as points beyond the minimum or maximum whisker. (d) UMAPs of
molecular embedding of all CD8 T cells in area 1 (n = 234 cells) showing if a given cell-type is present in the neighborhood. The underlying neighborhoods
were defined at the best performing resolution identified in Fig. 1c (40 µm). (e) Heatmap of fold change and false-discovery rate corrected p-values
of cluster enrichment of binary neighborhood labels, where fold changes are the ratio between the relative neighboring source cell-type frequencies
per subtype cluster and the overall source cell-type frequency in the image. (f) Area 1 of colon in the chip cytometry – colon dataset with CD8 T cell
sub-states (left) and the difference of R2 between the NCEM interaction model at resolution of 40 µm and the best nonspatial baseline model (right). (g)
Type coupling analysis, showing the number of differentially expressed genes at a false-discovery-rate-corrected p-value threshold of 0.05 for each pair of
sender and receiver cell types.
Extended Data Fig. 7 | Attributing cell heterogeneity to niche composition in colorectal cancer. (a) Field of view 16 of MIBI TOF – cancer dataset with
the spatial allocation of all cell-types superimposed. (b) UMAPs of molecular embedding of CD8 T cells only with molecular sub-clustering superimposed
(colors as in c). (c) Distribution of cell-wise difference of R2 between spatial model non-spatial baseline model by molecular sub-cluster (CD8 T cells
0: n = 304, CD8 T cells 1: n = 293, CD8 T cells 2: n = 278, CD8 T cells 3: n = 247, CD8 T cells 4: n = 207). The centerline of the boxplots defines the
median, the height of the box is given by the interquartile range (IQR), the whiskers are given by 1.5 * IQR and outliers are given as points beyond the
minimum or maximum whisker. (d) UMAPs of molecular embedding of all CD8 T cells in area 1 (n = 1,329 cells) showing if a given cell type is present
in the neighborhood. The underlying neighborhoods were defined at the optimal resolution identified in Fig. 1d (13 µm). (e) Heatmap of fold change
and false-discovery rate corrected p-values of cluster enrichment of binary neighborhood labels, where fold changes are the ratio between the relative
neighboring source cell-type frequencies per subtype cluster and the overall source cell-type frequency in the image. (f) Field of view 1, 5, 8 and 16 of colon
in the MIBI TOF – cancer dataset with CD8 T cell sub-states, cell type assignments to epithelial and T cells, and the difference of R2 between the NCEM
interaction model at a resolution of 13 µm and the best nonspatial baseline model (scale bar 50 µm). (g) Type coupling analysis, showing the number of
differentially expressed genes at a false-discovery-rate-corrected p-value threshold of 0.05 for each pair of sender and receiver cell types.
, where L is the number of distinct cell types in the model. Non-normalized saliencies will show a pattern similar to the contact frequency matrix as cell
types with frequent connections will skew the learned importance of cell connections. Therefore, we normalized the saliencies by the absolute number
nab of occurrences of each cell type pair: SALSnorm
ab
= n1 SALSab. For each box in (c), the centerline defines the median, the width of the box is given by the
ab
interquartile range (IQR), the whiskers are given by 1.5 * IQR and outliers are given as points beyond the minimum or maximum whisker.
Extended Data Fig. 9 | Modeling intrinsic and extrinsic variation in deep latent variable models. (a) A node generative network (CVAE–NCEM) is a
conditional variational autoencoder in which the condition is not a constant but a graph embedding, which is also learned. The forward pass for a cell i
through the model is shown. (b) Latent variable models improve reconstructive performance. Shown are the R2 values of held-out test data based on the
forward pass model evaluation from chip cytometry – colon data for linear models, encoder–decoder models, and variational autoencoders for both NCEM
and nonspatial models (n=3 cross-validation splits). baseline: a nonspatial linear model of gene expression per cell-type; NCEM interactions: linear model
with interaction effects; NL: nonlinear model; IND: the graph kernel is an indicator function across cell-types in the neighborhood; GCN: the graph kernel is
a graph convolution, a linear embedding of the cell-types in the neighborhood. (c) Neighborhood transfer performance of NCEM and nonspatial models.
Shown is the R2 over cells in the test set for models trained on predicting muscular cells and Lamina propria cells for both CVAE and CVAE–NCEMs trained
on neighborhoods with different radii with optimization algorithm as color (n=3 cross-validation splits). Plain: normal CVAE training; aggressive: aggressive
encoder training. For each box in (b, c), the centerline defines the median, the height of the box is given by the interquartile range (IQR), the whiskers
are given by 1.5 * IQR and outliers are given as points beyond the minimum or maximum whisker. (d–f) Latent variables of CVAE–NCEM are confounded
with neighborhood conditions. (d) UMAP of molecular embedding in the CVAE–NCEM IND latent space of muscular cells in an example image (n = 1,149
cells) with molecular sub-clustering superimposed (muscle 0: n = 315, muscle 1: n = 287, muscle 2: n = 238, muscle 3: n = 183, muscle 4: n = 126). (e)
UMAPs of molecular embedding in the CVAE–NCEM IND latent space of all muscle cells in the same image with superimposed binary label of presence
of a given cell-type, as defined in the title, in the neighborhood. The underlying neighborhoods were defined at a resolution of 100 µm. (f) Heatmap of fold
change and false-discovery corrected p-values of cluster enrichment of binary neighborhood labels, where fold changes are the ratio between the relative
neighboring source cell-type frequencies per subtype cluster and the overall source cell-type frequency in the image.
Extended Data Fig. 10 | Modeling ligand–receptor signaling with NCEM. (a) UMAP of cells in MERFISH – fetal liver data. (b) Imputation of MERFISH data
with scRNA-seq increases the number of genes that can be modeled with NCEMs, including receptor genes, ligand genes, and ligand–receptor pairs. (c)
Distribution of selected marker genes that are both observed in scRNA-seq and in MERFISH over cell types.