Tag: Lymphocyte differentiation

  • A spatial human thymus cell atlas mapped to a continuous tissue axis

    A spatial human thymus cell atlas mapped to a continuous tissue axis

    [ad_1]

    Data generation by institute

    Metadata about scRNA-seq and CITE-seq samples, including information on source study, cell enrichment and donor age, are provided in Supplementary Table 1. Information about spatial data, including Visium, IBEX, RareCyte and RNAscope, is provided in Supplementary Table 2. In brief, all CITE-seq data were generated at Ghent University and mapped to the human genome (GRCh38) at the Wellcome Sanger Institute (WSI). All other original scRNA-seq data and 10x Visium data were generated and mapped to the human genome (GRCh38) at WSI. All IBEX imaging was performed at the National Institute of Allergy and Infectious Diseases (NIAID), NIH. All non-IBEX imaging datasets (RareCyte, RNAscope, Visium H&E) were generated at WSI. No fetal work was performed at the NIH and at Ghent University.

    Human fetal and paediatric data from several previous studies3,7,15,16 were included and reanalysed from raw fastq files. Details on sample processing, ethics and funding are available in the respective publications; details on the origin of each sample are provided in Fig. 1c and Supplementary Tables 1 and 2.

    For samples processed at WSI, paediatric samples were obtained from cardiac corrective surgeries and provided by Newcastle University collected under REC approved study 18/EM/0314 and Great Ormond Street Hospital under REC approved study 07/Q0508/43. Human embryonic and fetal material was provided by the joint MRC & Wellcome Trust (grant MR/006237/1) Human Developmental Biology Resource (http://www.hdbr.org) with written consent and approval from the Newcastle and North Tyneside NHS Health Authority Joint Ethics Committee (08/H0906/21+5). Samples processed at Ghent University were obtained according to and used with the approval of the Medical Ethical Commission of Ghent University Hospital, Belgium (EC/2019-0826) through the haematopoietic cell biobank (EC-Bio/1-2018). Samples processed at the NIH were obtained under NIAID MTA 2016-250 from the pathology department of the Children’s National Medical Center in Washington, DC, following cardiothoracic surgery from children with congenital heart disease. Use of these thymus samples for this study was determined to be exempt from review by the NIH Institutional Review Board in accordance with the guidelines issued by the Office of Human Research Protections. Informed consent was obtained from all donors or their legal guardians.

    Sample processing and library preparation for 10x scRNA-seq at WSI

    Surgically removed paediatric thymi were directly moved to HypoThermosol (Sigma-Aldrich, H4416-100ML), shipped by courier with ice packs, and processed within 24 h from time of surgery. For scRNA-seq experiments, we performed single-cell dissociation as described in the protocol available online (https://doi.org/10.17504/protocols.io.bx8sprwe). In brief, thymic tissue was finely minced and cell dissociation was performed using a mixture of liberase TH (Roche, 05401135001) and DNase I (Roche, 4716728001) for ~30–60 min in two rounds. Digested tissue was filtered through a 70 μm strainer and digestion was stopped with 2% FBS in RPMI medium. Red blood cell lysis was then performed on the cell pellet using RBC lysis buffer (eBioscience, 00-4333-57), after which cells were washed and counted. Magnetic and/or FACS sorting were performed to enrich stromal populations. Magnetic sorting to enrich for EPCAM+ or to deplete CD45+ or CD3+ cells was performed for U09, U48 and Z11 samples (Supplementary Table 1) using a magnetic sorting kit from Miltenyi Biotec including LS Columns (130-042-401) and the following bead-tagged antibodies: human CD45 MicroBeads (130-045-801), human CD326 (EPCAM) MicroBeads (130-061-101), human CD3 MicroBeads (130-050-101). Z11 and U40 samples were FACS-sorted to enrich for CD45 stromal cells (Z11) or to obtain both CD45 cells and total TECs (U40) (Supplementary Table 1). To perform FACS sorting, cells were resuspended in FACS buffer (0.5% FBS and 2 mM EDTA in PBS), blocked with TruStainFcX (BioLegend, 422302) for 10 min and were stained with antibodies against EPCAM (anti-CD326 PE, 9C4, BioLegend, 324206), CD45 (anti-CD45 BV785, HI30, BioLegend, 304048), CD205 (anti-CD205 (DEC-205) APC, BioLegend, 342207) and CD3 (anti-CD3 FITC, OKT3, BioLegend, 317306) and DAPI for 30 min. All antibodies were diluted 1:50 for staining. After staining, cells were washed and sorted using Sony SH800 or Sony MA900 sorters with a 130 μm nozzle. Samples were first gated to remove debris and dead cells, but no singlet gate was applied to ensure large TECs would not be excluded. For samples stained with anti-CD45, total CD45 cells were sorted to enrich for stroma. For EPCAM/CD205-stained cells, EPCAM+CD205 and EPCAMCD205+ cells were sorted to obtain the total TEC fraction including cortical and medullary epithelial cells but exclude autofluorescent cells (Supplementary Fig. 19a). FCS Express v.7.18.0025 was used for data analysis. After sorting/enrichment, cells were resuspended in collection buffer to the recommended concentration (106 cells per ml) and loaded to capture around 8,000–10,000 cells on the 10x Genomics chromium controller to generate an emulsion of cells in droplets using Chromium Next GEM Single Cell 5′ Kit v2 (1000263). GEX and TCR-seq libraries were further prepared using the Library Construction Kit (1000190) and Chromium Single Cell Human TCR Amplification Kit (1000252) according to the manufacturer’s instructions. Sequencing was performed on the NovaSeq 6000 sequencer (Illumina). Additional details are provided in Supplementary Table 1.

    Curation of published datasets

    Data from several previous studies3,7,15,16 are included in this Article. For public datasets deposited on ArrayExpress, paired-end fastq files were downloaded from ENA, and the .sdhf file was used to determine the type of experiment (3′/5′ and the version of 10x Genomics kit). For public datasets deposited on GEO, if the data were deposited as a Cell Ranger .bam file, URLs for the bam files were obtained using srapath v.2.11.0. The .bam files were downloaded and converted to .fastq files using 10x bamtofastq v.1.3.2. If GEO data were deposited as paired-end fastq files, sra files were located using the search utility from NCBI entrez-direct v.15.6, downloaded and converted to fastq files using fastq-dump v.2.11.0. Sample metadata were curated from the abstracts deposited on GEO. Finally, published datasets that had been generated at WSI were downloaded from iRODs v.4.2.7 in the form of cram files and converted to fastq files using samtools v.1.12. using the command ‘samtools collate -O -u -@16 $CRAM $TAG.tmp | samtools fastq -N -F 0×900 -@16 −1 $TAG.R1.fastq.gz −2 $TAG.R2.fastq.gz -’. Sample metadata were obtained using the imeta command from iRODs.

    Processing of published and newly generated scRNA-seq and TCR-seq datasets at WSI

    After fastq file generation, 10x Genomics scRNA-seq experiments were processed using the STARsolo pipeline detailed on GitHub (https://github.com/cellgeni/STARsolo). A STAR human genome reference matching Cell Ranger GRCh38-2020-A was prepared as per instructions from 10x Genomics. Using STAR v.2.7.9a and the previously collected data about sample type (3′/5′, 10x Genomics kit version), we applied the STARsolo command to specify UMI collapsing, barcode collapsing, and read clipping algorithms to generate results maximally similar to the default parameters of the “cellranger count” command in Cell Ranger v.6: “–soloUMIdedup 1MM_CR –soloCBmatchWLtype 1MM_multi_Nbase_pseudocounts –soloUMIfiltering MultiGeneUMI_CR –clipAdapterType CellRanger4 –outFilterScoreMin 30”. For cell filtering, the EmptyDrops algorithm of Cell Ranger v.4 and above was invoked using “–soloCellFilter EmptyDrops_CR”. The option “–soloFeatures Gene GeneFull Velocyto” was used to generate both exon-only and full length (pre-mRNA) gene counts, as well as RNA velocity output matrices. TCR-seq samples were processed using Cell Ranger v.6.1.1 with VDJ reference vdj-GRCh38-alts-5.0.0. The default settings of the reference-based “cellranger vdj” command were used. Fastq files were converted to _S1_L001_R1_001.fastq.gz format to be compatible with Cell Ranger.

    scRNA-seq quality control, data integration and annotation

    Jupyter notebooks used for data quality control, preprocessing, integration and annotations are available in the GitHub repository for this manuscript (Code availability). Scanpy v.1.9.1 with anndata v.0.10.7 and the statistics and plotting libraries pandas v.2.2.2, numpy v.1.26.4, scipy v.1.13.0, seaborn v.0.13.2 and matplotlib v.3.8.4 were used for data analysis and visualization. Mapped libraries were subjected to computational removal of ambient RNA using CellBender42 v.0.1.0. Next, all datasets underwent cell quality-control filtering and cells with <400 or > 6500 genes, >6% mitochondrial reads or <5% ribosomal counts were removed. Doublets were annotated using Scrublet43 v.0.2.3. Next, datasets were integrated with scVI from scvi-tools44 v.0.19.0, for which mitochondrial, TCR and cell cycle genes were removed, and cells were annotated into major lineages (cell_type_level_0: T_DN, T_DP, T_SP, Epithelial, Stroma, Myeloid, RBC, B and Schwann) by Leiden clustering. Individual cell lineages were then separated and integrated with scVI to perform fine-grained annotation and remove remaining doublets picked up by manual annotation. Cell annotations were assigned based on four sequential steps: (1) high-resolution Leiden clustering was performed to find all potential cell clusters. (2) Annotations of new cells generated in this study were predicted (i) based on KNN graph majority voting of neighbouring cells with annotations from previous studies by adaptation of the weighted KNN transfer solution from scArches45, or (ii) automatic label transfer using CellTypist46 v.1.6.2 with the Developing_Human_Thymus or Pan_Fetal_Human models. (3) Calling the cell type annotation for a given cluster was then informed by a combination of the predicted labels and by marker genes reported in the literature. (4) Additional QC was performed and newly detected doublet clusters were removed where applicable. Steps 1–4 were repeated until final, fine-grained annotations were reached for the highest resolution (cell_type_level_4). Importantly, cell clusters that passed quality control but that we could not confidently assign to a defined cell type either in the literature or by cell markers by the strategy described above, were kept in the integrated object as ‘cell_type_level_4_explore’, which we recommend for future exploration and validation of these cell states. Annotations in ‘cell_type_level_4’ were grouped into five hierarchical levels from the finest (cell_type_level_4) to the broadest (cell_type_level_0) (Supplementary Table 7).

    Single-cell TCR-sequencing data were processed using Dandelion47 v.0.3.1, and a detailed notebook can be found in the GitHub repository for this Article (see Code availability). In brief, a Dandelion class object with n_obs was constructed from all combined TCR libraries. Cells were then further subset to only include DP and SP subtypes that contained V and J rearrangements for both TRB and TRA loci. Next, milo neighbourhoods were constructed based on scVI neighbourhood graphs (n_neighbors = 100) and VDJ genes and frequency feature space was calculated.

    mTEC trajectory analysis

    The scFates package48 v.1.0.7 was used for trajectory analysis on the combined mTECII and mTECIII cells. mTECII and mTECIII cells from the paediatric scRNA-seq dataset were reintegrated and batch-corrected using scVI. Next, the object was preprocessed according to recommendations from Palantir49 and the scVI latent embedding was used as an input to ‘palantir.utils.run_diffusion_maps’ (Palantir v.1.3.3). Tree learning was performed using scf.tl.tree by using multiscale diffusion space from Palantir as recommended by scFates. Next, the node that was characterized by expression of some mTECI markers (ASCL1, CCL21) together with mTECII markers (AIRE, FEZF2) was used as a root to compute pseudotime using scf.tl.pseudotime. Finally, milestones obtained after running the pseudotime were used to adjust the annotations. To derive differentially expressed genes across pseudotime branches, we applied the scf.tl.test_fork function, followed by scf.tl.branch-specific as described in the scFates article48. In brief, it fits a generalized additive model for each gene using pseudotime, branch and interaction between pseudotime and branch as covariates; two-sided P values were extracted for the interaction term (pseudotime:branch) and corrected using FDR to obtain significant differentially expressed genes. Next, these genes were tested for upregulation in each branch and assigned to different branches based on a cut-off of 1.3-fold upregulation.

    mcTEC differentiation potential analysis

    The STEMNET package50 (v.0.1) was used to determine the differentiation potential of mcTEC progenitor cells. For this purpose, only mTEC, cTEC and mcTEC(-Prolif) cells were retained in the dataset. mTECI/II/III and cTECI/II/III were set as maturation endpoints and the probability of each cell to adopt any of these six possible fates was calculated. To identify priming of mcTEC(-Prolif) towards mTECI versus cTECI fate, we derived a priming score by calculating the difference between the posterior probabilities for these two fates for each cell. Cells with priming score <−0.5 or >0.5 were labelled as mTECI-primed and cTECI-primed, respectively, whereas all other cells with comparable mTECI and cTECI potential were deemed to be unprimed.

    Formulation of the CMA with OrganAxis

    OrganAxis is a mathematical model aimed to derive the relative, signed position of a point in space in respect to two morphological landmarks. The OrganAxis base function H is highly flexible and tuneable with respect to the research question, spatial resolution and sampling frequency. In Supplementary Notes 1 and 2, we provide a detailed guide on tissue annotations with TissueTag v.0.1.1 and the details of OrganAxis derivation. The CMA is an extrapolation of OrganAxis, which is defined by a weighted linear combination of two H functions (CMA = 0.2 × H(edge-to-cortex) + 0.8 × H(cortex-to-medulla)). All IBEX and Visium images were annotated with TissueTag v.0.1.1 at a resolution of 2 um per pixel (ppm = 0.5). Then, annotations were transferred to a quasi-hexagonal grid that was generated by placing points with r-microns spacing in the x and y directions and staggering every other row by r/2. Throughout this study we used r = 15. L2 distances to broad level annotations (annotation_level_0) and the corresponding CMA values were calculated with k = 10 for all spots in the hexagonal grid. L2 distances to fine annotations (annotation_level_1) were calculated with k = 1. CMA and L2 distances were then transferred to spots in Visium datasets or nuclei segmentations for IBEX by nearest-neighbour mapping and were therefore spatially homogeneous for these two spatial technologies.

    To provide a common reference, we also binned the axis to a sequential discrete space for the entire thymus by ten levels (capsular, subcapsular, cortical level 1, cortical level 2, cortical level 3, cortical CMJ, medullary CMJ, medullary level 1, medullary level 2, medullary level 3; details are provided in Supplementary Note 2 and Supplementary Table 8).

    10x Visium spatial transcriptomics sample processing and sequencing

    Resected fetal and paediatric thymi were directly moved to HypoThermosol (Sigma-Aldrich, H4416-100ML) (paediatric samples) or cold PBS (fetal samples), shipped by courier with ice packs and processed within 24 h from time of surgery. For embedding, fetal or paediatric thymus tissue was first transferred to PBS and then placed onto ice for a few minutes to clear away any excess medium and preservation liquid (such as HypoThermosol). Next, as much liquid as possible was removed from the sample and, if necessary, the tissue was trimmed to fit into a cryomold. The sample was placed into a cryomold (Tissue-Tek, AGG4581) filled with OCT (Leica biosystems, 14020108926) and positioned according to the desired orientation. The cryomold was then placed in isopentane that had been equilibrated to −60 °C and left to fully freeze for 2 min. The sample was then rested on dry ice to allow draining of the isopentane. Finally, the cryomolds were wrapped in foil and stored at −80 °C. On the day of sectioning, the samples were removed 1 h before sectioning and placed into the cryostat (Leica biosystems) at −18 °C to equilibrate. Tissue was sectioned (section thicknesses are provided in Supplementary Table 2) and sections were placed onto Visium slides according to the manufacturer’s protocol. The sections were stained with H&E and imaged at ×20 magnification (Hamamatsu Nanozoomer 2.0 HT). Libraries were further processed according to the manufacturer’s protocol (Visium Spatial Gene Expression Slide & Reagent Kit, 10x Genomics, PN-1000184) (permeabilization times are shown in Supplementary Table 2). The samples were sequenced on the NovaSeq 6000 sequencer (Illumina) and the obtained fastq files were mapped with Space Ranger (10x Genomics; version numbers are provided in Supplementary Table 2).

    Visium preprocessing, image registration and annotation

    To process the Visium histology image data in higher resolution than the SpaceRanger defaults, we built a custom pipeline to extract an additional layer of image resolution at up to 5,000 pixels (hires5K), which we found to be more suitable for morphological analysis. We also developed our own fiducial image registration pipeline for increased accuracy where the fiducials are detected with cellpose v.2.1.1 and RANSAC from scikit-image v.0.22.0 is used for affine registration of reference fiducial frame (information provided by 10x Genomics). Lastly, for flexible tissue detection, we used Otsu thresholding with an adjustable threshold.

    We subsequently used TissueTag v.0.1.1 for semiautomated image annotation (Supplementary Note 1). Cortical and medullary pixels were predicted with a pixel random forest classifier by generating training annotations based on spots with the highest gene expression of AIRE (for medulla) and ARPP21 (for cortex). Automatic cortex/medulla annotations were then adjusted manually where necessary. Moreover, we manually annotated individual thymic lobes and specific structures, such as capsule/edge, freezing/sectioning artefacts, HCs, PVS and fetal thymus-associated lymphoid aggregates (as defined previously16). The morphological annotation and evaluations were done in consultation with expert human thymic pathologists. A full example of the Visium processing pipeline and annotation is provided on the GitHub repository for this Article (see Code availability).

    Spatial Visium mapping with cell2location

    To ensure the best possible matching between Visium and single-cell profiles, we performed spatial mapping using cell2location51 v.0.1.3 separately for fetal and paediatric datasets. We therefore subset our single-cell reference datasets according to either fetal or paediatric stage and removed rare cell types that were predominantly found in one of these stages. We then further removed cell types that showed stress signatures (which we believed to originate from technical factors) and cell types with the total number of cells < 40. A list of cell types that were excluded and the exclusion criteria are provided in Supplementary Table 9. Before cell2location deconvolution, we removed cell cycle genes (Supplementary Table 10) and mitochondrial genes, as well as TCR genes using the regex expression ‘^TR[AB][VDJ]|^IG[HKL][VDJC]’. We then further calculated highly variable genes and used relevant metadata cofactors (sample, chemistry, study, donor, age) to correct for batch effects in the cell2location model.

    Quality control and batch correction of Visium data

    After deconvolution, all Visium data were subjected to filtering based on read coverage and predicted cell abundance. Spots with fewer than 1,000 genes per spot or fewer than 25 predicted cells were omitted. Furthermore, annotated tissue artefacts and areas not assigned to a specific structure were removed. Next, to generate a common embedding we performed scVI integration after removing cell cycle, mitochondrial and TCR genes from the highly variable gene selection. scVI training was performed with ‘SampleID’ as the batch key, ‘SlideID, Spaceranger, section_thickness(um)’ as categorical covariates, and ‘Age(numeric), n_genes_by_counts’ as continuous covariates.

    Before performing any association analysis with the CMA, we further removed lobules (based on ‘annotations_lobules_0’) that had no or small medullar or cortical  regions, as we expected our CMA model to be less accurate in these cases.

    PCA cumulative contribution of CMA

    To estimate the dependency of the axis on spot gene variance across samples, we first normalized to a target sum of 2,500 counts and performed log transformation followed by combat regression (https://scanpy.readthedocs.io/en/stable/api/generated/scanpy.pp.combat.html) by sample to adjust for the batch effect of individual samples. We then computed the PCA for batch-corrected gene expression and calculated the Spearman correlation between the first ten PCs and CMA or the number of genes detected per spot (n_gene_by_counts). Note that number of genes per spot, in our hands, was mostly influenced by inconsistent permeabilization during Visium library preparation and constituted the largest technical source of within-sample variance that we found in both fetal and paediatric Visium samples. To estimate the cumulative contribution of either CMA or the number of genes per spot, we multiplied the Spearman’s R with the percentage cumulative explained variance of the first 10 PCs.

    Cytokine clustering by expression along the binned axis in Visium data

    To analyse cytokine gradients based on the spatial distribution across CMA bins, we first selected a group of 65 cytokines that were broadly expressed from the CellphoneDB database (v.4.1.0, genes annotated as ‘Cytokine’, ‘growthfactor | cytokine’, ‘cytokine’, ‘cytokine | hormone’). We excluded cytokines that were expressed in less than 5% of the spots in all CMA bins. We then performed hierarchical clustering on gene expression batch-corrected (combat) fetal Visium samples of the standardized mean expression of genes across bins using the Ward linkage method with the linkage function from the scipy.cluster.hierarchy module. A heat map was generated using the matrixplot function from the scanpy package52.

    Cytokine two-way ANOVA analysis and cosine similarity on binned axis Visium data

    To compare the distribution of cytokines across developmental groups (fetal versus paediatric) and identify differentially distributed genes, we implemented a two-way ANOVA approach. We initially log-normalized Visium gene expression, then removed lobules for which not a single cortex or medulla Visium spot was detected to increase CMA confidence in both datasets. Data were grouped by mean expression per sample and CMA bin, such that each sample had a single datapoint per CMA bin; n = 16 (paediatric) and n = 12 (fetal) samples. Cosine similarity was calculated based on the median values of the pooled sample bins between fetal and paediatric gene profiles with sklearn.metrics.pairwise.cosine_similarity from scikit-learn v.0.22.0. Two-way ANOVA for age group (fetal versus paediatric) and CMA bin was calculated with statsmodels.api.stats.anova_lm(model, type=2). P values for main and interaction effects were Bonferroni corrected with statsmodels.stats.multitest.multipletests (pvals, alpha=0.05, method=‘bonferroni’). For the full report of the results refer to Supplementary Table 5.

    Cell- and gene-weighted mean CMA location on Visium data

    To estimate the average position of a cell or gene distribution along CMA and HC axes (L2 distance to the nearest HC), spots with low gene expression were filtered out by using appropriate thresholds (0.2 for scVI corrected gene expression and 0.5 for predicted cell abundances). The position of a gene or cell was then calculated according to the following formula: for every gene/cell and axis positions, the weighted mean was calculated as a dot product of spot cell abundance values and CMA position divided by the sum of the cell abundance values across spots.

    Identification of cell-type-specific SGs

    To identify genes exclusively expressed in a specific cell type or subset thereof (‘specialization genes’, SGs), we developed custom Python functions. Starting from raw read count, gene expression was scaled with scipy.stats.zscore(). Cells that showed expression below a cut-off of 0.05 and genes that had expression below 1.5 mean counts were excluded from further steps. Next, a quantile threshold (>95%) was used to select cells with the highest expression level of a specific gene. A χ2 test (scipy chi2_contingency) was performed per gene to identify if the selected cells were over-represented in a specific cell type (cell_type_level_4_explore), indicating the gene to be a marker gene. Genes that were predicted to be expressed only in a single cell type (χ2 α = 1 × 10−50) were considered to be SGs and used for further analyses.

    IBEX clinical cohort details and sample preparation

    Human thymus samples were obtained from the pathology department of the Children’s National Medical Center in Washington, DC, after cardiothoracic surgery from children with congenital heart disease, as thymic tissue is routinely removed and discarded to gain adequate exposure of the retrosternal operative field. Use of these thymus samples for this study was determined to be exempt from review by the NIH Institutional Review Board in accordance with the guidelines issued by the Office of Human Research Protections. There were no genetic concerns for the patients in this cohort. Details about the cohort can be found in Supplementary Table 2. Human thymi were placed in PBS on receipt and processed within 24 h after surgery. Excess fat and connective tissue were trimmed and sectioned into <5 mm cubes. For IBEX imaging, human thymi were fixed with BD CytoFix/CytoPerm (BD Biosciences) diluted in PBS (1:4) for 2 days. After fixation, all tissues were washed briefly (5 min per wash) in PBS and incubated in 30% sucrose for 2 days before embedding in OCT compound (Tissue-Tek) as described previously17,53.

    IBEX sample imaging preparation

    IBEX imaging was performed on fixed frozen sections as described previously17,53. In brief, 20 μm sections were cut on a CM1950 cryostat (Leica) and adhered to two-well chambered cover glasses (Lab-tek) coated with 15 μl of chrome alum gelatin (Newcomer Supply) per well. Frozen sections were permeabilized, blocked and stained in PBS containing 0.3% Triton X-100 (Sigma-Aldrich), 1% bovine serum albumin (Sigma-Aldrich) and 1% human Fc block (BD Biosciences). Immunolabelling was performed with the PELCO BioWave Pro 36500-230 microwave equipped with a PELCO SteadyTemp Pro 50062 Thermoelectric Recirculating Chiller (Ted Pella) using a 2-1-2-1-2-1-2-1-2 program. The IBEX thymus antibody panel can be found in Supplementary Table 3 and has been formatted as an Organ Mapping Antibody Panel54 (OMAP-17) accessible online (https://humanatlas.io/omap). Custom antibodies were purchased from BioLegend or conjugated in house using labelling kits for Lumican (AAT Bioquest, 1230) and LYVE-1 (Thermo Fisher Scientific, A20182). A biotin avidin kit (Abcam, ab64212) was used to block endogenous avidin, biotin and biotin-binding proteins before streptavidin application. Cell nuclei were visualized with Hoechst 33342 (Biotium) and the sections were mounted using Fluoromount G (Southern Biotech). Mounting medium was thoroughly removed by washing with PBS after image acquisition and before chemical bleaching of fluorophores. After each staining and imaging cycle, the samples were treated with two 15 min treatments of 1 mg ml−1 of LiBH4 (STREM Chemicals) prepared in deionized H2O to bleach all fluorophores except Hoechst.

    IBEX image acquisition and alignment

    Representative sections from different tissues were acquired using the inverted Leica TCS SP8 X confocal microscope with a ×40 objective (NA 1.3), 4 HyD and 1 PMT detectors, a white-light laser that produces a continuous spectral output between 470 and 670 nm as well as 405, 685 and 730 nm lasers. Panels consisted of antibodies conjugated to the following fluorophores and dyes: Hoechst, Alexa Fluor (AF)488, FITC, AF532, phycoerythrin (PE), eF570, AF555, iFluor 594, AF647, eF660, AF680 and AF700. All images were captured at an 8-bit depth, with a line average of 3 and 1024 × 1024 format with the following pixel dimensions: x (0.284 μm), y (0.284 μm) and z (1 μm). Images were tiled and merged using the LAS X Navigator software (LAS X v.3.5.7.23225). For IBEX tissue imaging, multiple tissue sections were examined before selecting a representative tissue section that contained several distinct lobules with multiple functional units, often resulting in an unusually shaped region of interest. Fluorophore emission was collected on separate detectors with sequential laser excitation of compatible fluorophores (3–4 per sequential) used to minimize spectral spillover. The Channel Dye Separation module within LAS X v.3.5.7.23225 (Leica Microsystems) was then used to correct for any residual spillover. For publication-quality images, Gaussian filters, brightness/contrast adjustments and channel masks were applied uniformly to all images. Image alignment of all IBEX panels was performed as described previously using SimpleITK55,56. Additional details on antibodies, protocols and software can be found on the IBEX Knowledge-Base (https://doi.org/10.5281/zenodo.7693279).

    IBEX 3D single nuclei segmentation with cellpose

    IBEX images were converted from .ims format (Imaris, Oxford Instruments, v.9.5.0 and v.10.0.0) to 3D stacks (TIFF) by individual channels with FIJI v.1.54j. We then applied a custom pipeline for 3D single nuclei segmentation with cellpose v.2.1.1:

    1. (1)

      Image preparation: individual TIFF images were separated by z plane and channel and the sample channel metadata were extracted as a .csv file. The nuclear staining channel (Hoechst) was located, and a set of tiled 3D image arrays was generated.

    2. (2)

      Segmentation: the image arrays were sequentially segmented with cellpose using specific parameters, for example, diameter, resampling, anisotropy, thresholds and batch size. We used tiles to overcome restrictions on GPU and RAM resources.

    3. (3)

      Restitching and formatting: after segmentation, tiles were stitched back together, and all segmentation masks were stored in a uint32-bit image format to hold more than 65,535 mask labels, as a typical sample produces 200,000–700,000 cell masks.

    4. (4)

      Handling high nuclear density: the high nuclear density, especially in the thymic cortex, produced scenarios in which segmentation masks were often bordered by neighbouring cells, resulting in substantial inter-cell signal bleed. To overcome this issue, we excluded pixels in the interface between two cells such that the mask boundary was preserved if no neighbouring cell was present.

    5. (5)

      Mask filtering: we removed small cell masks to avoid noise and cell fragments using skimage.morphology.remove_small_objects.

    6. (6)

      Signal intensity extraction: for each cell mask post-filtering, we extracted the mean and maximum signal intensity for each channel in the IBEX image. This produced a cellXprotein file for each sample.

    7. (7)

      Data merging: once all sample single-cell segmentations were collected, we merged all samples (n = 8) and stored metadata, channel and spatial information in a unified AnnData object, totalling more than 1.1 million cells.

    Label transfer to IBEX cells from scRNA-seq and CITE-seq reference atlas with ISS patcher

    We used a KNN algorithm to compare the annotated cell types in our scRNA-seq reference atlas with the IBEX single-cell segmentations. For this purpose, the protein expression in the IBEX query cells was matched with the RNA expression of the corresponding genes in the scRNA-seq reference. Protein and gene names were matched according to Supplementary Table 11. Batch effects were removed from IBEX with scanpy.pp.combat(ibex_gene, key=‘sample’, inplace=True). We next subsetted each IBEX sample and ran the KNN prediction algorithm per sample with k = 30, including the following steps:

    1. (1)

      The shared feature space between the two objects was identified, log-normalized and z-scored on a per-object basis.

    2. (2)

      The KNN of the low-dimensional observations (IBEX) in the high-dimensional space (GEX) were identified. The counts of absent features were imputed as the mean of the high-dimensional neighbours.

    3. (3)

      On the basis of majority voting, the most frequent cell annotation in the GEX reference was assigned to the IBEX query cell. The proportion of KNNs that contributed to the majority voting was recorded as the evidence fraction (KNNf). For example, if out of the 30 nearest neighbours, 13 were labelled as A, 10 as B and 7 as C in GEX, then the IBEX cell received the label A. The fraction in this case refers to the proportion of the 30 nearest neighbours that contributed to the majority label, which would be KNNf = 13/30 = 0.43 for label A.

    In some of our IBEX samples AIRE staining in particular produced a relatively high level of non-specific signal and low signal-to-noise ratio. As a consequence, we identify some predicted AIRE+ mTECII cells that are not in the medulla and have capsular/cortical localization (Fig. 4g). We had cases of individual samples for which a specific antibody did not perform well or was missing (for example, for IBEX_01 the KRT10 antibody was out of stock). However, as ISS patcher was run on each sample separately, this did not affect proper scaling of that marker and its use of it for mapping in other samples. Moreover, by selecting cells with a higher proportion of matched KNN cells from the single-cell data (KKNf) these effects are reconciled through the removal of cells with low-confidence mapping. We would like to flag this point for future researchers who want to reuse our datasets.

    The general code for the ISS patcher KNN mapping algorithm can be found in the dedicated GitHub repository (https://github.com/Teichlab/iss_patcher/tree/main) and the full example for KNN mapping using IBEX is reported in the GitHub repository for this Article (see Code availability).

    To annotate T cell types in the IBEX data with high accuracy, we applied the ISS patcher with the CITE-seq T lineage data as a reference as follows: we first applied the KNN algorithm to IBEX data using the scRNA-seq reference atlas to identify and subset IBEX T lineage cells. We then used the CITE-seq data as a reference to repeat the KNN-based annotation on these selected cells. This KNN-based reannotation was performed on a hybrid RNA/protein reference, which included protein measurements for the 19 markers assayed in both CITE-seq and IBEX in addition to RNA measurements for the remaining 23 genes as described in Supplementary Table 11. We used the same KNN implementation as described above but with k = 7, while also imputing CD4 and CD8 pseudotime.

    Human thymus FFPE processing

    Human paediatric samples were obtained from cardiac corrective surgeries. Removed thymi were directly moved to HypoThermosol (Sigma-Aldrich, H4416-100ML), shipped with a courier with ice packs and processed in under 24 h after surgery. After arrival, for FFPE processing, the samples were cut to approximately 1 cm2 pieces with sharp scissors in 1× DPBS. Tissue pieces were then rinsed in clean DPBS to remove any excess HypoThermosol, patted dry with wipes (Kimtech) and placed into 10% formalin (cellpath, BAF-6000-08A) for 16–24 h at room temperature. The next day, tissues were dehydrated and embedded in wax, then kept at 4 °C.

    RareCyte immunostaining and 14-plex imaging

    Multiplex immunofluorescence and single round imaging was performed as described previously57. All steps were performed at room temperature unless stated otherwise. In brief, FFPE blocks were sectioned using a microtome (Leica, RM2235) at 3.5–5 µm thickness and placed onto a superfrost slide (Thermo Fisher Scientific, 12312148). Slides were dried at 60 °C for 60 min to ensure that tissue sections had adhered to the slides. Tissue sections were deparaffinized and subjected to antigen retrieval using the BioGenex EZ-Retriever system (95 °C for 5 min followed by 107 °C 5 min). For OCT sections, 7 µm sections were taken using a cryostat (Leica CM3050S), placed onto SuperFrost Plus slides (VWR) and immediately submerged in 10% buffered saline formalin (Cellpath, BAF-6000-08A) for 1 h at room temperature. The samples were then subjected to the following steps similarly to the FFPE samples. To remove autofluorescence, slides were bleached with AF quench buffer (4.5% H2O2, 24 mM NaOH in PBS). The slides were quenched for 60 min using the ‘high’ setting with a strong white-light exposure followed by further quenching for 30 min using the 365 nm ‘high’ setting using a UV transilluminator. The slides were rinsed with 1× PBS and incubated in 300 µl of Image-iT FX Signal Enhancer (Thermo Fisher Scientific, I36933) for 15 min. The slides were rinsed again and 300 µl of labelled primary antibody staining cocktail was added to the tissue, which was subsequently incubated for 120 min in the dark within a humidity tray. All antibodies were prediluted according to company recommendations and were not adjusted further (Supplementary Table 4). The slides were washed with a surfactant wash buffer and 300 µl of nuclear staining in goat diluent was added to the slide. The slides were then incubated in the dark for 30 min in a humidity tray. The slides were then washed and placed in 1× PBS. Finally the slides were coverslipped using ArgoFluor mount medium and left in the dark at room temperature overnight to dry. The slides were imaged the next day using the RareCyte Orion microscope with a ×20 objective and relevant acquisition settings were applied using the software Artemis v.4.

    RNAscope processing and imaging

    For RNAscope analysis, thymus tissue was processed as described above for Visium sectioning. Sections were cut from the fresh frozen OCT-embedded (OCT, Leica) samples at a thickness of 10 μm using a cryostat (Leica, CM3050S) and placed onto SuperFrost Plus slides (VWR). Sections were stored at −80 °C until staining. The sections were removed from the −80 °C storage and submerged in chilled (4 °C) 4% PFA for 15 min, then acclimatized to room temperature 4% PFA over 120 min. The sections were then briefly washed in 1× PBS to remove any remaining OCT. Then, the sections were dehydrated in a series of 50%, 70%, 100% and 100% ethanol (5 min each) and air-dried before performing automated 4-plex RNAscope.

    Using the automated Leica BOND RX, RNAscope staining was performed on the fresh frozen sections using the RNAscope LS multiplex fluorescent Reagent Kit v2 Assay and RNAscope LS 4-Plex Ancillary Kit for LS Multiplex Fluorescent (Advanced Cell Diagnostics (ACD), Bio-Techne) according to the manufacturer’s instructions. All of the sections were subjected to 15 min of protease III treatment before staining protocols were performed. Before running RNAscope probe panels, the RNA quality of fresh frozen samples was assessed using multiplex positive (RNAscope LS 2.5 4-plex Positive Control Probe, ACD Bio-Techne, 321808) and negative (RNAscope 4-plex LS Multiplex Negative Control Probe, ACD Bio-Techne, 321838) controls.

    The probes were labelled using Opal 520, 570 and 650 fluorophores (Akoya Biosciences, 1:1,000) and one probe channel was labelled using Atto 425-streptavidin fluorophore (Sigma-Aldrich, 1:500), which was first incubated with TSA–biotin (Akoya Biosciences, 1:400). The following RNAscope 2.5 LS probes were used for this study: Hs-AIRE (ACD Bio-Techne, 551248), Hs-LY75-C2 (ACD Bio-Techne, 481438-C2), Hs-CAMP-C3 (ACD Bio-Techne, 446248-C3), Hs-EPCAM-C4 (ACD Bio-Techne, 310288-C4), Hs-IGFBP6-C1 (ACD Bio-Techne, 496068) and Hs-DLK2-C3 (ACD Bio-Techne, 425088-C3). All nuclei were DAPI stained (Life Technologies, D1306). Details are provided in Supplementary Table 12.

    Confocal imaging was performed on the Perkin Elmer Operetta CLS High Content Analysis System using a ×20 (NA 0.16, 0.299 μm px−1) water-immersion objective with 9-11 z-stacks with 2 µm step. Channels: DAPI (excitation, 355–385 nm; emission, 430–500 nm), Atto 425 (excitation, 435–460 nm; emission, 470–515 nm), Opal 520 (excitation, 460–490 nm; emission, 500–550 nm), Opal 570 (excitation, 530–560 nm; emission, 570–620 nm), Opal 650 (excitation, 615–645 nm; emission, 655–760 nm). Confocal image stacks were stitched as individual z stacks using proprietary Acapella scripts provided by Perkin Elmer, and visualized using OMERO Plus (Glencoe Software).

    The contrast used for Extended Data Fig. 6e was as follows: DLK2 (magenta 150–500), IGFBP6 (yellow 200–1500), LY75 (green 200–4000), EPCAM (red 300–2500).

    CITE-seq tissue processing

    Paediatric thymus samples from children undergoing cardiac surgery were obtained according to and used with the approval of the Medical Ethical Commission of Ghent University Hospital, Belgium (EC/2019-0826) through the haematopoietic cell biobank (EC-Bio/1-2018). Thymus tissue was cut into small pieces using scalpels and digested with 1.6 mg ml−1 collagenase (Gibco, 17104-019) in IMDM medium for 30 min at 37 °C with regular agitation to generate a single-cell suspension. The reaction was quenched with 10% FBS and the thymocyte suspension was passed through a 70 μm strainer to remove undigested tissue. Cells were frozen in FBS containing 10% DMSO and stored in liquid nitrogen until needed.

    CITE-seq antibody preparation

    The TotalSeq-C Human Universal Cocktail 1.0 (BioLegend, 399905) was resuspended according to the manufacturer’s instructions. In brief, the lyophilized cocktail was equilibrated to room temperature for 5 min and then centrifuged at 10,000g for 30 s. Then, 27.5 µl cell staining buffer (BioLegend, 420201) was added and the tube was vortexed for 10 s, incubated for 5 min at room temperature and vortexed again. The resuspended cocktail was centrifuged for 30 s at room temperature at 10,000g and the entire volume was transferred to a low-bind tube. Finally, the tube was centrifuged again at 14,000g for 10 min at 4 °C and 25 µl of the supernatant was used per sample (2 × 106 cells in 200 µl).

    In total, 13 TotalSeq-C antibodies (BioLegend) were titrated individually by flow cytometry using PE-conjugated versions of the same antibody clone as recommended by BioLegend. After choosing a suitable concentration for each antibody, a master mix was prepared for cell staining. For this, all antibodies were initially diluted in the cell staining buffer to obtain a concentration 100-fold higher than the desired final staining concentration. Then, 2 µl of each diluted antibody substock was combined in a master mix, which was added to the cells for labelling in a total volume of 200 µl. Details on the TotalSeq-C antibodies are provided in Supplementary Table 6.

    CITE-seq sample preparation

    Cells were thawed slowly by gradually adding 15 volumes of pre-warmed IMDM medium and pelleted at 1,700 rpm for 6 min at 4 °C. After resuspending in PBS, cells were passed through a 70 μm strainer to remove clumps. Enrichment for viable cells was achieved using a magnetic bead-based dead cell removal kit (Miltenyi, 130-090-101). For this, cells were pelleted as before, washed with 1× binding buffer (part of the kit, prepared with sterile distilled water) and resuspended in dead cell removal microbeads (part of the kit) at a concentration of 107 cells per 100 µl beads. After incubation at room temperature for 15 min, cells were applied to an LS column (Miltenyi, 130-122-729), which was prerinsed with 3 ml 1× binding buffer. The column was washed four times with 3 ml binding buffer and the flow-through containing viable cells was collected. Cells in the flow-through were pelleted and viability was confirmed using trypan blue. A total of 2 × 106 viable cells was used for TotalSeq-C and anti-CD3-PE antibody staining. For this purpose, cells were washed with cell staining buffer (BioLegend, 420201), pelleted at 600g for 10 min at 4 °C and resuspended in 90 µl cell staining buffer. Then, 10 µl Human TruStain FcX blocking solution (BioLegend, 422301) was added and cells were incubated for 10 min at 4 °C. The TotalSeq-C Human Universal Cocktail 1.0 (BioLegend, 399905) was resuspended as described above, centrifuged at 14,000g for 10 min at 4 °C and 25 µl of the supernatant was added to the blocked cells. Individual TotalSeq-C antibodies were prepared as described above and 26 µl of the master mix was added to each sample. To facilitate enrichment of immature and mature thymocytes by FACS, 10 µl anti-CD3-PE (SK7, BioLegend, 344805) was added and the samples were topped up with 40 µl cell staining buffer resulting in a total staining volume of 200 µl. The samples were incubated for 30 min at 4 °C in the dark. To wash off unbound antibodies, cell staining buffer was added to the samples, and cells were pelleted for 10 min at 600g at 4 °C. All supernatant was removed, cells were resuspended in cell staining buffer, transferred to a new tube and pelleted as before. Cells were again resuspended in cell staining buffer and pelleted and this wash step was repeated once more before cells were resuspended in 200 µl MACS buffer (2% FCS, 2 mM EDTA in PBS) in preparation for sorting. Then, 1 µl propidium iodide (Invitrogen, 230111) was added for detection of dead cells and samples were sorted on the BD FACSAria III or BD FACSAria Fusion cell sorter using a 100 μm nozzle and a maximum flow rate of 4 (FACSDiva v.8.0.2, reanalysis with FlowJo v.10.8.2). Cells were gated using forward/side scatter to remove doublets and debris, then dead cells were excluded based on PI staining. CD3 and CD3+ cells were collected separately in cooled IMDM + 50% FCS (Supplementary Fig. 19b). After completion of the sort, collection tubes were topped up with DPBS and cells were pelleted at 400g for 10 min at 4 °C. The supernatant was removed and cells were resuspended at an estimated concentration of 1,500 cells per µl in PBS + 0.04% BSA (Miltenyi, 130-091-376), of which 16.5 µl was used in the GEM generation step. The Next GEM Single Cell 5′ Kit v2 (10x Genomics, 1000265) was used to prepare the reaction master mix, and load cells, gel beads and partitioning oil on a Chip K (10x Genomics, 1000286) according to the manufacturer’s protocol CG000330 Rev A. GEMs were generated using a Chromium Controller (10x Genomics), transferred to a tube strip and reverse transcription was performed in a BioRad C1000 Touch Thermal Cycler according to the protocol. The samples were stored at 4 °C overnight and the library preparation was carried out the next day.

    CITE-seq library preparation and sequencing

    Feature barcode (FB), gene expression (GEX) and TCR libraries were prepared according to protocol CG000330 Rev A (10x Genomics) using the Chromium Next GEM Single Cell 5′GEM Kit v2 (10x Genomics, 1000244), Library Construction Kit (10x Genomics, 1000190), 5′ Feature Barcode Kit (10x Genomics, 1000256), Human TCR Amplification Kit (10x Genomics, 1000252), Dual Index Kit TT set A (10x Genomics, 1000215) and Dual Index Kit TN set A (10x Genomics, 1000250). The protocol version for >6,000 cells was followed and libraries were amplified for 13 cycles (cDNA), 14 cycles (GEX), 8 cycles (FB) or 12 + 10 cycles (TCR libraries). Library quality and quantity were checked on the Bioanalyzer instrument (Agilent) using a high-sensitivity DNA assay. Libraries were pooled and sequenced on the NovaSeq 6000 instrument (Illumina) to a minimum of 25,000 reads per cell for GEX, 10,000 reads per cell for FB and 5,000 reads per cell for TCR libraries.

    CITE-seq quality control and denoising

    CITE-seq data were processed using the R packages Seurat58 (v.4.3.0), SeuratObject (v.4.1.4), SeuratDisk (v.0.0.0.9021), SingleCellExperiment (v.1.24.0), Matrix (v.1.6-4), matrixStats (v.1.2.0), dplyr (v.1.1.4), tidyr (v.1.3.1), reshape2 (v.1.4.4), BiocNeighbors (v.1.20.2), BiocParallel (v.1.36.0), stringr (V.1.5.1), reticulate (v.1.35.0) and sceasy (v.0.0.7). Data were visualized using ggplot2 (v.3.5.0), ggrastr (v.1.0.2), ggridges (v.0.5.6) and RColorBrewer (v.1.1-3).

    Fastq files from FB libraries were mapped with Cell Ranger v.7.0.0. GEX libraries were mapped with STARsolo as described above for scRNA-seq data. CITE-seq data were first subjected to quality-control processing based on RNA properties as described above. For the retained high-quality cells, ADT data were then denoised using dsb (v.1.0.3)59. For this purpose, empty droplets were identified in the unfiltered mapping output and used as a reference for estimating noise and antibody background levels. Approximately 1.4 million droplets were selected on the basis of RNA count < 240 reads, ADT count between 120 and 350 reads, and <5% mitochondrial reads to ensure that damaged cells were not included in the subset. During denoising and normalization with DSBNormalizeProtein, ADT data from droplets were used for background correction and the seven isotype control antibodies included in the TotalSeq-C Human Universal Cocktail were used to determine technical variation. Data were scaled based on the background by subtracting the mean and then dividing by the standard deviation of the empty droplets. Negative values after denoising, which correspond to very low expression, were set to zero to improve interpretation and visualization. In an additional quality-control step, cells in which less than 100 of the antibodies were detected were removed from the dataset as technical artefact. Moreover, a subset of cells affected by aggregates of the antibodies against TCRγδ, CD199 (CCR9), CD370, CD357 and XCR1 was removed from the dataset due to unreliable surface-staining properties.

    CITE-seq annotation

    Annotation of the CITE-seq data was carried out on integrated RNA and denoised ADT modalities. For this purpose, both data modalities were log-normalized and the top 2,000 highly variable genes were identified, followed by PCA on the scaled highly variable genes using the standard functions in the Seurat package. MNN correction was applied to the PCA loadings matrix using the reducedMNN function from the Batchelor package60 (v.1.18.1) to reduce batch and donor effects between samples. To integrate both modalities, multimodal neighbours and modality weights were identified and a weighted nearest neighbours (WNN) graph58 was constructed using FindMultimodalNeighbors based on the PCA. The number of PCs to be used was determined based on the difference in variation of consecutive PCs being higher than 0.1%. A UMAP visualization was generated based on the WNN graph to represent the weighted combination of both modalities. Furthermore, a supervised PCA (sPCA) was performed on transcriptome data using the RunSPCA function of the Seurat package to obtain dimensionality reduction for the RNA modality that represents the structure of the WNN graph58.

    To identify cell types and developmental stages, we performed Leiden clustering at a low resolution based on the sPCA using the Seurat functions FindNeighbors and FindClusters. The obtained clusters were then subsetted and analysed individually starting with the most immature cluster, which was identified based on high expression of the surface marker CD34. For each subset, normalization and scaling for RNA and ADT data were repeated as described above and a new WNN UMAP and sPCA were constructed. A combination of Leiden clustering on the sPCA and thresholding of protein levels was used to identify cell types. Moreover, to identify proliferating cells, cell cycle scoring was performed using the G2M and S phase markers supplied in the Seurat package. The FindMarkers function (Seurat) and the package singleCellHaystack61 (v.1.0.0) were used to identify differentially expressed genes and surface markers in a cluster-based and cluster-independent manner, respectively. Distinction of CD4 and CD8 lineage maturation stages was based on CD45RA, CD45RO and CD1a and identical expression cut-offs were used for both lineages to ensure that subsets would be directly comparable.

    Paired TCR-seq data were processed with Dandelion47 v.0.3.1 and information about productive or non-productive TRA and TRB rearrangements was extracted for each cell to validate cell type annotations after clustering.

    CITE-seq pseudotime analysis

    To carry out trajectory inference for αβT lineage cells, CITE-seq data were subsetted to contain DP_pos_sel, DP_4hi8lo, SP_CD4_immature, SP_CD4_semi-mature, SP_CD4_mature, SP_CD8_immature, SP_CD8_semi-mature and SP_CD8_mature cells. A new WNN UMAP was constructed based on surface protein and RNA. Slingshot62 (v.2.6.0) was used to establish a minimum spanning tree on the WNN UMAP using the getLineages function based on mutual nearest neighbour-based distance with DP_pos_sel set as start point and SP_CD4_mature and SP_CD8_mature specified as end points. Smooth lineages were obtained using the getCurves function and the derived pseudotime orderings were used to assess transcript and surface marker expression throughout differentiation.

    CITE-seq cell2location mapping, integration and processing

    For the focused multimodal analysis, we subsetted the full scRNA-seq dataset to include only paediatric data and further removed T lineage cells without CITE-seq protein information. We then performed cell2location as described above to obtain deconvolved cell type mapping based on CITE-seq annotations. In addition, as cell2location deconvolves spots by unique pseudobulk expression profiles from a single-cell reference based on discrete cell annotations, the mapping resolution is limited to that of the annotated cell subsets. To investigate the continuous spatiotemporal nature of CD4/CD8 cell lineages at increased resolution, we performed high-resolution Leiden clustering on the CITE-seq data using scanpy (resolution=35), resulting in 567 cell clusters. These cell clusters were then mapped to our paediatric Visium data with cell2location as described above. To measure the position of each cell cluster across the CMA, we selected Visium spots with the highest cluster label abundance (above percentile 95%) and calculated the weighted mean CMA values for these spots. This value was then assigned this to the cells comprising the associated cluster in the single-cell object. Further details are provided in the GitHub repository for this Article (see Code availability).

    Reporting summary

    Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

    [ad_2]

    Source link

  • Fate induction in CD8 CAR T cells through asymmetric cell division

    [ad_1]

    Lentiviral constructs and production, in vitro transcription, cell lines and cell culture

    All cells in this study were cultured in RPMI 1640 supplemented with 10% fetal bovine serum (FBS), 10 mM HEPES and 1% penicillin/streptomycin at 37 °C in fully humidified environment with 5% CO2 unless otherwise indicated. Cell lines were evaluated for mycoplasma contamination.

    We used single chain variable fragment (scFv)-based CARs against human CD19 (clone FMC63) and the human TCR δ chain (clone 5A6.E9) with an appended N-terminal pentaglycine tag in a third generation lentiviral backbone (Extended Data Fig. 1). Both CAR constructs used scFvs in the light-chain-heavy-chain configuration followed by a CD8α hinge and transmembrane domain and CD137 and CD3ζ cytoplasmic domains. Furthermore, CARs were cloned with an IgG4 hinge instead of CD8α hinge domains (Extended Data Fig. 6c) with standard cloning methods.

    To create a human CD19-sortase construct, we cloned human CD19 (C-terminal truncation) into a third generation lentiviral backbone that provided an IgH signal peptide followed by a flag-tag, low-affinity sortase mutant15 and a linker peptide. Similarly, we cloned a γδ TCR (clone PEER59) into this vector (both chains as a single transcript separated by a P2A site). Constructs were obtained as double-stranded DNA (dsDNA) fragments from IDT, digested and ligated into the lentiviral backbones using standard cloning techniques. We cloned extra TCR chains into lentiviral backbones (CD3γ-P2A-CD3δ and CD3ζ-P2A-CD3ε plasmids) to facilitate expression of the γδ TCR in non-T cell lines. Nucleotide and amino acid sequences of all constructs used in this study are listed in Supplementary Table 1. Lentivirus was produced as previously described using Lenti-X 293T cells60,61.

    For in vitro messenger RNA (mRNA) transcription, CAR constructs were cloned into a pDrive.150 poly(A) backbone62 using standard cloning techniques and linear mRNA was transcribed using the T7 mScript Standard mRNA Production System (Cellscript). Linear dsDNA templates were generated by digesting with either ClaI or SpeI, and mRNA was synthesized following the manufacturer’s recommendations for a Cap 1-mRNA and roughly 150 base-long poly(A)-tail. mRNA was purified with an RNeasy Mini Kit (Qiagen), eluted in RNase-free water at 1 μg μl−1 and aliquoted and stored at −80 °C.

    To disrupt the endogenous CD19 locus in Nalm6 cells (provided by M. Milone, originally obtained from DSMZ) and to create a cell line only expressing CD19-sortase or γδ TCR-sortase, two guide RNAs targeting the human CD19 locus were obtained (IDT, sequences in Supplementary Table 2) and 5 × 106 Nalm6 cells were electroporated with a total of 50 pM ribonucleoprotein (consisting of Cas9 (IDT) and single-guide RNA (sgRNA)) in a total volume of 20 μl of Lonza P3 buffer (P3 primary cell 4D-Nucleofector X kit S) with a Lonza 4D-Nucleofector Core Unit (pulse protocol EO115) according to the manufacturer’s protocol. After CD19 disruption, Nalm6 cells were cultured at 0.2–1 × 106 cells per ml in standard medium for 14 days before sorting CD19 negative cells by FACS (BD Biosciences AriaII). Initial disruption efficiency was greater than 90%, which increased to more than 99% after sorting. CD19 negative Nalm6 cells were transduced with target proteins (CD3γ, CD3δ, CD3ζ, CD3ε, γδ TCR-sortase or CD19-sortase) and positive cells were enriched by FACS.

    All target cells in this study express green fluorescent protein (GFP)-click beetle green luciferase. Target cells were sorted on a regular basis to ensure persistence of the luciferase and surface antigen expression over several passages.

    Bulk and CD8 CART production

    Bulk or CD8 only primary human T cells from healthy donors were obtained from the University of Pennsylvania Human Immunology Core, stimulated with anti-CD3/anti-CD28 beads (Dynabeads, Thermo Fisher) for 24 h before transduction with lentiviral CAR constructs. Anti-CD3/anti-CD28 magnetic beads were removed on day 4 after transduction and the IL-2 concentration was gradually lowered from 100 to 25 IU ml−1 by day 8 after activation and 0 IU ml−1 by day 10 after activation. Cell medium replacement and quantification of cell size and number (Coulter Multisizer 4e) occurred every 2–3 days. CAR transduction efficiency was determined by flow using a polyclonal antimurine Fab antibody conjugated to biotin (Jackson ImmunoResearch) and streptavidin-PE. Non-transduced T cells from the same donor were stained under identical conditions and used as negative control.

    To generate naive or effector CARTs, naive and effector T cells were isolated from bulk primary human T cells and were electroporated with mRNA encoding CARs. Naive T cells were isolated either with the Naive Pan T Cell Isolation Kit (Miltenyi Biotec, catalogue no. 130-097-095) or with a positive selection of CD62L and subsequent negative selection for CD45RA+ cells. For the latter approach, cells were stained with anti-CD62L-PE (BioLegend, DREG-56, catalogue no. 304840) and enriched with the anti-PE MultiSort kit (Miltenyi Biotec, catalogue no. 130-090-757) and LS column (Miltenyi Biotec, catalogue no. 130-042-401), with the flowthrough reserved for the isolation of effector T cells described below. CD62L+ cells were flushed out and separated from MultiSort MicroBeads using the MultiSort Release Reagent and centrifugation. CD45RA+CD62L+ cells were subsequently isolated by negative isolation using CD45RO MicroBeads (Miltenyi Biotec, catalogue no. 130-046-001) and two columns (Miltenyi LS). To isolate effector T cells of the same donor as the naive T cells, flowthrough from the first CD62L selection was added to the column (Miltenyi LD) for negative selection of CD62L cells. More than 95% population purity (determined by flow cytometry) was used in the presented studies. Following isolation, naive and effector cells were electroporated with 10 μg mRNA/1 × 107 T cells encoding the CARs using Lonza 4D-Nucleofector Core Unit (pulse code EH115) according to the manufacturer’s protocol.

    To disrupt the endogenous TCR, T cells were cotransduced with lentiviral CAR constructs and pCAT003, a lentivirus transfer plasmid encoding sgRNA targeting TRAC and gift from J. Doudna (Addgene plasmid no. 171628)63. Immediately following debeading, up to 4 × 106 T cells were electroporated with 50 pM of Cas9 as described above with the modification of pulse code EO115. TCR cells were negatively selected using CD3 MicroBeads (Miltenyi Biotec, catalogue no. 130-097-043) and LD column according to the manufacturer’s protocol.

    To genetically disrupt IKZF1, 1 × 106 T cells were electroporated immediately following debeading with 50 pM Cas9 and 100 nM guide RNA (IDT, Supplementary Table 2) as described above, with the modification of pulse code EH115. Genomic DNA was isolated using DNeasy Blood & Tissue Kit (Qiagen, catalogue no. 69504) according to the manufacturer’s protocol. The targeted IKZF1 locus was amplified using indicated primers (Supplementary Table 3). Quantification of genetic editing efficiencies were estimated using tracking of indels by decomposition64. Western blots were performed and stained using rabbit antiIkaros (IKZF1) monoclonal antibody (Cell Signaling, 9034S; dilution 1:1,000), digital antirabbit-HRP (Kindle Biosciences, LLC, R1006; dilution 1:1,000), mouse anti-β-actin monoclonal antibody (Cell Signaling, 3700S; dilution 1:3,000) and digital antimouse-HRP (Kindle Biosciences, LLC, R1005; dilution 1:3,000). Uncropped images of blots are provided in Supplementary Fig. 1. Pharmacologic depletion of IKZF1 was performed using 0.1 μM lenalidomide (MedChemExpress, catalogue no. HY-A0003).

    LIPSTIC assay

    Biotinylated LPETG peptide (biotin-aminohexanoic acidLPETGS, C-terminal amide, 95% purity)15 was purchased from LifeTein (custom synthesis), reconstituted in PBS at 10 mM and stored at −80 °C.

    To label target cells, Nalm6 cells (expressing sortase-tethered target molecules) were incubated with biotinylated LPETG peptide (100 μM, LifeTein) for 30 min at 37 °C in RPMI/10%FBS, followed by washing three times to remove excess soluble peptide. Sortase-bound LPETG was then labelled with fluorescent streptavidin (PE, AF647 or APC; 10 μg ml−1; BioLegend) for 30 min at 37 °C. Cells were then washed three times and resuspended at 1 × 106 cells per ml.

    All LIPSTIC assays were performed using fully rested T cells that had not demonstrated cell number increases in roughly 2 days. For the CARTs in the presented studies, the transduction efficiencies were between 20 and 85%, and the cell sizes of rested T cells were between 200 and 260 fl, which was achieved 12–15 days after activation. T cells were stained with CellTrace Violet following the manufacturer’s recommendations with the following modifications: T cells were stained at a concentration of 1 × 107 cells per ml for 10 min at 37 °C. Target cells and CARTs were mixed in a six-well plate well in a total volume of 5.5–7 ml, and CART to target ratios ranged from 0.3:1 to 4.25:1. Cells were incubated for 72 h before cell sorting (BD Biosciences AriaII) and subsequent analysis of first-division daughter cells.

    A second target encounter LIPSTIC assay was performed using sorted first-division LPETG+ or LPETG cells. Target cells labelled with a second colour fluorescent streptavidin and CARTs were mixed in a 96-well plate in a total volume of 200 μl and a 1:1 CART:target ratio. Second-division daughter cells were sorted 24 h after coincubation.

    Sorting gates were established for live single cells that were negative for GFP (excluding target cells), positive for Cell Trace Violet (CTV) and positive or negative for LPETG. LPETG positivity was determined relative to untransduced T cells, CARTs incubated without target cells or irrelevant CARTs incubated with target cells (threshold for LPETG positivity was generally the same for all controls).

    Multiparametric flow cytometry analysis of T cells

    Unless otherwise specified, antibodies were purchased from BioLegend.

    In vitro and in vivo LIPSTIC assay populations were sorted and subsequently phenotyped by staining with 1:200 CD8-APCH7 (SK1, BD Biosciences, catalogue no. 561423), 5:400 CD4-BUV805 (SK3, BD Biosciences, catalogue no. 612887), 1:160 CD45RA-BUV395 (HI100, BD Biosciences, catalogue no. 740298), 5:400 CD45RO (UCHL1, catalogue no. 304234), 1:40 CD25-BV711 (M-A251, catalogue no. 356138) and/or 1:250 CD62L-PE (DREG-56, catalogue no. 304840).

    For in vivo studies, samples were stained with CD8-APC/Cy5.5 (RFT8, SouthernBiotech, catalogue no. 9536-18), CD4-PE/Cy5.5 (RFT4, SouthernBiotech, catalogue no. 9522-16), CD3-BV605 (OKT3, catalogue no. 317322), CD19-APC (HIB19, catalogue no. 302212), CD45RA-BUV395 (HI100, catalogue no. 740298), CD45RO-BV785 (UCHL1, catalogue no. 304234), CD62L-PE (DREG-56, catalogue no. 304840), TCR-alpha/beta-BV421 (IP26, catalogue no. 306722) and CD45-PECy7 (QA17A19, catalogue no. 393408) at 1:100 dilution. Whole blood was stained in Trucount tubes (BD Biosciences) and fixed with FacsLyse solution (BD Biosciences) according to the manufacturer’s recommendations. Single-cell suspensions from spleen samples were produced by homogenization of the tissue through a 70 μm mesh followed by treatment with Red Blood Cell Lysis Buffer (BioLegend) according to the manufacturer’s recommendations and stained in PBS, 1% FBS and cell numbers were quantified with CountBright Plus Absolute Counting Beads (Thermo Fisher).

    Samples were analysed on an LSRII, LSR Fortessa or FACSymphony A3 Cell Analyzer (BD Biosciences). The population of interest was gated based on forward- versus side-scatter characteristics followed by singlet gating. Data were analysed with FlowJo v.10 (Tree Star). Graphs and statistical analyses were generated using GraphPad Prism v.9.4.0.

    Live-cell microscopy

    For live-cell imaging to capture CARTs undergoing the first cell division, LPETG-positive CARTs before the first cell division were isolated using fluorescence activated cell sorting after 48 h of coincubation with target cells and then imaged in a humidified incubation chamber at 37 °C in 5% CO2 on a Zeiss Observer 7 equipped with a Zeiss Axiocam 702 monochrome CMOS camera, a Zeiss Axiocam 503 colour CCD camera and a Colibri 7 LED light source in Definite Focus mode in a 35 mm glass bottom dish (Ibidi) every 3 min.

    To image the transfer of LPETG peptide from target to CART cells, CTV-labelled T cells were incubated with LPETG peptide-loaded target cells at an effector to target cell ratio of 1:5. The excess of target cells in this context increases the frequency of observing the interaction between CARTs and targets. Cells were placed in a 35 mm glass bottom dish (Ibidi) and analysed in a humidified incubation chamber at 37 °C in 5% CO2. Photographs were acquired in the GFP, CTV and AF647 channels in Definite Focus mode on a Zeiss Observer 7 every 45 s for 50 min. Images were acquired with ×40 objective using Zen (Blue edition) software (v.2.5, Zeiss). Videos were created with Fiji-ImageJ.

    Metabolic analysis

    T cell metabolic profiles were assessed using the Seahorse mitochondrial stress test (Agilent Technologies). Individual wells of an XF96 cell-culture microplate were coated with CellTak as per the manufacturer’s instructions. The matrix was adsorbed overnight at 37 °C, aspirated, air-dried and stored at 4 °C until use. Mitochondrial function was assessed on day 0 or day 1 after sorting proximal or distal or undivided cells. T cells were resuspended in non-buffered RPMI 1640 medium containing 5.5 mM glucose, 2 mM l-glutamine and 1 mM sodium pyruvate and seeded at 1.5 × 105 cells per well. The microplate was centrifuged at 1,000g for 5 min and incubated in standard culture conditions for 60 min. During instrument calibration (30 min), the cells were switched to a CO2-free 37 °C incubator. XF96 assay cartridges were calibrated according to the manufacturer’s instructions. Cellular OCRs and ECARs were measured under basal conditions and following treatment with 1.5 μM oligomycin, 1.5 μM FCCP and 40 nM rotenone, with 1 μM antimycin A (XF Cell Mito Stress kit, Agilent). OCR/ECAR ratios are calculated using the mean OCR and ECAR of 3–5 replicates for each population.

    In vivo studies

    Immunodeficient NSG (NOD.Cg-Prkdcscid Il2rgtm1Wjl/SzJ) mice were bred in house under an approved Institutional Animal Care and Use Committee (IACUC) protocol and maintained under pathogen-free conditions. To facilitate engraftment of T cells, bulk (CD4+ and CD8+) T cells were used in in vivo studies19. Sample sizes were not predetermined based on statistical methods but were chosen based on preliminary data and previously published results. For all in vivo experiments, treatment groups were randomly selected by the cage number. In vivo injections were performed in a blinded fashion by a member of the Ellebrecht or Payne laboratories or a staff member of the Human Stem Cell and Xenograft Core of the University of Pennsylvania.

    For in vivo LIPSTIC, LPETG-labelled target cells were injected intraperitoneally immediately followed by CTV-labelled CARTs. A 1:1 target:CART ratio was maintained, with a total of 1 × 107 total T cells injected. Mice were euthanized 2 days following injection. Cells were collected using peritoneal wash three times using 5 ml of ice-cold 2% FBS and PBS. First-division daughter cells were sorted and assessed by flow cytometry as described above.

    For functional longevity studies, 2.5 × 105 anti-CD19 CAR proximal- or distal-daughter cells, non-activated resting anti-CD19 CARTs or non-transduced T cells were intravenously injected into NSG mice on day 0. After 35 days, mice were challenged with 1 × 106 Nalm6 cells. Leukaemia burden was determined by bioluminescence imaging. Bioluminescence was quantified with an IVIS Lumina III (PerkinElmer) 2–3 times per week after Nalm6 injection as previously described3.

    In the stress-test model, NSG mice were injected with 1 × 106 Nalm6 cells on day 0. Engraftment of Nalm6 was confirmed on day 3 by bioluminescence imaging. On day 4, mice were treated with 2.5 × 105 proximal or distal-daughter anti-CD19 CARTs; 2.5 × 106 non-activated resting anti-CD19 CARTs or non-transduced T cells; or 1.3 × 106 bulk restimulated anti-CD19 CARTs (unsorted) by intravenous injection. Leukaemia burden was determined with bioluminescence imaging as above. Mice were euthanized when they had reached a total bioluminescence flux of at least 1 × 1011 photons per second, demonstrating loss of leukaemia control.

    Peripheral blood was obtained by retro-orbital bleeding. Mice were euthanized for organ harvest according to local IACUC guidelines, and spleen and blood samples were assessed by flow cytometry as described above. In accordance with the approved IACUC protocol for these studies, humane endpoints for euthanizing mice were established and not exceeded in this study. Mice were monitored at least twice weekly for symptoms. If severe lethargy or weakness, hunching, emaciated body condition (body condition score of 1 out of 5) or loss of 20% or more body weight, were observed, mice were euthanized. If a body condition score of 2 out of 5 was observed and accompanied by lethargy, mice were euthanized. Furthermore, mice were euthanized when their total bioluminescence flux exceeded 1 × 1011. All studies involving animals were performed under a protocol approved by the University of Pennsylvania IACUC.

    Luciferase-based in vitro cytotoxicity assay

    Cytotoxicity assays were performed either on day 1 or 4 after first cell division as previously described3. Click beetle green luciferase-expressing target Nalm6 cells were cocultured with proximal, distal or resting CARTs or donor-matched non-transduced T cells at indicated E:T ratios. To test TCR-mediated cytotoxicity of mRNA-electroporated naive and effector CARTs, K562 cells positive for CD64 (FcγRI) were incubated with 100 μg ml−1 anti-human CD3 (OKT3, Invitrogen, catalogue no. 16-0037-85) for 30 min on ice and washed twice were used as target cells.

    Single-cell multiomic analysis

    First-division proximal, first-division distal, activated-undivided and resting CD8 CARTs (1.5 × 105 cells each), sorted as described above from the LIPSTIC assay, were separately incubated in flow cytometry staining buffer (BioLegend) with a custom TotalSeq-C antibody cocktail (Supplementary Table 4, BioLegend 900000114, lot B311489) in 100 μl for 30 min at 4 °C before washing three times. Cell concentration was adjusted to 1.5 × 106 live cells per ml. 10,000 live CD8 T cells from each LIPSTIC population were loaded onto NextGem K chips (10X Genomics) and processed in a 10X Chromium device according to the manufacturer’s recommendations. A biological replicate was performed with CART from a separate donor. Library preparation was performed according to the 10 × 5′ V2 protocol for antibody-derived tags (ADT), gene expression (GEX) and paired alpha and beta TCR chains. Complementary DNA and subsequent library intermediates were checked for correct size, appropriate quantity and quality with a DNA high-sensitivity kit on a Bioanalyzer 2100 (Agilent). Libraries were sequenced in paired-end dual-index mode for 150 × 2 cycles on a NovaSeq 6000 sequencer (Illumina, one lane of a S4 cartridge). All cells in each experiment were sorted and stained on the same day and libraries were processed in parallel and sequenced in the same lane to minimize batch effects. Counts for demultiplexed GEX, ADT and TCR libraries were obtained with the STAR method of the Cell Ranger multi pipeline (10X Genomics, Cell Ranger v.6.1.2) using the human GENCODE v.32/Ensembl 98 GRCh38 reference (detailed version by 10X Genomics: Human (GRCh38) 2020-A, Human (GRCh38) v.5.0.0), which then were aggregated with the Cell Ranger aggr pipeline with read depth normalization to further reduce batch effects across libraries. Downstream analysis was performed with the Seurat V4 R package22. To remove doublets and low cells, cells with more than 25% mitochondrial gene transcripts, less than 7.5% ribosomal gene transcripts, transcript counts less than 500 or greater than 40,000, or a minimum number of detected genes of less than 500 were excluded. Counts were single-cell transformed using the sctransform V2 and glmGamPoi packages65. Dimensionality reduction was performed based on ADT counts with subsequent analysis of genes and surface proteins of interest and differentially expressed genes or surface proteins for TN-, TCM-, TEM– and TRM-like subsets. RNA velocity analysis was performed by counting spliced and unspliced transcripts in Cell Ranger binary alignment map output files with the velocyto package31 using the same transcriptome reference gene transfer format file (refdata-gex-GRCh38-2020-A) that was used for the initial Cell Ranger run. Output loom files were then used in scvelo after export of TN, TCM, TEM and TRM expression matrices containing proximal and distal first-division daughter cells from Seurat and conversion to SCANPY/ANNDATA objects66. Global velocity vectors and velocities of genes of interest were computed and visualized in stochastic or dynamic mode with scvelo30. Regulon analysis (gene modules co-expressed with transcription factors and with correct cis-regulatory upstream motif for a respective transcription factor) was performed using a list of 1,390 human transcription factors (https://github.com/aertslab/pySCENIC/blob/master/resources/hs_hgnc_curated_tfs.txt) with the Python version of SCENIC (that is, pySCENIC v.0.11.2)32,67 after importing expression matrices from Seurat to SCANPY (v.1.7.2). Gene-set enrichment analysis68,69 was performed on each T cell subsets with the GSEA function of clusterProfiler70 in R. T cell clonal analysis was performed with scRepertoire71 in R using the ‘strict’ setting, which requires identical V, D (if applicable), J and C genes in addition to identical CDR3 nucleotide sequence of both TCR chains to identify T cells belonging to the same clonotype.

    Seurat analysis was performed in R v.4.3.1, velocity analysis was performed in Python v.3.10.4 and regulon analysis was performed in Python v.3.7.12 in accordance with the respective pipeline requirements.

    Statistical analysis

    Statistical significance was determined with two-sided tests unless otherwise indicated. Where appropriate and as indicated, P values were adjusted for multiple testing (Benjamini–Hochberg). Q values were calculated with clusterprofiler in R. Whenever individual data points are presented, a horizontal line represents the mean value of the group. Survival in in vivo experiments was defined as time until the predetermined bioluminescence threshold was reached. Kaplan–Meier statistical analysis was used to compare survival between groups. Unless otherwise indicated, asterisks depict the following significance values: *P < 0.05, **P < 0.01, ***P < 0.001. P values less than 0.05 were considered statistically significant. The mean fluorescence intensity in flow cytometry plots is labelled with a cross in each gate. Statistical analysis was performed with the respective pipelines as mentioned above or with GraphPad Prism v.9.4.0.

    Reporting summary

    Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

    [ad_2]

    Source link

  • Immune microniches shape intestinal Treg function

    [ad_1]

    Mice

    C57BL/6 J (B6) and Foxp3GFP, Rag1−/−, CD2DsRed, Foxp3CD2Il10GFP, UbPA-GFP and Nur77GFP mice were bred and maintained under specific pathogen free conditions in an accredited animal facility at the University of Oxford. Lta−/− mice were purchased from Jackson Laboratories. HH7-2tg TCR transgenic mice, referred to as TCRHh here, were provided by D. R. Littman. TCRHh mice were bred with CD2DsRed mice and either Foxp3CD2Il10GFP or Nur77GFP mice to generate TCRHhCD2DsRedIl10GFP or TCRHhCD2DsRedNur77GFP, respectively. TCRHh mice were bred to Rag1−/− to generate TCRHhRag1−/− mice. UBPA-GFP mice were bred to CD2DsRed and TCRHh mice to generate CD2DsRedUbPA-GFP and TCRHhCD2DsRedUbPA-GFP, respectively.

    Mice were free of Helicobacter spp. and other known intestinal pathogens, were age- and sex-matched and between 6 and 12 weeks old. Animals were randomly assigned to experimental group, and cages contained mice of all different experimental groups. All experiments were conducted in accordance with the UK Scientific Procedures Act of 1986, and by persons holding a personal license. The project licence governing the mouse studies was reviewed by the University of Oxford’s Animal Welfare and Ethical Review Board and approved by the Home Office of His Majesty’s Government.

    No statistical methods were used to predetermine sample size. Sample sizes were based on previous similarly designed experiments from our research group. The spatial transcriptomics experiment included four mice per group to balance statistical power with cost. For other experiments we aimed for a minimum of five mice per experimental group. Exact mouse numbers for each experiment are included in the figure legends. Mice were assigned to different experimental groups at random. Mice were co-housed and littermate when possible. Each cage contained all treatment conditions. Animal studies were not blinded. Histopathology scoring was conducted by two independent assessors, one of whom was blinded.

    Flow cytometry

    Mouse cells were stained with combinations of the following monoclonal antibodies, all purchased from Biolegend, Invitrogen, or eBioscience: CD4 (RM4-5), TCRβ (H57-957), CD45.1 (A20), CD45.2 (104), CD11c (N418), CD11b (M1/70), anti-human CD2 (TS1/8), CXCR5 (L138D7), PD-1 (J43), FOXP3 (FJK-16s), RORγt (Q31-378), Ki-67 (SolA15). Dead cells were excluded using efluor 780 fixable viability dye (eBioscience). For transcription factor staining, cells were stained with surface markers prior to fixation and permeabilization using the FOXP3 staining buffer kit (eBioscience) according to manufacturer instructions.

    Immunofluorescence staining

    Swiss-rolled caecum tissues were fixed overnight at 4 °C in PLP buffer (1% paraformaldehyde, l-lysine 0.2 M pH 7.4 and 32 mg NaIO4). Then, tissues were dehydrated in 20% sucrose for at least 4 h at 4 °C and embedded in OCT compound (Avantor). Seven-micrometre cryosections were rehydrated, blocked and permeabilized with PBS, 1% goat serum, 1% BSA, 0.3 M glycine, 0.3% Triton X-100 for 1 h at room temperature. Sections were stained with the following antibodies: Alexa Fluor 488 anti-mouse CD172a (SIRPα) (clone P84, 5 μg ml−1 Biolegend), Alexa Fluor647 anti-mouse CD11c (clone N418, 5 μg ml−1 Biolegend) or FITC/Alexa Fluor 594 anti-mouse CD206 (clone C068C2, 5 μg ml−1 Biolegend) and anti-mouse CD64 (clone X54-5/7.1, 4 μg ml−1 Biolegend). Sections were stained overnight at 4 °C. Before imaging, nuclei were counterstained with Hoechst. Images were acquired using Zen Blue software on a ZEISS 980 Airyscan inverted microscope equipped with a motorized stage. Diode laser lines were used for excitation: violet (405 nm), blue (488 nm), yellow (514 nm) and red (639 nm). All images were acquired with a 25× (NA 0.8) LD LCI Plan-Apochromat oil-immersion objective.

    Colon tissue was embedded in OCT (Tissue-Tek) as Swiss rolls and sectioned at 7 μm. Slides were fixed with 3.7% formalin (Merck) and blocked with 10% donkey serum (Sigma Aldrich) and 1% Fc block (eBioscience) in permeabilization buffer (Foxp3/Transcription factor staining buffer set, eBioscience). B220 (RA3-6B2), CD4 (RM4-5), MHC Class II (M5/114.15.2), gp38 (8.1.1), IgD (11-26 c.2a) and BCL6 (IG191E/A8) (all Biolegend) were stained overnight in blocking serum.

    Isolation of lymphocytes from spleen, lymph node and intestinal tissue

    Intestinal tissues were washed twice in RPMI (Sigma Aldrich)/10%FCS/5 mM EDTA at 37 °C with agitation for 25 min to remove epithelial cells. CP and OLS were removed under 40× bright-field microscopy using a scalpel and a 16G needle and syringe, respectively. Remaining colon and caecum tissue, OLS and CP were digested for 40 min at 37 °C with agitation in RPMI/10% FCS/15 mM Hepes with 100 U ml−1 collagenase VIII (Sigma Aldrich) and 20 mg ml−1 DNase I (Sigma Aldrich). Leukocytes from colon and caecum tissue were recovered at the interface of a 40/70% Percoll gradient (Fisher Scientific).

    Spleens and MLNs were mechanically disrupted, and splenic red blood cells were lysed with ACK lysis buffer.

    Peripheral blood was collected by cardiac puncture and red cells were lysed with ACK lysis buffer.

    Hh culture and oral gavage

    Hh NCI-Frederick isolate 1 A (strain 51449) was grown on blood agar plates containing 7.5% laked horse blood (Thermo Scientific) and Skirrow Campylobacter supplement (Oxoid) under microaerophilic conditions at 37 °C with agitation. Cultures were expanded for 48 h in Tryptone Soy Broth (TSB, Fisher) containing 10% FCS (Gibco) and the above antibiotics. The concentration of bacteria was determined by optical density (OD) analysis at 600 nm. Mice were fed 1 × 108 colony-forming units of Hh (equivalent to 1 OD unit) by oral gavage using a curved 22G needle for a total of 2 doses 24 h apart.

    Lymph node lymphocyte egress blocking experiment

    Host mice were treated every 24 h with 1 mg kg−1 of the sphingosine-1-phosphate antagonist, Fingolimod (FTY720, Sigma Aldrich) via intraperitoneal injection at the indicated timepoints after naive TCRHh cell transfer.

    Sorting and adoptive transfer of naive TCRHh T cells

    Naive T cells were isolated from TCRHh mice splenocytes and sorted by flow cytometry as CD45+CD3+CD4+CD44lowCD62LhiVβ6+ (Extended Data Fig. 1d), with up to 2% contamination with nTreg cells. All monoclonal antibodies were purchased from Biolegend or eBioscience: CD3 (145-2C11), CD11b (M1/70), CD11c (N418), B220 (RA3-6B2), CD62L (MEL-14), TCRVβ6 (RR4-7), CD44 (IM7), CD45 (30-F11), CD4 (RM4-5). Sorted cells (2 × 103 or 5 × 104) were injected by intravenous injection into the tail vein for flow cytometric or in vivo live imaging respectively.

    In vitro co-culture

    Bone marrow stem cells were extracted from wild-type mouse femurs and cultured for 7 days in RPMI (Sigma) supplemented with 1% penicillin-streptomycin (Sigma), 10% FCS (Life Technologies), 1% Glutamax (Invitrogen) and 20 ng ml−1 mouse GM-CSF (Peprotech). Bone marrow-derived dendritic cells (BMDCs) were plated at a density of 1 × 104 cells per well overnight. Hh peptide (1 mg ml−1, Genscript) was added 1 h prior to plating 1 × 105 sorted naive TCRHhNur77GFP T cells in RPMI/10% FCS/1% Glutamax/1% penicillin-streptomycin and 50 mM β-mercaptoethanol (Life Technologies). Anti-mouse I-A/I-E antibody was added at 4, 12, 24, 48, 72 and 96 h after the plating of TCRHhNur77GFP T cells.

    Two-photon microscopy

    Mice were anaesthetized with isoflurane, the caecum exposed and immobilized with a suctioning imaging window46. Samples were illuminated with 910 nm <70 fs pulsed light from Mai-Tai laser and collected using a 20× water-dipping lens and the spectral detector of a Zeiss 880 multiphoton microscope (Carl Zeiss). Images were linearly unmixed using the Zen software (Carl Zeiss) to separate autofluorescence, collagen, eGFP, DsRed and Texas-red dextran based on single-colour controls. All in vivo live images of the LA and LP were performed in the caecum due to ease of access.

    Intravital videos were drift-corrected based on mucus or collagen signal. Images were smoothed using a Gaussian filter for display.

    Image analysis

    Intravital microscopy images were analysed using Imaris 9 (Bitplane). Following unmixing, autofluorescence was subtracted from DsRed and GFP channels. GFP+ and DsRed+ cells were marked using the Surface Creation Wizard, and their co-expression and location within LP and LA compartments were recorded.

    Immunofluorescence images were analysed using Imaris 10.0 (Bitplane). LAs were defined based on DAPI staining showing nuclear density and surrounding epithelial morphology, allowing for unbiased region selection. LP was chosen based on location and cellular density based on DAPI stain so that each region of interest collected and analysed for LA contained at least six LP surfaces of matching size. Surfaces for each cell type of interest were created using CD11c+ for BMDCs, CD206+ for CD206+ macrophages using the Surface Creation Wizard, which was applied to all images collected with the same conditions. CD11c+ surfaces were further subdivided based on median SIRPα staining levels within the surface.

    Photo-activation

    CD2DsRedUbPA-GFP hosts were colonized with Hh on day 0. Seven days later 50,000 naive TCRHhCD2DsRedUbPA-GFP cells were transferred into the Hh-colonized CD2DsRedUbPA-GFP hosts so both host and donor cells were photo-activatable. On day 21 the MLN and caecum were removed from 6 mice. The caecum was shaken at 37 C in RPMI + BSA + EDTA for 40 min and 20 min to remove the epithelium. The CP was separated from the rest of the tissue, and the remaining tissue was divided into one third without photo-activation as a negative control for FACS gating, one third for LA photo-activation, and one third for LP photo-activation.

    Samples were maintained in RPMI + BSA+Hepes on ice in the dark for the duration of the experiment with cold media flowed over the tissue during photo-activation. A Zeiss 880 upright multiphoton microscope (Carl Zeiss, Germany) fitted with two tunable lasers (Mai-Tai tuneable BB laser 710–990 nm, pulse width <80 fs and Mai-Tai tuneable 690–1040 nm, pulse width <70 fs) and a 20× water-dipping objective was used for tissue photo-activation. The microscope was set to dynamically unmix GFP, DsRed, and collagen based on pre-collected single-colour controls. Samples were imaged with 910 nm light to identify regions of interest based on CD2DsRed. MLN and CP T cell zones were defined as the densest T cell regions without gaps to exclude B cell zones. LA regions were defined as a cluster of cells with a diameter of at least ten CD2DsRed cells. LP region was defined as a region containing CD2DsRed cells distal to LA. After ROI definition, the second laser was turned on at 740 nm while imaging live. GFP photo-activation was observed dynamically to ensure sufficient photo-activation without toxicity. For each mouse 3–10 regions were photo-activated for each tissue microniche. Each photo-activation region comprised approximately 40,000 µm3 of tissue (70 µm diameter × 10 µm depth).

    After photo-activation, tissues were minced and digested for 30 min in RPMI+Hepes with 100 U ml−1 collagenase VIII (Sigma Aldrich) and 20 mg ml−1 DNase I (Sigma Aldrich). The digested tissue from the six mice was pooled per tissue microniche into one sample. Isolated cells were stained with efluor 780 fixable viability dye (eBioscience). Photo-activated GFP+ cells were sorted using a four laser BD FACSAria III based on gates defined by unactivated samples from the same mice (Extended Data Fig. 5b).

    Single-cell RNA library construction and sequencing

    For scRNA-seq experiments, the Chromium Single Cell 5′ version 2 reagent kit and Chromium Single Cell Mouse TCR Amplification Kit (10x Genomics) were used. Sorted cells were loaded onto each channel of the Chromium Chip K following the manufacturer’s instructions and the chip was inserted in the Chromium Controller for droplet encapsulation. cDNA synthesis, amplification, gene expression (GEX) and targeted TCR was performed on single cells, according to the manufacturer’s protocol (CG000331). Sequencing was performed on the Illumina Novaseq 6000 system. Gene expression libraries were sequenced at a targeted depth of 50,000 reads per cell, using the following parameters: Read1: 28 bp i7:10 bp, i5: 10 bp, Read2: 98 bp. TCR libraries were pooled at a ratio of 1:10 with the GEX libraries and sequenced at a target depth of 5,000 reads per cell.

    scRNA-seq analysis

    Pre-processing of 10x Genomics scRNA-seq and scTCR-seq data

    scRNA-seq raw sequencing data were processed using the CellRanger “multi” software (version 6.1.1, 10x Genomics) with the mm10 2020-A mouse reference genome (official 10X mouse pre-built reference). Single-cell TCR-sequencing (scTCR-seq) data were aligned and quantified using the CellRanger ‘multi’ software (v.6.6.1) and the reference vdj_GRCm38_alts_ensembl-5.0.0 was used with default settings.

    Quality control and processing of scRNA-seq data

    Data pre-processing was performed using the ScanPy workflow (v. 1.8.1)47. ScanPy (v. 1.7.1), Anndata (v. 0.7.5), Pandas (v.1.2.3), NumPy (v.1.20.1), and Python (v.3) were used to pool single-cell counts and conduct downstream analysis. For each run, SoupX algorithm48 was run with default parameters to remove ambient mRNA from the count matrix. Doublet detection was performed using the Scrublet algorithm (https://github.com/AllonKleinLab/scrublet49) with percolation step, as previously described50. Additional doublet exclusion was performed throughout downstream processing based on unexpected co-expression of canonical markers, such as Cd3d (T cells) and Cd19 (B cells). Cells with fewer than 1,000 unique molecular identifier (UMI) counts, fewer than 600 detected genes and more than 15% mitochondrial reads were excluded from downstream analysis. Genes were filtered out for expression in less than three cells. Gene expression for each cell was normalized (scanpy.pp.normalize_total, scaling factor 10,000) and log-transformed (scanpy.pp.log1p). Downstream analyses included variable gene detection (scanpy.pp.highly_variable_genes) and data feature scaling (sc.pp.scale). Cell cycle score was calculated using the expression of the cell cycle genes in Supplementary Table 1. Cell cycle score, UMI counts, the percentage of mitochondrial reads, and the percentage of Ig reads (calculated based on the genes provided in Supplementary Table 1) were regressed out during scaling the data. Dimensionality reduction (scanpy.tl.pca, based on highly variable genes) and Leiden-graph-based clustering (scanpy.tl.leiden, with clustering resolution manually adjusted, 0.3–1.5) were carried out. Cell lineages were annotated on the basimarker gene expression for each cluster (sc.tl.rank_genes_groups, method = ‘wilcoxon’).

    Cell-type annotation with CellTypist

    CellTypist is a cell-type database, server and pipeline for automatic annotation of scRNA-seq developed at Teichman lab (https://github.com/Teichlab/celltypist, https://www.celltypist.org). To assemble a mouse intestinal reference dataset, scRNA-seq data were collected from eight publications covering different cell lineages from small and large intestine as well as one additional dataset of sorted B cells from spleen, to cover in detail the annotation of germinal centre B cell populations (Supplementary Data 2).

    For each dataset, the raw count matrix was downloaded along with the accompanying cell meta information. After removing trivial cell types annotated by the original studies (for example, ‘doublets’ and ‘unresolved’), a total of 171,271 cells were obtained representing all major cell populations in the mouse gut. Cell-type names from different datasets were next standardized to achieve a common scheme of nomenclature. Specifically, the similarity of transcriptomes between each pair of cell types across datasets was examined and the two cell types were merged only if they corresponded to a single cell type (for example, ‘enteroendocrine cell’ from the Tabula Muris was renamed to ‘enteroendocrine’ as was used in Biton et al.30). Finally, after in-depth inspection, 126 cell types were harmonized from the eight datasets. A CellTypist model then was created based on logistic regression classifiers, as described in detailed48. The model is publicly available at https://celltypist.cog.sanger.ac.uk/models/Mouse_Gut_Casado/v2/Adult_Mouse_Gut.pkl. Cell identities were predicted using the resulting model, followed by manual curation. Cells from each lineage were further subclustered and Leiden clustering was repeated for fine-grained annotation of the cell types and states. A description of cell-type annotations for each lineage is provided in Extended Data Fig. 5c and Supplementary Data 3 and 4. The differentially expressed genes for the cell types in each lineage can be found in the Supplementary Table 1.

    For prediction on cycling regulatory T cells (prolif. Treg cells), eTreg and cTreg cells were used as a training reference. The model was built applying default parameters, and prediction was performed without majority voting.

    scTCR-seq downstream analysis

    The Python package scirpy (v. 0.12.2)51 was used to extract the V(D)J sequence information from the CellRanger output file filtered_contig_annotations.csv. Productive TCRαβ chains were determined using the scirpy.tl.chain_qc function, and cells without V(D)J data or with two pairs of productive TCRαβ chains were removed from the analysis. Clonotypes were defined with the function scirpy.tl.define_clonotypes based on the CDR3 nucleotide sequence identity and the V-gene usage for any of the TCR chains (either VJ or V(D)J need to match). For cells with dual TCRα or TCRβ chains, any of the primary or secondary receptor chain matching was considered for the clonotype definition. Clonotype networks were constructed using the pp.ir_dist function to compute distances between CDR3 nucleotide sequences (using identity as a metric) and tl.define_clonotype_clusters function to designate the clonotype clusters, removing the clonotypes with less than two cells. The TCR metadata were combined with the transcriptome data for downstream analysis and comparison of different T cell populations. Hh-specific TCR data were retrieved from Xu et al.15, and TCR sequences were obtained from https://www.ncbi.nlm.nih.gov/nuccore and mapped using the IMGT/V-QUEST alignment tool52. Hh.7-2 transgenic TCR (tgTCR) clonotypes were identified by expression of TRAV9-1/TRBV19 gene segments and the TCRβ CDR3 amino acid sequence, including those clonotypes with missing TCRα chain.

    RNA velocity analysis

    RNA velocity analysis53 was performed using the scVelo (v.0.2.4) package [10]. RNA velocity was estimated by distinguishing unspliced and spliced mRNAs using the velocyto package (v.0.17) (https://velocyto.org/velocyto.py/54). Data were subclustered on Treg cell, filtering out the subsets with fewer than ten cells per gut region (that is, excluding eTregs_MLN, eTregs_CP, cTregs_LA, cTregs_LP and Prolif-Tregs_CP). The dataset was then merged with the velocyto output (merged loom files) and pre-processed for detection of minimum number of counts, filtering and normalization (scvelo.pp.filter_and_normalize). The functions scvelo.pp.moments, scvelo.tl.velocity and scvelo.tl.velocity_graph were used to compute velocities using the stochastic mode in scVelo. The function scvelo.pl.velocity_embedding_stream was used to project the velocity information onto the UMAP. To test which genes have cluster-specific differential velocity expression and visualize selected genes, the functions scvelo.tl.rank_velocity_genes and scvelo.pl.velocity were applied. Velocity pseudotime was calculate with the function scvelo.tl.velocity_pseudotime.

    Cell-type scoring

    A list of mouse genes involved in the MHCII complex (Supplementary Table 2) was used for surface MHCII scoring. Cells were scored using the scanpy.tl.score_genes function according to the expression values of all genes.

    Cell–cell communication analysis

    The CellPhoneDB55,56 Python package (v.3 .0) was used to infer putative cell–cell interactions. The scRNA-seq dataset was split by gut region and cell types with <30 cells in a given region were filtered out. Human-mouse orthologue genes were retrieved using the biomaRt package57, and only one-to-one orthologous genes were considered. CellPhoneDB was applied on the normalized raw counts and fine cell-type annotations of myeloid, T cells and ILCs from LA and LP (separately for each gut region), using default parameters. To identify the most relevant interactions, specific interactions of Treg cells with myeloid cells and T cells/ILCs were selected and filtered for the ligand–receptor pairs that were significant (P ≤ 0.01) and ‘curated’. The selected interactions were plotted as expression of both ligands and receptors in relevant cell types. The ktplots R package (https://github.com/zktuong/ktplots/tree/plot_cpdb3; https://doi.org/10.5281/zenodo.5717923) was used to visualize the significant interactions per cell-type pair using a chord diagram.

    Fresh frozen Visium sample preparation

    Caecum and proximal colon tissue from Hh-infected and Hh/anti-IL10R treated mice were removed, cut longitudinally and cleaned of stool with cold phosphate buffered saline (PBS). The tissue was positioned luminal side up, and rolled into a Swiss roll from the caecum to the proximal colon. The tissue was placed into a histology plastic cassette and snap frozen for 1 min in dry-ice-cooled isopentane. The frozen tissue was embedded in OCT on dry ice and stored at −80 °C. The samples were selected based on tissue morphology and orientation (H&E-stained sections) and RNA integrity number, obtained using High sensitivity RNA ScreenTape system, (Agilent 2200 TapeStation). OCT blocks were sectioned at 10 μm thickness in a −20 °C cryostat (Leica CX3050S) at 10 μm, and transferred onto a 6 mm2 capture area on a Visium 10X Genomics slide. Visium spatial tissue optimization was performed, and an optimal permeabilization time of 24 min was selected. The Visium slides were processed according to manufacturer’s instructions, before fixing and staining H&E for imaging. H&E-stained slides were imaged at 40× on Hamamatsu NanoZoomer S60. After transcript capture, sequencing libraries were prepared according to the 10X Genomics Visium Spatial Transcriptomic protocol and sequenced on the Illumina Novaseq 600 system.

    Visium spatial transcriptomics data analysis

    10x Genomics Visium sequencing data processing

    After sequencing, 10x Genomics Visium spatial samples were aligned to the mouse transcriptome mm10 2020-A reference (as the scRNA-seq samples) using 10x Genomics SpaceRanger version 2.0.0. and exonic reads were used to produce mRNA count matrices for each sample. SpaceRanger was also used to align paired histology images with mRNA capture spot positions on the Visium slides. A custom image-processing pipeline was used for alignment of Visium slides and identification of the spots contained in the tissue, as described in ref. 58. Spots with fewer than 500 UMI counts, and more than 15% mitochondrial genes were removed from the analysis. Data from different samples were concatenated and SCVI was used for batch correction59.

    Spatial mapping of cell types using cell2location

    To spatially map intestinal cell types defined by single-cell transcriptomics in the Visium data we used cell2location41. First, to obtain a complete single-cell reference of cell types and cell states in the mouse intestine we integrated our NICHE-seq data with 3 publicly available datasets of intestinal epithelial cells30, immune cells15 and enteric nervous system27 (Fig. 5a). Redundant cell annotations across different datasets were harmonized and curated manually. This scRNA-seq reference (untransformed and unnormalized mRNA counts) was then used in the cell2location pipeline, as described in detail previously43. In brief, reference signatures of cell states (63 cell populations) were estimated using a negative binomial regression model provided in the cell2ocation package. The inferred reference cell-state signatures were used for cell2location mapping that estimates the abundance of each cell state in each Visium spot by decomposing spot mRNA counts. The cell2location spatial mapping was done separately for Hh and Hh/anti-IL10R sections. The paired H&E images were used to determine the average number of cells per spot in the tissue (set to 30) and used as a hyperparameter in the cell2location pipeline. Cell-state proportions in each Visium spot were calculated based on the estimated cell-state abundances.

    Two methods were used to identify cellular microenvironments in the tissue: manual annotation and conventional NMF analysis. Regions for manual annotation were defined based on H and E images. LA were defined by cellular density, whereas LP regions included histologically distinct areas both proximal and distal to the LA. NMF, implemented in the cell2location pipeline, was performed on cell abundance results by cell2location on each condition separately (Hh and Hh/anti-IL10R). The NMF model was trained for a range of factors and tissue zones (number of factors: n_fact) N = {5,…,30} and the decomposition into 18 factors was selected as a balance between segmenting relevant tissue zones (muscle compartment, lymphoid structures, lymphatics) and over-separating known zones into several distinct factors (Extended Data Fig. 8c).

    Cell-state spatial enrichment analysis

    Spots containing lymphoid aggregates and adjacent LP were manually annotated using the paired histology images of the spatial data in the 10x Genomics Loupe software. Cell-state proportions per spot were calculated based on the estimated abundances from cell2location and cell-state enrichments (odds ratio) in each manually annotated region were calculated as described60. In brief, the odds of target cell-state proportions were divided by the odds of the other cell-state proportions. Odds of cell proportions were calculated as the ratio of cell proportion in the spots of a structure of interest to that in the other spots. Statistical significance was obtained by chi-square analysis (scipy.stats.chi2_contingency) and the P value was corrected with the Benjamini–Hochberg method.

    Spatial co-occurrence analysis of cell types

    We quantified the degree of co-occurrence between cell types on the basis of their proportions inferred by cell2location. Specifically, since each cell type had an estimated abundance distribution across spatial spots, we calculated the Pearson correlation coefficient for any two given cell types as their co-occurrence rate. This calculation was conducted for each sample separately. Next, the log2 ratio between four Hh-only (control) samples and four Hh/anti-IL10R samples was defined as their fold change in co-occurrence rate and the significance (that is, P value) was assessed by a two-sided Student’s t-test.

    Spatial ligand–receptor analysis

    Ligand–receptor analysis on Visium data was performed using the Cell2location cell-type abundances and the stLearn package61 (https://github.com/BiomedicalMachineLearning/stLearn). In short, the connectomeDB2020_lit database for mouse was used as a reference of candidate ligand–receptor pairs. The st.tl.cci.run function was used to calculate the significant spots of ligand–receptor interactions within spot mode (distance = None), filtering out any ligand–receptor pairs with no scores for less than 20 spots, and using 10,000 random pairs (n_pairs). P values were corrected with the st.tl.cci.adj_pvals function using false discovery rate, Benjamini–Hochberg (adj_method = ‘fdr_bh’) adjusting by the number of spots tested per ligand–receptor pair (correct_axis = ‘spot’). Spot P values were displayed for particular ligand–receptor pairs (‘Ccl8_Ccr2’ and ‘Cxcl9_Cxcr3’) in the spatial context using the function st.pl.lr_result_plot.

    Statistical analysis

    Statistical analysis was performed using Prism 8 (GraphPad). t-Tests were used to compare two unpaired samples. For more than two groups, the ordinary one-way ANOVA was used. No samples were excluded from analysis. Mean with standard deviation shown unless otherwise indicated. Differences were considered statistically significant when P ≤ 0.05. Significance is indicated as *P ≤ 0.05, **P ≤ 0.01, ***P ≤ 0.001 and ****P ≤ 0.0001. Technical replicates were processed and analyses on the same day. Biological replicates are from independent experiments.

    Reporting summary

    Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

    [ad_2]

    Source link