Bacterial strains and growth conditions
Cultivation of E. coli EC100 (Lucigen), E. coli K-12 MG165542, E. coli K-12 BW2511343 and all other E. coli strains used in this study were carried out in lysogeny broth (LB) at 37 °C with shaking. Overnight cultures were inoculated from single bacterial colonies. Wherever applicable, media were supplemented with chloramphenicol at 12.5 μg ml−1 (for cosmids) or 25 μg ml−1 (for plasmids), spectinomycin at 50 μg ml−1, kanamycin at 50 μg ml−1, ampicillin or carbenicillin at 100 μg ml−1, and/or tetracycline at 5 μg ml−1 to ensure cosmid or plasmid maintenance. E. coli Keio knockout strains were obtained from the Coli Genetic Stock Center at Yale University44. The type I-E CRISPR interference strain E. coli K-12 MG1655 ACT-01 was a gift from C. A. Voight45. Miniprepped plasmids (prepared by QIAprep Spin Miniprep Kit, QIAGEN, 27106) were cloned into chemically competent E. coli EC100 cells (Lucigen), electrocompetent E. coli EC100 cells (Lucigen) or rubidium chloride (RbCl2) chemically competent E. coli K-12 MG1655 cells. For E. coli K-12 BW25113 and Keio knockout strains, protein purification strains, and strains with two plasmid combinations, existing strains were first made electrocompetent and then transformed with plasmid through electroporation (1 mm Bio-Rad Gene Pulser cuvette at 1.8 kV). The list of strains used in this study are available in Supplementary Data File 2.
Plasmid construction
For plasmid construction, refer to Supplementary Data File 2.
Gibson assembly
For Gibson assemblies46, 25–100 ng of the largest dsDNA fragment was combined with equimolar volumes of the smaller fragment(s) in a total volume of 5 μl in nuclease-free water. Reaction mixtures were prepared on ice and mixed with 15 μl of Gibson assembly master mix, pipette mixed and incubated at 50 °C for 1 h in a thermal cycler. Gibson reactions were transformed into chemically competent E. coli EC100 cells (Lucigen) or RbCl2 chemically competent E. coli K-12 MG1655 cells by mixing 5 μl of Gibson reaction with 50 μl cells and following a standard transformation protocol for chemically competent cells.
Oligonucleotide cloning
Oligonucleotide cloning was used to create a repeat-spacer-repeat CRISPR array with a desired spacer following a previously described protocol47. In brief, we used a BsaI restriction digest cloning approach. Parent type II-A CRISPR array-containing plasmids with a repeat-spacer-repeat carried a 30 bp spacer sequence with two BsaI cut sites at either end (pCas9)47. To set up the BsaI plasmid digest, we mixed 42 μl of the parent CRISPR plasmid (40–60 ng μl−1) with 6 μl BsaI-HF (NEB, R3535L), 6 μl NEB CutSmart buffer and 6 μl nuclease-free water. The restriction digest reaction was incubated at 37 °C for approximately 6 h. Two IDT oligonucleotides comprised the type II-A CRISPR spacer to be inserted into the BsaI cut plasmid CRISPR array: a ‘top’ strand oligonucleotide with sequence 5′-AAAC-(30 bp spacer)-G-3′ and a ‘bottom’ strand oligonucleotide with sequence 5′-AAAAC-(30 bp spacer reverse complement)−3′. For oligonucleotide cloning of type I-E spacers into pACYC184-TypeIEspcNT, the top strand oligonucleotide had sequence 5′-ACCG-(32 bp spacer)−3′ and the bottom strand oligonucleotide had sequence 5′-ACTC-(32 bp spacer reverse complement)−3′. The two oligonucleotides were phosphorylated with T4 polynucleotide kinase (NEB, M0201S) in a 50 μl reaction: 1.5 μl 100 μM top oligonucleotide, 1.5 μl 100 μM bottom oligonucleotide, 41 μl nuclease-free water, 5 μl T4 DNA ligase reaction buffer (NEB, B0202S), 1 μl T4 polynucleotide kinase (NEB, M0201S). The reaction was incubated at 37 °C for 1 h in a thermal cycler. After phosphorylation, oligonucleotides were annealed by adding 2.5 μl of 1 M sodium chloride (Fisher Scientific, S271-3) solution to the 50 μl reaction and incubating for 5 min at 98 °C and then allowing the reaction to gradually cool to room temperature (approximately 2 h). The annealed oligonucleotides were diluted 1:10 in nuclease-free water and ligated into the BsaI-digested plasmid in a 20 μl reaction: 10 μl BsaI-digested plasmid, 6 μl nuclease-free water, 1 μl 1:10 diluted annealed oligonucleotides, 5 μl T4 DNA ligase reaction buffer (NEB, B0202S), 1 μl T4 DNA ligase (NEB, M0202M). The ligation reaction was performed at room temperature overnight. The next day, 5 μl of the ligation reaction was transformed into 50 μl of chemically competent E. coli EC100 cells (Lucigen) or electrocompetent ACT-01 cells following dialysis, and colonies were confirmed by PCR the next day. The list of CRISPR spacers generated by BsaI cloning and used in this study are available in Supplementary Data File 2.
Strain construction
λ Red recombineering was used to generate the E. coli K-12 BW25113 ΔxthA Δnfo strain. An overnight culture of the E. coli K-12 BW25113 Δnfo Keio strain carrying the pAM38(red) plasmid with chloramphenicol resistance was diluted and grown to OD600 ~ 0.3 and then induced with 0.2% l-arabinose until OD600 ~ 1–1.2. Cells were made electrocompetent by washing twice with cold water and electroporated (1 mm Bio-Rad Gene Pulser cuvette at 1.8 kV) with a PCR product carrying a xthA:tetR gene replacement matching the xthA:kanR gene replacement found in the E. coli K-12 BW25113 ΔxthA Keio strain, with ~50 bp homology upstream and downstream of the xthA locus in the PCR product. After ~2 h of recovery, cells were plated on LB agar plates with kanamycin at 50 μg ml−1 and tetracycline at 5 μg ml−1 to select for double mutants. Double knockouts were confirmed by PCR. After confirmation, strains were grown overnight in LB with kanamycin at 50 μg ml−1 and tetracycline at 5 μg ml−1 (but no chloramphenicol which selects for the plasmid) and with 0.2% l-arabinose induction. Without antibiotic selection, induced plasmid was rapidly lost due to toxicity from λ Red overexpression. Strains were frozen at −80 °C (900 μl culture + 100 μl DMSO) and struck out on appropriate antibiotic plates to confirm both double knockouts and loss of the recombineering plasmid.
Preparation of phage stocks
λvir, T4 and T7 were gifts from B. Levin. T2, T3, T5 and T6 phages were purchased from ATCC. Phages were first grown up in 10 ml cultures of exponentially growing E. coli K-12 MG1655 or EC100 cells at OD600 ~ 0.3. The phage-added cultures were incubated at 37 °C with shaking overnight. Tubes were then spun down at 15,000g for 10 min at 4 °C. Phage-containing supernatants were filtered using Acrodisc 13 mm SUPOR 0.45 μm syringe filters (Pall, 4604) into 15 ml conical tubes and supernatants frozen down as phage stocks at −80 °C (900 μl filtered supernatant + 100 μl DMSO). To grow up a phage stock for plaquing assays and other experiments, a pipette tip was used to scrape off a tiny portion of a frozen phage stock, which was then resuspended in 20 μl LB medium. Serial dilutions were prepared from the resuspended phage and spotted on a fresh LB top agar (LB broth Lennox base, 0.5% agar) lawn of E. coli EC100 in LB agar. The plate was incubated at 37 °C overnight after drying at room temperature for 25 min. The next day a single phage plaque was picked from the top agar lawn using a P20 pipette set to 15 μl and resuspended in a 10 ml culture of exponentially growing E. coli EC100 at OD600 ~ 0.3. The phage-added culture was incubated at 37 °C with shaking overnight. The tube was spun down the next day at 15,000g for 10 min at 4 °C. The phage-containing supernatant was filtered using an Acrodisc 13 mm SUPOR 0.45 μm syringe filter (Pall, 4604) into a 15 ml conical tube. All final phage stocks were titred on top agar lawns of E. coli EC100 and stored at 4 °C.
To grow phage stocks of Brig1 escaper phages, single plaques formed by T4 or T6 phages on lawns of pBrig1-carrying EC100 cells were picked using a P20 pipette and resuspended in 20 μl LB medium. Serial dilutions were prepared from the resuspended phage and spotted on a fresh LB top agar lawn of E. coli EC100 carrying pBrig1 to maintain selection of the escaper phage. The plate was incubated at 37 °C overnight after drying at room temperature for 25 min. The next day a single phage plaque was picked from the top agar lawn using a P20 pipette set to 15 μl and resuspended in a 10 ml culture of exponentially growing OD600 ~ 0.3 E. coli EC100 carrying pBrig1 for continued selection. The phage-added culture was incubated at 37 °C with shaking overnight and filtered the next day as described earlier to generate the escaper phage stock. Final phage stocks were titred on top agar lawns of E. coli EC100 and stored at 4 °C. The list of phages used in this study are available in Supplementary Data File 2.
Generation of mutant phage stocks
T4 and T6 phage stocks were used to construct T4 Δa-gt, T4 Δb-gt, T4 Δalc ΔdenB Δgp56, T4(C) and T6 Δba-gt mutant phage stocks. In each case, a culture of E. coli EC100 cells carrying a recombinant pUT18C-based plasmid was grown overnight at 37 °C with shaking in 10 ml LB supplemented with 100 μg ml−1 carbenicillin. The pUT18C plasmid contained a cloned segment of phage T4 or T6 DNA with the desired gene deleted and ~750–1000 bp homology arms flanking the deleted genic region on either side. The overnight culture was diluted 1:50 in 10 ml LB medium supplemented with 100 μg ml−1 carbenicillin. After approximately 1 h of culture growth, OD600 was measured for the culture and confirmed to be between 0.2 and 0.4. The 10 ml culture was then infected with 2 μl of T4 or T6 phage stock and grown overnight at 37 °C with shaking to allow wild-type phages to recombine with the plasmid. The next day, the tube was spun down at 15,000g for 10 min at 4 °C. The phage-containing supernatant was filtered using an Acrodisc 13 mm SUPOR 0.45 μm syringe filter (Pall, 4604) into a 15 ml conical tube.
Serial dilutions of recombinant phage were prepared and spotted on a fresh top agar lawn of E. coli EC100 containing a pCas9 plasmid in LB agar supplemented with 25 μg ml−1 chloramphenicol. The pCas9 plasmid carried a type II-A CRISPR spacer targeting the phage gene that was deleted to select specifically for recombinant phage with the desired deletion. Top agar plates were incubated at 37 °C overnight after drying at room temperature for 25 min. The next day multiple phage plaques were picked from the top agar lawn using a P20 pipette set to 15 μl and resuspended in 20 μl LB medium. Five microlitres of the resuspend phage plaques were boiled in 15 μl colony lysis buffer48 at 98 °C for 15 min and then PCR checked to confirm that the desired gene was deleted, either with the deletion carried on the pUT18C recombinant plasmid or a de novo CRISPR-generated deletion that eliminated the appropriate gene. Serial dilutions were prepared for 1–2 correct phage plaques, which were then replaqued onto top agar lawns of pCas9 selection strains and incubated overnight at 37 °C for stringent selection. The next day, a single phage plaque was picked from the top agar lawn using a P20 pipette set to 15 μl and pipetted directly into an OD600 ~ 0.2–0.4 exponentially growing culture that maintained the same selection for the mutant phage. The phage-infected culture was grown overnight at 37 °C with shaking. The next day, the tube was spun down at 15,000g for 10 min at 4 °C. The phage-containing supernatant was filtered using an Acrodisc 13 mm SUPOR 0.45 μm syringe filter (Pall, 4604) into a 15 ml conical tube. In some cases, an arabinose-inducible type I-E CRISPR–Cas expressing E. coli MG1655 strain, ACT-01, with a pACYC184-based plasmid expressing an arabinose-inducible type I-E CRISPR spacer was used to select for the recombinant phage. In these instances, 0.2% l-arabinose was included in all media for proper phage selection through type I-E CRISPR–Cas targeting. To make the T4 Δa-gt phage, instead of CRISPR selection, E. coli EC100/pBrig1 was used to select for the pUT18C-recombined phage. PCR and Sanger sequencing confirmed the desired in-frame deletion of a-gt in the mutant phage, matching the exact deletion carried on the pUT18C-da-gt recombination plasmid.
To make the T4(+β-GT) phage, which is T4 phage carrying a higher-than-normal fraction of β-glucosyl-hmC nucleobases, wild-type T4 was passaged through E. coli EC100 carrying the plasmid p(b-gt), which overexpresses T4 β-GT under 1 mM isopropyl-β-d-thiogalactopyranoside (IPTG) induction. An overnight culture of E. coli EC100/p(b-gt) was diluted 1:50 in 10 ml LB medium supplemented with 50 μg ml−1 spectinomycin and 1 mM IPTG. After approximately 1 h 15 min of culture growth, OD600 was measured for the culture and confirmed to be between 0.2-0.4. The 10 ml culture was then infected with 2 μl of wild-type T4 phage stock and grown overnight at 37 °C with shaking. The next day, the tube was spun down at 15,000g for 10 min at 4 °C. The phage-containing supernatant was filtered using an Acrodisc 13 mm SUPOR 0.45 μm syringe filter (Pall, 4604) into a 15 ml conical tube.
Please refer to Supplementary Data File 2 for the plasmids used to generate each mutant phage. All final phage stocks were titred on top agar lawns of E. coli EC100 and stored at 4 °C. The list of phages used in this study are also available in Supplementary Data File 2.
Plaque assays and efficiency of plaquing analysis
Overnight cultures were launched from single colonies in 3 ml of LB medium supplemented with appropriate antibiotic(s). Top agar lawns of E. coli were prepared by mixing 100 μl of overnight culture with 6 ml of LB top agar (LB broth Lennox base, 0.5% agar) supplemented with appropriate antibiotic(s). Top agar mixtures were poured over LB agar in 10 cm plates supplemented with appropriate antibiotic(s). Where necessary, 0.2% l-arabinose was included in the overnight media as well as in the LB top agar and the LB agar plate. Plates were dried at room temperature, partially open by a sterilizing flame, for 25 min for the top agar to solidify. Serial dilutions of phage stock were prepared and spotted on the top agar after drying. For imaging of plaque assays, 2.5 μl of each phage dilution was spotted on top agar using a multichannel pipette. For quantification of phage titres, efficiency of plaquing, and isolation of single phage plaques for phage DNA sequencing, 3–3.5 μl of each phage dilution was spotted on top agar using a multichannel pipette and the plate was tilted to allow phage spots to drip down the plate for easier quantification and isolation of single plaques. In all cases, plates were incubated at 37 °C overnight after drying at room temperature for 25 min or until the plates were completely dry. Overnight plaque assays were imaged the next day (~16–24 h after infection) using the FluorChem HD2 system (ProteinSimple). Plaque assay images were all auto-contrasted using Adobe Photoshop to give clearer images. In some cases, image brightness was enhanced further using Adobe Photoshop for better visualization of phage spots. Plaque assays with BASEL phages reported in Extended Data Fig. 9e were performed in larger 15 cm plates of LB agar supplemented with 12.5 μg ml−1 chloramphenicol, to allow for plaquing of up to ten different phages on a single lawn. Here, the protocol was performed exactly as above, except with scaled up volumes: 300 μl of overnight culture was mixed with 15 ml of LB top agar supplemented with 12.5 μg ml−1 chloramphenicol. As before, 2.5 μl of each phage dilution was spotted on top agar using a multichannel pipette.
In Extended Data Fig. 2i, efficiency of plaquing was quantified as the number of plaques formed by the phage on an E. coli EC100/pBrig1 (targeting) lawn divided by the number of plaques formed by the same phage on an E. coli EC100/pWEB-TNC (control) lawn. In Extended Data Fig. 9c, to quantify phage plaques of T6 and T6 Δba-gt formed on E. coli EC100/pBrig1 (targeting) lawns, infections were spread out across the entire top agar lawns to accurately count individual plaque-forming units (PFUs). To this end, 100 μl of phage stock normalized to ~1 × 106 PFU μl−1 (so ~108 PFUs total) was mixed with 100 μl of overnight culture and then mixed with 6 ml LB top agar (with 12.5 μg ml−1 chloramphenicol) and subsequently poured over an LB agar plate, supplemented with 12.5 μg ml−1 chloramphenicol. Top agar plates were incubated at 37 °C overnight after drying at room temperature for 25 min. The next day, single plaques were counted across the entire top agar lawn. To accurately determine the total PFUs added of each phage, plaquing of serial dilutions of the ~1 × 106 PFU μl−1 normalized phage stocks was performed following the standard procedure of a plaque assay outlined above, using 3.5 μl drips of each phage dilution to facilitate more precise quantification of phage titres. Efficiency of plaquing was quantified as the total number of plaques formed by the phage across an entire E. coli EC100/pBrig1 (targeting) lawn divided by the experimentally estimated total number of PFUs added.
Functional selection of a T4-resistant clone in the AZ52 soil DNA library in E. coli
The DNA library we used was generated in an earlier study using DNA extracted from an arid soil sample collected in Arizona11. The library, AZ52, is comprised of large ~40 kb DNA fragments from soil microorganisms cloned into a pWEB-TNC cosmid. The insert-carrying cosmids were transformed into E. coli EC100 cells (Lucigen), generating a soil DNA library with approximately 20 million clones, divided into megapools carrying roughly 1.25 million clones each.
Each clone within the library houses a cosmid with a soil DNA insert, which carries genes from soil-derived microorganisms. Soil-derived genes can therefore be expressed heterologously in our library system. We performed our functional screen using the coliphage T4. To grow up libraries, we scraped frozen library stocks of E. coli EC100 carrying megapools 3–16 of the AZ52 DNA library into separate tubes with 10 ml LB supplemented with 12.5 μg ml−1 chloramphenicol and grew cultures overnight at 37 °C with shaking. The next day, we infected E. coli EC100 overnight cultures with T4 at a multiplicity of infection (MOI) of 10, high enough to kill almost all clones without bona fide immunity. Infections were performed in 6 ml LB top agar with 500 μl of overnight stationary culture mixed with phage at MOI 10 on LB agar plates, supplemented with 12.5 μg ml−1 chloramphenicol. We incubated plates at 37 °C for 36–48 h and then inspected surviving colonies within top agar infections. We found that only megapool 4 showed an increased number of surviving colonies upon T4 infection compared to an infection of E. coli EC100 cells carrying an empty pWEB-TNC cosmid (control).
As cells may survive T4 infection due to mutations within the E. coli host that prevent phage infection and not due to immunity genes carried within the soil DNA cosmids, we wanted to enrich for true immunity genes carried on cosmids. To eliminate false positive clones, we extracted pooled cosmid DNA from the surviving colonies on the enriched plate. To do this, we scraped top agar with surviving colonies into a 50 ml conical tube, melted the top agar in a 98 °C heating block for 10–15 min until the top agar was completely melted, and then centrifuged the tube at ~4,000g for 5 min at room temperature to collect a cell pellet from which surviving cosmids were isolated using the QIAprep Spin Miniprep Kit (QIAGEN, 27106). The miniprepped cosmid pool was then transformed into 50 μl of electrocompetent E. coli EC100 cells (Lucigen) through electroporation (1 mm Bio-Rad Gene Pulser cuvette at 1.8 kV) and recovered in 1 ml SOC medium. After 1.5 h of recovery, cells were assayed for transformation efficiency by pipetting tenfold serial dilutions of the transformation culture on to an LB agar plate supplemented with 12.5 μg ml−1 chloramphenicol. While the plate was grown overnight at 37 °C, the remaining transformation culture was stored overnight at 4 °C. The next day, based on the calculated transformation efficiency, the transformation culture was spread onto ten 15 cm LB agar plates supplemented with 12.5 μg ml−1 chloramphenicol, plating for ~30,000 colonies on each plate, for a total of ~300,000 colonies. Plates were incubated overnight at 37 °C and the next day colonies from all ten plates were scraped into 20 ml LB, vortexed and inverted to mix, and then diluted to OD600 = 10. The OD600 = 10 colony mixture was then mixed 1:1 with 50% glycerol to make a −80 °C freezer stock of a 1× phage-enriched DNA library for AZ52 megapool 4. This library was then grown up for re-infection with T4 and the steps described above were repeated two more times to generate a freezer stock of a 3× phage-enriched DNA library for AZ52 megapool 4.
We sampled colonies from the 3×-enriched library for anti-phage immunity by streaking the library to single colonies on an LB agar plate supplemented with 12.5 μg ml−1 chloramphenicol. Sixteen single colonies were grown overnight in LB supplemented with 12.5 μg ml−1 chloramphenicol at 37 °C with shaking. Colonies were assayed for anti-phage immunity using plaque assays (described above) with phages λvir and T4. Of the sixteen colonies, twelve were found to carry immunity against T4 and none against λvir. Cosmids were isolated from the twelve T4-resistant clones using the QIAprep Spin Miniprep Kit (QIAGEN, 27106) and sent for Sanger sequencing by Genewiz/Azenta using the universal primers T7 and M13F40, which flank the metagenomic DNA insert within the pWEB-TNC cosmid. Sequencing the T4-resistant cosmids revealed they all contained the same metagenomic DNA insert, suggesting that they all originated from the same T4-resistant library clone. One of the twelve T4-resistant clones was frozen at −80 °C (900 μl culture + 100 μl DMSO) for use in future experiments.
Cosmid sequencing, assembly and gene annotation
Cosmid DNA was extracted using the QIAprep Spin Miniprep Kit (QIAGEN, 27106). DNA was sequenced using the Nextera XT DNA Library Preparation Kit (Illumina, FC-131-1024). Paired-end 2 × 75 bp sequencing was conducted using the 150-cycle MiSeq Reagent Kit v3 (Illumina, MS-102-3001) on the Illumina MiSeq platform. Geneious Prime was used to assemble the cosmid genome, using the Geneious assembler (medium sensitivity/fast) on 100,000 paired-end DNA sequencing reads. The sequence of the cosmid harbouring Brig1 was deposited on GenBank, accession number OR880862. SnapGene was used to predict ORFs with ATG or GTG start codons (minimum length: 50 amino acids) within the metagenomic DNA insert of the assembled cosmid genome. Predicted ORFs were then run through NCBI PSI-BLAST (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins) and HHpred49,50 (https://toolkit.tuebingen.mpg.de/tools/hhpred) to ascertain protein function where possible. Side-by-side genome annotation was also performed using the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) Genome Annotation Service (https://www.bv-brc.org/app/Annotation), with the annotation recipe for Bacteria/Archaea and the taxonomy name set to Nocardiodes, taxonomy ID 1839. Defence genes and systems were identified using DefenseFinder51,52 (https://defense-finder.mdmparis-lab.com/) and the Prokaryotic Antiviral Defence LOCator (PADLOC)53,54 (https://padloc.otago.ac.nz/padloc/).
Subcloning of the T4-resistant cosmid to identify the T4 anti-phage system
To identify the immunity gene(s) in our cosmid, we subcloned four DNA fragments (A–D) that span the entire length of the metagenomic insert sequence. DNA fragments were amplified using 10 ng of cosmid DNA as template for PCR amplification using Phusion High-Fidelity DNA Polymerase (Thermo Scientific, F530L) with 1 M betaine (Sigma-Aldrich, B0300) and 1 μl DMSO in a 50 μl PCR reaction. Fragments were cloned into PCR-amplified pWEB-TNC cosmid backbones using NEBuilder HiFi DNA Assembly Master Mix (NEB, E2621L). NEBuilder HiFi DNA assembly was carried out at 50 °C in a thermal cycler for 4 h, and then 5 μl of the assembly reaction was transformed into 50 μl of chemically competent E. coli EC100 cells (Lucigen). Cells were incubated on ice for 30 min, heat shocked in a 42 °C water bath for 30 s, placed back on ice for 2 min and then recovered in 250 μl SOC for 2 h. Cells were then plated on LB agar supplemented with 12.5 μg ml−1 chloramphenicol and incubated overnight at 37 °C. The next day, 8 colonies were picked, grown overnight in LB supplemented with 12.5 μg ml−1 chloramphenicol and their cosmids miniprepped the next day using the QIAprep Spin Miniprep Kit (QIAGEN, 27106). Miniprepped cosmids were sent for Sanger sequencing by Genewiz/Azenta using the universal primers T7 and M13F40, which flank the subcloned DNA fragment inserted into the pWEB-TNC cosmid backbone. Colonies that harboured cosmids with correct insert fragments were then assayed for immunity against phage T4 using plaque assays (see above). Plaque assays identified Fragment D as the fragment harbouring anti-T4 immunity. Fragment D was further subdivided into Fragments D1, D2 and D3, cloned and tested for immunity as described above. Fragment D3, containing a three-gene operon, was identified as the minimal DNA fragment carrying anti-T4 immunity. To determine the gene or genes responsible within the Fragment D3 operon, we generated six cosmid constructs (D3-1 to D3-6) containing different numbers and combinations of the three genes within the operon, each time being driven by the same promoter upstream of the first gene within the operon. These constructs were then tested using plaque assays to identify the gene within the operon that conveyed anti-T4 immunity.
NCBI blastn of the T4-resistant metagenomic DNA sequence
To identify possible organisms that our metagenomic DNA comes from, we performed a nucleotide BLAST on NCBI using the algorithm for somewhat similar sequences (blastn) (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastn&BLAST_SPEC=GeoBlast&PAGE_TYPE=BlastSearch). We performed blastn on the DNA sequences of Fragments C and D (see above and Extended Data Fig. 1c).
T4 phage adsorption assay
Overnight cultures of E. coli EC100 cells carrying pWEB-TNC or pBrig1 were diluted 1:50 in 10 ml LB medium supplemented with 12.5 μg ml−1 chloramphenicol. After 1 h 15 min of culture growth, OD600 was measured for each culture and normalized to OD600 = 0.3. Cultures were then infected with T4 at MOI 0.01 and incubated at 37 °C with shaking for 50 min. A 10 ml bacteria-free, media-only control (LB + 12.5 μg ml−1 chloramphenicol) was mixed with the same volume of T4 and incubated alongside the cultures at 37 °C with shaking for 50 min. Phage-infected cultures were sampled at the following time points: 0-, 10-, 20-, 30-, 40- and 50-min post-infection. At each time point, 400 μl was collected from each phage-infected culture and spun down in 1.5 ml Eppendorf tubes at 15,000 rpm for 2 min in a tabletop microcentrifuge at 4 °C. Phage-containing supernatants were filtered using Acrodisc 13 mm SUPOR 0.45 μM syringe filters (Pall, 4604) into fresh 1.5 ml Eppendorf tubes. Filtered phage supernatants were used to prepare two sets of serial dilutions to estimate phage titres on fresh top agar lawns of E. coli K-12 MG1655. Top agar plates were incubated at 37 °C. The next day, phage plaques were counted to determine phage titres at each time point. The experiment was repeated another two times in this manner for three independent biological replicates.
qPCR of phage DNA replication
To quantify phage DNA replication within an infected E. coli cell, overnight cultures of E. coli EC100 cells carrying pWEB-TNC or pBrig1 were diluted 1:50 in 50 ml of LB medium supplemented with 12.5 μg ml−1 chloramphenicol. After 1 h 15 min of growth, OD600 was measured, and the culture was normalized to OD600 = 0.3. 700 μl of culture was dispensed between multiple 1.5 ml Eppendorf tubes, corresponding to three replicates and multiple time points for each infection being monitored. These 700 μl cultures were infected with phage T4 at MOI 1 and incubated at 37 °C with shaking for specified time points. At each time point, samples were removed from the incubator and tubes spun down at 15,000 rpm for 1 min in a tabletop microcentrifuge at 4 °C. Supernatants were removed and cell pellets immediately frozen down at −80 °C for DNA extraction later. Additionally, 1–3 uninfected tubes for cells carrying pWEB-TNC or pBrig1 were also prepared for DNA extraction as no-phage controls for qPCR.
Total DNA was extracted from frozen E. coli cell pellets using the Promega Wizard Genomic DNA Purification Kit (Promega, A1125) following the protocol for Gram-negative bacteria. Extracted DNA was quantified using the Qubit dsDNA HS Assay Kit and each sample was normalized to 4 ng μl−1. A total of 32 ng DNA was used as input for qPCR, performed using Fast SYBR Green Master Mix (Applied Biosystems, 4385612) and the QuantStudio 3 Real-Time PCR System (Applied Biosystems) with primer pairs AA870/AA871 (T4 gp43 target), AA872/AA873 (T4 gp34 target) and AA387/AA388 (E. coli K-12 MG1655 dxs control). For qPCR data analysis, ΔΔCt values were calculated for the two T4 qPCR targets for each replicate at each time point. Fold-change values were then calculated for each replicate relative to the mean ΔΔCt value for cells carrying pWEB-TNC infected with T4 phage at the earliest time point post-infection for a given experiment. The mean fold change of three biological replicates was plotted for each time point post-infection.
Next-generation sequencing of phage DNA in T4-infected E. coli cells
Overnight cultures of E. coli EC100 cells carrying pWEB-TNC or pBrig1 were diluted 1:50 in 10 ml of LB medium supplemented with 12.5 μg ml−1 chloramphenicol. After 1 h 15 min of growth, OD600 was measured, and cultures were normalized to OD600 = 0.3. Cultures were then infected at MOI 5 with T4 or T4 escaper1 for 8 min at 37 °C with shaking, prior to centrifugation at 15,000g for 5 min at 4 °C and subsequent freezing of cell pellets at −80 °C. All cell pellets were stored at −80 °C at least overnight, until ready for genomic DNA purification using the Promega Wizard Genomic DNA Purification Kit (Promega, A1125) following the protocol for Gram-negative bacteria. Purified genomic DNA was sheared using a pre-split snap-cap 6×16 mm Covaris microTUBE (Covaris, 520045) in a Covaris S220 focused-ultrasonicator and prepared for next-generation sequencing using the Illumina TruSeq Nano DNA LT kit (Illumina, 20015964). Paired-end 2 × 75 bp sequencing was conducted using the 150-cycle MiSeq Reagent Kit v3 (Illumina, MS-102-3001) on the Illumina MiSeq platform. Illumina paired-end sequencing reads were aligned to phage genomes using a custom Python script, where the recorded number of phage-derived sequencing reads at a specific base pair position within the phage genome was normalized to the total sequencing reads for each sample.
Phage DNA extraction
Phage genomic DNA was extracted from capsids using a previously described protocol55. In brief, three tubes of 450 μl of a phage stock were first treated with DNase I (Invitrogen, 18068015) and RNase A (Promega, A7973) in DNase I buffer (20 mM Tris-HCl, pH 8, 2 mM MgCl2), the reaction stopped with EDTA (Invitrogen, AM9260G), then capsids digested with Proteinase K (NEB, P8107S), and finally phage genomic DNA extracted using the DNeasy Blood & Tissue kit (QIAGEN, 69504). DNA was quantified using the Qubit dsDNA HS Assay Kit and assessed for quality using a nanodrop spectrophotometer.
T4 and T4 escaper1 genome sequencing and assembly
Phage genomic DNA was sequenced using the Nextera XT DNA Library Preparation Kit (Illumina, FC-131-1024). Paired-end 2 × 75 bp sequencing was conducted using the 150-cycle MiSeq Reagent Kit v3 (Illumina, MS-102-3001) on the Illumina MiSeq platform. Reads were quality-trimmed using Sickle (https://github.com/najoshi/sickle) and assembled into contigs using ABySS (https://github.com/bcgsc/abyss). Finally, contigs were mapped to a reference phage T4 genome (GenBank: AF158101.6) using Medusa (http://combo.dbe.unifi.it/medusa). Automated genome annotation was performed using SnapGene and a reference phage T4 genome from NCBI (GenBank: AF158101.6). Alignment of the T4 and T4 escaper1 genomes to the reference T4 genome revealed differential mutations between the two assembled phage genomes.
Sanger sequencing of bacteriophage escapers
T4 or T6 phage plaques on lawns of E. coli EC100 cells carrying pBrig1 were isolated and resuspended in 20 μl of LB medium. Serial dilutions were prepared from the resuspended phage and spotted on a fresh LB top agar lawn of E. coli EC100 carrying pBrig1 to maintain selection of the escaper phage. The plate was incubated at 37 °C overnight. The next day a single phage plaque was picked from the top agar lawn using a P20 pipette set to 15 μl and resuspended in 20 μl of colony lysis buffer48. Resuspended phage mixtures were boiled at 98 °C for 15 min in a thermal cycler, and 1 μl of the boiled phage mixture was then used as template for PCR amplification using Phusion High-Fidelity DNA Polymerase (Thermo Scientific, F530L) with primers AA681/AA682 to amplify T4 a-gt and primers AA1115/AA1116 to amplify T6 a-gt. PCR products were submitted to Sanger sequencing by Genewiz/Azenta to identify mutations in a-gt. Wild-type T4 and T6 phage stocks were also PCR-amplified at a-gt loci and sent for Sanger sequencing to provide reference sequences for comparison. Snapgene was used to align Sanger sequencing products of the escaper phages to wild-type a-gt sequences to identify escape mutations.
Brig1 structural predictions using AlphaFold2
The structure of the intact (261 amino acid) Brig1 protein was predicted using the colab implementation of AlphaFold217,18 (https://colab.research.google.com/github/sokrypton/ColabFold/blob/main/AlphaFold2.ipynb) using default settings (except that the amber option was turned on to improve side chain rotamers). The highest ranked PDB structure produced by ColabFold (ptm = 0.86) was then visualized using PyMOL (The PyMOL Molecular Graphics System, Version 2.5.5, Schrödinger, LLC; www.pymol.org/pymol.html). Protein structure predictions of the Brig1 homologues from Nocardioides zhouii and Nocardiodes anomalus were performed in the same way. Cavities and pockets were visualized in PyMOL using default settings for surface calculation and the ‘cavities and pockets only’ option for display.
Purification of Brig1
The brig1 gene was recloned into the Nde1 and XhoI sites of pET21a using PCR primers that destroyed the XhoI site and added a His6 tag immediately after the native C-terminal glycine of Brig1. The insert was verified by DNA sequencing. E. coli strain Rosetta(DE3)pLysS was used for protein expression. Cells were grown in LB medium with 100 μg ml−1 ampicillin at 37 °C. 0.5 mM IPTG was added to induce protein expression when OD600 ~ 0.7, followed by further growth at 37 °C for 2 h. Cell pellets were resuspended in Ni column buffer A (50 mM phosphate, 1 M NaCl, 5% glycerol, 1 mM DTT, pH 7.5) with complete mini protease inhibitor cocktail (Roche), one tablet per litre culture. After adding lysozyme to a final concentration of 200 mg ml−1, the mixture was sonicated 3 times for 1 min each, then centrifuged at 20,000 rpm in an SS-34 rotor for 1 h. The supernatant was filtered and loaded onto a Ni column (Cytiva, HisTrap HP, 17-5248-02), and eluted with a 30-minute gradient of 0 to 100% buffer B (Ni buffer A plus 500 mM imidazole, pH 7.5). Brig1-containing fractions were pooled and diluted with heparin column buffer A (25 mM MES, 0.5 mM EDTA, 5% glycerol, 1 mM DTT, pH 6), loaded on a heparin column (Cytiva, HiTrap Heparin HP, 17-0406-01) and eluted with a gradient from 10% to 70% heparin buffer B (heparin column buffer A + 2 M NaCl, pH 6) over 90 min. The purest fractions were pooled and concentrated, then dialysed into storage buffer (20 mM Tris, 0.5 mM EDTA, 200 mM NaCl, 20% glycerol, 2 mM DTT, pH 8) and flash-frozen in small aliquots.
Purification of T4 α-glucosyltransferase
A pET15b derivative encoding N-terminally His6 tagged phage T4 α-GT was used for protein expression. Rosetta(DE3)pLysS cells harbouring this plasmid were grown and induced as for Brig1, but after induction grown at 20 °C overnight rather than for 2 h at 37 °C. The same purification protocol as for Brig1 was followed except for a change in the pH of the heparin column buffers (A = 25 mM HEPES, 0.5 mM EDTA, 5% glycerol, 1 mM DTT, pH 7 and B = A + 2 M NaCl, pH 7). The purest fractions were pooled and concentrated, then dialysed into storage buffer (20 mM Tris, 0.5 mM EDTA, 200 mM NaCl, 20% glycerol, 2 mM DTT, pH 8) and flash-frozen in small aliquots.
Purification of Brig1(Y121A/E147A) mutant
The Brig1(Y121A/E147A) mutant protein was purified according to a modified protocol. For consistency, wild-type Brig1 was purified according to this same protocol, side-by-side, and this batch of purified Brig1 protein was used only in experiments where Brig1(Y121A/E147A) was used. Both Brig1 and Brig1(Y121A/E147A) were cloned into a pET21a vector, with a His6 tag immediately after the native C-terminal glycine of Brig1. E. coli strain BL21(DE3) was used for protein expression. Cells were grown in LB medium with 100 μg ml−1 ampicillin at 37 °C overnight. The next day, a 1:100 dilution of the overnight was grown in 1 l of LB medium with 100 μg ml−1 ampicillin at 37 °C for 3-4 h. 0.5 mM IPTG was added to induce protein expression when OD600 ~ 0.7, followed by overnight growth (~16 h) at 18 °C. Cells were pelleted at 4500 rpm at 4 °C (Eppendorf Centrifuge 5810 R) for 15 min. Cell pellets were resuspended in 20 ml of lysis buffer (50 mM HEPES, pH 7.7, 150 mM NaCl, 10% glycerol, 1 mM TCEP, 30 mM imidazole, 2 Roche mini protease inhibitor tabs EDTA free, 0.5 mg ml−1 lysozyme) and incubated on ice for 1 h with shaking. The resuspended pellets were then sonicated using a Qsonica Q500 sonicator (70% amplitude with 10 s on, 30 s off for 2.5 min). The sonicated samples were spun down at 12,000 rpm at 4 °C (Eppendorf Centrifuge 5810 R) for 30 min and the supernatant run through a gravity column loaded with 3 ml of HisPur Ni-NTA Resin (Thermo Scientific, 88222). Before passing supernatant, the column was equilibrated with equilibration buffer (50 mM HEPES, pH 7.7, 150 mM NaCl, 10% glycerol, 1 mM TCEP, 30 mM imidazole). Then, the ~20 ml of sonicated cell pellet supernatant was passed through the column. The column was washed twice with 25 ml wash buffer (50 mM HEPES, pH 7.7, 500 mM NaCl, 10% glycerol, 1 mM TCEP, 30 mM imidazole) and then eluted with 20 ml elution buffer (50 mM HEPES, pH 7.7, 150 mM NaCl, 10% glycerol, 1 mM TCEP, 300 mM imidazole). The eluted protein was concentrated to <500 μl using an Amicon Ultra-4 Centrifugal Filter, 10 kDa MWCO (Millipore, UFC801024), with multiple rounds of centrifugation at 4,300g for 10 min at 4 °C (Eppendorf Centrifuge 5810 R), carefully resuspending the mixture between rounds of centrifugation via pipette mixing. The concentrated eluant was run on an ÄKTA pure chromatography system (Cytiva) fitted with a Superdex 75 Increase 10/300 GL column (Cytiva, 29148721) using storage buffer (50 mM HEPES, pH 7.7, 150 mM NaCl, 10% glycerol, 1 mM TCEP). Two peaks, corresponding to fractions 17–20 and 22–27, were collected and separately pooled. Pooled fractions were concentrated to <500 μl using an Amicon Ultra-0.5 Centrifugal Filter, 10 kDa MWCO (Millipore, UFC501096), with multiple rounds of centrifugation at 13,000g for 5 min in a tabletop microcentrifuge at 4 °C, carefully resuspending the mixture between rounds of centrifugation via pipette mixing. For both Brig1 and Brig1(Y121A/E147A), the second peak (fractions 22–26 for Brig1 and fractions 23–27 for the mutant) was determined to be free of nucleic acid contamination via nanodrop and found to contain pure protein (~29 kDa) by a Coomassie gel. Concentrated protein was flash-frozen in small aliquots and stored at −80 °C for future use.
Annealing of ssDNA oligonucleotides
To generate dsDNA substrates for MfeI digestion and for DNA glycosylase assays, complementary ssDNA oligonucleotides were annealed. In brief, 1:1 molar ratios of top and bottom strand complementary ssDNA oligonucleotides (25–50 μM each) were mixed in a 60 μl reaction containing NaCl to a final concentration of 100 mM. The reaction was heated at 80 °C for 20 min in a water bath or thermal cycler and then allowed to cool very slowly to room temperature. Annealed oligonucleotides were purified using an oligonucleotide cleanup kit (Zymo Research, Oligo Clean & Concentrator Kit, D4061) according to the manufacturer’s instructions.
Generation of glucosylated ssDNA and dsDNA oligonucleotides
We tested the activity of α-GT on both single- and double-stranded DNA as previous studies only tested dsDNA substrates56. The ssDNA substrates were hmdC_18, hmdC_60_MfeI and hmdC_60_MfeI_Bot, which are 18mer and 60mer oligonucleotides, each containing a single hmC residue (Supplementary Data File 2). The dsDNA substrate was hmdC_60_MfeI annealed to Bot_MfeI_60 (Supplementary Data File 2). Substrate DNAs (100 μM for ssDNA and 50 μM for dsDNA) were mixed at a 1:1 molar ratio with α-GT in 1× NEBuffer 4 (50 mM potassium acetate, 20 mM Tris-acetate, 10 mM magnesium acetate, 1 mM DTT, pH 7.9) supplemented with 2 mM UDP-glucose (NEB, supplied with NEB T4 β-GT). All samples were incubated at 37 °C overnight, then purified with an oligonucleotide cleanup kit (Zymo Research, Oligo Clean & Concentrator Kit, D4061) according to the manufacturer’s instructions. A subset of the ss- and dsDNA substrates were also treated with β-GT (NEB, M0357S) following the supplier’s instructions and purified as described above.
Modification by α-GT (or β-GT) was monitored by digestion with MfeI-HF (NEB, R3589S), which is blocked by the presence of glucosylated hmC but not by hmC (the modified C in hmdC_60_MfeI and in hmdC_60_MfeI_Bot is within an MfeI site, Supplementary Data File 2). Before digestion, single-stranded α-GT- or β-GT-treated hmdC_60_MfeI oligonucleotides were annealed to Bot_MfeI_60 or to untreated or α-GT-treated hmdC_60_MfeI_Bot. Approximately 1.5 μg (Extended Data Fig. 3e) or 500 ng (Extended Data Fig. 5c) of each sample was digested with MfeI-HF (NEB, R3589S) for 1 h, then electrophoresed on a 10% TBE gel (Invitrogen, EC6275BOX, 10-well or EC62755BOX, 15-well) at 140 V for 35 min. Gels were stained with 2 μg ml−1 ethidium bromide for 20 min, extensively rinsed with distilled water (3× for 10 min each), and then scanned using a ChemiDoc MP imager (Bio-Rad) set to UV trans illumination and the machine’s 605/50 filter to detect ethidium bromide (Extended Data Fig. 3e) or using the Amersham ImageQuant 800 set to UV fluorescence (Extended Data Fig. 5c).
DNA glycosylase assays with ssDNA oligonucleotides
Detection of the abasic site with an aldehyde-reactive probe
We used an aldehyde-reactive fluorescent probe, AZDye 488 Hydroxylamine, (fluoroprobes.com) to detect removal of a base from the phosphodiester backbone in the absence of DNA cleavage. The dye was dissolved in distilled water to form a 10 μg μl−1 stock solution.
DNA glycosylase reactions were carried out in a reaction buffer containing 45 mM HEPES, pH 7.5, 0.4 mM EDTA, 2% glycerol, 1 mM DTT and 50 mM KCl, in a total reaction volume of 50 μl. The final DNA concentrations were 2 μM. Brig1 was added to single-stranded α-GT- and β-GT-treated hmdC_60_MfeI to a final concentration of 35 μM, while 2 μl (10 units; 5 units per μl) of SMUG1 (NEB, M0336S) was added to dU_60 as a positive control. Reactions were incubated overnight at 37 °C, after which 2 μl AZDye 488 dye was added, followed by incubation at 37 °C for 30 min. 1/10 volume of 10% SDS was then added and incubated for another 30 min and purified by phenol/chloroform extraction. Samples were then treated with an oligonucleotide cleanup kit (Zymo Research, Oligo Clean & Concentrator Kit, D4061) according to the manufacturer’s instructions, eluted with 15 μl nuclease-free water, mixed with loading dye and electrophoresed for 45 min at 180 V on a 10% TBE gel (Invitrogen, EC62755BOX). The gel was stained with 2 μg ml−1 ethidium bromide for 20 min, extensively rinsed with distilled water (3× for 10 min each), then scanned using a ChemiDoc MP imager (Bio-Rad) set to UV trans illumination and the machine’s 605/50 filter to detect ethidium bromide and then using blue epi illumination with the 530/28 filter for the AZDye 488 fluorescent probe.
Detection by NaOH- or endonuclease IV-mediated cleavage of the abasic site
DNA glycosylase reactions were carried out in a reaction buffer containing 45 mM HEPES, pH 7.5, 0.4 mM EDTA, 2% glycerol, 1 mM DTT and 50 mM KCl, in a total reaction volume of 50 μl. The final ssDNA or dsDNA concentrations were 1 μM. Brig1 or Brig1(Y121A/E147A) was added to a final concentration of 1 μM (unless stated otherwise—for example, range of 50–1,600 nM in Extended Data Fig. 7b), while 1 μl (5 units) of SMUG1 (NEB, M0336S) was added as a positive control. Reactions were incubated at 37 °C overnight, unless stated otherwise (for example, 30 min in Extended Data Fig. 7b). Following enzymatic incubation, one set of samples was directly processed with an oligonucleotide cleanup kit (Zymo Research, Oligo Clean & Concentrator Kit, D4061) according to the manufacturer’s instructions. A second matched set of samples was treated with NaOH before cleanup: 25 μl of 0.5 M NaOH was added to each 50 μl sample and then heated at 90 °C for 30 min before purification with the oligonucleotide cleanup kit (Zymo Research, Oligo Clean & Concentrator Kit, D4061) according to the manufacturer’s instructions. All samples were eluted from the cleanup columns in 15 μl nuclease-free water. Five microlitres of each was mixed with loading dye and loaded onto a 10% TBE gel (Invitrogen, EC62755BOX) and electrophoresed at 140 V for 35 min. Gels were stained with 2 μg ml−1 ethidium bromide for 20 min, extensively rinsed with distilled water (3× for 10 min each), and then scanned using a ChemiDoc MP imager (Bio-Rad) set to UV trans illumination and the machine’s 605/50 filter to detect ethidium bromide (Extended Data Fig. 3h) or using the Amersham ImageQuant 800 set to UV fluorescence (all other relevant figures). For Urea-PAGE gels, eluted samples were first denatured by mixing 5 μl of purified sample with 5 μl of 2× TBE-Urea Sample Buffer (Invitrogen, LC6876) and then heated at 70 °C for 3 min. Denatured samples were loaded onto a 6% TBE-Urea gel (Invitrogen, EC68655BOX) and electrophoresed at 140 V for 35 min. Gels were soaked in ethidium bromide and rinsed with distilled water as described above, before imaging with the Amersham ImageQuant 800 set to UV fluorescence. For all gels, DNA ladders were made by mixing 20-, 40- and 60-bp ssDNA or dsDNA oligonucleotides (Supplementary Data File 2) and loading them onto their corresponding gels at ~100 ng each oligonucleotide per load.
For abasic site detection by NEB endonuclease IV (Endo IV), DNA glycosylase reactions were set up as described above and incubated with Brig1 overnight. Three matched sets of reactions were set up. After overnight incubation, one matched set of samples was treated with NaOH as described above and purified using the Zymo Research Oligo Clean & Concentrator Kit (D4061) according to the manufacturer’s instructions. The remaining two matched sets of samples were processed directly using the oligonucleotide cleanup kit. The purified samples were then incubated at 37 °C for 4 h in a 50 μl reaction with 1× NEBuffer 3 (100 mM NaCl, 50 mM Tris-HCl, 10 mM MgCl2, 1 mM DTT, pH 7.9), with or without 50 units of NEB Endo IV (5 μl; 10 units per μl; M0304S). After 4 h, reactions were purified using the Zymo Research Oligo Clean & Concentrator Kit according to the manufacturer’s instructions. All the purified samples were then loaded onto a 10% TBE gel (Invitrogen, EC62755BOX), electrophoresed at 140 V for 35 min, stained with ethidium bromide as described above and imaged with the Amersham ImageQuant 800 set to UV fluorescence.
High-resolution mass spectrometry of SMUG1- and Brig1-treated ssDNA oligonucleotides
DNA glycosylase reactions were carried out in a reaction buffer containing 45 mM HEPES, pH 7.5, 0.4 mM EDTA, 2% glycerol, 1 mM DTT and 50 mM KCl, in a total reaction volume of 50 μl. Reactions were performed with 18mer ssDNA oligonucleotides: dU_18, hmdC_18 and α-GT-treated hmdC_18 (Supplementary Data File 2). The final ssDNA concentration in each reaction was 2 μM. Brig1 was added to a final concentration of 2 μM, while 2 μl (10 units) of SMUG1 (NEB, M0336S) was added as a positive control. A no-enzyme reaction was used as a negative control. 2 ×50 μl reactions were set up for each reaction condition with the dU_18 oligonucleotide, while 8 ×50 μl reactions were set up for each reaction condition with hmdC_18 and α-GT-treated hmdC_18. Reactions were incubated overnight at 37 °C. After overnight incubation, all matched samples were pooled and processed with an oligonucleotide cleanup kit (Zymo Research, Oligo Clean & Concentrator Kit, D4061) according to the manufacturer’s instructions.
For mass spectrometry, purified oligonucleotide samples were dried using vacuum centrifugation and dissolved in 50/50 water/acetonitrile with 0.001% triethylammonium bicarbonate. The pH of the solution was found to be comparable to that of deionized water. The samples were introduced to the mass spectrometer by manual injection using a Hamilton syringe applying pressure by hand at approximately 10 μl min−1. Samples were analysed using an orbitrap Ascend tribrid mass spectrometer (Thermo Scientific) operating in negative mode. Spectra were recorded in the mass range 600–1,300 m/z at 120,000 resolution. A blank injection was introduced after each sample to eliminate carryover.
Raw data was inspected using the Xcalibur Quality Browser (Thermo Scientific) and spectra were summed as necessary to provide representative spectra with a sufficient signal-to-noise ratio (S/N). Spectra were further processed using UniDec deconvolution software57 with the following parameters: sampling resolution and peak FWHM were both set to 0.1, adduct mass was defined as −1.007276 Da, and charge states were defined 4–12 based on observations from the raw data. The m/z range was adjusted to fit the data and to exclude singly charged noise. Apart from the mass of the oligonucleotides, additional masses from metal adducts were also observed.
DNA glycosylase assays with phage and cosmid DNA
All reactions were performed in 50 μl reaction volumes in a reaction buffer containing 45 mM HEPES, pH 7.5, 0.4 mM EDTA, 2% glycerol, 1 mM DTT and 50 mM KCl. Assays were performed by incubating 50–500 ng of extracted phage genomic DNA from capsids or miniprepped pWEB-TNC cosmid DNA with varying concentrations (2–800 nM) of purified Brig1 or Brig1(Y121A/E147A) or with 10 units of NEB SMUG1 (NEB, M0336S) as a negative control. Reactions were incubated in a thermal cycler at 37 °C for 30 min (or at 37 °C for 30 min plus an additional 20 min at 65 °C in Extended Data Fig. 6c, to cleave DNA at abasic sites and denature the glycosylase prior to gel electrophoresis). Reactions were then mixed with 10 μl of purple 6× loading dye with no SDS (NEB, B7025S) and the entire reaction volume was loaded onto a 1% agarose gel containing ethidium bromide. Unless stated otherwise, the gel was run for 70 min at 85 V at room temperature and then imaged using a UV gel imager (Amersham ImageQuant 800 set to UV fluorescence). Where SDS was used for protein denaturation (Extended Data Fig. 6d), all steps were carried out as described above except, before gel loading, a purple 6× gel loading dye containing a final 1× concentration of 0.08% SDS (NEB, B7024S) was used instead of loading dye without SDS.
For the gel in Extended Data Fig. 6a,b, samples were loaded onto the 1% agarose gel in a cold room at 4 °C and run at 40 V for 3 h and then imaged using a UV gel imager (Amersham ImageQuant 800 set to UV fluorescence). The same gel was then allowed to equilibrate to room temperature for 30 min and then run longer by electrophoresis, this time under high voltage and at room temperature (in a room temperature gel box), first at 85 V for 25 min, then at 150 V for 8 min and finally at 200 V for another 8 min before final imaging using the Amersham ImageQuant 800 set to UV fluorescence.
Brig1 multiple sequence alignment and phylogenetic tree construction
Brig1 homologues were obtained using the NCBI PSI-BLAST protein homology search. Homologues were then subjected to a multiple sequence alignment using MUSCLE v5 with 16 maximum iterations via the Geneious Prime software. A tree was built with the alignment output file via IQ-TREE 1.6.1258 using the LG4M model with 1,000 bootstrap alignments. The online tool ITOL59 was used for visualization of the resulting tree.
Brig1 gene neighbourhood analysis
Gene neighbourhoods of the brig1 homologues from above (10 genes upstream and 10 genes downstream of each homologue) were constructed using a custom Python script. In brief, the script parses a blastp result XML file for accession numbers of each of the hits. For each hit accession, the script obtains the corresponding nucleotide accession from which the protein accession is derived. Finally, all annotated features within the nucleotide accession that are labelled as ‘CDS’ or ‘tRNA’ are built into a list, including their position within the nucleotide entry and their feature name. From this list, neighbours of the initial protein hit (10 genes upstream and 10 genes downstream) are extracted and built into a TSV file for subsequent analysis.
Statistical analysis
Statistical analyses were performed using GraphPad Prism version 10.1.0. Error bars and number of replicates for each phage experiment are defined in the figure legends. Statistical significance in Extended Data Fig. 2i was determined using a two-tailed Student’s t-test (an unpaired parametric test assuming Gaussian distribution and that both populations have the same standard deviation).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.