Presented here is an assay to quantify somatic hypermutation within the immunoglobulin heavy chain gene locus using germinal center B cells from mouse Peyer’s patches.
Within the germinal centers of lymphoid organs, mature B cells alter their expressed immunoglobulin (Ig) by introducing untemplated mutations into the variable coding exons of the Ig heavy and light chain gene loci. This process of somatic hypermutation (SHM) requires the enzyme activation-induced cytidine deaminase (AID), which converts deoxycytidines (C), into deoxyuridines (U). Processing the AID-generated U:G mismatches into mutations by the base excision and mismatch repair pathways introduces new Ig coding sequences that may produce a higher affinity Ig. Mutations in AID or DNA repair genes can block or significantly alter the types of mutations observed in the Ig loci. We describe a protocol to quantify JH4 intron mutations that uses fluorescence activated cell sorting (FACS), PCR, and Sanger sequencing. Although this assay does not directly measure Ig affinity maturation, it is indicative of mutations in Ig variable coding sequences. Additionally, these methods utilize common molecular biology techniques which analyze mutations in Ig sequences of multiple B cell clones. Thus, this assay is an invaluable tool in the study of SHM and Ig diversification.
B cells, members of the adaptive immune system, recognize and eliminate antigens by producing antibodies, also known as immunoglobulins (Ig). Each Ig is composed of two heavy (IgH) and two light (IgL) chain polypeptides, which are held together by disulfide bonds to form the characteristic “Y” shape structure of the Ig1. The N-termini of IgH and IgL comprise the variable (V) region of each polypeptide and together they form the antigen binding site of the Ig, whereas the constant region of IgH imparts the effector function of the Ig. Developing B cells in the bone marrow rearrange the V coding exons of IgH and IgL in a process known as V(D)J recombination2,3,4. Transcription of the recombined V exons, coupled with the respective constant region exons, forms the mRNA that is translated into the Ig.
Mature B cells expressing a membrane bound Ig, also known as a B cell receptor (BCR), circulate to secondary lymphoid organs, such as the spleen, lymph node, or Peyer’s patches, where they survey the environment for antigens and interact with other cells of the immune system1. Within the germinal centers (GC) of secondary lymphoid organs, B cells that recognize antigen through the BCR become activated. Aided by follicular dendritic cells and follicular helper T cells, activated B cells can then proliferate and differentiate into plasma and memory cells, which are important effectors of a robust immune response5,6,7,8,9. Additionally, these activated B cells can undergo secondary Ig gene diversification processes – class switch recombination (CSR) and somatic hypermutation (SHM). During CSR, B cells exchange the default μ constant region of the IgH polypeptide with another constant region (γ, α, ε) through a DNA deletional-recombination reaction (Figure 1). This allows for the expression of a different constant exon and translation of a new Ig. The B cell will switch from expressing IgM to another isotype (IgG, IgA, IgE). CSR changes the effector function of the Ig without altering its antigen specificity10,11,12. However, during SHM, B cells mutate the V coding regions of IgH and IgL to enable the production and selection of higher affinity Igs, which can more effectively eliminate an antigen13,14,15 (Figure 1). Importantly, both CSR and SHM depend on the function of one enzyme: activation-induced cytidine deaminase (AID)16,17,18. Humans and mice deficient in AID cannot complete CSR or SHM and present with elevated IgM serum titers or Hyper-IgM17,19.
In CSR, AID deaminates deoxycytidines (C) in the repetitive switch regions that precede each constant coding exons, converting them into deoxyuridines (U)20,21, which creates mismatched base pairing between deoxyuridines and deoxyguanosines (U:G). These U:G mismatches are converted into the double-stranded DNA breaks, which are required for DNA recombination, by either the base excision repair (BER) or mismatch repair (MMR) pathway22,23,24,25,26,27,28,29. In SHM, AID deaminates C within the V coding exons. Replication across the U:G mismatch generates C:G to T:A transition mutations, whereas removal of the uracil base by the BER protein, uracil DNA glycosylase (UNG), prior to DNA replication produces both transition and transversion mutations16. Null mutations in UNG significantly increase C:G to T:A transition mutations21,22. Similar to CSR, SHM requires the complementary roles of MMR and BER. During SHM, MMR generates mutations at A:T base pairs. Inactivating mutations in MutS homology 2 (MSH2) or DNA polymerase η (Polη) significantly reduces mutations at A:T bases and compound mutations in MSH2 and Polη virtually abolishes mutations at A:T bases21,30,31. Consistent with the critical role for BER and MMR in converting AID-generated U into transition or transversion mutations, mice deficient for both MSH2 and UNG (MSH2-/-UNG-/-) display only C:G to T:A transition mutations resulting from replication across the U:G mismatch21.
The analysis of SHM in V coding regions remains complicated because developing B cells can recombine any of the V(D)J coding exons in the IgH and IgL loci1,2,4. Accurate analysis of these uniquely recombined and somatically mutated V regions requires the identification and isolation of clones of B cells or the Ig mRNA11,13. The JH4 intron, which is 3’ of the last J coding exon in the IgH locus, harbors somatic mutations due to the spreading of mutations 3’ of the V promoter32,33,34 and therefore is frequently used as a surrogate marker for SHM in V regions31,35 (Figure 1). To experimentally elucidate how specific genes or genetic mutations alter SHM patterns or rates, the JH4 intron can be sequenced from Peyer’s patches (PP) germinal center B cells (GCBCs), which undergo high rates of SHM36,37,38. GCBCs can be readily identified and isolated with fluorescently conjugated antibodies against cell surface markers (B220+PNAHI)17,39.
A detailed protocol is presented to characterize JH4 intron mutations in PP GCBCs from mice using a combination of FACS (fluorescence activated cell sorting), PCR, and Sanger sequencing (Figure 2).
All mutant mice were maintained on a C57BL/6 background. Age-matched (2-5 months old) male and female mice were used for all experiments. Husbandry of and experiments with mice were conducted according to protocols approved by The City College of New York Institutional Animal Care and Use Committee.
1. Dissection of Peyer’s patches
2. Cell isolation for FACS
3. Staining GCBCs for FACS
4. DNA extraction from GCBCs
5. JH4 intron sequence amplification and analysis
Flow cytometry
Mature B cells circulate to germinal centers where they undergo affinity maturation, clonal expansion, and differentiation into plasma or memory cells40,41,42,43,44. These GCBCs can be identified by numerous cell surface markers, including high expression of the CD45R/B220 receptor and binding of peanut agglutinin (PNA)45,46. To isolate activated GCBCs, PP cells were stained with anti-B220 antibodies conjugated to phycoerythrin (PE) and biotinylated-PNA, followed by streptavidin conjugated to APC-eFluor780. Dead cells were eliminated using the fluorescent 4',6-Diamidino-2-Phenylindole (DAPI) dye, which stains the nucleic acid of dying or dead cells47,48. The stained cells were subsequently analyzed and sorted via flow cytometry. The PPs consisted of ~80% B220+ cells49,50. WT PPs contain on an average 4 x 106 cells per mouse (Figure 3A). Approximately 8% of the WT PP cells were B220+PNAHI, which is half the number observed in AID-/ – (Figure 3B). Thus, 0.3-0.6 x 106 B220+PNAHI GCBCs were obtained after sorting, which were sufficient to analyze mutations in the JH4 intron.
JH4 Sequence Analysis
The JH4 intron was amplified by a nested PCR using common VHJ558 family primers (J558FR3Fw and VHJ558.2) followed by JH4 intron spanning primers VHJ558.3 and VHJ558.435,37. Of the 105 unique sequences obtained from WT GCBCs, a total of 226 mutations were found (Figure 5A). Analysis of the GCBC mutation spectrum in the WT mice showed a range of transitions and transversions at a rate of 4 x 10-3 mutations/bp, which was calculated by dividing the total number of mutated bases by the total number of bases that were sequenced32,36,37,38. Additionally, each JH4 PCR product from WT GCBCs contained 1-25 mutations (Figure 5B), where multiple mutations were frequently found on one sequence33,36. Only two mutations were identified in 113 AID-/- sequences (Figure 5A). AID-/- B cells exhibited 1.66 x 10-5 mutations/bp, which was significantly lower than WT B cells (p <0.05)36 and compares to the error rate of the high fidelity polymerase (5.3 x 10-7 sub/base/doubling)51,52. Thus, AID-/- B cells served as a useful negative control for this assay.
Figure 1: Schematic of the IgH gene locus and the regions targeted by AID during CSR and SHM. The red bar indicates the 580 bp JH4 intron that is 3’ of VDJH4 rearrangements and is analyzed in this protocol. In CSR, AID-dependent deamination of intronic switch regions (Sμ and Sε) promotes DSB formation that allows for deletional-recombination and the expression of a new antibody isotype (IgM to IgE). During SHM, V regions (grey boxes) accumulate mutations (blue lines) that may lead to higher affinity Ig. Please click here to view a larger version of this figure.
Figure 2: Workflow to analyze SHM of the JH4 intron in GCBCs isolated from PPs. Please click here to view a larger version of this figure.
Figure 3: Characterization of PP GCBCs. (A) Total number of PP cells from WT and AID-/- mice (n = 4 per genotype). Error bars represent standard deviation from the mean. (B) Percentage of B220+PNAHI GCBCs obtained from PPs of WT and AID-/- mice (n = 4 per genotype)36. Error bars represent standard deviation from the mean, *p<0.05 using student’s t-test. (C) Representative FACS plots to sort B220+PNAHI GCBCs from PPs. Please click here to view a larger version of this figure.
Figure 4: Analysis of JH4 Sanger sequence data. (A) Sample sequence alignments of Sanger sequence data of the JH4 PCR product from WT (top) and AID-/- (bottom) GCBCs to the reference genomic sequence (NG_005838), which is the sequence immediately below the numbered tick marks. Alignments were generated using Clustal Omega. (B) Electropherogram of high-quality Sanger sequence data, which displayed distinct peaks for each base. (C) Electropherogram of low-quality sequence data, which showed ambiguous peaks and unspecified bases (N). The nucleotide shown in red must be manually annotated in the sequence text file. Please click here to view a larger version of this figure.
Figure 5: Analysis of mutations in the JH4 intron in WT and AID-/- GCBCs. (A) The total number of transition (red) and transversion (blue) mutations at A, C, G, and T bases for each genotype is summarized in the tables. The total number of sequences analyzed is indicated below the table. (B) The number of mutations per PCR amplicon for each genotype is depicted in the pie charts. This figure has been modified from Choi et al.36 Copyright 2020. The American Association of Immunologists, Inc. Please click here to view a larger version of this figure.
Staining Cocktail for GCBCs | Volume: 500 μL | ||
Antibody or Dye | Fluorophore | Dilution | μL |
B220 | PE | 1000 | 0.5 |
Streptavidin | APC-eFluor780 | 500 | 1 |
DAPI | N/A | 500 | 1 |
Table 1: Staining cocktails for GCBCs. Cocktail of the indicated antibodies or dye (indicated in italics) at the specified dilutions were used to stain PP cells in 500 μL for flow cytometry.
Single Stains for Compensation | Volume: 500 μL | ||
Antibody or Dye | Fluorophore | Dilution | μL |
B220 | PE | 1000 | 0.5 |
B220 | APC-eFluor780 | 750 | 0.67 |
DAPI | N/A | 500 | 1 |
Table 2: Single stain controls for compensation. B220 antibodies conjugated to the indicated fluorophores were used for single stain controls to compensate for spectral overlap.
PCR #1 | ||||
Reagent | Volume | Thermocycler Conditions | ||
5x Buffer | 4 μL | 1 | 95 °C | 3 min |
10 mM dNTP | 2 μL | 2 | 94 °C | 30 sec |
10 μM J558FR3Fw | 1 μL | 3 | 55 °C | 30 sec |
10 μM VHJ558.2 | 1 μL | 4 | 72 °C | 1:30 min |
High Fidelity DNA polymerase | 0.25 μL | Cycle 2-4 9x | ||
DNA | x (standardize to least concentrated sample) | |||
H2O | to 20 μL | 5 | 72 °C | 5 min |
Dilute PCR product 1:5 in H2O before proceeding to PCR #2 |
Table 3: Nested PCR of the JH4 intron. PCR components and thermocycler conditions for the first amplification reaction. Dilute the first PCR product 1:5 with water and use 1 μL of this dilution for the second PCR.
PCR #2 | ||||
Reagent | Volume | Thermocycler Conditions #2 | ||
5x Buffer | 4 μL | 1 | 94 °C | 3 min |
10 mM dNTP | 2 μL | 2 | 94 °C | 30 sec |
10 μM VHJ558.3 | 1 μL | 3 | 55 °C | 30 sec |
10 μM VHJ558.4 | 1 μL | 4 | 72 °C | 30 sec |
High Fidelity DNA polymerase | 0.25 μL | Cycle 2-4 21x | ||
Diluted PCR#1 | 1 μL | |||
H2O | to 20 μL | 5 | 72 °C | 5 min |
Table 4: PCR components and thermocycler conditions for the second PCR.
Reagent | Volume |
2x Buffer | 10 μL |
Purified PCR | x (standardize to least concentrated sample) |
Plasmid with blunt ends | 1 μL |
T4 DNA Ligase | 1 μL |
H2O | to 20 μL |
Incubate at room temp for 5 min or overnight at 16ºC |
Table 5: Ligation reaction. Components for the ligation of the purified JH4 intron PCR product into the plasmid.
FACS Buffer |
Heat inactivate FBS at 56 ˚C for one hour prior to use. Supplement PBS, pH 7.4 (Gibco, #10010049) with 2.5% (v/v) of heat-inactivated FBS. Store at 4˚C. |
DNA Extraction Buffer (100 mM Tris pH 8.0, 0.1 M EDTA, 0.5% (w/v) SDS) |
Add 50 mL of 1 M Tris pH 8.0, 100mL of 0.5 M EDTA, and 12.5 mL of 20% SDS. Add distilled water to 500 mL. Store at room temperature. |
TE Buffer (10 mM Tris pH 8.0, 1 mM EDTA) |
Add 2.5 mL of 1 M Tris pH 8.0, and 500 mL of 0.5 M EDTA. Add distilled water to 250 mL. Store at room temperature. |
Table 6: Buffer recipes.
Oligonucleotides List | ||
J558FR3Fw | 5’-GCCTGACATCTGAGGACTCTGC-3’ | |
VHJ558.2 | 5’-CTGGACTTTCGGTTTGGTG-3’ | |
VHJ558.3 | 5’-GGTCAAGGAACCTCAGTCA-3’ | |
VHJ558.4 | 5’-TCTCTAGACAGCAACTAC-3’ |
Table 7: Oligonucleotides used in the assay.
Supplementary Figure 1: Representative agarose gel image after completion of step 5.4. The JH4 intron nested PCR product was resolved on a 1.5% agarose gel and the 580 bp amplicon was excised. WT PP indicates that WT PP GCBC genomic DNA was used as a template for the first PCR and AID PP indicates that AID-/- PP GCBC genomic DNA was used as a template for the first PCR. ɸ indicates the no template PCR control and – indicates nothing was loaded into the well of the agarose gel. The last lane shows a 100 bp DNA ladder. Please click here to download this figure.
Characterizing SHM within the IgH and IgL V coding sequences of a heterogenous B cell population presents a challenge, given that each B cell uniquely reorganizes V coding segments during V(D)J recombination34. In this paper, we describe a method to identify mutations in the JH4 intron of GCBCs. The JH4 intron, which is located 3’ of the last J coding segment in the IgH locus, is used as a surrogate for SHM of V regions (Figure 1)31,33,34,35. To catalog these JH4 intron mutations and assess how specific genes affect the production or pattern of mutations, PP GCBCs are specifically analyzed. These cells accumulate JH4 intron mutations as a result of chronic stimulation by intestinal microbiota53. Furthermore, the B220+PNAHI GCBCs from the PPs of unimmunized mice have a mutation spectra that compares to splenic GCBCs from immunized animals54,55. However, mutations in the JH4 intron cannot be correlated to Ig affinity maturation because these mutations are non-coding.
To determine whether SHM alters Ig affinity, mice should be immunized intraperitoneally with an antigen, such as NP (4-hydroxy-3-nitrophenylacetyl) conjugated to CGG (chicken gamma globulin) or KLH (keyhole limpet hemocyanin)56. Subsequently, mRNA can be purified from splenic B220+PNAHI GCBCs to examine SHM within VH186.2, the V coding exon that most frequently recognizes NP and is mutated following NP-CGG or NP-KLH immunization31,57,58,59,60. Mutation of tryptophan-33 to a leucine in VH186.2 has been characterized to increase Ig affinity up to 10-fold59,60 and is, therefore, one indicator that SHM and clonal selection has generated high affinity Ig. Measuring NP7- and NP20-specific serum Ig titers by ELISA and calculating the Ig-specific NP7/NP20 ratio during the course of the immunization also documents Ig affinity maturation resulting from SHM of V regions17,21,36. Both these assays can be used to correlate SHM within the VH186.2 coding sequences with changes in NP-specific Ig affinity maturation.
Whether immunized or unimmunized animals are used to analyze SHM of VH186.2 or the JH4 intron, GCBCs must be accurately identified. We present a FACS based approach to isolate B220+PNAHI GCBCs. Alternatively, Fas and non-sulfated α2-6-sialyl-LacNAc antigen, which is recognized by the GL7 antibody61,62,63,64, can also be used to isolate GCBCs, which are identified as B220+Fas+GL7+65 or CD19+Fas+GL7+37. GL7 expression closely mirrors PNA in activated GCBCs of the lymph nodes64,65,66. In addition to using antibody markers specific for GCBCs, staining cocktails should maximize the excitation of a fluorophore and detection of a biomarker while minimizing spectral overlap of fluorescence emission. Antigens expressed at low levels should be detected with an antibody that is conjugated to a fluorophore with a robust emission fluorescence67. The recommended staining protocol was optimized for analysis on a cell sorter equipped with four lasers (405nm, 488nm, 561nm, 633nm) and 12 filters; however, filter configurations and laser availability vary between cytometers. To amend the protocol according to reagent and equipment availability, the reader is referred to additional resources, online spectrum viewers and published literature67,68,69,70,71,72,73. The multi-color staining protocol described herein requires compensation of spectral overlap to ensure that the sorted cell populations are GCBCs rather than inaccurate detection of fluorescence emission. B220 serves as a useful staining control for the described FACS (Table 1B) because PPs will have distinctive B220 negative and positive populations (Figure 3C), which allows for appropriate compensation of spectral overlap. The gating strategy presented in Figure 3C should be used as a guideline. The flow cytometry plots may vary depending on the staining conditions and cytometer settings. Nevertheless, 4-10% of the live cells should be B220+PNAHI 35, 52.
All mutations within the JH4 intron of PP GCBCs must be validated to ensure that the observed mutations are truly reflective of SHM and not an artefact of PCR or sequencing. AID-/- B cells can serve as a useful negative control when examining the SHM phenotype in other mutant mouse models because these cells cannot complete SHM17,19. The JH4 intron mutation rate in the of AID-/- GCBCs (1.66×10-5 mutations/bp)20,21,36,37,38,50,74 is comparable to the error rate of the high fidelity polymerase (5.3×10-7 sub/base/doubling)51,52 that is used to amplify the DNA in the nested PCR. If AID-/- mice are not available, compare the observed mutation pattern and frequency to the published literature. Ig V regions accumulate 10-3-10-4 mutations per base pair division, which is approximately 106-fold higher than the mutation rate of other gene loci73,75. Results may vary with the age of the animal76. Alternatively, B220+PNALO cells, which marks non-GCBCs, may be used as a negative control in the absence of AID-/- mice52. If the mutation frequency in WT GCBCs is lower than expected, the WT germline JH4 intronic sequence may be disproportionately represented. In this case, ensure that GCBCs were stained and sorted appropriately and PCRs are free from WT germline JH4 intron contamination. Additionally, raw sequencing data in electropherograms should be analyzed thoroughly to ensure that mutations in the sequence text data are not artefacts of sequencing errors. For example, poor Sanger sequencing results may reduce the reliability of the sequence data (Figure 4). This quality control of the Sanger sequence data will increase the accuracy and reproducibility of the JH4 intron mutation analysis.
The authors have nothing to disclose.
We thank Tasuku Honjo for the AID-/- mice. This work was supported by The National Institute on Minority Health and Health Disparities (5G12MD007603), The National Cancer Institute (2U54CA132378), and The National Institute of General Medical Sciences (1SC1GM132035-01).
0.2 ml PCR 8-tube FLEX-FREE strip, attached clear flat caps, mixed | USA Scientific | 1402-4708 | |
Ampicillin sodium salt | Fisher | BP1760-5 | |
APC-eFluor780 anti-CD45R/B220 | eBioscience | 47-0452-80 | clone RA3-6B2 |
BD FACSAria II | BD | 643186 | four lasers (405nm, 488nm, 561nm, 633nm) and 12 filters (PacBlue (450/50), AmCyan (502LP; 530/30), SSC (488/10), FITC (502LP; 530/30), PerCP-Cy5.5 (655LP; 695/40), PE (585/15), PE-Texas Red (600LP; 610/20), PE-Cy5 (630LP; 670/14), PE-Cy7 (735LP; 780/60), APC (660/20), Alexa700 (710LP; 730/45), APC-Cy7 (755LP; 780/60)) |
BD slip tip 1mL syringe | Fisher | 14-823-434 | sterile |
Biotinylated peanut agglutinin (PNA) | Vector Labs | B-1075-5 | |
C57BL/6J mice | Jackson Laboratories | 664 | |
Corning Falcon test tube with cell strainer snap cap | Fisher | 08-771-23 | |
DAPI (4',6-Diamidino-2-Phenylindole, dihydrochloride) | Fisher | D1306 | 0.5 mg/ml |
dNTP | NEB | N0447L | 10 mM |
ElectroMAX DH10B competent cells | Fisher | 18-290-015 | |
Falcon cell strainer 40mm | Fisher | 08-771-1 | |
Falcon round-bottom polystyrene tubes (FACS tubes) | Fisher | 14-959-5 | |
Falcon round-bottom polystyrene tubes (capped) | Fisher | 149591A | |
Fetal bovine serum | R&D Systems (Atlanta Biologicals) | S11150 | |
Gibco phosphate buffered saline PBS pH 7.4 | Fisher | 10-010-049 | |
Glycogen | Sigma | 10901393001 | |
Lasergene Molecular Biology (MegAlign Pro) | DNA Star | version 15 | |
PE anti-CD45R/B220 | BD | 553090 | clone RA3-6B2 |
Proteinase K | Fisher | BP1700-100 | |
Q5 High-Fidelity DNA Polymerase | NEB | M0491L | |
QIAquick Gel Extraction Kit | Qiagen | 28706 | |
Seal-Rite 1.5mL microcentrifuge tubes | USA Scientific | 1615-5500 | |
Streptavidin APC-eFluor 780 Conjugate | eBioscience | 47-4317-82 | |
T4 DNA ligase | NEB | M020L | |
Thermo Scientific CloneJET PCR Cloning Kit | ThermoFisher | FERK1231 | |
Tissue culture plate 6 well | Fisher | 08-772-1B | sterile |
Unlabeled anti-mouse CD16/CD32 (Fc block), BD | Fisher | BDB553142 | Clone 2.4G2 |