Genetic associations often remain unexplained at a functional level. This method aims to assess the effect of phenotype-associated genetic markers on gene expression by analyzing cells heterozygous for transcribed SNPs. The technology allows accurate measurement by MALDI-TOF mass spectrometry to quantify allele-specific primer extension products.
The number of significant genetic associations with common complex traits is constantly increasing. However, most of these associations have not been understood at molecular level. One of the mechanisms mediating the effect of DNA variants on phenotypes is gene expression, which has been shown to be particularly relevant for complex traits1.
This method tests in a cellular context the effect of specific DNA sequences on gene expression. The principle is to measure the relative abundance of transcripts arising from the two alleles of a gene, analysing cells which carry one copy of the DNA sequences associated with disease (the risk variants)2,3. Therefore, the cells used for this method should meet two fundamental genotypic requirements: they have to be heterozygous both for DNA risk variants and for DNA markers, typically coding polymorphisms, which can distinguish transcripts based on their chromosomal origin (Figure 1). DNA risk variants and DNA markers do not need to have the same allele frequency but the phase (haplotypic) relationship of the genetic markers needs to be understood. It is also important to choose cell types which express the gene of interest. This protocol refers specifically to the procedure adopted to extract nucleic acids from fibroblasts but the method is equally applicable to other cells types including primary cells.
DNA and RNA are extracted from the selected cell lines and cDNA is generated. DNA and cDNA are analysed with a primer extension assay, designed to target the coding DNA markers4. The primer extension assay is carried out using the MassARRAY (Sequenom)5 platform according to the manufacturer’s specifications. Primer extension products are then analysed by matrix-assisted laser desorption/ionization time of-flight mass spectrometry (MALDI-TOF/MS). Because the selected markers are heterozygous they will generate two peaks on the MS profiles. The area of each peak is proportional to the transcript abundance and can be measured with a function of the MassARRAY Typer software to generate an allelic ratio (allele 1: allele 2) calculation. The allelic ratio obtained for cDNA is normalized using that measured from genomic DNA, where the allelic ratio is expected to be 1:1 to correct for technical artifacts. Markers with a normalised allelic ratio significantly different to 1 indicate that the amount of transcript generated from the two chromosomes in the same cell is different, suggesting that the DNA variants associated with the phenotype have an effect on gene expression. Experimental controls should be used to confirm the results.
1) Cell Culture
2) DNA Extraction
3) RNA Extraction
4) Nucleic Acid Sample Preparation
5) PCR and Primer Extension Assay
6) Statistical Analysis
7) Experimental Controls
8) Representative Results
An example of allele-specific expression analysis is shown in Figure 3 with examples of mass spectrometry traces. The figure shows the results from the analysis of 4 allele-specific primer extension products carried out on 4 different templates. The top graphs represent the results obtained from the analysis of genomic DNA of two different cell lines, which are both heterozygous for a transcribed coding polymorphism. However, only cell line A is heterozygous for the risk DNA variant (Figure 1). The lower graphs represent the results obtained from the analysis of the cDNA generated from the same cell lines. As a result of experimental artifact the first peak is always higher than the second peak even in the genomic analysis. Therefore, the results from the genomic analysis are used to normalize the cDNA data. After normalization, the allele ratio is significantly different from 1 in cell line A (where the second peak appears lower compared to the other spectra), while it is very close to 1 in cell line B. The data suggest that cell line A carries a DNA variant, in phase with the allele measured by the second peak, which reduces the expression of the gene under analysis. Cell line B provides a very convenient negative control.
Figure 1. Cell line selection strategy. Cell lines A and B are heterozygous for a transcribed coding marker (triangle) but only cell line A is heterozygous for risk DNA variant (circle). The blue allele of the risk variant is associated (either with a direct effect or because in close correlation with the actual functional variant) with a lower level of transcription.
Figure 2. Allele-specific primer extension assay. This figure show the work-flow of the experimental procedure carried for both cDNA (left) and genomic DNA (right) templates. PCR primers (grey rectangles) are designed to amplify a heterozygous coding polymorphism (triangle). To avoid genomic contamination PCR primers anneal sequences placed in different exons (light blue bars) when amplification is carried out on cDNA template. The PCR product is then used as template for a primer extension reaction carried out with an extension primer, annealing one base next to the polymorphism, and the appropriate mix of three dideoxynucleotide (ddNTPs) terminators (squares) and one deoxynucleotide (circle) to extend the primer of one or two basis respectively. The resulting primer extension products have different masses which allow separation and quantification by MALDI-TOF mass spectrometry. The quantity of product corresponding to the blue allele of the DNA marker is relatively lower to the red allele.
Figure 3. Results of an allele-specific expression assay. The figure show the mass spectrometry results from the analysis carried out on the genomic DNA and the cDNA of cell line A and cell line B (Figure 1).
This method enables evaluation of the effect of disease-associated DNA variants on gene expression by assessing in vivo allele-specific difference in transcript level. In this protocol relative transcript abundance is measured by MALDI-TOF but other technologies allowing allele-specific quantification, such as TaqMan6,7, can be used. The major limitation of this approach is the availability of transcribed coding markers. Methods based on the principle described here, but taking advantage of different classes of polymorphisms, have been described. The haploChip assay measures allele-specific expression using markers located within 1 kb of the transcriptional start or end site of a gene which would be present in chromatin immunoprecipitated material isolated with antibodies specific to RNA polymerase II8. Alternatively, intronic SNPs can be used when the template is heteronuclear RNA9.
The specific protocol described here, in combination with the haploChip assay, has been used successfully to identify a candidate gene for dyslexia (or reading disability). Initially a genetic association study identified a DNA sequence associated with dyslexia and which was spanning three genes10. The allele-specific gene expression assay showed that in the presence of the dyslexia-associated DNA variants expression variation was observed for only one of the three genes but not the other two11.
The authors have nothing to disclose.
This work was funded by the Wellcome Trust through grants to A.P.M [076566/Z/05/Z], J.C.K. [074318] and a core award to the Wellcome Trust Centre for Human Genetics [075491/Z/04].
Material Name | Type | Company | Catalogue Number | Comment |
---|---|---|---|---|
PBS | Sigma | P-4417-100TAB | ||
SDS | Biorad | 161-0416 | ||
NaCl | VWR | 27788.366 | ||
Tris | Sigma | T1503-1KG | ||
EDTA | Sigma | E5134-500G | ||
RNase A | Qiagen | 19101 | ||
Proteinase K | Sigma | P2308-500MG | ||
Phenol: chloroform | Sigma | 77617 | ||
Chloroform | VWR | 100777C | ||
Ethanol | Sigma | 32221-2.5L | ||
Trizol | Invitrogen | 15596-018 | ||
RNeasy kit | Qiagen | 74104 | ||
SuperScript III Reverse Transcriptase | Invitrogen | 18080-044 | ||
Immolase Taq | Bioline | BIO-21048 | ||
dNTP | Sigma | DNTP100A | ||
Exonuclease 1 | NEB | M0293S | ||
Shrimp Alkaline Phosphatase | GE Healthcare | E70092Z | ||
SpectroCLEAN resin | Sequenom | 10053 | ||
MassEXTEND ddNTP/dNTP mix | Sequenom | Varies according to assay design | ||
MassEXTEND thermosequenase | Sequenom | 10052 | ||
Equipment | ||||
Chilled microcentrifuge | Eppendorf | |||
NanoDrop Spectrophotometer | Thermo Scientific | |||
SpectroPOINT nanolitre dispenser | Sequenom | |||
SpectroREADER mass spectrometer | Sequenom |