2016 inbre sequencing update and site-directed mutagenesis
TRANSCRIPT
AcknowledgementsWe would like to thank the PSU Research Advisory Council and Student Research Advisory Council for funding support. Research reported in this poster was also supported by an Institutional Development Award (IDeA) from the National Institute of General Medical Sciences of the National Institutes of Health under grant number P20GM103506. We would like to thank the UNH Hubbard Center for Genome Studies and Dartmouth College Molecular Biology Shared Resources Lab for sequencing. We would also like to thank Jon Bairam, Kevin Chesmore, Joel Dufour, Evyn Grimes, Ethan Johnson, Kathryn Kahrhoff, Lauren Oakes, Stacy Peterson, Ellen Rounds, Harlie Shaul, Kate-Lyn Skribiski, Amed Torres, Zoe White, and Alycia Wiggins for their contributions.
Conclusions
Department of Biological Sciences at Plymouth State University, Plymouth NH
References 1. Ahmed MS, Oie E, et al. (2004) Connective tissue growth factor--a novel mediator of angiotensin II-stimulated cardiac fibroblast activation in heart failure
in rats. J Mol Cell Cardiol 36(3): 393-404.2. Arnott, J. A., Lambi, A. G., Mundy, C. M., Hendesi, H., Pixley, R. A., Owen, T. (2011). The Role of Connective Tissue Growth Factor (CTGF/CCN2) in
Skeletogenesis. Critical Reviews in Eukaryotic Gene Expression, 21(1), 43–69.3. Bhangale TR, Stephens M, Nickerson DA (2006) Automating resequencing -based detection of insertion-deletion polymorphisms. Nat Genet 38:1457-
1462.4. Broughton G, Janis JE, Attinger CE (2006) The Basic Science of Wound Healing. Plastic and Reconstructive Surgery 117: 12S-34S.5. Chen MM, Lam A, et al. (2000) CTGF expression is induced by TGF- beta in cardiac fibroblasts and cardiac myocytes: a potential role in heart fibrosis. J
Mol Cell Cardiol 32(10): 1805-1819.6. Chuva De Sousa Lopes SM, Feijen A, et al. (2004) Connective tissue growth factor expression and Smad signaling during mouse heart development and
myocardial infarction. Developmental dynamics 231(3): 542-550.7. Dean RG, Balding LC, et al. (2005) Connective tissue growth factor and cardiac fibrosis after myocardial infarction. Journal of Histochemistry and
Cytochemistry 53: 1245-1256.8. Diegelmann RF, Evans MC (2004) Wound healing: An overview of acute, fibrotic and delayed healing. Frontiers in Bioscience 9: 283-289.9. Doherty, H (2010) The Role of Quantitative Variations in Connective Tissue Growth Factor Gene Expression in Cardiac Hypertrophy and Fibrosis. Chapel
Hill :11-12.10. Ensembl Genome Browser. (n.d.). Retrieved from http://www.ensembl.org/index.htm11. Fonseca C, Lindahl GE, et al. (2007) A polymorphism in the CTGF promoter region associated with systemic sclerosis. New England Journal of Medicine
357: 1210-1220.12. Frazier K, Williams S, et al. (1996) Stimulation of fibroblast cell growth, matrix production, and granulation tissue formation by connective tissue growth
factor. Journal of Investigative Dermatology 107: 404-411.13. "Heart Disease Facts." Centers for Disease Control and Prevention. Centers for Disease Control and Prevention, 10 Aug. 2015. Web. 03 May 2016. 14. Igarashi, A., Okochi, H., Bradham, D., & Grotendorst, G. 1993. Regulation of Connective Tissue Growth Factor Gene Expression in Human Skin Fibroblasts
and During Wound Repair. Molecular Biology of the Cell, 4: 637-645. 15. Jones E, Oliphant E, Peterson P, et al. 2001. SciPy: Open Source Scientific Tools for Python.16. Kelley LA et al (2015) The Phyre2 web portal for protein modeling, prediction and analysis17. Lasky J, Ortiz L, et al. (1998) Connective tissue growth factor mRNA expression is upregulated in bleomycin-induced lung fibrosis. American Journal of
Physiology: 275(2 Pt 1): L365-371. 18. Leask A, Abraham DJ (2003) The role of connective tissue growth factor, a multifunctional matricellular protein, in fibroblast biology. Biochemistry and
Cell Biology 81(6): 355-363.19. Matsui Y, Sadoshima J (2004) Rapid upregulation of CTGF in cardiac myocytes by hypertrophic stimuli: implication for cardiac fibrosis and hypertrophy.
Journal of molecular and cellular cardiology 37(2): 477-481.20. Milne, B., Moffitt, T., Crump, R., Poulton, R., Rutter, M., Sears, M., Taylor, A., and Caspi, A. 2008. How should we construct psychiatric family history
scores? A comparison of alternative approaches from the Dunedin Family History Health History Study. Psychological Medicine: 38(12): 1793-1802. 21. Mori T, Kawara S, et al. (1999) Role and interaction of connective tissue growth factor with transforming growth factor β in persistent fibrosis: A mouse ‐
fibrosis model. Journal of cellular physiology 181: 153-159.22. Nature Protocols 10: 845-858.23. "National Center for Biotechnology Information." National Center for Biotechnology Information. U.S. National Library of Medicine, n.d. Web.
<http://www.ncbi.nlm.nih.gov/>.24. Nickerson DA, Tobe VO, Taylor SL (1997) PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-
based resequencing. Nucleic acids research 25(14):2745-2751.25. Ohnishi, H., Okay, T., Kusachi, S., Nakanishi, T., Takeda, K., Nakahama, M., Doi, M., Murakami, T., Ninomiya, Y., Takigawa, M., & Tsuju, T. 1998. Increased
expression of connective tissue growth factor in the infact zone of experimentally induced myocardial infarction in rats. Journal of Molecular and Cellular Cardiology, 30: 2411-2422.
26. Paradis V, Dargere D, et al. (1999) Expression of connective tissue growth factor in experimental rat and human liver fibrosis. Hepatology 30: 968-976.27. Porter KE, Turner NA (2009) Cardiac fibroblasts: at the heart of myocardial remodeling. Pharmacology & therapeutics 123(2): 255-278.28. Ramensky V, Bork P, Sunyaev S (2002) Human non-synonymous SNPs: server and survey. Nucleic Acids Res 30(17):3894-3900.29. Shi-Wen X, Leask A, Abraham D (2008) Regulation and function of connective tissue growth factor/CCN2 in tissue repair, scarring and fibrosis. Cytokine &
growth factor reviews 19: 133-144.30. Sonnylal S, Shi-Wen X, et al. (2010) Selective expression of connective tissue growth factor in fibroblasts in vivo promotes systemic tissue fibrosis.
Arthritis & Rheumatism 62: 1523-1532.31. Sunyaev SR, Eisenhaber F, et al. (1999) PSIC: profile extraction from sequence alignments with position-specific counts of independent observations.
Protein Eng 12(5):387-394.32. Sun Y, Zhang JQ, et al. (2000) Cardiac remodeling by fibrous tissue after infarction in rats. J Lab Clin Med 135(4): 316-323.33. Van Rossum, G. 2007. Python programming language. In USENIX Annual Technical Conference.34. Wilson, Peter WF, et al (1998) Prediction of coronary heart disease using risk factor categories. Circulation 97(18): 1837-1847.35. Wynn TA (2008) Cellular and molecular mechanisms of fibrosis. Journal of Pathology 214: 199-210.
Further sequencing to identify SNPs within CTGF in a more diverse population
Insertion of additional variants into the CTGF vector
Investigate CTGF variants in a cell culture model of wounding to determine whether they impact fibrosis-related phenotypes
SNPs published in Ensembl are not fully representative of the larger human population
2 newly discovered SNPs may impact CTGF structure and function because they are nonsynonymous
Nonsynonymous SNPs in exon 2 do not appear to be associated with cardiovascular disease risk factors
The nonsynonymous SNP in exon 3 may be protective against cardiovascular disease, but a larger sample size is needed to make a definitive conclusion
Determination of CTGF SNP Frequencies and Vector Design by Site-Directed Mutagenesis
KM Jesseman, LL Smith, HA Gentile, ZM Stevens, AE Kennedy, and HE Doherty, PhD
Future Directions
Connective Tissue Growth Factor
Figure 2: Chromatograms from Sanger fluorescent sequencing generated by PolyPhred demonstrate the contrast between A) homozygous and B) heterozygous SNPs. Homozygous locations produce a single peak. Heterozygous alignments exhibit a primary peak and a secondary peak underneath, representing the two different nucleotide bases at the locus. Secondary peaks must reach at least 50% the height of primary peaks after subtraction of background fluorescence to be considered a true heterozygote.
Figure 1: Sequences obtained from the CTGF gene include the 5 exons, as well as a portion of the 5’ and 3’ untranslated regions. Each box represents an exon. Blue arrows indicate the amplicons used for sequencing and the direction in which they are sequenced. Each exon was amplified separately, except exons 1 and 2, which were amplified together.
PolyPhred Used to Identify SNPs
Cardiovascular disease is the leading cause of death in the United States. Following a heart attack, myocardial tissue becomes damaged which can lead to scarring of the heart known as fibrosis. Connective Tissue Growth Factor (CTGF), a gene involved in tissue repair, is upregulated and expressed in fibroblast cells when tissue damage occurs. Fibrosis develops due to continued remodeling of myocardial tissue after a heart attack. As a result, the heart’s ability to contract is compromised, reducing its elasticity which can lead to sudden cardiac death. Previous studies conducted in animal models suggest that increased expression of CTGF can lead to increased severity of fibrosis. Research in our lab focuses on identifying single nucleotide polymorphisms (SNPs) within CTGF that may alter its structure, function, and may potentially impact fibrotic phenotypes. In order to identify these SNPs, cheek swabs were collected from volunteers at PSU. DNA was extracted from the cheek cells, amplified via PCR, purified, and sent for sequencing at the Dartmouth or UNH sequencing facilities. Sequences were analyzed for homozygous and heterozygous SNPs using PolyPhred (University of Washington). Familial background information from volunteers was also gathered to see if SNPs correlated with a family history of obesity, cardiovascular disease, diabetes, high blood pressure, heart attack, stroke or a combination of these factors. Mutant CTGF alleles of interest were inserted into a CTGF vector via site directed mutagenesis. Future research includes testing CTGF vectors containing SNPs in a cell tissue culture model of wounding to determine whether they impact fibrosis-related phenotypes. By identifying SNPs that impact fibrosis risk, individualized treatment plans can be implemented for patients following a heart attack.
Methods
Introduction
Following a heart attack, myocardial tissue becomes damaged and invokes a robust healing response. However, excess healing of cardiac tissue can cause scarring of the heart known as fibrosis. The dense fibrotic tissue to build up reduces the heart’s ability to contract efficiently and the resulting dysfunction can lead to sudden cardiac death. Connective Tissue Growth Factor (CTGF) is a gene involved in tissue repair and remodeling. Research in our lab focuses on identifying Single Nucleotide Polymorphisms (SNPs) that may potentially alter the structure and/or function of CTGF. To do so, cheek cell swabs were collected from volunteers at PSU, amplified via PCR, purified, and sent for sequencing at Dartmouth Molecular Biology Core sequencing facility. Once sequences were returned, they were analyzed for SNPs using PolyPhred software (University of Washington). To date, 9 SNPs have been detected; 4 novel and 5 previously identified. Of the 9 variants, 4 are nonsynonymous mutations. Family history of cardiovascular disease and risk factors were recorded by survey at the time of cheek cell collection. A Kendall’s Tau-b test was used to detect any significant associations between family history and SNPs in exon 2 and 3, but none were detected. Variants of interest were inserted in a vector via site-directed mutagenesis. CTGF variants will be further examined in a model of wounding to determine if they impact fibrosis-related phenotypes. By doing so, individualized treatment plans could be developed for patients after suffering a heart attack.
Abstract No Correlation Between SNPs and Family History Scores
Figure 3: A) Diagram of pUC19 vector containing WT CTGF, the BamHI and XbaI cut sites used to insert it, an ampicillin resistance gene, and the location of the T482C base change. B) Chromatograms showing alignment of WT vector and vector following mutagenesis with a base change of T to C.
Results: Site Directed Mutagenesis generated a successful base change from T to C at cDNA location 482 in the CTGF gene. The base change will result in a V94A amino acid change in the protein. Efforts continue for C57Y, H83D, and V174M mutagenesis.
Successful Insertion of SNP into Vector
Table 2: Kendall’s Tau-b values were calculated for each risk factor and a combined risk of all factors. Values for b range from -1 to 1 with values < 0 suggesting a negative correlation, values > 1 suggesting positive correlation, and values close to 0 suggesting no correlation. p < 0.05 was considered statistically significant. Risk factors were tested for association to two nonsynonymous SNPs in exon 2 and one in exon 3. Exon 2 samples with variants contained both G371A and T482C (n=5) and the WT sample size was n=39. The exon 3 variant tested was G721A (n=1) and the WT sample size was n=73.
Results: No correlation was observed between individuals with the exon 2 variants (G371A and T482C) and family history score, suggesting this subset of variants does not impact cardiovascular disease risk factors. No statistically significant correlation was observed between the G721A variant in exon 3 and family history score for individual risk factors. Combined risk showed a Tau-b value of -0.127 and a p-value of 0.108, suggesting there may be a negative correlation between the SNP and combined risk. In order to determine if the exon 3 SNP results in protection against cardiovascular disease risk, a larger sample size needs to be collected.
DNA Preparation• Cheek cell samples were obtained from Plymouth State University student volunteers. • DNA was extracted from the cells.• Exons 1 and 2 of CTGF were PCR amplified together using Phusion polymerase (Thermo-Fisher).• Exons 3 through 5 of CTGF were PCR amplified separately for each sample using Dream Taq
polymerase (Thermo-Fisher). • Gel electrophoresis was used to confirm PCR amplification.• Successfully amplified samples were purified and sent out for fluorescent Sanger sequencing at
UNH Hubbard Center for Genome Sciences or Dartmouth College Molecular Biology Core Facility.SNP Detection• PolyPhred (v 6.18) software assigns a quality score to each fluorescent peak based on peak height
and location while filtering out background fluorescence. Base changes are then identified through alignment to the published human sequence (Ensembl.org Build 38, Release 85).
• Homozygous SNPs were detected by the presence of primary peaks only. • Heterozygous SNPs were identified by detecting secondary peaks at the same locus which are at
least 50% the height of the corresponding primary peak. • Population frequencies were calculated for each SNP and, for those previously detected, compared
to published frequencies in Ensembl.Site Directed Mutagenesis• The four nonsynonymous SNPs observed in the PSU population were introduced into a previously
designed CTGF vector with a pUC19 base and the Wild Type (WT) human CTGF gene.• The WT allele was determined for our population by CTGF resequencing and includes three
SNPs not present in the Ensembl published sequence.• Synonymous SNPs at cDNA positions A420C and A441C• Nonsynonymous SNP at cDNA position C448G, causing an H83D amino acid change
• Q5 Site Directed Mutagenesis Kit (NEB) was used to insert G371A, G448C, T482C, or G721A• Samples were sent for sequencing at the Dartmouth Molecular Biology Core Facility to determine if
the desired mutation had occurred.Family History Association • A survey of family history for cardiovascular disease was collected along with cheek cell samples. • For each incidence of disease in the individual or 1st degree relative, a value of 1 was assigned. For
2nd degree relatives, incidence of disease was assigned a value of 0.5. To determine combined disease risk all factors were summed (similar to Milne et al 2008).
• Variant scores were assigned as 0 for no copies, 1 for one copy, and 2 for two copies. • A custom program in Python using a statistical package from SciPy was used to calculate p-values
and Kendall’s Tau-b values (Van Rossum, 2007) (Jones et al., 2001)• Analysis was run for nonsynonymous SNPs in exons 2 and 3
9 SNPs Detected in PSU Sample
Table 1: Summary of SNPs identified in CTGF including exon, Ensembl location (Build 38, Release 85), published frequency, PSU sample frequency, nucleotide change, amino acid change, and sample size (chromosomes). Individuals studied are primarily of European descent.
Results: Overall, 9 SNPs were detected. 4 SNPs were novel and 5 were previously identified. SNP frequencies in our population are similar to frequencies previously published in Ensembl. 2 common variants in exon 2 were found that are not currently published. The remaining 2 novel SNPs were uncommon with a frequency of 3%, occurring in the exon 5 and the 3’ UTR of CTGF. G371A and T482C SNPs in exon 2 were always observed together, as well as C1224T and C1362T in exon 5 and the 3’ untranslated region. Of the 9 detected SNPs, 4 were nonsynonymous and, therefore, could potentially impact the structure and function of CTGF.
Exon Ensembl Location
cDNA Location
Published Frequency
PSU Sample Frequency
Nucleotide Change
Amino Acid Change
Sample Size
2 6:131950889 371 Novel 0.06 G/A C57Y
91
2 6:131950840 420 >0.99 0.99 A/C Syn
2 6:131950819 441 >0.99 0.99 A/C Syn
2 6:131950812 448 >0.99 0.99 C/G H83D
2 6:131950778 482 Novel 0.06 T/C V94A
3 6:131950313 721 <0.01 <0.01 G/A V174M 159
5 6:131949291 1224 Novel 0.03 C/T Syn221
5 6:131949279 1236 <0.01 <0.01 C/T Syn
3’ UTR 6:131949153 1362 Novel 0.03 C/T UTR 221
Risk FactorExon 2 Tau-b
Exon 2 p-value
Exon 3 Tau-b
Exon 3 p-value
Obesity -0.046 0.663 -0.078 0.325
Cardiovascular Disease -0.052 0.618 -0.078 0.325
Diabetes 0.087 0.404 -0.063 0.426
High Blood Pressure -0.039 0.705 -0.104 0.188
Heart Attack/Stroke <65 -0.088 0.402 -0.062 0.437
Heart Attack/Stroke >65 0.053 0.611 -0.067 0.396
Combined Risk -0.017 0.870 -0.127 0.108
AmprA
CTGF
pUC19 with CTGF insert
6934 bpXbaI
1
2000
6000
4000
BamHI
T482C
WT Vector
T482C Vector
B
A B