chromatinlectures 2010

83
Local and Global Effect of Chromatin

Upload: harru

Post on 11-Dec-2015

213 views

Category:

Documents


0 download

DESCRIPTION

romatinLectures 2012 gene expresion

TRANSCRIPT

Page 1: ChromatinLectures 2010

Local and Global Effect of Chromatin

Page 2: ChromatinLectures 2010

Reading from Molecular Biology of the Cell 5th edition

Pages 202-245

Page 3: ChromatinLectures 2010

Chromosomes- Structure composed of a very long DNA molecule and associated proteins that carries the genetic (and epigenetic) information

- Especially evident in plant and animal cells undergoing mitosis or meiosis, where each chromosome becomes condensed into a compact rod-like structure that is visible by light microscopy

Chromatin- Complex of DNA, histones and non-histone proteins that collectively make up chromosomes

- DNA, histones and non-histone proteins are subject to post-translational/replication modifications that form the basis of an epigenetic code

Chromosomes and Chromatin

Page 4: ChromatinLectures 2010

A. Interphase chromatin B. A replicated chromosome at mitosis

Compaction of chromatin is cell-stage dependent

Big, basic question: What is the relationship between structure of chromatin and gene expression ?

Page 5: ChromatinLectures 2010

Most interphase chromatin is condensed into 30nm coils.

Interphase chromatin ~500 fold compaction end-to-end

Mitotic chromatin 20X fold compaction end-to-end over interphase

levels of Chromatin compaction

Page 6: ChromatinLectures 2010

A) 30 nm fibersB) beads on a string-nucleosome from interphase nucleus

The nucleosome is a basic unit of chromatin 1974

Page 7: ChromatinLectures 2010

Nucleosome = a nucleosome core particle + linker DNA+ a linker histone

DNA length: 180-200 bp

Nucleosome core particle = histone octamer + 146 bp DNA

Nomenclature

Page 8: ChromatinLectures 2010

Histones- highly basic (+) proteins

Protein Molecular weight

Major basic Amino acids

H1 21 Lys++

H2a 13.8 Lys

H2b 13.8 Lys

H3 15.4 Arg/Lys

H4 11.4 Arg/Lys

Page 9: ChromatinLectures 2010

Histone fold-3 alpha helices and 2 folds

Page 10: ChromatinLectures 2010

Nucleosome core particle

2.8 A crystal structure of theMono-nucleosome

Page 11: ChromatinLectures 2010

1. DNA (146 bp) is wrapped in 1.75 left-handed superhelical turns

per nucleosome

2. One side of DNA is in contact with histone octamer, the other is

solvent exposed

3. DNA helical turns in a nucleosome have an average of 10.2 bases

per helical turn versus 10.5 for DNA in solution

How is nucleosomal DNA different from free DNA

Page 12: ChromatinLectures 2010

Translational positioning Rotational Positioning

linker

Core particle

Nucleosome Positioning

Page 13: ChromatinLectures 2010

Transcription factor binding to DNA is inhibited within nucleosomes

• Chromatin assembly inhibits transcription by all three RNA polymerases in vitro.

• In vivo genetic evidence links histones to repression (and activation as well).

• Affinity of transcription factors for its cognate DNA binding site is decreased when DNA is reconstituted into nucleosomes.

• The binding of TBP to the TATA box is very sensitive to chromatin assembly in vitro.

• Extent of inhibition is dependent on:

– Location of the binding site within the nucleosome

• binding sites at the edge are more accessible than the center

– The type of DNA binding domain

• Zn finger transcription factors bind to nucleosomal DNA more easily than bHLH domains.

Page 14: ChromatinLectures 2010

ArticleNature 442, 772-778 (17 August 2006)

A genomic code for nucleosome positioning

Eran Segal et al (Jonathan Widom)

Abstract

Eukaryotic genomes are packaged into nucleosome particles that occlude the DNA from interacting with most DNA binding proteins. Nucleosomes have higher affinity for particular DNA sequences, reflecting the ability of the sequence to bend sharply, as required by the nucleosome structure. However, it is not known whether these sequence preferences have a significant influence on nucleosome position in vivo, and thus regulate the access of other proteins to DNA. Here we isolated nucleosome-bound sequences at high resolution from yeast and used these sequences in a new computational approach to construct and validate experimentally a nucleosome–DNA interaction model, and to predict the genome-wide organization of nucleosomes. Our results demonstrate that genomes encode an intrinsic nucleosome organization and that this intrinsic organization can explain approx. 50% of the in vivo nucleosome positions. This nucleosome positioning code may facilitate specific chromosome functions including transcription factor binding, transcription initiation, and even remodelling of the nucleosomes themselves.

Page 15: ChromatinLectures 2010

Naked DNA

Chromatin

Lev

el o

f ac

cess

ibil

ity/

Tra

nsc

rip

tion

al a

ctiv

ity

Unavoidably High levels ofTranscription

Mechanisms for increased accessibility or activationMechanisms for

decreased accessibility or repression

High

low

Page 16: ChromatinLectures 2010

How to actively transcribe a gene embedded within repressive

chromatin?

• Cooperative binding of multiple factors.

• Utilize various chromatin remodeling activities to make chromatin compatible for transcription

TATA

Page 17: ChromatinLectures 2010

–Modulation by incorporation of histone variants

–Modulation by ATP-driven chromatin remodeling complexes

–Modulation by enzymes that post-translationally modify

histonesAcetylationMethylationUbiquitinationPhosphorylationADP-ribosylation

Histone association with DNA is dynamic ‘on its own’ 4 second average residencyDNA is exposed for 10 to 50 milliseconds

prior to reassociation

Mechanisms for chromatin remodeling

Page 18: ChromatinLectures 2010

Modulation by Histone Variants• In addition to the major histones H2A, H2B, H3 and H4, many

organisms also have distinct batteries of histone variants– H2AZ, H2AX and macroH2A H2AZ has been shown to associate with actively transcribed chromatin

regions H2AX has been shown to be crucial for chromatin decompaction during

DNA repair. Phosphorylation on its SQE/DØ sequence is one of the earliest events in response to double-strand DNA breaks

– H3.3 and CenH3s (CenpA in human) H3.3 is a replacement H3 variant CenH3s are centromere-specific H3 variants

• Histones H2B and H4 have very few variants

The chromatin structure and function can be different dependent on the presence or absence of histone variants which possess different amino acid sequences.

Page 19: ChromatinLectures 2010

• Yeast SWI/SNF– 10 proteins

– Needed for expression of genes involved in mating-type switching and sucrose metabolism (sucrose non-fermenting).

– Some suppressors of swi or snf mutants are mutations in genes encoding histones.

– Interacts with chromatin to activate a subset of yeast genes.

– Is an ATPase-containing complex

• Mammalian homologs: hSWI/SNF– The ATPase component is BRG1, related to Drosophila

Brahma

• Other ATP-dependent chromatin remodeling complexes have been discovered

Remodeling by ATP-driven chromatin remodeling complexes

Page 20: ChromatinLectures 2010

Involvement of SWI/SNF in transcription

• The SWI/SNF complex is required for transcriptional activation of 5% yeast genes.

Can be recruited directly through interaction with DNA binding transcription factors.

Can be recruited indirectly by interaction with other transcriptional coactivators or along with the RNA polymerase holoenzyme.

SWI/SNF complex exerts its major effect in transcriptional activation at a step subsequent to transcriptional activator-promoter recognition.

Dependent on the chromatin organization as well as the transcription factors involved, the SWI/SNF also contributes to transcriptional repression.

Page 21: ChromatinLectures 2010

How do chromatin remodeling complexes work?

• Structural alteration

• Nucleosome sliding

• Nucleosome eviction

The consequence of chromatin remodeling is dependent on the type of chromatin

remodeling complexes involved

A common route involves ATP hydrolysis to do the following:

Page 22: ChromatinLectures 2010

Nucleosome sliding induced by ISWI-containing complexes

• Make nucleosome mobile in the presence of ATP

• Also involved in nucleosome/chromatin assembly

• Has roles in both transcriptional activation and repression (occlude exposed

transcription factor binding site

NURF +ATP

Move nucleosome from its

stable, low energystate position

NURF +ATP

TF

NURF +ATP

Evenly spacenucleosomes thatwould otherwiseclump due toinfluence of DNAsequence

Expose transcription Factor binding site

Page 23: ChromatinLectures 2010
Page 24: ChromatinLectures 2010

Chromatin remodeling by covalent modification of

histones

Page 25: ChromatinLectures 2010

Multiple modifications provide for an enormous number of potential combinations

Page 26: ChromatinLectures 2010

Histone Acetyltransferases (HATs)

• Type A nuclear HATs: acetylate histones in chromatin.

• Type B cytoplasmic HATs: acetylate free histones prior to their assembly into chromatin.– Acetylate K5 and K12 in histone H4

Use acetyl-coA as donor coA

Page 27: ChromatinLectures 2010

Highly acetylated histones are associated with actively transcribed chromatin– Increasing histone acetylation can turn on some

genes– Chromatin immunoprecipitation (ChIP) of DNA with

antibodies against Ac-histones pulls down actively transcribed genes

– The acetylated chromatin is more “open”• Increased DNase sensitivity• Increased accessibility to transcription factors

and polymerases

Page 28: ChromatinLectures 2010

Some common coactivators are nuclear HATs

• Gcn5p is a yeast transcriptional activator of many genes in yeast that possesses HAT activity

• PCAF (p300/CBP associated factor) is the human homolog of yeast Gcn5p

• p300 and CBP are similar HATs that interact with many transcription factors (e.g. CREB, AP1 and MyoD)

• p300/CBP are needed for activation by many transcription factors, and thus are considered as general coactivators or

cointegrators

• p300/CBP has intrinsic HAT activity as well as binding to the HAT PCAF

Page 29: ChromatinLectures 2010

Roles of histone acetylation

• To increase access of transcription factors to DNA in nucleosomes.

• To decondense 30nm chromatin fibers• To serve as epigenetic marks for binding of

non-histone proteins (e.g. bromodomain proteins like Gcn5p/CBP/p300/PCAF) to chromatin

Page 30: ChromatinLectures 2010

Histone deacetylation is catalyzed by histone deacetylases and associated with

transcriptional repression

Histone deacetylases (HDACs):1. Three classes, about 20 identified members2. can be recruited by transcriptional repressors to

specific target genes and/or deacetylate histones in chromatin in a non-targeting, global fashion

3. Acetylation and deacetylation are very dynamic events

4. Aberrant histone deacetylation has been linked to cancer

Page 31: ChromatinLectures 2010

Mammalian HDACs have been classified into three classes

Class I (HDACs 1, 2, 3 & 8) homologs of yeast RPD3 and localize to the nucleus.

Class II (HDACs 4, 5, 6, 7, 9 & 10) are homologs of yeast Hda1 and are found in both the nucleus and cytoplasm.

Class III (Sirt1 - Sirt7) are homologs of yeast Sir2 and form a structurally distinct class of NAD-dependent enzymes found in both the nucleus and cytoplasm. 

Page 32: ChromatinLectures 2010
Page 33: ChromatinLectures 2010

Information about three classes of HDACs

• Class I HDACs are relatively small in size, abundant, ubiquitously expressed, mainly nuclear, sensitive to TSA and tend to associate with corepressor proteins to form large corepressor complexes.

• Class II HDACs are relatively larger in size, less abundant, shuffle between cytoplasm and nuclei, likely tissue-specific and sensitive to TSA.

• Class III HDACs are involved in silencing of rRNA genes , telomere silencing and polII-transcribed genes and not sensitive to TSA. They require NAD as a cofactor.

Page 34: ChromatinLectures 2010

Histone Methylation• Two chemical classes of HMTs: arginine specific-HMTs

and lysine-specific HMTs.• Both Arg and Lys can be mono-, di- or tri-methylated.• In contrast to histone acetylation, histone methylation is

believed to be stable until recently. This modification is still rather stable though. Turnover rate of histone methylation is similar to that of histone turnover

• Methylation does not neutralize positive charge on Lys and Arg.

• Methyl group is thought to be too small to have a direct effect on chromatin structure.

• The function of histone methylation is believed to be mediated through specific methylated histone binding proteins (as part of histone code hypothesis).

Page 35: ChromatinLectures 2010

Approaches for identification and characterization of HMTs

1. Biochemical purification of HMT activities using in vitro HMT assay.

2. Sequence similarity: testing the proteins containing a SET domain.

The first identified Arg-specific HMT is Carm1/PRMT4

The first identified Lys-specific HMT is SUV39h1 based on its similarity to a plant protein methyltransferase. The HMT activity resides in the SET domain

Page 36: ChromatinLectures 2010

HMTase Histones Sites

ySET2 H3 K36

G9a H3 K9

CARM1 H3 R2, R17, R26

PRMT1 H4 R3

Summary of Known HMTases and Their Target Sites

ySET1/mSET7 H3 K4

SUV39H1/Clr4/ESET H3 K9

DOT1 (no SET) H3 K79

SET8 H4 K20

Roles in transcription

elongation

silencing

activation

activation and elongation*

activation and elongation*

silencing/X-inactivation

activation

silencing, cell cycle

EZH1/EZH2 H3 K27 silencing/X-inactivation

Page 37: ChromatinLectures 2010

How histone methylation affects chromatin function?

• H4-R3 methylation facilitates acetylation of H4 by p300 (why it is associated with activation.

• H3-K9 methylation creates a binding site for HP1 and HP1 is known to associate with HDACs and involved in heterochromatin formation (why it is involved in repression)

• H3-K4 methylation prevents binding of NuRD• The underlining mechanism for many other

modifications is not clear

Page 38: ChromatinLectures 2010

Reversibility of histone methylation

Non-specific histone demethylation1. Histone tail clipping by an endoprotease.2. Histone replacement by unmodified histones or histone

variants.

Specific demethylation 1. Arginine demethylase: PAD4 (peptidylarginine

deiminase 4) 2. The amine oxidase family lysine demethylases LSD1

(lysine specific demethylase 1) H3K4Me and H3K9Me 3. The jmjc family demethylases

JHDM1: H3K36Me2JHDM2A and JHDM2B: H3K9Me2JHDM3A/JMJD2A: H3K9Me3/2 and H3K36Me3/2

Page 39: ChromatinLectures 2010

Histone phosphorylation• Phosphorylation on Ser-10 of H3 is involved in

both transcriptional activation and in chromosome condensation during mitosis

• Phosphorylation on Ser-10 of H3 may facilitate acetylation of histone H3 on Lys-9 and Lys-14

• Phosphorylation of histone H2AX variant is involved in DNA repair

• Phosphorylation of Ser14 of H2B is linked to apoptosis

• Many Ser and Thr sites in histone tails can be phosphorylated. In most cases the functional consequence is not clear

Page 40: ChromatinLectures 2010

Histone Ubiquitination• H2A (Lys-119) and H2B (Lys-123) can be mono-

ubiquitinated• Mono-ubiquitination is not associated with protein

degradation by the proteasome• H2B ubiquitination in yeast is catalyzed by Rad6

and is required for methylation on Lys-4 and Lys-79 of H3 (trans-histone effect). The underlining mechanism is unknown.

• Both ubiquitination and de-ubiquitination are involved in transcriptional regulation (Genes and Development 17: 2648-2663, 2003)

Page 41: ChromatinLectures 2010

5 8 12 163Ac-N-SGRGKGGKGLGKGGAKRHRKVLRDNIQGIT...Human H4

20

Me MeAc Ac Ac Ac

Human H3 N-ARTKQTARKSTGGKAPRKQLATKAARKSAP...Ac Ac Ac AcMe MeMe

4 9 14 18 23 27

p pMe MeMe

Me

Interplay Between Different Histone Modifications

1. At the level of modification2. At the level of function

P-S10 inhibits binding of HP1 to H3K9(Me)2/3.Fischle et al., Nature. 2005 Dec 22;438(7071):1116-22

1stPRMT1

2nd

p300

Page 42: ChromatinLectures 2010

The ‘Histone Code’

Histone code hypothesis: that multiple histone modifications, acting in combinatorial or sequential fashion on one or multiple histone tails, specify unique downstream functions.

How the histone code be read? Likely read by specific protein domains

Code Protein Motif

Ac-Lys Bromo – on SWI/SNF proteins

H3K9Me HP1 Chromo

H3K27Me Polycomb Chromo

Phos-H3 S10 ? Cell cycle/mitosis

Different combinations ?

H3K4Me WDR5 – Needed for Hox gene activation

Page 43: ChromatinLectures 2010

The functions of SWI/SNF and the SAGA complex are genetically

linked• Some genes require both complexes for activation.• Other genes require one or the other complex.• Many genes require neither – presumably they may utilize

different ATP-dependent complexes and/or HATs

Functional interplay among different chromatin remodeling factors

Page 44: ChromatinLectures 2010

Chromatin Structure and FunctionGlobal Level

Page 45: ChromatinLectures 2010

Interphase Nucleus: euchromatin vs heterochromatin

Page 46: ChromatinLectures 2010

46

Interphase chromatin exists in two general states

A. Euchromatin - Less-condensed state – open, dispersed and potentially active in transcription– often located near nuclear pores– acetylated histones, H3-K4 methylation, and low DNA

methylationB. Heterochromatin – more-condensed state

– Usually located at the periphery of the nucleus– this DNA in general is not transcribed– Hypoacetylated histones, low H3-K4 methylation,

methylation at H3-K9, H3-K27 and H4-K20, and DNA methylation

Page 47: ChromatinLectures 2010

22.228 lecutre 7 47

Constitutive vs facultative heterochromatin1. Constitutive heterochromatin, condensed at all times

– the centromere– the telomeres

2. Facultative heterochromatin, transient condensation, contains potentially active genes

– Inactive X chromosome known as the “Barr body”– facultative heterochromatin becomes more abundant in

cells as the organism matures from embryo to adult and aging, as cells specialize, and gene expression is restricted

Page 48: ChromatinLectures 2010

Epigenetics

Epigenetics is the study of heritable mechanisms that affect the transcriptional state of a gene which is not due to change in DNA sequence

Molecular mechanisms that mediate epigenetic phenomena include (but is not limited to): DNA methylation, histone modifications and RNAi machinery (specific chromatin structures that allow stable transcriptional activation or silencing)

Page 49: ChromatinLectures 2010

Position-effect variegation (PEV): euchromatic genes become subject to transcriptional silencing as a result of their placement adjacent to heterochromatin by chromosomal rearrangements. Gene silencing by heterochromatisation in PEV is clonally initiated in a variable number of cells resulting in the variegated phenotype.

Su(var) : genes identified in Drosophila genetic screens that contribute positively to heterochromatin formation. HP1, Su(var)3-9

E(var) : genes identified in Drosophila genetic screens that contribute negatively to heterochromatin formation Herman Muller 1938

Page 50: ChromatinLectures 2010

Figure 9-65. Position-effect variegation in Drosophila. (A) Heterochromatin (red) is normally prevented from spreading into adjacent regions of euchromatin (green) by special barrier sequences of unknown nature. In flies that inherit certain chromosomal translocations, however, this barrier is no longer present. (B) During the early development of such flies, the heterochromatin now spreads into neighboring chromosomal DNA, proceeding for different distances in different cells. The spreading soon stops, but the established pattern of heterochromatin is inherited, so that large clones of progeny cells are produced that have the same genes condensed into heterochromatin and thereby inactivated (hence the "variegated" appearance of some of these flies; see Figure 9-51B). This phenomenon shares many features with X-chromosome inactivation in mammals.

Page 51: ChromatinLectures 2010

H3K9 methylase

Positive feedback and heterochromatin spreading

Page 52: ChromatinLectures 2010

Insulator/Boundary element: elements that modulate interactions between other cis-acting sequences and separate chromatin domains with distinct condensation states. Thus, they are proposed to play an important role in the partitioning of the genome into discrete realms of expression.

Page 53: ChromatinLectures 2010

Chicken -globin domain

Insulator InsulatorCTCF binding site

Page 54: ChromatinLectures 2010

Imprinting: Genomic imprinting describes the preferential or exclusive expression of a gene from only one of the two parental alleles. The allele-specific expression of imprinted genes is based on allele-specific epigenetic modifications such as cytosine methylation and histone acetylation and methylation.

Imprinting Control Region

methylation

Read the paper that describeshow parental imprinting isrelated to GWAS studiesfor specific risk allelesingle nucleotide polymorphisms(SNPs)

Page 55: ChromatinLectures 2010

Chromatin domain: The organization of one or more genes into an i) expressed, ii) potentially expressed, or iii) silenced region defined by insulators and/or sites of attachment to the nuclear matrix (The activity state is often related to covalent modification of histones and/or DNA, in addition to the association of specific regulatory proteins).

i) Insulator/boundary element

ii) Scaffold attachment regions (SARs)

iii) Matrix attachment regions (MARs)

Page 56: ChromatinLectures 2010

Mapping higher orderChromatin interactions

Living cell

Formalin-Cross linkedchromatin

Ligate, thenReverse crosslinks and sequence

Page 57: ChromatinLectures 2010

Estrogen receptor- regulated genes form chromatin loops

Loop geneanchor gene anchor gene

ChIA-PET clones

ERIP

Page 58: ChromatinLectures 2010

For the ER ChIA-PET analysis:

Long range interactions are intrachromosomal

Loops vary between 100 kb to 1 Mb in distance

Stronger ER binding sites correlate with duplex/complexER chromatin loops

Anchor genes near interacting ER binding site loopsare expressed more strongly than genes near non-interacting ER binding sites

Page 59: ChromatinLectures 2010

Dosage Compensation

Males (XY) and females (XX) must generate equal amounts of most X-linked gene products

Three distinct mechanisms:Mammals: one of X chromosome is inactivated

D. melanogaster: Males double transcription from the single X chromosome to equal XX females

C. elegans: Females reduce transcription from both XX by half to equal XO males

Page 60: ChromatinLectures 2010

Dosage Compensation and X-chromosome Inactivation

Page 61: ChromatinLectures 2010

In mammals, Xist RNA is transcribed from the X inactivation center (XIC) and spreads to inactivate the X chromosome in cis. Xist transgenes inserted on autosomes also cause the spreading of transcriptional inactivation in cis.

In flies, there are about 35 chromatin entry sites (CES) on the X chromosome, including roX1 and roX2. Unlike the XIC in mammals, the entry sites in flies regulate twofold transcriptional activation.

1.7 kb ncRNA

Page 62: ChromatinLectures 2010
Page 63: ChromatinLectures 2010

Epigenomics - genome-wide study of epigenetic features on DNA

Epigenetics - refers to heritable changes in phenotype (appearance) or gene expression caused by mechanismsother than changes in the underlying DNA sequence

Omics - referring to totality of some sort

In the past: Studies of relationship between epigenetic marks on chromatin and gene expression focused on a single gene

Now/near future: Relationship between chromatin and gene expressioncan be understood across the genome

What new rules, patterns or statistics emerge from this effort?

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project Nature. 2007 June 14; 447(7146): 799–816. The ENCODE Project Consortium

200 experiments; 30 Mbases of DNA; 400 million data points

Page 64: ChromatinLectures 2010

How do we know that non-coding DNA isn’t junk DNA?

Conservation analysis:

~5% of human genome under purifying selection pressure based on all metazoan genomes

~20% of human and mouse genomes subject to purifying selection pressure

Only 1.2% of the human genome encodes for proteins (exons)

What is the significance of non-protein coding DNA?

Epigenomic studies might provide an answer to this question.

Page 65: ChromatinLectures 2010

Genomics - Primary DNA sequence - Evolutionary conservation of primary sequence - Shared synteny - positional co-localization of genes on chromosomes of different species – i.e., Hox cluster - Copy number variations, indels - SNPs – single nucleotide polymorphisms

Epigenomics – genome-wide mapping of: - DNA methylation of the genome - histone post translational modifications - DNA hypersensitive sites - DNA tethering to locus control regions - Transcription factor binding to DNA

Phenotype - GWAS – genome wide association studies (genes/loci associated with disease) - proteomics – spectrum of proteins expressed - manifestation of differentiated cell types and tissues

Transcriptome – transcriptional output – mRNA, miRNA, ncRNA

Page 66: ChromatinLectures 2010

Disease risk alleles

Personalized genomics is now an affordable reality

Illumina 550k SNP Chip array

Page 67: ChromatinLectures 2010

Tiling Array – much greaterCoverage than “expression arrays”up to 6 million probes – 545 timesbigger

Transcriptome profiling:

224K or tiling arrays usedto identify transcripts

rather than 11k expressionarrays based on RefSeqgene exons

Lots of transcriptional ‘dark matter’Non coding RNAs

Transcriptome profiling

Page 68: ChromatinLectures 2010

ChIP-CHIP mapping of the epigenome

Antibodies for acetylated or methylated histones, transcription factors,high mobility group factors, coactivators used for mapping the distributionof corresponding epigenetic features on the genome

Protein of interest

Page 69: ChromatinLectures 2010

Epigenomic identification of methylated DNA

Bisulfite sequencing of the genome – defines actual sites of DNA methylationbut is more costly

Page 70: ChromatinLectures 2010

Identification of DNAse hypersenstive sites – open chromatin

Also – Formaldehyde assisted isolation of regulatory elements (FAIRE) is usedisolates open instead of closed DNA

Page 71: ChromatinLectures 2010

Sample of ENCODE epigenomic anotation

Replicationtiming

DNAse hypersensitivesites

Regulatory factorBinding regions

http://genome.ucsc.edu/ENCODE/encode.hg18.html

Page 72: ChromatinLectures 2010

Looking at ENCODE data

ES

MEFH3K4me3

Page 73: ChromatinLectures 2010

A look at multiple histone marks in the mouse genome

H3K4me3 marks at 5’ end of genes

H3K36me3 – marks transcriptional elongationAox1 – expressed strongly in brain

Page 74: ChromatinLectures 2010

Developmentally poised promoters in ES cells

Activating and repressive marks both present in pluripotentCells – These represent genes with complex stage and tissueSpecific expression patterns

H3K27me3 – repressive mark – in differentiated cells it has theSame pattern as H3K9Me

Page 75: ChromatinLectures 2010

H3K36me3 – a mark for transcriptional elongation

Differentiation

Page 76: ChromatinLectures 2010

Proximal

DHS not near TSS

H3K4me1Marks insulatorDNA and enhancers

Page 77: ChromatinLectures 2010

Transcription factor and methylated histone proximity to transcription start sites (TSSs)

RFBRs – regulatory factor binding regions

repressive mark

Page 78: ChromatinLectures 2010

Evolutionary constraints in DNA sequence

Open chromatin,Transcription factorBinding and modifiedhistones

Page 79: ChromatinLectures 2010

Protein coding

Evolutionarily conserved DNA – relationship with coding sequence,exposed chromatin and RFBRs (transcription factors and methylated histones)

Ancient repeatsevo. neutral

contained within feature

Window surrounding feature

Page 80: ChromatinLectures 2010

Constrained sequences

Codingsequences

Ancient repeats

Open chromatin

Transcription factors

SN

P

Single nucleotide polymorphisms exist more frequently in regulatoryDNA sequences than in coding sequences

- more open to evolutionary change

Intraspecies constraint for small RNAs

This looks at differences between humans only

Page 81: ChromatinLectures 2010

H3K4me2 and H3K4me3 histone marks are biased towardCoding DNA sequence

Seila et al. Science 322, 1849 (2008)

Page 82: ChromatinLectures 2010

Cost and effort issues

# of epigenetic marks to look at (DNA methylation, H3K4me2, H3K4me3, H3K9me3, H4K20Me…

100

X

# of tissues and cell lines (NCI 60, others from ATCC, ES cells…)

100

X

# of biological conditions to assess (G1/S/G2/M, hormone treatment, growth factors…)

100 = 1,000,000 experiments

X the cost of a single ChIP-Seq or ChIP-CHIP run

$5000

= 100 x 100 x 100 x $5000 = $5,000,000,000 ($5 billion)

Page 83: ChromatinLectures 2010

The End