supplemental data - genes & developmentgenesdev.cshlp.org/.../supplemental_data.docx · web...

Supplemental Information

USP49 deubiquitinates histone H2B and regulates co-transcriptional pre-mRNA

splicing

Zhuo Zhang, Amanda Jones, Heui-Yun Joo, Dewang Zhou, Ying Cao, Shaoxia Chen,

Hediye Erdjument-Bromage, Matthew Renfrow, Hang He, Paul Tempst, Tim M.

Townes, Keith E Giles, Ligeng Ma, and Hengbin Wang

Supplemental Experimental Procedures

Purification of USP49 as a putative H2B deubiquitinase.

For conventional purification of USP49, Hela nuclear proteins were separated into

nuclear extracts and nuclear pellets using a previously described procedure (Dignam et al.

1983; Joo et al. 2007). Nuclear extract (6 g) was loaded onto a 700 ml P11 column

equilibrated with buffer C (20 mM Tris-HCl, pH 7.9, 0.2 mM EDTA, 1 mM DTT, 0.1

mM PMSF, 0.025 % NP-40, 10 % glycerol) containing 100 mM KCl (BC100). Proteins

that bound to the column were step eluted with BC300, BC500, and BC1000. The BC500

fraction was dialyzed against Buffer D (40 mM Hepes-KOH, pH 7.9, 0.2 mM EDTA, 1

mM DTT, 0.1 mM PMSF, 0.025 % NP-40, 10 % glycerol) containing 20 mM ammonium

sulfate (BD20) and then loaded onto a 200 ml DE52 column. Bound proteins were

stepwise eluted with BD350 and BD500. The BD350 fraction was then dialyzed against

BD20 and loaded onto a 45 ml HPLC-DEAE-5PW column (TOSOH Bioscience). Bound

proteins were eluted with an 8 column volume (cv) linear gradient from BD20 to BD500.

Fractions that contained H2B-deubiquitination activity were adjusted to BD600 with

saturated ammonium sulfate and loaded onto a 22 ml FPLC Phenyl Sepharose column

(Amersham Biosciences). Bound proteins were eluted with a 12 cv linear gradient from

BD600 to BD0. The active fractions were dialyzed against buffer P (5 mM Hepes-KOH,

pH 7.5, 0.04 M KCl, 0.01 mM CaCl2, 10 % glycerol, 1 mM DTT, 0.1 mM PMSF)

containing 10 mM potassium phosphate and loaded onto a 5 ml hydroxyapatite column

(Bio-rad). The column was eluted with a 20 cv linear gradient from BP10 to BP600.

Fractions that contained H2B-deubiquitination activity were dialyzed against BC100 and

loaded onto a MonoS column. The column was eluted with a 20 cv linear gradient from

BC100 to BC1000. Activity containing fractions were loaded onto a Superose 6 column

(Amersham Biosciences) that was equilibrated with BC500. The H2B deubiquitination

activity eluted between fraction # 60 to #63, corresponding to a molecular weight of 443

Purification of USP49 from sf9 cells and USP49 complex from HeLa stable cell line.

To purify USP49 from sf9 cells, full-length cDNA of USP49 was cloned into a

baculovirus vector with a Flag tag at the N-terminus and a 2 x Myc tag at the C-terminus.

USP49 was purified from sf9 cells infected with this baculovirus by sequential affinity

purification, first by anti-Flag immunoprecipitation and then by anti-Myc

immunoprecipitation (Joo et al. 2007). To reconstitute the recombinant USP49 complex

in vitro, baculovirus vectors encoding FLAG tagged USP49 orUSP49 catalytic mutant

and HA tagged RVB1 and SUG1 were individually transfected into sf9 cells and purified

by anti-Flag or anti-HA immunoprecipitation. Complex containing wild type USP49,

RVB1 and SUG1 or USP49 mutant, RVB1 and SUG1 were reconstituted in vitro by

dialyzing against BC50. To purify the USP49 complex, nuclear extracts (1.5 g) from

HeLa S3 stable cell lines expressing Flag-HA-USP49 were loaded onto a 220 ml P11

column and step eluted with BC100, BC300, BC500, and BC1000. BC500 fraction was

then loaded onto a 120 ml DE52 column and stepwise eluted with BD350 and BD500.

The BD350 fraction, which contains USP49, was loaded on a 45 ml DEAE5PW column

(TOSOH Bioscience) and bound proteins were eluted with an 8 cv gradient from BD20 to

BD500. Fractions that contain USP49 were pooled, dialyzed against BC-50 and used as

input for a 5 ml HiTrapQ column (GE Amersham). USP49 containing HiTrapQ fractions

were dialyzed against BC50 and used as input for anti-Flag immunoprecipitation. The

eluate from anti-Flag immunoprecipitation was subjected to a second

immunoprecipitation with anti-HA antibody. Elute from anti-HA immunoprecipitation

was dialyzed and used for silver staining, mass spectrometry identification, and histone

deubiquitination assays.

Mass spectrometry identification.

To identify all polypeptides in lane # 60 (Figure 1D, top panel), the entire gel lane was

divided into 27 pieces and subjected to mass spectrometry analysis with a high resolution

linear quadrupole ion trap Fourier transform ion cyclotron resonance mass spectrometer

(LTQ3 FT-ICR MS, ICR, Thermo Fisher Scientific). Briefly, individual gel slices

were reduced with 10 mm DTT at 37 °C for 45 min, alkylated with 50 mm iodoacetamide

at 37 °C for 45 min, and digested with trypsin overnight at 37 °C. Peptides were extracted

from the gel using 50% acetonitrile and concentrated in a speed vacuum. Tryptic digests

were loaded onto a 100-μm diameter, 11-cm pulled tip packed column with Jupiter 5-μm

C18 reversed-phase beads (Phenomenex) using a Micro AS autosampler and LC

nanopump (Eksigent). An acetonitrile gradient in 0.1% formic acid was run from 5 to

40% over 50 min at a flow rate of 650 nl/min. The eluting peptides were analyzed by

collision-induced dissociation fragmentation on a LTQ FT-ICR. The LTQ FT-ICR

parameters were set as described previously (Renfrow et al. 2007). Fully tryptic human

peptides were identified using TurboSequest version 27 (revision 12, Thermo Fisher

Scientific) (Eng et al. 1994) with a parent ion mass accuracy of 10.0 ppm from the

UniRef100 data base (06/2009).

To identify USP49 complex, purified protein complexes were resolved using

SDS-PAGE, followed by brief staining with Coomassie Blue and excision of the

separated protein bands. In gel trypsin digestion of polypeptides in each gel slice was

performed as described (Sebastiaan Winkler et al. 2002). The tryptic peptides were

purified using a 2-µl bed volume of Poros 50 R2 (Applied Biosystems, CA) reversed-

phase beads packed in Eppendorf gel-loading tips (Erdjument-Bromage et al. 1983). The

purified peptides were diluted to 0.1% formic acid and then subjected to nano-liquid

chromatography coupled to tandem mass spectrometry (nanoLC-MS/MS) analysis as

follows. Peptide mixtures (in 20 µl) were loaded onto a trapping guard column (0.3x5mm

Acclaim PepMap 100 C18 cartridge from LC Packings, Sunnyvale, CA) using an

Eksigent nano MDLC system (Eksigent Technologies, Inc. Dublin, CA) at a flow rate of

20 µl/min. After washing, the flow was reversed through the guard column and the

peptides eluted with a 5-45% acetonitrile gradient over 85 min at a flow rate of 200

nl/min, onto and over a 75-micron x 15-cm fused silica capillary PepMap 100 C18

column (LC Packings, Sunnyvale, CA). The eluent was directed to a 75-micron (with 10-

micron orifice) fused silica nano-electrospray needle (New Objective, Woburn, MA). The

electrospray ionization needle was set at 1800 V. A linear ion quadrupole trap-Orbitrap

hybrid analyzer (LTQ-Orbitrap, ThermoFisher, San Jose, CA) was operated in automatic,

data-dependent MS/MS acquisition mode with one MS full scan (450-2000 m/z) in the

Orbitrap analyzer at 60,000 mass resolution and up to five concurrent MS/MS scans in

the LTQ for the five most intense peaks selected from each survey scan. Survey scans

were acquired in profile mode and MS/MS scans were acquired in centroid mode. The

collision energy was automatically adjusted in accordance with the experimental mass

(m/z) value of the precursor ions selected for MS/MS. Minimum ion intensity of 2000

counts was required to trigger an MS/MS spectrum; dynamic exclusion duration was set

at 60 s.

Initial protein/peptide identifications from the LC-MS/MS data were performed

using the Mascot search engine (Matrix Science, version 2.3.02;

www.matrixscience.com) with the human segment of International Protein Index (IPI)

protein database (89,952 sequences; European Bioinformatics Institute, Hinxton, UK).

The search parameters were as follows: (i) two missed cleavage tryptic sites were

allowed; (ii) precursor ion mass tolerance = 10 ppm; (iii) fragment ion mass tolerance =

0.8Da; and (iv) variable protein modifications were allowed for methionine oxidation,

cysteine acrylamide derivatization and protein N-terminal acetylation. MudPit scoring

was typically applied using significance threshold score p<0.01. Decoy database search

was always activated and, in general, for merged LS-MS/MS analysis of a gel lane with

p<0.01, false discovery rate averaged around 1%.

Scaffold (Proteome Software Inc., Portland, OR), version 3_3_2 was used to

further validate and cross-tabulate the MS/MS-based peptide and protein identifications.

Protein and peptide probability was set at 95% with a minimum peptide requirement of 1.

RNA and ChIP sequencing and bioinformatics analysis.

mRNA samples were prepared as described in the Illumina mRNA sequencing sample

preparation guide. Briefly, mRNA was specifically enriched from total RNA using

oligo(dT) beads and sheared into small pieces. The fragments were then reverse-

transcribed into first-strand cDNA using random hexamer primers, followed by second

strand synthesis using DNA polymerase I. The short cDNA strands were ligated with 3’-

and 5’-adapters for amplification and sequencing. Chromatin immunoprecipitation was

prepared following a protocol provided by Dr. Zhibin Wang (Johns Hopkins University)

(Wang et al. 2009). Briefly, cells were cross-linked with 1% formaldehyde and chromatin

was disrupted into 100-250 bp fragments by sonication and precipitated using antibody as

indicated in the text and protein A beads. After elution from the beads and reverse cross-

linking, the DNA fragments in immunoprecipitates were extracted with

phenol/chloroform and recovered by ethanol precipitation. Immunoprecipitated DNA was

ligated with 3’- and 5’-adapters for amplification and sequencing by Illumina II.

The ChIP-seq reads were mapped to hg18 using Bowtie, and the RNA-seq reads

were mapped using TopHat (Langmead et al. 2009; Trapnell et al. 2009). Both

algorithms were run using the default parameters within Galaxy. PCR duplicates were

removed using Picard (http://picard.sourceforge.net). For visualization, the mapped reads

were converted into bedgraph format using BEDTools ‘genomeCoverageBed’ with the –

bg parameter and –split option as appropriate (Quinlan and Hall 2010). These datasets

were also normalized in the bedgraph format as reads per million total unique mapped

reads using the –scale parameter.

The mapped RNA-seq data (BAM format) was input into Cufflinks to assemble

transcripts (Roberts et al. 2011). Differentially expressed genes were identified with

Cuffdiff. The log2 fold change (Figure 4A) is a direct output from Cuffdiff. Exon

inclusion level was calculated for all internal exons with more than 10 spliced aligned

tags. We first calculated the A = number of spliced tags joining a single internal exon to

an upstream or downstream exon. We next calculated B = the number of spliced tags

linking an upstream exon to a downstream exon. Exon inclusion level was calculated as

A/(A+B), with a value of 0 representing complete skipping of the exon and a value of 1

indicative of complete inclusion. Splicing efficiency for each internal intron was

calculated as X/(X+.5(Y+Z) where X=RPKM of exon aligning tags, Y= RPKM of

upstream intron aligning tags and Z= RPKM of downstream intron aligning tags. The

uH2B, H3, H2B and USP49 ChIP-seq tags that aligned within these exons, genes, and 3’

and 5’ intron-exon junctions were calculated tags and the RPM for USP49KD and

Control cells were then compared. The meta-analyses were performed on intervals 500

base pairs downstream and 500 base pairs upstream of the 5’ and 3’ end of each

identified exon. The tag density at each position within the resulting 1000 base pairs per

exon were quantified using coverageBed –d . The average coverage at each position was

then calculated using groupBy –g 7 – c 8 – o mean option, all using BEDTools. The raw

data have been deposited in the NCBI database under GEO number GSE38101.

RNA immunoprecipitation

RNA immunoprecipitation was performed according to a previously published protocol

(Luco et al. 2010). Briefly, approximately 5X106 cells per IP were crosslinked with 1%

formaldehyde for 15 min at room temperature. After quick PBS-washing, cells were

harvested in 3 ml of ice-cold Lysis buffer (50 mM Tris-HCl pH 8, 100 mM NaCl, 5 mM

MgCl 2, 0.5% NP-40). Nuclei were pelleted at 2,000xg for 4 min at 4 °C. Pellets were

resuspended in 500 μl of IP buffer (50mM Tris-HCl pH 8, 140 mM NaCl, 1 mM EDTA,

1% Triton X-100, 0.1% Na-deoxycholate) for sonication (Fisher Scientific Sonic

Dismembrator Model 500, 14% Output for 21 times plus 21% output for 7 times with 10

seconds for each sonication and each interval). After spinning at 13200 rpm for 25

minutes to remove cell debris, the supernatants were incubated with 30 μl of protein A

beads for preclearing followed by incubation with U1A antibody (USbiological,

Swampscott, MA, Cat# U0005-60) for 2 hr and incubation with 35 μl protein A beads for

overnight at 4°C. Then the protein A beads were pelleted and washed sequentially in

RIPA buffer (50mM Tris-HCl pH 8, 150 mM NaCl, 1 mM EDTA, 1% Triton X-100,

0.1% SDS and 0.1% Na-deoxycholate) and high-salt RIPA (RIPA at 500 mM NaCl) for 5

min in a rotation wheel at 4 °C and then twice in TE buffer at room temperature for 2

min. Beads were eluted for 15 min at 65 °C in 200 µl of elution buffer (1% SDS and

100mM NaHCO3). Cross-linking was reversed by incubation of elution with 200 mM

NaCl (final concentration) for 4.5 h at 65 °C. 1 ml of Trizol (Invitrogen) and 200 µl of

chloroform were added and spun for 15 min at 12,000 g for separation of the phases. 1 ml

of cold isopropanol was added to precipitate the RNA which was then washed by 1 ml of

RNAse-free 75% ethanol. Then RNA pellet was dissolved in DEPC H2O and treated

with DNAse I (Invitrogen, Cat# AM1906M) to avoid DNA contamination. Then

immunoprecipitated and input RNA were reverse transcribed to cDNA using MMLV

reverse transcriptase (Promega, Cat# M1701) and random primer (Promega, Cat#

C1181). The cDNAs were analyzed by SYBR-Green real-time qPCR.

Chromatin bound and unbound fractionation

Fractionation of chromatin bound and unbound cell lysates were performed following a

published protocol (Fuchs et al. 2012) . Briefly, cells were lysed by Dounce

homogenizer on ice after being incubated with lysis buffer A (5mM HEPES pH7.9,

0.75mM MgCl2, 5mM KCl, 0.25mM dithiothreitol [DTT]) on ice for 30 min. After

centrifuging for 15 min at 3,300g at 4 °C, supernatant (S1) was collected, and the

pellet was resuspend in buffer B (buffer A supplemented with 0.34M sucrose, 10%

glycerol, 10mM NaF and 1mM Na3VO4) plus 0.5% NP40. Next, samples were incubated

on ice for another 30 min and centrifuged for 10 min at 3300g at 4 °C. Supernatant (S2)

was added to (S1) to make the “unbound” fraction. The pellet was resuspended in buffer

B to make the “bound” fraction.

Nucleoplasmic and chromatin bound RNA

The fractionation of nucleoplasmic and chromatin bound RNA was adapted from

Khodor, et al (Khodor et al. 2011). Inducible USP49 knockdown cell lines were grown

in 10 cm plates to confluence either with or without doxycycline. Cells were harvested

by scraping and washed twice in ice-cold PBS. Cells were resuspeneded in buffer (15

mM HEPESKOH at pH 7.6, 10 mM KCl, 5 mM MgOAc, 3 mM CaCl2, 300 mM sucrose,

0.1% Triton X-100, 1 mM DTT, and protease inhibitors) and lysed with a dounce

homogenizer (30 strokes, tight pestle). 0.5 mL aliquots of the cell lysate were layered

onto buffer B (15 mM HEPES-KOH at pH 7.6, 10 mM KCl, 5 mM MgOAc,3 mM

CaCl2, 1 M sucrose, 1 mM DTT, protease inhibitors), and centrifuged at 8000 rpm for 15

minutes at 4°C. The supernatant , containing nucleoplasmic RNA’s, was collected and

mRNA purified with TRIzol (Invitrogen) according to the manufacturer’s instructions.

The pellet, containing chromatin bound RNA’s, was resuspended with two strokes in a

dounce homogenizer in 5 volumes of nuclear lysis buffer (10 mM HEPES-KOH at pH

7.6, 100 mM KCl, 0.1 mM EDTA, 10% glycerol, 0.15 mM spermine, 0.5 mM

spermidine, 0.1 M NaF, 0.1 M Na3VO4, 0.1 mM ZnCl2,1 mM DTT, 13 Complete

protease inhibitors, 1 U/mL RNasin Plus [Promega]) . NUN buffer (25 mM HEPES-

KOH at pH 7.6, 300 mM NaCl, 1 M Urea, 1%NP-40, protease inhibitors) was added to

the nuclear suspension dropwise in a 1:1 ratio. After a 20 minute incubation on ice, the

sample was spun down at 14,000 rpm for 30 minutes at 4°C. The supernatant was RNA

was purified with TRIzol.

Sequences for primers and RNAi

The sequences of the primers for amplifying mRNAs are the following: HDM2: F:

TGTTGTGAAAGAAGCAGTAGCA, R: CCTGATCCAACCAATCACCT; PUMA: F:

GACGACCTCAACGCACAGTA, R: GTAAGGGCAGGAGTCCCAT; GAPDH: F:

GGTGGTCTCCTCTGACTTCAACA, R: GTTGCTGTAGCCAAATTCGTTGT. The

sequences of the primers for amplifying the introns are the following: RPS3 Intron 4: F:

TGTTTGCACACATTGAAGCA, R: CTCCAGGCCCTTTCACATTA; RPS6 Intron 4:

F: CTGAAAAGGCCTATGCTCCA, R: TGACAATTCCTGCCAATTCA; PUMA intron

3: F: TGTCCTGGATGAGGATGTGA, R: GCAGTTAGCAGGGGACTGAG;

HDM2 intron 8: F: AGGGAGAAGGCAGAAGGAAG, R:

GGCTAAATGACATGCCTGGT. Myc region 1: -287 to -179, F:

GAGATCCGGAGCGAATAGG, R: GCTGCTATGGGCAAAGTTTC; region 2: -115 to

+70, F: GAGGCTATTCTGCCCATTTG, R: GCATTCGACTCATCTCAGCA; region 3:

+51 to +185, F: TGCTGAGATGAGTCGAATGC, R: AGTCACTTTACCCCGATCCA.

GATAD2A Junction of Exon 4 and Intron 4: F: AAGTTGCGGCAGAGTCAAAT, R:

TGACATTGATGAGGGGATCA; SLMO2 Junction of Exon 2 and Intron 2 F:

GAAAGTTGCACAGCCACAGA, R: GGCTCCCACAACCTATTTGA; MAD2L1-

Junction of Exon 2 and Intron 2 F: CGGACTCACCTTGCTTGTAA, R:

GAGCGGAAAAATGACTGCTT; MAD2L1-Exon 2 F:

TCGGCATCAACAGCATTTTA, R: GAGTCCGTATTTCTGCACTCG; MAD2L1

Intron 2 F: ACTCTTCCCCCACCCACTAC, R: TGGCCACATTTACTGCAAAA

siRNAs for transient knockdown of endogenous USP49 in human cells and the scrambled

control siRNAs were purchased from Invitrogen (Grand Island, NY). The sequences of

the USP49 siRNAs are: #1: UAUACAUGUGACCUGACUGAGCAGC; #2:

UUAUCAUUGAGCACGUAGUCCUUGC; #3:

UUCCCGUGAUGCAUGACCACUGCGG. USP49 siRNA and scrambled control

siRNA were transfected into human cells using Lipofectamine2000 (Invitrogen, Grand

Island, NY) according to the manufacturer’s protocol.

The sequences of the oligonulaotides cloned into pBabe-U6 for transcription of shRNAs

against USP49 in human cells are: #1: GGTCATGCATCACGGGAAA; #2:

GGACTACGTGCTCAATGAT.

ReferencesDignam J, Martin P, Shastry B, Roeder R. 1983. Eukaryotic gene transcription with

purified components. Methods Enzymol 101: 582-598.Eng J, McCormack A, Yates J. 1994. An approach to correlate tandem mass spectral data

of peptides with amino acid sequences in a protein database. Journal of The American Society for Mass Spectrometry 5: 976-989.

Erdjument-Bromage H, Lui M, Lacomis L, Grewal A, Annan R, McNulty\ D, Carr S, Tempst P. 1983. Examination of micro-tip reversed-phase liquid chromatographic extraction of peptide pools for mass spectrometric analysis. J Chromatogr A 826: 167-181.

Fuchs G, Shema E, Vesterman R, Kotler E, Wolchinsky Z, Wilder S, Golomb L, Pribluda A, Zhang F, Haj-Yahya M et al. 2012. RNF20 and USP44 Regulate Stem Cell Differentiation by Modulating H2B Monoubiquitylation. Mol Cell 46: 662-673.

Joo H-Y, Zhai L, Yang C, Nie S, Erdjument-Bromage H, Tempst P, Chang C, Wang H. 2007. Regulation of cell cycle progression and gene expression by H2A deubiquitination. Nature 449: 1068-1072.

Khodor YL, Rodriguez J, Abruzzi KC, Tang CH, Marr MT, 2nd, Rosbash M. 2011. Nascent-seq indicates widespread cotranscriptional pre-mRNA splicing in Drosophila. Genes Dev 25: 2502-2512.

Langmead B, Trapnell C, Pop M, Salzberg SL. 2009. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25.

Luco RF, Pan Q, Tominaga K, Blencowe BJ, Pereira-Smith OM, Misteli T. 2010. Regulation of Alternative Splicing by Histone Modifications. Science 327: 996-1000.

Quinlan AR, Hall IM. 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26: 841-842.

Renfrow MB, MacKay CL, Chalmers MJ, Julian BA, Mestecky J, Kilian M, Poulsen K, Emmett MR, Marshall AG, Novak J. 2007. Analysis of O-glycan heterogeneity in

IgA1 Myeloma Proteins by Fourier Transform Ion Cyclotron resonance Mass Spectrometry: Implications for IgA Nephropathy. Analytical and Bioanalytical Chemistry 389: 1397-1407.

Roberts A, Trapnell C, Donaghey J, Rinn JL, Pachter L. 2011. Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biol 12: R22.

Sebastiaan Winkler G, Lacomis L, Philip J, Erdjument-Bromage H, Svejstrup JQ, Tempst P. 2002. Isolation and mass spectrometry of transcription factor complexes. Methods 26: 260-269.

Trapnell C, Pachter L, Salzberg SL. 2009. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25: 1105-1111.

Wang Z, Zang C, Cui K, Schones DE, Barski A, Peng W, Zhao K. 2009. Genome-wide Mapping of HATs and HDACs Reveals Distinct Functions in Active and Inactive Genes. Cell 138: 1019-1031.

Supplemental Experimental data

Figure S1. Mass spectrometry analysis of the USP49 complex. The identified peptides

corresponding to USP49, RVB1, and SUG1 are highlighted. Numbers represent amino

acid position.

Figure S2. USP49 co-immunoprecipitates with RVB1 and SUG1. Western blot

analysis of immunoprecipitates from USP49 antibody and control IgG with nuclear

extracts as input. Antibodies used are indicated on the left side of the panel.

Figure S3. USP49 inducible knockdown cell lines established with different shRNA.

USP49 knockdown results in a specific increase of uH2B levels with no effects on uH2A

levels.

Figure S4. Scatter plot of splicing completion values for control and USP49

knockdown cells. Most exons are completely spliced in control and USP49 knockdown

cells (see box and whisker plot insert).

Figure S5. A box and whisker plot of splicing completion values for up- and down-

regulated exons before and after USP49 knockdown. Control and USP49 knockdown

cell splicing completion values are lower for exons down-regulated in USP49 knockdown

cells.

Figure S6. A box and whisker plot of the change in splicing completion values in

control and USP49 knockdown cells. In USP49 knockdown cells, down-regulated

exons show a decrease in splicing completion compared to the splicing completion for

these exons in control cells.

Figure S7. USP49 knockdown caused defects in splicing of chromatin associated

pre-mRNA. Regions corresponding to intron-exon junctions were amplified.

Figure S8 USP49 knockdown does not affect mRNA decay. Intron containing

transcripts are elevated in USP49 knockdown cells (compare 0 time point for filled or

empty boxes), but intron containing transcripts are degraded at similar rates in control

and USP49 knockdown cells.

Figure S9. Exons from genes with USP49 bound transcription start sites (TSS) show

greater changes in alternative splicing. Exon inclusion levels for down-regulated exons

from genes whose TSS were bound by USP49 are lower than up-regulated USP49 TSS

bound exons (p-value=4.32 x 10-46) or than exon inclusion levels for up-regulated (p-

value=1.66 x 10-55) or down-regulated (p-value=.004125) exons whose TSS were not

bound by USP49.

Figure S10. Exons from genes with USP49 bound transcription start sites (TSS)

show greater changes in splicing completion. Splicing completion levels for down-

regulated exons from genes with TSS bound by USP49 are lower than up-regulated

USP49 TSS bound exons (p-value=6.03 x 10-4) or than exon inclusion levels for up-

regulated (p-value=7.88 x 10-5) or down-regulated (p-value=.015757) exons whose TSS

were not bound by USP49.

Figure S11. USP49 binds exons down-regulated in USP49 knockdown cells. USP49 is

relatively enriched at exons as well as 3’ (p-value=9.45 x 10-20) and 5’ (p-value=1.43 x

10-19) splice junctions of exons (p-value=.021686) down-regulated but not up-regulated in

USP49 knockdown cells.

Figure S12. Increase of unspliced transcripts for exons down-regulated in USP49

knockdown cells. The median level of unspliced transcripts is increased for down-

regulated exons but not up-regulated exons (p-value=1.03 x 10-50) in USP49 knockdown

cells.

Figure S13. Knockdown of USP22 and USP12 does not affect pre-mRNA splicing. A. Western blot assay of H2A and H2B ubiquitination in control and USP22 and USP12 knockdown cells. Antibodies used are labeled on the left side of the panels. B. RT-qPCR analysis of selected genes in control and USP22 knockdown cells. C. RT-qPCR analysis of selected genes in control and USP12 knockdown cells.

Figure S14. 3’ splice site strength is elevated at down-regulated exons. The median strength of 3’ splice sites (p-value=0.00102) is increased for down-regulated exons but not up-regulated exons in USP49 knockdown cells. 5’ splice site (p-value=0.453562) strength is not significantly different for exons down-regulated and up-regulated in USP49 knockdown cells. Splice site strength was calculated using MaxEnt.

Figure S15. USP49 binding and H2B ubiquitination are highly correlated.

Figure S16. Genes bound by USP49 show greater uH2B levels in response to USP49 knockdown. A box and whisker plot of uH2B RPKM normalized tags for genes, exons and introns.

Figure S17. Chow-Ruskey plot of the overlaps between decreased splicing completion, elevated USP49 binding, and fold change in uH2B levels.

Figure S18. Fold change in H3 levels for exons down-regulated or up-regulated in USP49 knockdown cells. H3 levels are relatively stable in response to USP49 knockdown.

Figure S19. RPM normalized H3 tag counts are relatively stable across exons and 5’ and 3’ splice junctions.

Figure S20. Normalized H2B tag counts RPM is increased at exons as well as 5’ and 3’ splice junctions for exons down-regulated in USP49 knockdown cells.

Figure S21. U1A immunoprecipitation and RT-qPCR assay on -actin transcripts. A. RT-qPCR analysis of input. B. RT-qPCR analysis of U1A immunoprecipitate. Regions for real time PCR amplification, corresponding to exon-intron junctions, are shown at bottom.

Figure S22. ChIP assay of U1A (A) and U2B (B) binding to the 5’ and 3’ splice site of MAD2L1 gene. Regions for real time PCR amplification, corresponding to exon-intron junctions, are shown at bottom.

supplemental data - genes & developmentgenesdev.cshlp.org/.../supplemental_data.docx · web...

Documents

supplemental material - genes &...

supplemental data statistics - genes &...

pax9-deficient mice lack pharyngeal pouch derivatives and...

supplemental information - genes &...

weber suppl-revision-final e4894 - genes &...

supplementary materials and methods - genes &...

genes &...

research paper deubiquitinase usp9x deubiquitinates β

usp15 regulates neuromuscular functions through the ...usp15...

yod1 deubiquitinates nedd4 involved in the hippo signaling...

supplemental data - genes &...

supplementary info legend - genes &...

releasing cohesin from chromosome ... - genes &...

chip-seq data processing - genes &...

sting recruits nlrp3 to the er and deubiquitinates nlrp3...

integrative genome analysis of ... - genes &...

amsh, an escrt-iii associated enzyme, deubiquitinates

supplemental data - genes &...

loh et al. supplemental material - genes &...

supporting online material - genes &...