protein, is the mammalian homolog of drosophila nona

11
MOLECULAR AND CELLULAR BIOLOGY, Sept. 1993, p. 5593-5603 0270-7306/93/095593-11$02.00/0 Copyright © 1993, American Society for Microbiology NonO, a Non-POU-Domain-Containing, Octamer-Binding Protein, Is the Mammalian Homolog of Drosophila nonA YIH-SHENG YANG,1 JEFFREY H. HANKE,lt LEON CARAYANNOPOULOS,1 CHERYL M. CRAFT,2 J. DONALD CAPRA,' AND PHILIP W. TUCKER'* Departments of Microbiology' and Psychiatry,2 The University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Boulevard, Dallas, Texas 75235-9048 Received 11 January 1993/Returned for modification 17 February 1993/Accepted 14 June 1993 We have cloned the ubiquitous form of an octamer-binding, 60-kDa protein (NonO) that appears to be the mammalian equivalent of the Drosophila visual and courtship song behavior protein, no-on-transient A/dissonance (non4dis,). A region unprecedently rich in aromatic amino acids containing two ribonuclear protein binding motifs is highly conserved between the two proteins. A ubiquitous form of NonO is present in all adult tissues, whereas lymphocytes and retina express unique forms of NonO mRNA. The ubiquitous form contains a potential helix-turn-helix motif followed by a highly charged region but differs from prototypic octamer-binding factors by lacking the POU DNA-binding domain. In addition to its conventional octamer duplex-binding, NonO binds single-stranded DNA and RNA at a site independent of the duplex site. All VH and VL promoters and the immunoglobulin heavy- chain (IgH) enhancer contain an essential octamer motif (ATGCAAAT). Deletion of the promoter octamer results in a drastic reduction of Ig gene expression, to almost back- ground levels in B cells (10, 32). A heptamer motif (CT CATGA), located 2 to 21 bp upstream of the octamer, contributes to full promoter activity (7, 42), apparently by serving as a low-affinity, cooperative binding site for factors that bind the octamer (23, 29). Conventional octamer-binding proteins are part of a large gene family that contains a DNA-binding motif, the POU domain. OTF-1 (Oct-1), present in all cell types, and OTF-2 (Oct-2), found primarily in B cells and the brain, share no similarity other than their POU domains (reviewed in refer- ences 22, 45, and 51). All POU domain proteins appear to function as positive or negative transcription factors, with the major activating domain(s) of known members located outside the POU domain (22, 45). The findings that OTF-2 is found predominantly in B cells and that the octamer motif is conserved in all Ig promoters led to the hypothesis that this protein is necessary and sufficient for tissue-specific Ig gene expression (28, 48, 53). However, OTF-1 can stimulate an Ig VH promoter in vitro as efficiently as OTF-2 can (18, 41). Several pre-B-cell lines have no detectable OTF-2 RNA or DNA-binding activity yet transcribe VH genes as well as do mature B cells, which have moderate levels of OTF-2 (18). This finding raised the possibility that in vivo, OTF-1 and/or another octamer-binding factor is sufficient for Ig expres- sion, particularly during early B-cell differentiation. We isolated a nuclear protein from a murine B-cell leuke- mia line (BCL1) that footprinted the octamer motif indistin- guishably from OTF-2 (11). Here we characterize the com- plementary DNA clone encoding a ubiquitous form of this 60-kDa molecule, which we designate NonO. NonO lacks conventional POU domains, but it possesses distinct prop- erties sufficient to suggest functional roles for it in immuno- logic and/or neurologic gene regulation. These include a * Corresponding author. t Present address: Department of Molecular Genetics, Pfizer Inc., Groton, CT 06340. striking homology with no-on-transient A (nonAdiSS), a pro- tein in Drosophila melanogaster involved in at least two different abnormalities: reduced visual acuity and aberrant male courtship songs (38). nonAdiss mutants demonstrate an electrophysiological defect in characteristic on- and off- transients (16, 39). These signals arise from electrical activ- ity in neurons of the lamina, the first optic ganglia postsyn- aptic to photoreceptor cells of the eye. Extensive behavioral characterization of nonAdi,s flies revealed reduced visual acuity, decreased detection of intensity differences, and impaired optomotor responses (13), phenotypes suggestive of a defect in the processing of visual information. A second mutation, dissonance (diss), associated with aberrant court- ship songs, has many of the visual defects found in nonA (26). diss and nonzA alleles exhibit a complex pattern of complementation, leading to the speculation that the same nonA gene product (NONA) is used in different locations within the nervous system to affect different regulatory patterns (20, 26). This view was confirmed by the recent cloning of the gene and complementation of mutants (1, 20, 44). The NONA protein has no known function, but it contains consensus RNA-binding sites, prompting the sug- gestion (1, 44) of a role in RNA metabolism. MATERIALS AND METHODS Protein purification and peptide sequencing. The purifica- tion of NonO by virtue of binding to an Ig promoter element has been described previously (11). Sodium dodecyl sulfate (SDS)-polyacrylamide gel electrophoresis (PAGE)-separat- ed 60-kDa protein was electroeluted and subjected to trypsin digestion. The resultant peptides were fractionated by re- verse-phase high-pressure liquid chromatography (HPLC) using a C18 column and a trifluoroacetic acid-acetonitrile gradient. The major peaks were collected and subjected to amino acid sequencing on an Applied Biosystems 477A amino acid sequencer. The following peptide sequences were obtained: FGQAATME, FAQPGSFEYEYAMRW, IVEFSGK?AA, ?K?EMEMEAA, and MGQMAMGGAMG INN. Identification and sequencing of ubiquitous NonO cDNA. A cDNA library was constructed in an Okayama and Berg 5593 Vol. 13, No. 9

Upload: phamkhue

Post on 31-Dec-2016

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Protein, Is the Mammalian Homolog of Drosophila nonA

MOLECULAR AND CELLULAR BIOLOGY, Sept. 1993, p. 5593-56030270-7306/93/095593-11$02.00/0Copyright © 1993, American Society for Microbiology

NonO, a Non-POU-Domain-Containing, Octamer-BindingProtein, Is the Mammalian Homolog of Drosophila nonA

YIH-SHENG YANG,1 JEFFREY H. HANKE,lt LEON CARAYANNOPOULOS,1 CHERYL M. CRAFT,2J. DONALD CAPRA,' AND PHILIP W. TUCKER'*

Departments ofMicrobiology' and Psychiatry,2 The University of Texas Southwestern Medical Centerat Dallas, 5323 Harry Hines Boulevard, Dallas, Texas 75235-9048

Received 11 January 1993/Returned for modification 17 February 1993/Accepted 14 June 1993

We have cloned the ubiquitous form of an octamer-binding, 60-kDa protein (NonO) that appears to be themammalian equivalent of the Drosophila visual and courtship song behavior protein, no-on-transientA/dissonance (non4dis,). A region unprecedently rich in aromatic amino acids containing two ribonuclearprotein binding motifs is highly conserved between the two proteins. A ubiquitous form of NonO is present inall adult tissues, whereas lymphocytes and retina express unique forms of NonO mRNA. The ubiquitous formcontains a potential helix-turn-helix motif followed by a highly charged region but differs from prototypicoctamer-binding factors by lacking the POU DNA-binding domain. In addition to its conventional octamerduplex-binding, NonO binds single-stranded DNA and RNA at a site independent of the duplex site.

All VH and VL promoters and the immunoglobulin heavy-chain (IgH) enhancer contain an essential octamer motif(ATGCAAAT). Deletion of the promoter octamer results ina drastic reduction of Ig gene expression, to almost back-ground levels in B cells (10, 32). A heptamer motif (CTCATGA), located 2 to 21 bp upstream of the octamer,contributes to full promoter activity (7, 42), apparently byserving as a low-affinity, cooperative binding site for factorsthat bind the octamer (23, 29).

Conventional octamer-binding proteins are part of a largegene family that contains a DNA-binding motif, the POUdomain. OTF-1 (Oct-1), present in all cell types, and OTF-2(Oct-2), found primarily in B cells and the brain, share no

similarity other than their POU domains (reviewed in refer-ences 22, 45, and 51). All POU domain proteins appear tofunction as positive or negative transcription factors, withthe major activating domain(s) of known members locatedoutside the POU domain (22, 45). The findings that OTF-2 isfound predominantly in B cells and that the octamer motif isconserved in all Ig promoters led to the hypothesis that thisprotein is necessary and sufficient for tissue-specific Ig geneexpression (28, 48, 53). However, OTF-1 can stimulate an IgVH promoter in vitro as efficiently as OTF-2 can (18, 41).Several pre-B-cell lines have no detectable OTF-2 RNA orDNA-binding activity yet transcribe VH genes as well as domature B cells, which have moderate levels of OTF-2 (18).This finding raised the possibility that in vivo, OTF-1 and/oranother octamer-binding factor is sufficient for Ig expres-sion, particularly during early B-cell differentiation.We isolated a nuclear protein from a murine B-cell leuke-

mia line (BCL1) that footprinted the octamer motif indistin-guishably from OTF-2 (11). Here we characterize the com-

plementary DNA clone encoding a ubiquitous form of this60-kDa molecule, which we designate NonO. NonO lacksconventional POU domains, but it possesses distinct prop-erties sufficient to suggest functional roles for it in immuno-logic and/or neurologic gene regulation. These include a

* Corresponding author.t Present address: Department of Molecular Genetics, Pfizer Inc.,

Groton, CT 06340.

striking homology with no-on-transient A (nonAdiSS), a pro-tein in Drosophila melanogaster involved in at least twodifferent abnormalities: reduced visual acuity and aberrantmale courtship songs (38). nonAdiss mutants demonstrate anelectrophysiological defect in characteristic on- and off-transients (16, 39). These signals arise from electrical activ-ity in neurons of the lamina, the first optic ganglia postsyn-aptic to photoreceptor cells of the eye. Extensive behavioralcharacterization of nonAdi,s flies revealed reduced visualacuity, decreased detection of intensity differences, andimpaired optomotor responses (13), phenotypes suggestiveof a defect in the processing of visual information. A secondmutation, dissonance (diss), associated with aberrant court-ship songs, has many of the visual defects found in nonA(26). diss and nonzA alleles exhibit a complex pattern ofcomplementation, leading to the speculation that the samenonA gene product (NONA) is used in different locationswithin the nervous system to affect different regulatorypatterns (20, 26). This view was confirmed by the recentcloning of the gene and complementation of mutants (1, 20,44). The NONA protein has no known function, but itcontains consensus RNA-binding sites, prompting the sug-gestion (1, 44) of a role in RNA metabolism.

MATERIALS AND METHODS

Protein purification and peptide sequencing. The purifica-tion of NonO by virtue of binding to an Ig promoter elementhas been described previously (11). Sodium dodecyl sulfate(SDS)-polyacrylamide gel electrophoresis (PAGE)-separat-ed 60-kDa protein was electroeluted and subjected to trypsindigestion. The resultant peptides were fractionated by re-verse-phase high-pressure liquid chromatography (HPLC)using a C18 column and a trifluoroacetic acid-acetonitrilegradient. The major peaks were collected and subjected toamino acid sequencing on an Applied Biosystems 477Aamino acid sequencer. The following peptide sequenceswere obtained: FGQAATME, FAQPGSFEYEYAMRW,IVEFSGK?AA, ?K?EMEMEAA, and MGQMAMGGAMGINN.

Identification and sequencing of ubiquitous NonO cDNA. AcDNA library was constructed in an Okayama and Berg

5593

Vol. 13, No. 9

Page 2: Protein, Is the Mammalian Homolog of Drosophila nonA

5594 YANG ET AL.

e~~~~~~~~~~~~~~~~~~zQ

_t -<2.7kb.A.

.* ~ ~ ~ ~~~1v< kb

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

FIG. 1. Two forms of NonO transcripts in various mouse tissues. Total RNA was extracted from the indicated mouse tissues and organs.Five to 10 ,ug of RNA was fractionated, transferred to a nylon membrane, and probed with a NonO cDNA coding-region fragment. Shownbelow the autoradiogram is the ethidium bromide-stained gel to assess relative loading among lanes. Indicated are the positions of theubiquitous and lymphoid cell-specific NonO RNAs and the 28S and 18S rRNAs.

vector (37) from the B-cell leukemia, BCL1, and screenedwith a32P-phosphorylated oligonucleotide, 5'-lTT GG(G/C)CAG GCT GCC AC(A/T/C) ATG GAG-3', derived from theamino acid sequence of one of the tryptic peptides. A partialcDNA was isolated and used to screen a library of BCL1cDNA in XZap (Stratagene). The oligonucleotide screeningprocedure used tetramethylammonium chloride (46), whilescreening with the partial cDNA clone was done with anick-translated fragment. The probe was hybridized with 6xSSC (lx SSC contains 150 mM NaCl and 15 mM sodiumcitrate) at 65°C. Final washing conditions were 0.1 x SSC atthe same temperature. A 2.4-kb insert was subcloned intoM13, and a series of deletion mutants were generated byexonuclease III digestion (14). The sequence of the completecDNA clone was achieved by sequencing individual deletionmutants or priming with synthetic oligomers with an auto-matic DNA sequencer (Genesis 2000; Dupont).

Protein preparation. NonO mRNA was synthesized invitro with T7 and T3 RNA polymerase (Pharmacia). Onemicrogram of plasmid DNA linearized at the 3' end of theNonO-5.7 insert was incubated with the enzyme at 37°C inreaction buffer (50 mM Tris-HCl [pH 8.0], 8 mM MgCl2, 2mM spermidine, 50 mM NaCl, 30 mM dithiothreitol) for 1 h.Synthesized RNA was purified by phenol extraction, thenphenol-chloroform extracted, and finally ethanol precipi-tated. The mRNAs were used as a template for translation ina rabbit reticulocyte lysate system (Promega). In vitro-translated proteins were internally labeled with [35S]methio-nine (NEN). Protein products were analyzed on SDS-10%polyacrylamide gels (27).

Bacterial fusion proteins were prepared for wild-typeNonO by blunt-end ligation of the MseI fragment of NonOcDNA into the SmaI site of pGEX-2T (49), which alonegenerates a 26-kDa N-terminal glutathione S-transferase(GST) fusion. Mutants N-RNP and ARNP-C were con-structed by polymerase chain reaction-T4 DNA polymerasemethods as described previously (54). The fusion geneconstructs were transformed into Eschenchia coli, and theculture was grown to log phase before isopropylthiogalacto-pyranoside (IPTG) induction (49). Cell pellets were har-vested from a 500-ml culture and washed in 10 ml of bufferA(50 mM Tris-HCl [pH 8.0], 25% sucrose, 10 mM EDTA),resuspended in 10 ml of buffer B (10 mM Tris-HCl [pH 7.4],

1 mM EDTA, 1 mM dithiothreitol, 1 mM phenylmethylsul-fonyl fluoride, 1 jig of leupeptin per ml, 1 ,ug of antipain perml, 1 ,ug of pepstatin A per ml), and sonicated, and the celldebris was centrifuged. The supernatant was mixed with 25ml of buffer C (20 mM N-2-hydroxyethylpiperazine-N'-2-ethanesulfonic acid [HEPES; pH 7.6], 100 mM KCl, 20%glycerol, 1 mM EDTA, 1 tg of leupeptin per ml, 1 ,ug ofantipain per ml, 1 ,ug of pepstatin A per ml), 1/10 volume of10% Triton X-100, and 4 ml of preswollen glutathione-agarose beads at 4°C for 2 h. The beads were spun andwashed four times with 5 ml of buffer C at 4°C. Finally, thefusion protein was eluted from beads with 2.5 ml of 10 mMglutathione (pH 7.5; Sigma) in buffer C.

Mobility shift gel electrophoresis. The electrophoretic mo-bility shift assay (EMSA) was performed as previouslydescribed (11), with some modifications. Five microliters(about 1 ,g) of GST or GST-NonO fusion protein purifiedfrom affinity beads was used in binding assays. Sampleswere incubated in the reaction mixture [20 mM HEPES (pH7.9), 20% glycerol, 25 mM KCl, 0.2 mM EDTA, 10 mMdithiothreitol, 1 mM phenylmethylsulfonyl fluoride, 0.05%Nonidet P-40, 1 p,g of poly(dI-dC)] in a final volume of 20 ,ulat room temperature for 15 min. The probe (final concentra-tion of 3 nM) was added to the mixture described above andincubated at room temperature for 30 min prior to electro-phoresis on 4% acrylamide gels in 0.5x TBE (50 mM Trisbase, 50 mM boric acid, 0.55 mM EDTA [pH 8.0]) at 4°C. Incompetition assays, competitors were added 10 min prior tothe probe. For the RNA competition assay, 40 U of RNAsin(Promega) was included in the reactions to retard degrada-tion. The dimerized BN oligonucleotide was cleaved atpolylinker sites from a plasmid containing subcloned BNoligonucleotide, gel purified, filled in with Klenow enzyme,and labeled as a binding probe.Western blot (immunoblot) analysis. Proteins were sepa-

rated by SDS-PAGE and electroblotted onto polyvinylidenedifluoride membranes (Millipore). Antisera were collectedfrom rabbits immunized with GST-NonO fusion protein andpurified on protein A-Sepharose (12). Pooled IgG elutedfrom the protein A column was depleted of antibodiesreacting with GST by GST affinity chromatography. Remain-ing antibodies were concentrated and used for immunoblot-ting according to the ImmunoSelect kit protocol (Bethesda

28S -

18S -

MOL. CELL. BIOL.

VW .. I1. I.5 ...

.1, .1.4 40 W* 40qp -, * ..

,W: .:-Il I...- 0

Page 3: Protein, Is the Mammalian Homolog of Drosophila nonA

VOL. 13, 1993 NonO, AN UNCONVENTIONAL OCITAMER-BINDING PROTEIN 5595

A 100bp

Q K RNP ,,. PNonO

F

B GCTACATCAG GGGCTGTGCA GCGTCGCCCA TACTCCGAGC AAAGAACTGT GGCCAGAGGC AGTCGAGGTT AGTGAGGACT GCGAGGCAGA CACTMTGCTG 100

TGTTCAAATC CAAGTCAAGG GTACAAAA ATG CAG AGC MT MA GCC mT MC TTG GAG MG CAG MT CAT ACT CCA AGG MG CAT CAT 18BM Q S N K A F N L E K Q N H T P R K H H

CAG CAT CAC CAC CAG CAG CAC CAT CAG CAG CM CAG CAG CAG CAG CAG CM CAG CCA CCC CCA CCA ATA CCT GCA MT GGC CAG 272Q H H H Q Q H H Q Q Q Q Q Q Q Q Q Q P P P P I P A N G Q

[-Q QE-lCAG G-C AGC AGC CAG MT GAA GGC TTG ACT ATT GAC CTG MG MT TTT AGG AM CCA GGA GAG AAG ACC TTT ACA CAG CGT AGC 356Q A S S Q N E G L T I D L K N F R K P G E K T F T Q R S.. S/TCGT CTC TTT GIG GGC MT CTT CCC CCT GAT ATC ACT GAG GAG GAA ATG AGG MA CTA TTT GAG MA TAT UGA MA GCA GGC GM 440R L F V G N L P P D I T E E E M R K L F E K Y G K A G E

GTT TTC ATT CAT MG GAT AM GGC m GGC TTT ATT CGC TTG GAA ACA CGA ACC CTA GCG GM ATT GTC MA GTG GAG CTG GAC 524V F I H K D K G F G F I R L E T R T L A E I V K V E L D

MC ATG CCC CTC CGT GGG MG CAG CTG CGA GTG CGC TTC GCC TGT CAC AGT GCA TCC CTT ACA GTC CGC AAC CTT CCT CAG TAC 608N N P L R G K Q L R V R F A C H S A S L T V R N L P Q Y

GTG TCG MC GAA CTG CTG GM GM GCC TTT TCT GTG TTC GGC CAG GTG GAG AGG GCT GFA GTC ATT GTG GAT GAC CGA GGA AGG 692V S N E L L E E A F S V F G Q V E R A V V I V D D R G R

CCC TEA GGG AAA GGC ATT GTT GAG TTC TCA GGG 'AG CCA GCT GCT CGG MA GCT CTG GAC AGA TGC AGT GM GGC TCC TTC TTG 776P S G K G I V E F S G K P A A R K A L D R C S E G S F L

CTG ACT ACA TTT CCT TGG CCT GTG ACT GTG GAG CCT ATG 6AC CAG TTA GAT GAT GAA GAG GGA CTT CCA GAG MA CTG GTT ATA 860L T T F P W P V T V E P ND Q L D D E E G L P E K L V I

S/T o..4AAA AAC CAG CAA TTC C AAGGAG AGA GAA CAG CCA CCC AGA TTTGACACM CCT GGC TCC TTT GAG TAT GAG TAT GCC ATG CGC 944K N Q Q F H K E R E Q P P R F A Q P G S F E Y E Y A M R

TGG AAG GCA CTC ATT GAG ATG GAG MG CM CAG CAG GAT CM GTG GAT CGG AC ATC MG GAG GCT CGT GAG MG CTG GAG ATG 1028W K A L I E N E K Q Q Q D Q V D R N I K E A R E K L E N

". HTH

GAG ATG GAG GCT GCA CGT CAT GAG CAC CAG GTT ATG CTA ATG AGG CAG GAT TTG ATG AGA CGT CM GM GAG CTT CGG AGA ATG 1112E N E A A R H E H Q V N L M R Q D L M R R Q E E L R R M

HTH '4 ". Basic/Acidic

GAG GXG CTG CAT MC CM GAG GTT CAG MG CGA AG CAG TTA GM CTC AGG CAG GM GAG GMACGC AGG CGC CGT GAG GM GAG 1196E E L H N Q E V Q K R K Q L E L R Q E E E R R R R E E E

ATG CGG CGA CXR CM GAG GM ATG ATG CGC CGA CAG CAG GM GGA TTC MG GGA ACC TTC CCT GAT GCG AGA GM CM GAG ATA 1280M R R Q Q E EN N R R Q Q E G F K G T F P D A R E Q E I

Basic/Acidic ...4

CGG ATG GGC CM ATG GCT ATG GGA GGT GCT ATG GGC ATA MC MT 'AA GGC GCG ATG CCC CCT GCT CCT GTG CCA CCT GGT ACT 1364R N G Q N A N G G A M G I N N R G A M P P A P V P P G T

" P ". T

CCA GCT CCT CCA GGA CCT GCC ACT ATG ATG CCA GAT GGA ACC CTT GGA TTG ACC CCA CCA ACA ACT GAA CGT TTT GGC CM GeT 1448P A P P G P A T M M P D G T L G L T P P T T E R F G Q A

P -4GCA ACA ATG GM GGA ATT GGA GCA ATT tVT GGA ACT CCT CCT GCA TTC MC CGT CCA GCT CCG GGA GCT GM M GCT CCA MT 1532A T N E G I G A I G G T P P A F N R P A P G A E F A P N

T'.4

AM CGC CGC CGA TAT TAG ATAMGTTG CATTGTCTAG MCCTGCAG CCCTTAAAAG MGGGCCCTT MGGACTAG CCAGAATTCT ACCCTGGAM 1629K R R R Y

AGTGTTAGGG GTTCTTCCCA ATAGATAGGC CTTCCCTGCT TGTACTACTC TAGGGATCAT GCTTGMGTC AGAGGGGCAG AGMGGGGTG GTATTCMCA 17294

AGTCAMAGTC TGTGGTATAT TGCMATCA AGACTGTCTG GTGCATTCCT GAACTATATT AATTGTTGAG GGCCTGGAGA ACCATGGGM MTGMCTCA 18293

GAGCTCCATT AATCTTGATC ATTCCTTCTC TCTCUTCTC TCTCTCTTGT MTTTMACT TTCTCATCTT TATTCCCCTC AACCCCTGAG ACACTGCCAT 1929

ATATACCACA AACCATMAC ATCCTCCAAT GACCTAGCCC CATCCCTCCA TTCACTCCCA GGTMGAATT CAGACMATG TCCACAGAGG TTACAGCATA 2029

CGTACGGTTG TGTTATATCT CATATATGAC CCCTTCATGT CCTMGGMG ACATUTCTC TTAGAGGM TCATUTAGT ATATCTTMA AGAATCTTGT 2129

GTTACCTTGC CTCCATCM TTCTTGGGTA AGGACTACAC MGTGTCTC TGATGTTGCT GTTCACAGCT TTTCTTGATA GGCCTAGTAC MTCTTGGGA 2229

ACAGGGTTGC TGTGTGGTGA AGGTCTGACA GTAGTTCTTA GTCTTGCCTA TCTTAGGTAG CTACGCTGTG CATTTTTATT GGTATACTAT GMTTGTTCC 2329

AGATACCTTC AGTTTGGAM GTTTTCTGAG AAATGGAGAC GTCATGCGGC ATCACCTTAT TAAAATGCAT TTGMAGCCTT TT 2411FIG. 2. Structure and sequence of ubiquitous NonO. (A) Schematic representation of NonO cDNA. The open box indicates the open

reading frame. The glutamine (Q)-rich region, proline (P)-rich region, HTH domain, and highly charged regions are as labeled. F,phenylanine-rich region. (B) Nucleotide and predicted amino acid sequences of NonO. Both nucleotide and amino acid sequences arenumbered from the beginning of the sequences. Amino acids in boldface are the potential HTH domain regions. Putative domains (discussedin the text) are bordered with arrows. The direct and inverted repeats at the 3' end of the UTR are marked with arrows.

Page 4: Protein, Is the Mammalian Homolog of Drosophila nonA

5596 YANG ET AL.

A

o~I

67-

43-

30-

0~~~~~~3

1 2 3M 45 6

_ lj- 943-

Preimmune serum

7'

7

Immune serum

1 2

FIG. 3. In vitro and in vivo identification of ubiquitous NonOproteins. (A) In vitro-translated [35S]methionine-labeled NonOproducts were analyzed on an SDS-10% polyacrylamide gel. Proteinmarkers are as indicated (in kilodaltons). Lanes: 1, in vitro-trans-lated product of full-length antisense RNA; 2, in vitro-translatedproduct of full-length sense RNA. The arrowhead marks the full-length NonO protein. (B) Western blot analysis of NonO from aB-cell leukemia nuclear extract. Nuclear extracts were preparedfrom BCL1 cells, fractionated on an SDS-10% polyacrylamide gel,and then transferred for Western blotting. The primary antiserumwas prepared from GST-NonO rabbit immunization. Lanes: 1 and 7,GST protein; 2 and 6, GST-NonO fusion protein; 3 to 5, BCL1nuclear extract. Lanes 1 to 3 were developed with preimmune rabbitsera and goat anti-rabbit IgG (Sigma); lanes 4 to 7 were developedwith immune rabbit sera; lane 4 was developed with the same

secondary antibody as in lanes 1 to 3; lanes 5 to 7 were developedwith a goat anti-rabbit antibody from Bio-Rad. The arrow denotesthe 60-kDa NonO protein in BCL1 extracts. The asterisk denotes thefull-length NonO fusion protein (positive control). The pre- andpostimmune rabbit sera were purified as described in Materials andMethods.

Research Laboratories), except that 5% dry milk was sub-stituted for 5% bovine serum albumin as a blocking agent.DNA and protein sequence analysis. Sequence manipula-

tion, homology analyses, and protein domain analyses werecarried out with DNAstar programs (DNAstar Inc.).

Northern (RNA) filter hybridization. Filters containingtotal RNA from various mouse tissues were kindly providedby Ray MacDonald (University of Texas SouthwesternMedical Center, Dallas). Total RNA was extracted from rattissues as described by Illaria et al. (17) and analyzed aspreviously described in detail (5). The probe was a nick-translated MseI fragment which includes the complete cod-ing region of NonO. Northern filter hybridization was per-formed at 42°C overnight in 5 x SSC buffer with 50%formamide, lx Denhardt's solution, 1% SDS, 100 ,g of calfthymus single-stranded DNA (ssDNA) 5 ,ug of poly(A) perml, 0.1% sodium pyrophosphate, 1 mM EDTA, 10 mM Tris(pH 7.5), and 106 cpm of probe per ml. After hybridization,the filters were washed twice in 2x SSC-1% SDS-0.1%sodium pyrophosphate for 15 min at room temperature, fourtimes in the same buffer in 0.1x SSC at 55°C, and finally in0.1 x SSC buffer alone at 55°C for 30 min.

RESULTS

Peptide sequencing and isolation of NonO cDNA clones.DNA-binding proteins were purified from BCL1 nuclearextracts as reported previously (11). Proteins that had beenaffinity column purified twice were fractionated by SDS-PAGE, and a 60-kDa protein band was electroeluted andthen digested with trypsin. The digested fragments wereseparated by HPLC and subjected to peptide sequencing.Several peptide sequences were obtained, some of whichwere used to construct oligomer probes. All of the sequencesfound in the protein fragmentation studies were subse-quently found in the NonO cDNA clone. A cDNA clone of2.4 kb was further characterized.

Ubiquitous and potentially lymphoid cell-specific forms ofNonO mRNA are present in normal mouse tissues. We probedNorthern filters containing total RNA isolated from variousmouse tissues for NonO transcripts. As shown in Fig. 1,there are two forms of NonO mRNA that accumulate tosufficient abundance to be detected. The larger mRNA isabout 2.7 kb, a size consistent with that of our cDNA clone,assuming average polyadenylation. It is ubiquitously ex-pressed in most tissues in various levels, although it is muchlower in submaxillary, sublingual, and parotid glands and inthe heart. Low signals in lung and kidney are inconclusivebecause of suboptimal RNA loading (Fig. 1, lower panel).The smaller form of NonO mRNA is about 1.6 kb. Itsexpression is restricted to certain organs or tissues that arepredominantly lymphoid (spleen and thymus) or are sites ofhigh lymphocyte migration or contamination (the small in-testinal segments, particularly the ileum). Although prelim-inary, these results suggest that the 1.6-kb message islymphoid cell specific.

Characterization of ubiquitous NonO cDNA and deducedprotein structure. A schematic of the ubiquitously expressedcDNA and deduced protein is shown in Fig. 2A, and thenucleotide sequence is presented in Fig. 2B. There is anopen reading frame encoding a protein of 473 amino acids.The deduced peptide has a molecular mass of 55 kDa. ThemRNA contains a long (860-nucleotide) 3' untranslatedregion (UTR). Although no consensus polyadenylation sig-nal was found, a poly(A) tract present at the 3' end of thecDNA clone suggests that the clone was not truncated. The3' UTR has three direct repeats and one inverted repeat (Fig.2B). One of the direct repeats is continuous from nucleotidepositions 1855 to 1874, containing only U and C nucleotides.Whether these repeats have any functional significance inmRNA stability or translation efficiency is unknown.The deduced amino acid sequence (Fig. 2B) revealed some

putative domains of interest (summarized in Fig. 2A). Thefirst is a glutamine-rich area near the N terminus. A stretchof 10 glutamines (Q), from amino acid residues 29 to 38, isimmediately bordered by 7 histidines (H) and 5 prolines (P).A second type of motif was located by using an algorithm

(6) to detect helix-turn-helix (HTH) DNA-binding motifs inprotein sequences. Two segments which have the potentialto form the HTH domain structure were identified betweenamino acids 289 and 310 and between amino acids 320 and341. The SD scores of these two segments, based on Doddand Egan's calculation method (6), are 1.8 and 1.7, respec-tively. The SD score of the 289 to 310 segment with the lastthree residues removed (EHQ) is 2.4. This score stronglysuggests that this segment is an HTH.The putative HTH domain is followed by a region rich in

both basic and acidic amino acids. From residues 320 to 370,32% (16 of 50) of the residues are basic amino acids (14

MOL. CELL. BIOL.

R"Z

Page 5: Protein, Is the Mammalian Homolog of Drosophila nonA

NonO, AN UNCONVENTIONAL OCTAMER-BINDING PROTEIN 5597

1o0bp

Q K RNP > HTH +/- P

F0F

NonO

NonAF

__-~~~~~~~~~~~~~~5

<*

MQSNKAFNLEKQNHTPRKHHQHHHQQHHOOOOOOOQQQPPPPIPANGQQASSQNEGLTIDLKNFRKPGEKTFTQR8RLFVQ N P Q G T L P E F R RL V

NQQRDNRNRSGPRPGGGAGGAM2gsTNMGGGGGGGGGGGPRGGEDFFITQRLRSISGPTFELEPVEVPTETI S9SRNRLYV

--------------------------------------------------- RNP ------------------------* 160 * ***15*

GNLPPDITEEEMRKLFEKYGKAGEVFIHKDKGFGFIRLETRTLAEIVKVELDNMPLRGKQLRVRFAC8A8RLTVRNLPQYGNL DIT E R F YG E F DK F F AE K LD G QLRVRFA L V NLGNLTNDITDDELREMFKPYGEISEIFSNLDKNFTFLKVDYH3fPKAKRALDGBMRKGRQLRVRFAPNATILRVSNLTPF

** ~~~~~*200* *

VSNELLEEAFSVFGQVERAVVIVDDRGRPSGKGIVEFSGKPAARKALDRCSEGSFLLTTFPWPVTVEPNDQLDDEEGLPEVSNELL F FG ERA VDDRG G GIVEF K A L C E F LT P V PM DD GLPEVSNELLYKSFEIFgPIERASITVDDRGKHMGEGIVEFAKKSSASACLRMCNEKCFFLTASL45CLVDPMEVNDDTDGLPB

1<------ HTH ------->|

K-LVIKNQQFHKEREQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQVDRNIKEAREKLSEMEAARNEHQVMLMRQDLMK K F ER PRFA P SFE EY RWK L K QD R K KLE ME AR E L RQ L

NON-A RAFNKRRPDFNQERSIGPRFADPNSFEHEYG§RWKQLHNLFKTKQDALKRELKMEEDKLEAQMEYARYEQETELLRQELR

I<------------------ BASIC/ACIDIC ------------------->1NON-O RRQEELRRMEELHNQEVQKRKQLELRQERRRREEEE----MRRQQEEMMRRQQEGFKGTFPDAREQEIRMGQMAMGGAM

R R R EE RR E M RQ E M RRQQE GGNON-A KEDNERKKLEWEMREKQAEEMRKREEETMRRHQTEMQSHMNRQEEDMLRR88ETLFMKAQQLNSLLDQQEGFGGGGGG

NON-O GIANRGAMPPAPVPPGTPAPPGPATMMPDGTLGLTPPTTERFGQAATMEGIGAB8GTPPAFNRPAPGAEFAPNKRRRYP P T F GG G P RRR

NON-A NNSTFDNFAGNSNSPFEVFRGNgigNNSTMIGNNAAPNTQDSFAFEFGVNNMNQAU A

RNP2LFVGNL *****E**L***F**FG*lITIIG D Y V

CRNPCONSENSUS

NON-ONON-A

- 70

RNP1K* KGFGF ********K********L*R R YA Y I

'8RSR LFVGNL PPDITZEEMRKLFEKYGKAGEVFIHKD KGFGFIRL ETRTLAEIVKVELDNMPLRGKQLRVRMW308RNR LYVGNL TNDITDDELREMFKPYGEISEIFSNLD KNFTPLKV DYHPNAEKAKRALDGSMRKGRQLRVR03

RNP2LFVGNL *****E**L***F**FG*IIYIKG D Y V

RNP1X* KGFGFVXF ********K********L*GR R YA Y I

NON-O AS LTVRNL PQYVSNELLEEAPSVFGQVERAVVIVDDRGRP SGKGIVUF SGKPAARKALDRCSEGSFLL?TF4NON-A J§JI LRVSNL TPFVSNELLYKSFEIFGPIERASITVDDRGKH MGEGIVUF AKKSSASACLRMCNEKCFFLTt*F

FIG. 4. Sequence homology and RNP consensus sequences of ubiquitous NonO and nonA. (A) Schematic comparison of NonO and nonAproteins, indicating conserved regions. Amino acids are given in one-letter code. (B) The complete NonO deduced amino acid sequencealigned with the carboxyl two-thirds of the nonA sequence. Capital letters between the sequences indicate amino acid identity. Asterisks markthe phenylalanines and tyrosines in NonO within the F domain (boldface). Stretches of glutamine (Q) and glycine (G) are underlined. Arrowsdenote the locations of introns in the NonA gene. Dashes indicate deletions introduced to maximize alignment. The RNP, HTH, andbasic/acidic regions in NonO are indicated. (C) RNP consensus sequence alignment of NonO and nonA. RNP1 and RNP2 consensussequences and other conserved residues are aligned with amino acid positions of RNP domains indicated. Asterisks denote nonconservedamino acid residues. Conservative changes within RNP consensus sequences are indicated in boldface.

arginines and 2 lysines), and 30% (15 of 50) are acidicresidues (all glutamic acids). The net charge of this region ispositive at physiological pH. Such an arrangement, with anHTH domain followed by a nearby charged region, ischaracteristic of the DNA-binding domain in OTF-1 andOTF-2 (3, 50).

Another distinctive area was found at the carboxyl-termi-nal region from amino acids 404 to 432, where 41% (12 of 29)of the residues are proline. Like the N-terminal glutamine-rich region, proline-rich domains may mediate protein-pro-tein interactions essential for transcriptional activation prop-erties (15, 21, 34, 40). Serine/threonine clusters critical for

VOL. 13, 1993

A

NON-O

NON-A

NON-O

NON-A

NON-O

NON-A

NON-O

RNPCONSENSUS

G, N, Q RICH G K RNP +1-

Page 6: Protein, Is the Mammalian Homolog of Drosophila nonA

5598 YANG ET AL.

3.8 kb-

2.7kb- 4;

- 28S

- 18S

1 2 3FIG. 5. Ubiquitous and retinal cell-specific NonO transcripts.

Total RNA, extracted from rat tissues, was fractionated, blotted toa nylon membrane, and probed with a fragment containing the entireNonO cDNA coding region. The larger NonO band present in retinais about 3.8 kb. The ubiquitous band is 2.7 kb.

transactivation in certain cases (52) are not observed inNonO. But curiously, the vast majority of both of theseamino acids are restricted to two regions: residues 51 to 225and residues 412 to 452 at the C terminus.

In vitro and in vivo characterization of ubiquitous NonOprotein. Sense and antisense RNAs from the 2.4-kb ubiqui-tous cDNA clone were synthesized, and in vitro-translatedproducts of these RNAs were analyzed by SDS-PAGE. Asshown in Fig. 3A, the molecular size of in vitro-translatedproduct NonO (55 to 60 kDa) corresponded with our previ-ous estimates derived by purification of the protein (11) andconfirmed the predicted open reading frame. The severalsmaller NonO in vitro products probably result from prema-ture termination or nonspecific initiation of translation.An MseI fragment, containing the entire coding region

sequence minus the N-terminal seven amino acids, was

ligated in frame to the carboxyl terminus of the GST gene.The expressed fusion protein, GST-NonO, had the predictedmolecular size (data not shown). Attempts to use thrombinto cleave NonO from GST were unsuccessful, as NonOcontains several potential thrombin cleavage sites (2). There-fore, the purified fusion protein was used directly. The60-kDa ubiquitous NonO protein was identified in Westernblots of BCL1 nuclear extracts by heteroantisera raised inrabbits by injection of GST-NonO bacterial fusion protein(see Materials and Methods). The major band detectedimmunologically (denoted by the arrowhead in Fig. 3B, lanes4 and 5) is the same size as the protein originally purified byDNA affinity chromatography (11) and is of a size consistentwith the predicted size from the cDNA sequence. The minorbands are proteins that react with the secondary antibody(compare lanes 4 and 5, which are immunostained withdifferent preparations of secondary antibody). The onlycommon band is the 60-kDa NonO protein.

Ubiquitous NonO is homologous to a protein, nonA, which isessential for visual processing in flies. Using the Lipman andPearson method (31) to search protein sequence homologies,we found a striking relationship between a portion of ubiq-uitous NonO and a portion of the Drosophila protein nonA(Fig. 4A). The amino acid sequence of nonA from residue227 to the C terminus aligns with the entire NonO sequence.Within the first 316 residues, NonO and nonA have 40%amino acid sequence identity, achieved with only one inser-tion in nonA at position 468. The sequence similarity be-tween NonO residues 72 to 326 and nonA residues 298 to 543

approaches 50% identity. The level of homology focusedwithin this segment implies similar domain structures withinthe two proteins.The NonO-nonA alignment has several interesting fea-

tures (Fig. 4B). Near the N terminus, a stretch of glutamines(residues 29 to 38 in NonO) are glycines in nonA. However,nonA also has a glutamine-rich area further toward its Nterminus (20) (Fig. 4A), outside our alignment. This region isalso rich in asparagine and glycine.Within the 200 residues of highest homology, there are 15

phenylalanines and 4 tyrosines in NonO (amino acids 70 to270) (marked with a asterisks in Fig. 4B). Fifteen of thosenineteen aromatic residues align identically in nonA. Twoexhibit a conservative change to another aromatic residue.Aromatic residues outside of this homology region are notconserved. Like NonO, nonA has a corresponding cluster ofcharged basic and acidic residues (amino acids 547 to 601).However, the other two potentially functional domainspredicted for NonO (HTH and proline-rich region) aremissing in nonA.

It has been suggested (1, 44) that nonA may bind RNA bymeans of tandemly arrayed ribonuclear protein (RNP)-bind-ing motifs (reviewed in reference 24). NonO has retainedboth RNP-binding motifs within the highly conserved, phe-nylalanine-rich region (Fig. 4C). Aromatic amino acidswithin the RNP domain are believed to be involved inring-stacking interactions between protein and RNA (24).Although neither of the RNP repeats is a perfect match withthe consensus, the more N-terminal motifs match veryclosely and are likely to be responsible for the observedRNA binding of NonO (see below).A third form of NonO transcript is present in retina tissue.

The observed similarity of nonA and NonO prompted us toexamine NonO expression in retina and pituitary (Fig. 5).Besides the 2.7-kb ubiquitous transcript, retina sampleshave a third form of 3.8 kb. This transcript was undetect-able in other neural tissues examined (Fig. 1 and data notshown).

Ubiquitous NonO fusion protein binds double-strandedDNA (dsDNA) and ssDNA with octamer specificity. The60-kDa protein was originally purified from a murine B-cellnuclear extract and was monitored by EMSA of an octamer-containing, VH promoter fragment (11). A similar assay wasused with synthetic oligonucleotides (Fig. 6A) to documentthe binding of the recombinant protein to DNA. The BNoligonucleotide, which contains wild-type octamer and hep-tamer (a low-affinity octamer site) sequences and was origi-nally used on a DNA affinity column to purify the NonOprotein (11), was employed as the probe. As shown in Fig.6B, four protein-DNA complexes were formed with thedouble-stranded BN oligonucleotide at increasing proteinconcentrations. In competition analyses, the most slowly-migrating complex (ds2) was competed by double-strandedoligonucleotides that contain wild-type octamer (BH; Fig.6C, lane 5) or octamer and heptamer (V1; lane 6) motifs.Little competition was seen with a mutated octamer duplex(BO; lane 4) and an unrelated duplex (Ro; lane 7) showed nocompetition. The fastest-migrating complex (dsl) did notshow octamer specificity in that complete competition wasachieved by Ro (lane 7).The binding of NonO to the BN oligonucleotide appeared

complex, and the affinity of the specific binding complex(ds2) was relatively low. Therefore, we dimerized the BNprobe in an effort to increase binding affinity. As shown inFig. 6D (lane 2), a single complex was observed under thesecircumstances. Competition analyses again confirmed the

MOL. CELL. BIOL.

"Ae,5.z,q §..,v .\.

ii..I.i:- ..Ww. ".1:.. I'- -:!

..,:" I.x,,,. :-,

Page 7: Protein, Is the Mammalian Homolog of Drosophila nonA

NonO, AN UNCONVEN'TIONAL OCTAMER-BINDING PROTEIN 5599

HEP OCT

BN TMGATGTACCCTGTCTCATGAATATGCAMTCAGGTGAGTCTATGGT

BO TAAGATGTACCCTGTCTCATGAATTGTACCCTCAGGTGAGTCTATGGT

BH TAAGATGTACCCTGTCTTCTTAATATGCAAATCAGGTGAGTCTATGGTHEP

Vi AACTTCACTCTCTACAACTTCAATCCTAGAGCTAATGATATAGCAGAAOCT

AGACATGCAAATTAGGCC

Ro AACCGACGAAGGTCCAATGGAAAAAGACGGCCAT

B25 ss1

ds1ds2

Proteinconcentration

0.5 1 2 3 4 5

4... .. X....- ds2>... -$32

ss

- dsl

*- Free ptobe2 3 4 5 6 7 (double-stranded)

DS

5-

1 2 3 4

Proteinconoentration

SsBN Ro VI BN

E

Proteinconcentration 0.1 0.5 1 2 3

2 3 4 5 6 7

4 5 20,

-ss1

-dsl

- Free probe

ss1

ss2

15--

- ss2 0

ia

-$3s1 10;E

0

5-

Free probe(single-stranded) 1 2 3 4 5

Proteinconcentration

binding specificity (lanes 3 to 22) for octamer and heptamermotifs. The NonO-BN dimer complex is self-competed forby the BN oligonucleotide (lanes 3 to 7) as well as anotheroligonucleotide, Vl (lanes 13 to 17), which has only theoctamer and heptamer motifs in common with BN (Fig. 6A).Neither an unrelated oligonucleotide (Ro; lanes 8 to 12) nor

the single strand of BN (lanes 18 to 22) could compete withinthe same concentration range as BN or Vl.We noticed that the relative abundances of the four

complexes varied among experiments (data not shown). Thisfinding along with the predicted RNP-binding motifs com-

mon to NonO and nonA prompted us to test whether thecomplex pattern of Fig. 6B might involve ssDNA binding.Indeed, with use of the BN sense strand as a probe, two (ssland ss2) of the four complexes seen with the double-strandedprobe were apparent (Fig. 6E). The formation of the ss2complex was concentration dependent, suggesting that it

- Free probe

4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22

FIG. 6. Binding specificity of ubiquitous NonO for octamermotifs. (A) Synthetic oligonucleotides for DNA binding and compe-tition assays. Sequence motifs within each oligomer (see text) are

indicated above wild types (BN and V1), with mutations of octamer(BO) and heptamer (BH) underlined. (B) Concentration-dependentformation of dsDNA-NonO complexes. The probe was double-stranded BN. Protein concentrations are indicated above each lanein microliters (5 p1 = 1 p.g). NonO-DNA complexes were quantifiedon a Betascope, and the values are plotted at the right as a functionof protein concentration. (C) EMSA competition analysis of double-stranded octamer-containing DNA with double-stranded competi-tors. The BN duplex oligomer was end labeled as the binding probe.GST affinity-purified NonO fusion protein was incubated with a finalconcentration of 3 nM probe in the presence of 25-fold molarexcesses of various competitors. Lanes: 1, probe only; 2, GSTvector control; 3, GST-NonO fusion protein; 4 to 7, with BO, BH,Vl, and Ro dsDNAs as competitors. (D) EMSA competition anal-ysis of double-stranded octamer-containing DNA with double- or

single-stranded competitors. The BN duplex head-to-tail dimer wasend labeled as the binding probe. The binding reactions are the sameas described above except that 0.8 nM (final concentration) probeand 0.5 p,g of poly(dI-dC) were used in each lane. Specific compet-itors were used at 50- to 1,000-fold molar excess for double-strandedcompetitors and 100- to 2,000-fold for single-stranded competitor.The complex indicated by the arrow migrates at the position of ds2.(E) Concentration-dependent formation of ssDNA-NonO com-

plexes. The probe was the sense strand of BN. The bindingconditions are as described for panel B.

might be an oligomer of ssl. Binding of neither complex wasstrictly octamer dependent, as demonstrated with ssDNAcompetitors (data not shown). Actually, NonO binds tossDNA with significantly higher affinity (Fig. 6B) than todsDNA, even though ssDNA binding does not show equiv-alent sequence specificity.NonO binds RNA and ssDNA at the same site. The finding

of consensus RNP motifs suggested that NonO might bind

A

20

.0'9 15.E 1

x.DC.EV 10-

C

Competitor

DS

BO BH Vl Ro

D

C

is2

5

VOL. 13, 1993

4,.;. i..:.: ..:, glw*4 .:

't

a1

Page 8: Protein, Is the Mammalian Homolog of Drosophila nonA

5600 YANG ET AL.

ANON-O

Q RNP in. +/- PF:::'' :::-.':::

F

N-RNP GST-

ARNP-C GST-

DS Ss

BN Vi BO RoBN V1 BO Ro S A S A S A A S

- Free probe

1 2 3 4 5 6

FIG. 7. RNA can compete for single-stranded but not double-stranded octamer-NonO complexes. The binding assay is identicalto that of Fig. 6C except that the amount of each RNA competitor(indicated above the corresponding lane) is 2 1lg.

RNA. We initially found that labeled in vitro-transcribedRNA or total RNA bound readily to NonO fusion proteinimmobilized on glutathione beads (data not shown). Giventhe DNA binding results presented above, we used variousRNA sources as competitors in the EMSA (Fig. 7). AllRNAs competed efficiently for ssDNA binding but not fordsDNA-protein complexes (lanes 4 to 6). However, simpleribonucleotide polymers such as poly(A) (lane 3) or poly(I-C)(data not shown) did not compete. These data suggest thatRNA and ssDNA bind to the same site, independently ofdsDNA.

Segregation of dsDNA- and ssDNAIRNA-binding domains.NonO contains tandem RNP domains N-terminal to a puta-tive HTH. domain followed by a highly charged region.Given our binding data, we suspected that the RNP domainsare responsible for the RNA/ssDNA binding and that theHTH and charged residues are necessary for dsDNA bind-ing. Mutants containing one or the other of these domainswere generated for further EMSA analysis (Fig. 8A). Asshown in Fig. 8B, N-RNP, which is the N-terminal half ofNonO and contains both RNP domains, bound to the BNsense-strand probe. The binding complex was competed forby all of the previously tested ssDNA competitors except foreither strand of Ro (Fig. 8B, lanes 7 to 14). The bindingcomplex was not eliminated by any dsDNA competitors(Fig. 8B, lanes 3 to 6). Reciprocally, ARNP-C, which con-tains only the putative HTH and highly charged region, wasable to bind the duplex octamer probe (Fig. 8C, lane 2) withsequence specificity (lanes 3 to 6). However, as opposed tothe N-RNP binding complex, the ARNP-C binding complexwas not competed for by any of the ssDNAs (lanes 7 to 14).

DISCUSSION

We previously isolated a novel 60-kDa protein from aB-cell leukemia (BCL1) nuclear extract which bound to themurine, IgH promoter-associated octamer (11). In thisstudy, we have characterized the complementary DNA thatencodes the ubiquitous form of this protein (NonO) anddocumented its nucleic acid binding specificity and its strik-ing homology with Drosophila nonA. By virtue of conserva-

Free probe

1 2 3 4 5 6 7 8 9 10 11 12 13 14

C

Competitor

DS ss

BN Vi BO RoBN Vl BO Ro S A S A S A A S

I Free probe

1 2 3 4 5 6 7 8 9 10 11 12 13 14

FIG. 8. Different domains in NonO are required for duplex andssDNA binding. (A) NonO truncation mutants containing separatedomains. Each mutant was constructed and expressed as a GSTfusion (Materials and Methods). (B) Binding of N-RNP to thesingle-stranded BN oligonucleotide. The sense strand of BN waslabeled as the probe. The binding reactions and competitors are thesame as described for Fig. 6. (C) Binding of ARNP-C to thedouble-stranded BN DNA probe. Reaction conditions were asdescribed above. S, sense; A, antisense. The arrow marks thebinding complex.

tion of potentially functional domains, aspects of the discus-sion below are likely to apply to both NonO and nonA.The amino acid sequence of five distinct peptides isolated

by enzymatic digestion of the electroeluted NonO proteinconfirmed that the cDNA corresponds to the purified DNA-binding protein. The molecular weight of recombinant orcellular NonO detected immunologically is indistinguishablefrom that of our purified protein (11) and is similar to that ofthe lymphoid cell-specific transcription factor OTF-2. Usinga computer algorithm (6), we found a potential HTH motifwithin NonO. HTH motifs are integral components of allhomeoboxes including that of POU (22, 45). However, theNonO potential HTH motif shares no homology with thePOU homeobox, nor does the remainder of NonO align withany position of OTF-1, OTF-2, or other POU domain-containing proteins (3, 8, 25, 50). Likewise, NonO shares no

similarity with Ku, another non-POU, octamer-binding pro-tein (33, 43).

RNACompetitor

Wt1 " W_ - ds2

- ssl

- dsl

B

Competitor

MOL. CELL. BIOL.

:, -I ., I ..*

i

i"... .4 iA .WW.WL44Wk, .:44 #A,,I

Page 9: Protein, Is the Mammalian Homolog of Drosophila nonA

NonO, AN UNCONVENTIONAL OCTAMER-BINDING PROTEIN 5601

Northern blots revealed three forms of NonO mRNA:ubiquitous (2.7 kb), lymphoid cell specific (1.6 kb), andretinal cell specific (3.8 kb). On the basis of size, weconcluded that the cDNA clone characterized here is theubiquitous form. Our inability to immunologically detect twospecies of NonO protein in BCL1 nuclei (Fig. 3B) could bedue to their similar molecular sizes, the absence of sharedepitopes, or their presence in different subcellular compart-ments. The difference in size between the three NonOmRNAs probably results from alternative processing of thesame precursor. This view is suggested by our finding (datanot shown) of no more than two hybridizing fragmentsgenerated in human and mouse genomic DNA, using variousrestriction enzymes that do not cut within the ubiquitouscDNA. Furthermore, initial sequences of potential lymphoidand retinal cell-specific cDNA candidates matched that ofthe ubiquitous form perfectly at different segments butcompletely diverged at both 5' and 3' UTRs (data notshown). Cell-type-specific inclusion versus exclusion of ex-ons has been observed for several genes (4, 35, 36). Indeed,at least three alternative splicing patterns giving rise to twopolypeptides were observed for the single nonA gene (20).Indirect evidence suggests differential tissue accumulation ofnonA during development (1, 44).Our previous observations (11) and the presence of a

potential HTH domain followed by a region high in chargedamino acids predicted that recombinant NonO would havethe ability to bind dsDNA. Direct examination by EMSAwith octamer-containing oligonucleotides confirmed this hy-pothesis. However, the binding of NonO is more complexthan anticipated, probably in part because of the presence oftandem RNP sites. NonO binding to dsDNA occurs inde-pendently of its binding to ssDNA. Given the consensusRNP sites, it was not surprising that NonO bound RNA.However, it also appears to do this independently of dsDNAbinding. This conclusion was suggested by the observationthat RNA competes with ssDNA but not octamer dsDNA.Further confirmation was provided by our segregation ofdsDNA- and ssDNA-binding activities in mutants in whichthe RNP or the putative HTHlfcharged region domains weredeleted.While several proteins that bind both forms of DNA have

been characterized, TFIIIA (reviewed in reference 55) maybe the only precedent for specific DNA and RNA binding.TFIIIA acts as a positive transcription factor by binding toan internal control region in the oocyte 5S RNA gene. Itperforms a second essential function by binding 5S RNA inthe cytoplasm, thereby stabilizing the RNA prior to ribo-some assembly. The specificity of RNA binding is contro-versial (discussed in reference 19); some reports claimed thatTFIIIA binding to RNA resembles its DNA binding, whileothers found no similarity between 5S DNA and 5S RNAbinding. However, both nucleic acids interact with the samelinear array of nine zinc fingers (30). NonO, on the otherhand, has evolved separate binding sites for dsDNA andRNA. A dual-site model is consistent with the fact thatneither the RNA (RNP)- nor the DNA (HTH)-binding motifsproposed for NonO have been shown to bind the othernucleic acid.Given the demonstration of dual and separate binding

sites, do both have physiological significance? Clearly du-plex octamer binding, although specific, is of relatively lowaffinity. This may account for our inability to date tofootprint bacterially expressed NonO-octamer complexes.Similar failures have been noted for other low-affinity pro-tein-DNA complexes, including the testis-determining factor

SRY (9) and the recombination signal sequence-bindingprotein T160 (47). However, we were able to footprint NonOpurified to apparent homogeneity from a B-cell line (11). Thediscrepancy could indicate the necessity for a posttransla-tional modification or a protein-protein interaction; neither isprovided by bacteria. ssDNA, and predictably RNA, inter-acts with the NonO RNP domain at significantly higheraffinity. The conservation of this region in Drosophila nonAsuggests that RNA binding is critical to the function of bothproteins.The most critical issue yet to be addressed is the function

of NonO and whether this function is shared by the tissue-specific forms. Although we have no functional data forrecombinant NonO, previously we showed that the same60-kDa material, affinity purified from BCL1 nuclear extract,transactivated the BCL1 promoter in vitro and that thetransactivation was octamer dependent (18). The similarsizes of NonO and OTF-2 raise the possibility that thisactivity was provided by copurified OTF-2. If so, OTF-2would have had to be present in amounts below the detectionlimits of our peptide analysis. This is unlikely, but theinterpretation is further complicated by the potential copu-rification of the lymphoid cell-specific form of NonO. Inaddition to nucleic acid-binding domains, an essential crite-rion for a transcription factor is a transactivation domain.Within ubiquitous NonO, there is a stretch of 10 glutamineresidues near the N terminus and a proline-rich region nearthe carboxyl terminus. Both of these domains have beenshown to be involved in protein-protein interactions withintranscriptional initiation complexes, and they function toenhance transcriptional activity (reviewed in reference 34).An alternative interpretation of our in vitro transcriptionresults (18) is that NonO functioned as a cofactor withconventional octamer-binding proteins to affect transcrip-tion.Although a role for ubiquitous NonO in Ig gene expression

remains to be established, its homology with nonA and itsalternative retinal form raise distinct functional possibilities.Since NonO is the mammalian equivalent of nonA, it isconceivable that a retinal cell-specific NonO has a high-affinity binding site for a gene involved in visual processing.Octamer- and degenerate octamer-binding proteins of thePOU family act as transcriptional activators in the develop-ing forebrain (reviewed in reference 45). Indeed, knockoutexperiments (4a) suggest that the most important functionfor OTF-2 is in the brain, not the immune system.

ACKNOWVLEDGMENTSWe thank William Watson for technical assistance. We also thank

Richard Baer for discussions and Fred Kruse for preparing themouse tissue Northern blot. We thank Jeff Hall and Kathy Rendahlfor sharing data prior to publication.

This work was supported by NIH grants AI-18499 and GM-31689(to P.W.T. and J.D.C.) and NS28126, VA research, and the Ella C.McFadden Charitable Trust (to CMC). J.D.C. holds the Edwin L.Cox distinguished chair in immunology and genetics.

REFERENCES1. Besser, H. V., P. Schnabel, C. Wieland, E. Fritz, R Stanewsky,

and H. Saumweber. 1990. The puff-specific Drosophila proteinBj6, encoded by the gene no-on-transient-A, shows homology toRNA binding proteins. Chromosoma 100:37-47.

2. Chang, J.-Y. 1985. Thrombin specificity; requirement for apolaramino acids adjacent to the thrombin cleavage site of polypep-tide substrate. Eur. J. Biochem. 151:217-224.

VOL. 13, 1993

Page 10: Protein, Is the Mammalian Homolog of Drosophila nonA

5602 YANG ET AL.

3. Clerc, R. G., L. M. Corcoran, H. J. LeBowitz, D. Baltimore, andP. A. Sharp. 1988. The B-cell-specific Oct-2 protein containsPOU box and homeo box-type domains. Genes Dev. 2:1570-1581.

4. Collier, V. L., W. A. Kronert, P. T. Donnell, K. A. Edwards, andS. I. Bernstein. 1990. Alternative myosin hinge regions areutilized in a tissue-specific fashion that correlates with musclecontraction speed. Genes Dev. 4:885-895.

4a.Corcoran, L. M., M. Karrelas, G. J. V. Nossal, Z.-S. Ye, T.Jacks, and D. Baltimore. 1993. Oct-2, although not required forearly B-cell development, is critical for later B-cell maturationand for postnatal survival. Genes Dev. 7:570-582.

5. Craft, C. M., D. H. Whitmore, and L. A. Donoso. 1990.Differential expression of mRNA and protein encoding retinaland pineal s-antigen during the light/dark cycle. J. Neurochem.55:1461-1473.

6. Dodd, I. B., and J. B. Egan. 1990. Improved detection ofhelix-turn-helix DNA-binding motifs in protein sequences. Nu-cleic Acids Res. 18:5019-5026.

7. Eaton, S., and K. Calame. 1987. Multiple DNA sequenceelements are necessary for the function of an immunoglobulinheavy chain promoter. Proc. Natl. Acad. Sci. USA 84:7634-7638.

8. Garcia-Blanco, M. A., R. G. Clerc, and P. A. Sharp. 1989. TheDNA-binding homeo domain of the Oct-2 protein. Genes Dev.3:739-745.

9. Giese, K., J. Cox, and R. Grosschedl. 1992. The HMG domain oflymphoid enhancer factor 1 bends DNA and facilitates assemblyof functional nucleoprotein structures. Cell 69:185-195.

10. Grosschedl, R., and D. Baltimore. 1985. Cell-type specificity ofimmunoglobulin gene expression is regulated by at least threeDNA sequence elements. Cell 41:885-897.

11. Hanke, J. H., N. F. Landolfi, P. W. Tucker, and J. D. Capra.1988. Identification of murine nuclear proteins that bind to theconserved octamer sequence of the immunoglobulin promoterregion. Proc. Natl. Acad. Sci. USA 85:3560-3564.

12. Harlow, E., and D. Lane. 1988. Antibodies: a laboratory man-ual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor,N.Y.

13. Heisenberg, M. 1972. Comparative behavioral studies on twovisual mutants of Drosophila. J. Comp. Physiol. 80:119-136.

14. Henikoff, S. 1984. Unidirectional digestion with exonuclease IIIcreates targeted breakpoints forDNA sequencing. Gene 28:351-359.

15. Herr, W., R. A. Sturm, R. G. Clerc, L. M. Corcoran, D.Baltimore, P. A. Sharp, H. A. Ingraham, M. G. Rosenfeld, M.Finney, G. Ruvkun, and H. R. Horvitz. 1988. The POU domain:a large conserved region in the mammalian pit-1, oct-1, oct-2,and Caenorhabditis elegans unc-86 gene products. Genes Dev.2:1513-1516.

16. Hotta, Y., and S. Benzer. 1970. Genetic dissection of theDrosophila nervous system by means of mosaics. Proc. Natl.Acad. Sci. USA 67:1156-1163.

17. Ilaria, R., D. Wines, S. Pardue, S. Jamison, S. R. Ojeda, J.Snider, and M. R. Morrison. 1985. A rapid microprocedure forisolating RNA from multiple samples of human and rat brain. J.Neurosci. Methods 15:165-174.

18. Johnson, D. G., L. Carayannopoulos, J. D. Capra, P. W. Tucker,and J. H. Hanke. 1990. The ubiquitous octamer-binding pro-tein(s) is sufficient for transcription of immunoglobulin genes.Mol. Cell. Biol. 10:982-990.

19. Joho, K. E., M. K. Darby, E. T. Crawford, and D. D. Brown.1990. A finger protein structurally similar to TFIIIA that bindsexclusively to SS RNA in xenopus. Cell 61:293-300.

20. Jones, K. R., and G. M. Rubin. 1990. Molecular analysis ofno-on-transient A, a gene required for normal vision in Dro-sophila. Neuron 4:711-723.

21. Kadonaga, J. T., A. J. Courey, J. Ladika, and R. Tjian. 1988.Distinct regions of Spl modulate DNA binding and transcrip-tional activation. Science 242:1566-1570.

22. Kemler, I., and W. Schaffner. 1990. Octamer transcriptionfactors and the cell type-specificity of immunoglobulin geneexpression. FASEB J. 4:1444-1449.

23. Kemler, I., E. Schreiber, M. M. Muller, P. Matthias, and W.Schaffner. 1989. Octamer transcription factors bind to twodifferent sequence motifs of the immunoglobulin heavy chainpromoter. EMBO J. 8:2001-2008.

24. Kenan, D. J., C. C. Query, and J. D. Keene. 1991. RNArecognition: towards identifying determinants of specificity.Trends Biochem. Sci. 16:214-220.

25. Ko, H.-S., P. Fast, M. McBride, and L. M. Staudt. 1988. Ahuman protein specific for the immunoglobulin octamer DNAmotif contains a functional homeobox domain. Cell 55:135-144.

26. Kulkarni, S. J., A. F. Steinlauf, and J. C. Hail. 1988. Thedissonance mutant of Drosophila melanogaster, isolation, be-havior and cytogenetics. Genetics 118:267-285.

27. Laemmli, U. K. 1970. Cleavage of structural proteins during theassembly of the head of the bacteriophage T4. Nature (London)227:680-685.

28. Landolfi, N. F., J. D. Capra, and P. W. Tucker. 1986. Interac-tion of cell-type-specific nuclear proteins with immunoglobulinVH promoter region sequences. Nature (London) 323:548-551.

29. Landolfi, N. F., X.-M. Yin, J. D. Capra, and P. W. Tucker. 1988.A conserved heptamer upstream of the IgH promoter regionoctamer can be the site of a coordinate protein-DNA interac-tion. Nucleic Acids Res. 16:5503-5514.

30. Liao, X., K. R. Clemens, L. Tennant, P. E. Wright, and J. M.Gottesfeld. 1992. Specific interaction of the first three zincfingers of TFIIIA with the internal control region of the xenopus5S RNA gene. J. Mol. Biol. 223:857-871.

31. Lipman, D. J., and W. R. Pearson. 1985. Rapid and sensitiveprotein similarity searches. Science 227:1435-1441.

32. Mason, J. O., G. T. Williams, and M. S. Neuberger. 1985.Transcription cell type specificity is conferred by an immuno-globulin VH gene promoter that includes a functional consensussequence. Cell 41:479-487.

33. May, G., C. Sutton, and H. Gould. 1991. Purification andcharacterization of Ku-2, an octamer-binding protein related tothe autoantigen Ku. J. Biol. Chem. 266:3052-3059.

34. Mitchell, P. J., and R. Tjian. 1989. Transcriptional regulation inmammalian cells by sequence-specific DNA binding proteins.Science 245:371-378.

35. Mosthaf, L., K. Grako, T. J. Dull, L. Coussens, A. Ullrich, andD. A. McClain. 1990. Functionally distinct insulin receptorsgenerated by tissue-specific alternative splicing. EMBO J.9:2409-2413.

36. Munroe, S. H., and M. Lazar. 1991. Inhibition of c-erbAmRNAsplicing by a naturally occurring antisense RNA. J. Biol. Chem.266:22083-22086.

37. Okayama, H., and P. Berg. 1982. High efficiency cloning offull-length cDNA. Mol. Cell. Biol. 2:161-170.

38. Pak, W. L. 1991. Molecular genetic studies of photoreceptorfunction using Drosophila mutants. Prog. Clin. Biol. Res. 362:1-32.

39. Pak, W. L., H. Grossfeld, and S. Arnold. 1970. Mutants of thevisual pathway of Drosophila melanogaster. Nature (London)227:518-520.

40. Peterson, M. G., N. Tanese, B. F. Pugh, and R. Tjian. 1990.Functional domains and upstream activation properties ofcloned human TATA binding protein. Science 248:1625-1630.

41. Pierani, A., A. Heguy, H. Fujii, and R. G. Roeder. 1990.Activation of octamer-containing promoters by either octamer-binding transcription factor 1 (OTF-1) or OTF-2 and require-ment of an additional B-cell-specific component for optimaltranscription of immunoglobulin promoters. Mol. Cell. Biol.10:6204-6215.

42. Poellinger, L., B. K. Yoza, and R. G. Roeder. 1989. Functionalcooperativity between protein molecules bound at two distinctsequence elements of the immunoglobulin heavy-chain pro-moter. Nature (London) 337:573-576.

43. Reeves, W. H., and Z. M. Sthoeger. 1989. Molecular cloning ofcDNA encoding the p7O (Ku) lupus autoantigen. J. Biol. Chem.264:5047-5052.

44. Rendahl, K. G., K. R. Jones, S. J. Kulkarni, S. H. Baguily, andJ. C. Hall. 1992. The dissonance mutation at the no-on-tran-sient-A locus of D. melanogaster. Genetic control of courtship

MOL. CELL. BIOL.

Page 11: Protein, Is the Mammalian Homolog of Drosophila nonA

NonO, AN UNCONVENTIONAL OCITAMER-BINDING PROTEIN 5603

song and visual behaviors by a protein with putative RNA-binding motifs. J. Neurosci. 12:390-407.

45. Rosenfeld, M. G. 1991. POU-domain transcription factors: pow-erful developmental regulators. Genes Dev. 5:897-907.

46. Sambrook, J., E. F. Fritsch, and T. Maniatis. 1989. Molecularcloning: a laboratory manual, 2nd ed. Cold Spring HarborLaboratory Press, Cold Spring Harbor, N.Y.

47. Shirakata, M., K. Huppi, S. Usuda, K. Okazaki, K. Yoshida, andH. Sakano. 1991. HMG1-related DNA-binding protein isolatedwith V-(D)-J recombination signal probes. Mol. Cell. Biol.11:4528-4536.

48. Singh, H., R. Sen, D. Baltimore, and P. A. Sharp. 1986. Anuclear factor that binds to a conserved sequence motif intranscriptional control elements of immunoglobulin genes. Na-ture (London) 319:154-158.

49. Smith, D. B., and K. S. Johnson. 1988. Single-step purification ofpolypeptides expressed in Escherichia coli as fusions withglutathione-S-transferase. Gene 67:31-40.

50. Sturm, R. A., G. Das, and W. Herr. 1988. The ubiquitous

octamer-binding protein Oct-1 contains a POU domain with ahomeo box subdomain. Genes Dev. 2:1582-1599.

51. Stunn, R. A., and W. Herr. 1988. The POU domain is a bipartiteDNA-binding structure. Nature (London) 336:601-604.

52. Tanaka, M., and W. Herr. 1990. Differential transcriptionalactivation by Oct-i and Oct-2: interdependent activation do-mains induce Oct-2 phosphorylation. Cell 60:375-386.

53. Wirth, T., L. Staudt, and D. Baltimore. 1987. An octameroligonucleotide upstream of a TATA motif is sufficient forlymphoid-specific promoter activity. Nature (London) 329:174-178.

54. Yang, Y.-S., W. J. Watson, P. W. Tucker, and J. D. Capra.1993. Construction of recombinant DNA by exonuclease reces-sion. Nucleic Acids Res. 21:1889-1893.

55. You, W., N. Veldhoen, F. Baudin, and P. J. Romaniuk. 1991.Mutations in 5S DNA and 5S RNA have different effects on thebinding of Xenopus transcription factor IIIA. Biochemistry30:2495-2500.

VOL. 13, 1993