seed proteomics

12
Review Seed proteomics Ján A. Miernyk a , Martin Hajduch b, a USDA, Agricultural Research Service, Plant Genetics Research Unit, Department of Biochemistry, Interdisciplinary Plant Group, University of Missouri, Columbia, MO 65211, USA b Institute of Plant Genetics and Biotechnology, Slovak Academy of Sciences, Nitra, Slovak Republic ARTICLE INFO ABSTRACT Article history: Received 27 October 2010 Accepted 10 December 2010 Available online 21 December 2010 Seeds comprise a protective covering, a small embryonic plant, and a nutrient-storage organ. Seeds are protein-rich, and have been the subject of many mass spectrometry-based analyses. Seed storage proteins (SSP), which are transient depots for reduced nitrogen, have been studied for decades by cell biologists, and many of the complicated aspects of their processing, assembly, and compartmentation are now well understood. Unfortunately, the abundance and complexity of the SSP requires that they be avoided or removed prior to gel- based analysis of non-SSP. While much of the extant data from MS-based proteomic analysis of seeds is descriptive, it has nevertheless provided a preliminary metabolic picture explaining much of their biology. Contemporary studies are moving more toward analysis of protein interactions and posttranslational modifications, and functions of metabolic networks. Many aspects of the biology of seeds make then an attractive platform for heterologous protein expression. Herein we present a broad review of the results from the proteomic studies of seeds, and speculate on a potential future research directions. © 2010 Elsevier B.V. All rights reserved. Keywords: Electrophoresis Mass spectrometry Proteins Proteomics Seeds Seed storage proteins Contents 1. Proteomics .......................................................... 390 2. Seeds ............................................................. 390 3. Seed storage proteins .................................................... 390 3.1. Terminology ..................................................... 390 3.2. Seed storage protein synthesis and processing .................................. 391 3.3. Seed storage proteins; the dynamic range problem ................................ 392 4. Seed development ...................................................... 392 4.1. Endosperm development .............................................. 393 4.2. Embryo development ................................................ 394 JOURNAL OF PROTEOMICS 74 (2011) 389 400 Abbreviations: CM, central metabolism; CS, cellular structures; DIGE, difference in gel electrophoresis; ER, endoplasmic reticulum; HS, hormones and signalling; LCMS/MS, liquid chromatography coupled with tandem-MS; MALDI-TOF PMF, matrix-assisted laser desorption/ionization peptide mass fingerprinting; MT, membrane transport; MS, mass spectrometry; NA, nucleic acid metabolism; PF, protein folding; PS, protein synthesis; PSV, protein storage vacuoles; PT, protein targeting; PTM, posttranslational modification (s); PUF, proteins of unknown function; SSP, seed storage proteins; SR, stress response Corresponding author. Institute of Plant Genetics and Biotechnology, Slovak Academy of Sciences, Akademicka 2, P. O. Box 39A, 950 07 Nitra, Slovak Republic. Tel.: +421 37 6943346; fax: +421 37 7336660. E-mail address: [email protected] (M. Hajduch). 1874-3919/$ see front matter © 2010 Elsevier B.V. All rights reserved. doi:10.1016/j.jprot.2010.12.004 available at www.sciencedirect.com www.elsevier.com/locate/jprot

Upload: independent

Post on 15-Nov-2023

1 views

Category:

Documents


0 download

TRANSCRIPT

J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

ava i l ab l e a t www.sc i enced i r ec t . com

www.e l sev i e r . com/ loca te / j p ro t

Review

Seed proteomics

Ján A. Miernyka, Martin Hajduchb,⁎aUSDA, Agricultural Research Service, Plant Genetics Research Unit, Department of Biochemistry, Interdisciplinary Plant Group,University of Missouri, Columbia, MO 65211, USAbInstitute of Plant Genetics and Biotechnology, Slovak Academy of Sciences, Nitra, Slovak Republic

A R T I C L E I N F O

Abbreviations: CM, central metabolism; CShormones and signalling; LC–MS/MS, liqudesorption/ionization peptide mass fingerprprotein folding; PS, protein synthesis; PSV, pproteins of unknown function; SSP, seed sto⁎ Corresponding author. Institute of Plant Gen

Nitra, Slovak Republic. Tel.: +421 37 6943346;E-mail address: [email protected] (M. Ha

1874-3919/$ – see front matter © 2010 Elsevidoi:10.1016/j.jprot.2010.12.004

A B S T R A C T

Article history:Received 27 October 2010Accepted 10 December 2010Available online 21 December 2010

Seeds comprise a protective covering, a small embryonic plant, and a nutrient-storageorgan. Seeds are protein-rich, and have been the subject of many mass spectrometry-basedanalyses. Seed storage proteins (SSP), which are transient depots for reduced nitrogen, havebeen studied for decades by cell biologists, and many of the complicated aspects of theirprocessing, assembly, and compartmentation are now well understood. Unfortunately, theabundance and complexity of the SSP requires that they be avoided or removed prior to gel-based analysis of non-SSP. While much of the extant data from MS-based proteomicanalysis of seeds is descriptive, it has nevertheless provided a preliminarymetabolic pictureexplainingmuch of their biology. Contemporary studies aremovingmore toward analysis ofprotein interactions and posttranslational modifications, and functions of metabolicnetworks. Many aspects of the biology of seeds make then an attractive platform forheterologous protein expression. Herein we present a broad review of the results from theproteomic studies of seeds, and speculate on a potential future research directions.

© 2010 Elsevier B.V. All rights reserved.

Keywords:ElectrophoresisMass spectrometryProteinsProteomicsSeedsSeed storage proteins

Contents

1. Proteomics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3902. Seeds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3903. Seed storage proteins . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 390

3.1. Terminology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3903.2. Seed storage protein synthesis and processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3913.3. Seed storage proteins; the dynamic range problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 392

4. Seed development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3924.1. Endosperm development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3934.2. Embryo development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 394

, cellular structures; DIGE, difference in gel electrophoresis; ER, endoplasmic reticulum; HS,id chromatography coupled with tandem-MS; MALDI-TOF PMF, matrix-assisted laserinting; MT, membrane transport; MS, mass spectrometry; NA, nucleic acid metabolism; PF,rotein storage vacuoles; PT, protein targeting; PTM, posttranslational modification (s); PUF,rage proteins; SR, stress responseetics and Biotechnology, Slovak Academy of Sciences, Akademicka 2, P. O. Box 39A, 950 07fax: +421 37 7336660.jduch).

er B.V. All rights reserved.

390 J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

5. Seed germination . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3945.1. Endosperm-dominant seeds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3945.2. Cotyledon-dominant seeds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395

6. Seeds as a biotechnology platform. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3957. The future of seed proteomics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395Acknowledgements. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 396References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 396

“Animals are something invented by plants to move seedsaround. An extremely yang solution to a peculiar problemwhich they faced.” – Terence McKenna

1. Proteomics

As a discipline, proteomics is a broad, instrument-intensiveresearch area that has progressed rapidly since its genesisalmost twenty years ago. While protein isolation and separa-tion methods have improved during this period [1–3], it hasbeen improved instrumentation that has driven expansion ofthe field [4–6]. Significant improvements in sensitivity, massaccuracy, and fragmentation in recent years have led towidespread adoption of proteomic strategies [7]. As the fieldof proteomics matures there have been an increasing numberof very specialized studies; organism-specific [e.g., [8,9]], cellor organelle-specific [e.g. ][10,11], and analyses that targetnarrowphysiological [12] or developmental [13] targets. Hereinwe address the burgeoning field of seed proteomics.

2. Seeds

The most prominent case of plant domestication by man wasduring the Neolithic agricultural revolution [14]. Humans wereseed-gathering foragers during most of prehistory, but beganto cultivate plants starting in the Epipaleolithic/NeolithicPeriod. Cultivation and selection ultimately gave rise toseeds with increased levels of protein and oil, characteristicswhich stimulated both human evolution and, more recently,plant biology. Seeds can be protein-dense based upon arelatively small number of genes encoding seed storageproteins (SSP) [15–17], and this relationship between humandiet and seed protein density has encouraged the early studiesin plant proteomics [18–20].

Seeds comprise the three basic parts of embryonic plants;the embryo proper [21], a tissue containing nutrients stored foruse by the embryo prior to achieving autotrophy, and the seed-coat protective covering [22]. Formation of the seed completesa reproductive cycle that begins with development of flowers,and pollination [13]. The embryo develops from the zygoteformed by the fusion of an egg and a sperm cell, and the seedcoat develops from the integuments of the ovule [23]. Thenutrient-storage tissue can be the cotyledon(s), the endo-sperm, or the megagametophyte in gymnosperms. Theendosperm, derived from the parent plant via double fertil-

ization, is usually triploid, and is rich in either oil and proteinor starch and protein [24]. In some species, the embryo isembedded in the endospermwhile in others the endosperm isabsorbed by the embryo as the latter grows within thedeveloping seed, and the cotyledons of the embryo becomefilledwith the stored nutrients. Thusmature seedsmight haveno remaining endosperm (embryo dominant) or theymight bemostly endosperm (endosperm dominant) [13]. The embryo-dominant seeds include all of the agriculturally importantlegumes while endosperm-dominant seeds include all of thegrains plus a few exceptional dicots (e.g. Brazil nut (Bertholletiaexcelsa HBK)) and castor oil seeds (Ricinus communis L.) [13].

3. Seed storage proteins

3.1. Terminology

The terminology of SSP is complex and can be dauntinglyconfusing to the non-expert. The first systematic classifica-tion system was developed by Osborne [25], and grouped SSPon the basis of solubility in H2O (albumins), dilute saline(globulins), alcohol: water mixtures (prolamins), and dilutealkali or acid (glutelins). The majority of agriculturallyimportant SSP are albumins, globulins, or prolamins[16,17,26]. While albumins are found in all seeds, prolaminsand glutelins are most abundant in monocotyledon seeds andglobulins are prevalent in dicotyledon seeds [17,27].

The globulin SSP have additionally been grouped basedupon sedimentation, 7S and 11S [26]. While this systemappears simple and straightforward, some authors haveeschewed clarity for specific scientific detail. As a result, it isnot uncommon to additionally find reference to 3S or 12Sglobulins. Taking this to the extreme, there are also literaturereports of 2.2S and 11.3S globulins [28].

Furthermore, there are a plethora of trivial names assignedto SSP. Initially, the cereal prolamins were named based ontheir Latin generic names; the zeins from maize (Zea mays)[29], hordeins from barley (Hordeum vulgare) [30], secalins fromrye (Secale cereale) [31], etc. There are exceptions to even thisnomenclature however, and the wheat (Triticum aestivum)prolamins were named gliadins [32,33]. The 7S globulins arecalled vicilins [34,35], while the 11S globulins are calledlegumins [36]. A third group of globulins, the convicilinshave been found only among members of the legume tribeVicieae [37]. The convicilins arose from one or more insertionswithin the N-terminal region of vicilin-like progenitor, and arenot in any way related to the jackbean (Canavalia ensiformis)seed lectin, concanavalin A. Finally, the trivial name cactin

391J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

has been used to describe the 2S albumin SSP from seeds ofCereus jamacaru [38].

Finally, in the postgenomics era it has been well estab-lished that SSP are encoded by multigene families of varyingsizes [33,39–42]. Unfortunately, even here this is no consis-tency in nomenclature as; α, α', A, B, 1, 2, etc. are all variouslyused, sometimes in confusing hybrid configurations (e.g., 2S1).

3.2. Seed storage protein synthesis and processing

The SSP are synthesized on the rough-endoplasmic reticulum(ER), and targeted to the ER lumen by an N-terminal signalsequence which is co-translocationally removed by signalpeptidase [43]. The SSP are then sorted from bulk proteintraffic through the secretory pathway to their site of deposi-tion, the protein storage vacuole (PSV) [44]. There are twotargeting pathways for SSP, one for prolamins and a second forthe non-prolamin SSP [44]. The prolamin-containing PSV ariseby direct vesiculation from the ER, and the prolamin SSP donot traverse the classical secretory pathway. Non-prolaminSSP do pass through the secretory pathway, and are sortedfrom a post-Golgi compartment to the PSV [45].

Fig. 1 – A schematic model for the synthesis, processing, and assprepro-polyprotein in the cytoplasm has a MW of 70,302 Da andco-translocational import into the lumen of the rough-ER, the sigand two high-mannose glycans are added to the primary sequonMW is increased to 71,736 Da by glycan addition. After transfer tomodified (Man removal; Xyl and Fuc addition). The slight decreatransport to the protein storage vacuole (PSV) the pro-sequencemodified glycan) is removed by a Kexin-type protease [155]. Withtwice within specific linker regions (blue) [156]. The internal fragand the two remaining fragments are joined by formation of a dglycolytic processing yield multiple products, each with a uniqueCys; ○, Man; □, GlcNAc; Δ, Fuc; ◊, Xyl.

The prolamin-PSV can have a complex internal architec-ture, which is thought to be the result of different rates ofsynthesis of the various subunits [46]. In contrast, non-prolamin PSV have a uniformly granular appearance [47]. Anadditional morphological characteristic of prolamin-PSV is,since they are directly derived from the rough-ER, that theyare studdedwith ribosomes [48]. Rice (Oryza sativa) endospermis unusual in containing both prolamin- and non-prolaminPSV [49].

The 2S, 7S, and 11S SSP are synthesized as large precursors(now known to be prepro-polyproteins) (Fig. 1) which undergoinitial processing and assembly while still within the ER [50–55]. In addition to the N-terminal signal sequence, thecanonical SSP-precursor primary sequence includes a pro-sequence containing the PSV-targeting information [56], plusat least two linker/protease cleavage sites [57,58]. It is notuncommon for the SSP pro-proteins to display the canonicalAsn-X-Ser/Thr glycosylation sequon, and become N-glycosy-lated during passage through the ER [59]. In some instancesthe N-glycosylation sequons are within the pro-sequence orlinker regions, and are subsequently removed during proteo-lytic processing (Fig. 1) [60]. In other instances, however, they

embly of a hypothetical 2S/7S/11S seed storage protein. Thewould be cleaved into 44 peptides by tryptic digestion. Uponnal sequence (green) would be removed by signal peptidases Asn-X-Thr/Ser. Despite removal of the signal sequence, thethe Golgi apparatus, the glycan located in the pro-sequence isse in MW (71,041) reflects removal of six Man residues. Upon(red) that contained the PSV sorting information (plus thein the PSV the primary sequence of the model SSP is cleavedment (grey) is degraded by non-specific vacuolar peptidases,isulfide bond. The multiple steps of both proteolytic andMW and pI value, and pattern of tryptic peptides. N, Asn; C,

Fig. 2 – Depletion of the abundant storage proteins as aprelude to proteomic analysis of seed proteins. Lane A,fractional solubility of the maize endosperm prolamins. A1,total proteins; A2, dilute-buffer soluble proteins, A3,extraction of the zein SSP with 35% (v/v) ethanol. Lane B,“salting out” of soybean globulin SSP [73]. B1, total proteinsfrom mature seeds, B2, globulin-depleted proteins; B3,proteins precipitated by incubation with 10 mM CaCl2.Lane C, reduction of the beta-conglycinins from a totalprotein fraction isolated from developing soybean seeds byimmobilized-lectin chromatography [74]. Lane D,immunoremoval of the agglutinin proteins from maturecastor seed endosperm preparations. D1, total endospermproteins, D2, ricin-depleted endosperm proteins, D3,removed castor SSP [75]. The positions of size markerproteins are indicated to the left and right of the Y-axes.

392 J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

persist and are present in the final fully-processed SSP (Fig. 1)[61]. Proteolytic processing and assembly continue duringtransit through the secretory pathway, and are completedwithin the PSV [56,62,63].

After proteolytic processing has been completed, the finalpolypeptide associations are stabilized by formation of at leastone disulfide bond [64,65]. The final stable SSP structures canbe as simple as a “αβ-heterodimer” or as complex as a α3β3-heterohexamer [55,66]. It is noteworthy that the incisive SSP-based research of a generation of plant cell biologists has beenmechanistically verified and extended by recent results basedupon availability of genome sequence information andapplication of tandem mass spectrometry (MS/MS) [39,66,67].It is equally important to note that the understanding of SSPprocessing can greatly simplify interpretation of MS results(Fig. 1). This is especially true of gel-based results, where itcould be difficult to interpret the position of a protein that isvery different from the MW and pI of the primary translationproduct unless one has prior knowledge of both proteolyticand glycolytic processing events (Fig. 1).

3.3. Seed storage proteins; the dynamic range problem

This brings us to an important point in the review. Even if theseeds of a hypothetical plant have only, for example, a 2Salbumin and an 11S globulin as SSP, the complexity in 2D gelspot-patterns can be nearly overwhelming because of thecontributions of extended multigene families [26,27,33,41,42]plus heterogeneity in both proteolytic [68] and glycolytic[60,61,69] processing!

The abundance of the SSP can be a great benefit……if youare studying SSP. If not, especially if a gel-based strategy isemployed, then the SSP can substantially interfere withanalysis of total proteins (Fig. 2). In some instances, priorknowledge of the amino acid composition of the SSP can beexploited to allow their avoidance. An excellent example ofthis addresses the SSP of castor endosperm which containrelatively low levels of Lys [70]. By using a Lys-specific N-hydroxysuccinimide-activated Cy-dye to stain the gels, thereappear to be far fewer SSP spots than would have beenvisualized after staining with Coomassie Blue, Sypro, etc.Using this approach, itwas possible to visualize castor proteinsat the same pI and Mr values as the SSP [71].

Otherwise, SSP-depletion can be an important componentof sample preparation [56]. With the exception of theprolamins (Fig. 2A), fractional solubility has not proved to begenerally useful in SSP-depletion. However, if a “total protein”fraction has been initially isolated, then fractional solubility ofthe globulins can be exploited by selective precipitation [72,73](Fig. 2B). If the interfering SSP are N-glycosylated, it is possibleto use immobilized-lectin affinity chromatography for theirremoval. This strategy was useful in removing the acidicsubunit of beta-conglycinin from total soybean (Glycine max (L.)Merr.) seed proteins [74] (Fig. 2C). If suitable antibodies areavailable [75], then immunoremoval can be a useful strategyfor the depletion of SSP (Fig. 2D). Application of any of thesemethods will need to be individually refined. For example,antibodies and lectins might need to be chemically cross-linked in order to be useful if SSP are prepared underdenaturing conditions [e.g., 74].

4. Seed development

Specific characteristics of seeds simplify their study. Firstly, allcell division is completed within a few days after fertilization.This marks the line of demarcation between embryogenesisand seed development. Subsequent cellular specializationtakes place in the absence of cell division [13,76]. Secondly,nearly all biosynthetic activity during seed development isdirected toward the accumulation of storage polymers (oils,polysaccharides, and proteins) [77,78]. These compounds, andthe subcellular structures that contain them, are inert depotsawaiting the activities responsible for mobilizing them toprovide biosynthetic intermediates necessary until the devel-oping seedling becomes autotrophic [79,80]. Thirdly, seeds arerelatively simple anatomically; comprising the embryonicaxes, storage tissue(s), and the protective seed-coat.

During typical analyses, seed-coats are removed anddiscarded, while embryos are usually either removed or,because of their relatively small size, ignored. Thus, “seedproteomics” usually means proteomic analysis of the storagetissues/organs. While SSP are very abundant, accounting foras much as 60% of total seed protein, they are generallyconsidered inert. If they were not biochemically removed priortoMS analysis, then they have beenmanually subtracted fromthe analyses described herein.

393J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

4.1. Endosperm development

The endosperm is composed of specialized cells that producelarge quantities of storage polymers [81,82]. The amount ofendosperm in a mature seed is variable. For example, itcomprises the majority of the whole seed mass in cereals,such as rice, maize, and wheat, and it is prominent in theseeds of a few dicots, such as castor and Brazil nut. In otherspecies, for examplemouse-ear cress (Arabidopsis thaliana), theendosperm is almost completely absent in the mature seed[22,83]. Regardless of the pattern of endosperm persistence, itis vital for embryo development in much the same way theplacenta is essential for mammalian embryos [22].

Similar patterns of identified protein distribution wereseen among the endosperm-dominant starch-rich seeds ofwheat, barley, rice, andmaize. Using a 2-DE plus eitherMALDI-TOF PMF [81,84,85] or LC–MS/MS [86–89] experimental design,a total of 1496 proteins were identified. When the proteins areseparated into 10 functional classes [13]; modified from [90],the distribution is: Central Metabolism (CM), 34%; CellularStructures (CS), 12; Stress Response (SR), 5; Nucleic Acidmetabolism (NA), 2; Protein Synthesis (PS), 2; Protein Folding(PF), 5; Protein Targeting (PT), 7; Hormones and Signalling (HS),2; Membrane Transport (MT), 2; and Proteins of UnknownFunction (PUF), 29 (Fig. 3). It is noteworthy that conditions thatmight be considered as stress for the non-seed components ofa plant are part of the normal developmental programof seeds

Fig. 3 – The functional annotation of MS-identified seed proteinsStructure (CS), Stress Response (SR), Nucleic Acid metabolism (NTargeting (PT), Hormones and Signalling (HS), Membrane Transponot included in the analysis. The starch-rich endosperm categorymaize [81,84–89]. The oil-rich endosperm category includes 437cotyledon-dominant seeds of A. thaliana [105] and B. napus [102,proteins were identified from the starch-rich cotyledon-dominanManual subtraction of the SSP reduced this total to 64. A total ofmegagametophyte of Cunninghamia lanceolata (Lamb) Hook seed

[91]. Water “stress” is a good example. With the possibleexception of recalcitrant seeds, all seeds become dehydratedas they approach maturity and prepare for quiescence [91].

The results from separate proteomic analyses of develop-ingwheat [81] and rice [88] endosperm, andwheat [89] and riceembryos [92,93] highlight the differences in these two seedorgans. The CM proteins were the most abundant in allinstances; however members of the PF category were relative-ly more abundant in endosperm, while proteins included inthe SR andHS categories weremore abundant in embryos. Thehigher proportion of proteins involved in signalling mightreflect the role(s) of the embryo in controlling metabolism inthe endosperm [c.f., [94]]. Proportionally, there are far morePUF proteins in endosperm than in embryos irrespective of theavailability of whole genome sequence data.

Castor is an unusual example of an oil-rich endosperm-dominant seed; most accumulate starch rather than oil. Usinga 2-DE plus LC–MS/MS strategy, Houston et al., [71] identified522 proteins from developing castor endosperm. Discountingthe SSP, themost abundant castor proteinswere involvedwithCM, PF, SR, and CS (Fig. 3).

Amyloplasts are non-green plastids specialized for thesynthesis and accumulation of starch [95]. Balmer et al. [96]described isolation of amyloplasts from developing wheatendosperm, and used a 2-DE LC–MS/MS strategy to identify289 proteins. In addition to all of the enzymes necessary forstarch biosynthesis, they were also able to identify many

. The 10 categories [13] are; Central Metabolism (CM), CellularA), Protein Synthesis (PS), Protein Folding (PF), Proteinrt (MT), and Proteins of Unknown Function (PUF). The SSP arecomprises 1496 proteins identified from rice,wheat, barley, orproteins identified from castor [71]. Analysis of the oil-rich106] yielded 1049 identified proteins, while a total of 278t seeds of pea (P. sativum) [109], and lentil (L. culinaris) [110].71 non-SSP proteins were identified from the oil-richs [134].

394 J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

proteins involved with nitrogen and sulfur assimilation, andamino acid and lipid biosynthesis. The composition of thewheat endosperm amyloplast proteome is more similar tothat of castor endosperm leucoplasts [97] or even greenplastids [98,99] than the wheat endosperm cell cytoplasm.

4.2. Embryo development

Typically embryo-dominant seeds contain starch plus SSP asthe storage polymers (peas, beans, lentils), or oil plus SSP(mouse-eared cress, canola (Brassica napus L.)), soybean/barrelmedic (Medicago truncatula Gaertn.)/lotus (Lotus japonicus L.). Todate, the cotyledons of legume seeds have received the mostattention from “omics biologists,” in part due to theiragronomic importance [13,15,79,80,100].

There have been extensive proteomic studies of developingG. max [101,102], M. truncatula [77,103], and L. japonicus [104]seeds. Experimental strategies included 2-DE plusMALDI-TOF,2-DE plus LC–MS/MS, or GeLC–MS (Proteins are separatedusing 1-D SDS-gels which are subsequently divided intoequally-sized segments. Proteins in each segment are re-duced, alkylated, and digested in-gel. An LC-system then usedto separate and supply the peptides to the mass detector)analysis of tryptic peptides. Seed proteome data for G. max,M.truncatula, and L. japonicus have been collected, analyzed, andcan be retrieved from: http://bioinfoserver.rsbs.anu.edu.au/utils/PathExpress/pathexpress4legumes.php. A total of 1723proteins were identified. Manual subtraction of 316 SSP left1407 identified proteins from the three legume species. Whenthese proteins were categorized, the percent distribution was:CM, 49%; CS, 19; NA, 2; PS, 3; PF, 5; PT, 3; HS, 3; MT, 4; SR, 3; and,PUF, 10. In all three species a relatively large number of lateembryo abundant proteins were identified.

Similar studies of A. thaliana [105] and B. napus [102,106]seeds yielded similar results. A total of 1290 proteins wereidentified either by 2-DE plus LC–MS/MS or MudPIT. Aftermanual removal of 241 SSP entries, 1049 identified proteinsremained. Once again, the CM category was by far the mostpopulous (36%), followed by CS (19), and PUF (17).

Agrawal and Thelen [107] used the phospho-proteinspecific fluorescent dye ProQ Diamond as a probe for analysisof developing B. napus seeds. They were able to detect 234phospho-protein spots, 103 of which were identified by LC–MS/MS. Not surprisingly, most of the identified proteins werein the CM and HS categories. It was, however, somewhatsurprising that many of the cruciferin SSP subunits gave apositive phospho-protein stain since these proteins had notbeen previously described as phospho-proteins.

Jain et al. [108] used MudPIT to identify 80 proteins fromplastids purified from developing B. napus embryos. Theplastid proteins complement was enriched in enzymes ofthe CM grouping, and was mid-way between the non-greenplastids from wheat and castor endosperm [96,97] and thechloroplasts isolated from green organs [98,99].

In contrast with these oil plus SSP embryo-dominant seeds,the main starch plus SSP seeds that have been characterizedare pea (Pisum sativum) [109], and lentil (Lens culinaris) [110]. Acombination of MALDI-TOF PMF and LC–MS/MS was used toidentify 122 proteins from mature L. culinaris seeds. Manualsubtraction of the SSP reduced this number to 25. Of these 6

were grouped in CM, 4 each in CS and PT, and 3 each in NA andSR. Similar results were obtained when Bourgeois et al. [109]usedMALDI-TOF PMF to identify 156 proteins frommature peaseeds. Manual subtraction of the SSP left 39, distributedamong CM (16), PUF (8), CS (7), and PF (4). A persistent problemwith analysis of the starch plus SSP legume seeds is the lack ofgood genomic or EST resources.

5. Seed germination

Mature quiescent seeds are dispersed at low (5–15%) moisturecontent, andwithmetabolic activity at a standstill [91,111]. Forgermination, quiescent seeds need only be hydrated at asuitable temperature in the presence of O2. Germinationbegins with water uptake by the seed (imbibition), continuesthrough the elongation the embryonic axis inside the seed,and is visiblymanifest by protrusion of the radicle through theseed coat [112]. This sequence continues until the transition ofthe heterotrophic seedling to an autotrophic plant. Germina-tion involvesmany cellular andmetabolic events, coordinatedby complex regulatory networks. Seed dormancy, the intrinsicability to temporarily block radicle elongation in order tooptimize the timing of germination, will be treated herein as aspecialized variant of quiescence [113–115].

Dormancy is a process whereby germination is delayed inorder to avoid conditions adverse for seedling survival.Release from dormancy is controlled by perception of acombination of environmental signals [91,112,113,115,116].Temperature, light quality, and hormonal balance (abscisicacid and gibberellins) all play key roles in release fromdormancy.

The period between germination and assumption ofautotrophy is broadly referred to as postgerminative growth.It is during this period that the seed storage polymers (oil,polysaccharides, SSP) are mobilized to provide biosyntheticintermediates. In general, the enzymes necessary for polymerdegradation are synthesized de novo during postgerminativegrowth. Thus, SSP proteases (endo- and exo-), lipases and theglyoxylate cycle enzymes, and starch-degrading enzymes canserve as markers to define the period of postgerminativegrowth [114].

5.1. Endosperm-dominant seeds

Total endosperm proteins from germinated castor seeds wereanalyzed using a 2-DE plus MALDI-TOF/TOF MS strategy, andnearly 400 distinct proteins were identified [97]. Essentially allof these proteins are the same as were previously describedfrom analysis of developing castor endosperm [71]. Withouttheir removal in this study, the extremely high “background”of SSP and CM proteins precluded identification of anyproteins that could be specifically attributed to germination/postgerminative growth.

Castor endosperm plastidial and mitochondrial fractionswere also prepared and subjected to proteomic analysis [97].The proteins identified are the same as have been previouslydescribed for these organelles from other plant species andorgans [98,117,118]. Maltman et al. [119] used 2-DE, DIGE,MALDI-TOF PMF, and LC–MS/MS to show that more than 100

395J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

proteins are differentially abundant when comparing the ERfrom developing castor endospermwith that from germinatedseeds. With the exception of a few contaminants from otherorganelles (RuBisCO, plastids; malate synthase, glyoxysomes),and PUF proteins, those identified were all either SSP ormembers of the PF and PT groups.

A proteomics strategy employing 2-DE and MALDI-TOFPMF analysis was used to identify nearly 200 proteins fromgerminating barley seeds [120]. Nearly all of the proteinsidentified are included in the CM, PF, CS, and SR categories. Bythree days after imbibitions there were large increases in theenzymes involved with starch breakdown and mobilization,such as the α- and β-amylases.

Because barley is used extensively in both the food andbrewing industries, there have been detailed analyses of a fewspecific proteins; α-amylase, peroxidases, and thioredoxins[85]. Hynek et al. [121] used LC–MS/MS to analyze a plasmamembrane-enriched fraction isolated from germinating bar-ley. Many of the proteins identified appear to be contaminantsfromother subcellular organelles, demonstrating howdifficultit is to obtain purified subcellular components. Despite thedifficulties, they also identified several bona fide plasmamembrane proteins including a H+-ATPase, a pyrophospha-tase, and a voltage-dependent anion channel [104].

Analysis of germinating rice endosperm using a 2-DE plusMALDI-TOF MS strategy allowed Yang et al. [122] to studyproteins that changed in abundance. They found that the SSP,along with members of the CM and CS groups decreased inabundance during germination, while different members ofthe CM group increased, some of which are involved withstarch breakdown. Results similar to those found withgerminating rice and barley endosperm proteins were alsofound during studies of wheat and maize [81,86].

Evidence is accumulating that indicates the cellularenvironment is increasingly oxidizing as seeds dehydrateand approach quiescence. Formation of disulfide bonds is onemanifestation of the changing redox environment. Subse-quently, in response to imbibition, the thioredoxin (NADPH,thioredoxin h, and NADP-thioredoxin reductase) and ascor-bate/glutathione systems are activated [123,124]. These sys-tems reduce disulfide bonds, and in doing so increases proteinsolubility, the rate of proteolysis, and ultimately the extent ofnitrogen and carbonmobilization [125]. The endosperm-basedthioredoxin paradigm was subsequently extended to includedicot seeds [126].

5.2. Cotyledon-dominant seeds

Essentially all reported proteomic analyses of “germinatingseeds” are actually from studies of postgerminative growth.Inevitably, 2-DE-based LC–MS/MS analyses of the postgermi-native growth of cotyledon-dominant seeds yield datasetsidentical to those of mature seeds and are dominated by thepresence of SSP [127–132]. If proteins were quantified, then themajor theme is the decrease in levels of SSP. In some instancesthis overall decrease is accompanied by transient accumula-tion of SSP degradation intermediates.

In contrast to most “shotgun” proteomic analyses of seeds,Müller et al. [133] reported the results from analysis of aspecific tissue, the endosperm cap, during germination of

cotyledon-dominant Lepisium sativum seeds. The cap is aspecific part of the endosperm surrounding the embryoradicle. Using a 2-DE plus LC–MS/MS strategy, 140 proteinswere identified. The largest group of proteins was the CMcluster, followed by SR, and PF. The endosperm cap proteomewas both qualitatively and quantitatively different from therest of the endosperm, but no proteins were identified thatmight be responsible for modification of the endospermstructure in response to radicle protrusion.

“Logically,” one might reasonably have expected the suiteof proteins identified in oil-rich seeds [12] to differ from thoseof starch-rich seeds, or for endosperm-dominant seeds todiffer from cotyledon-dominant seeds. It was surprising thenthat the patterns seen with these four groups are quite similar(Fig. 3). The implication, that a seed proteome is more similarto that of another seed than it is to another organ of even thesame plant [13], requires more attention. In Fig. 3, theapparently different pattern of proteins seen in the oil-richmegagametophyte must be considered in the context of arelatively small number of proteins identified from a singlespecies of gymnosperm (C. lanceolata) [134].

6. Seeds as a biotechnology platform

Because seeds have a high intrinsic capacity for proteinsynthesis, packaging, and storage, they should be an excellentplatform for production of transgenic products [135]. Proteinsynthesis can be driven by the strong, tissue-specific SSPpromoters [e.g., [136]]. Successful early targets have includedexpression of both industrial [137] and biomedical [138]proteins. There is relatively little information availableaddressing potential pleiotropic effects of heterologous pro-tein expression on expression of native seed proteins[139,140]. Should this become problematic, Schmidt andHerman [141] have developed a unique strategy for rebalan-cing the seed proteome in order to increase expression ofheterologous non-seed proteins.

7. The future of seed proteomics

One goal of this review was to identify areas that needincreased attention from proteomics-based researchers. Oneclear need is for more comparative analyses, especially interms of gymnosperm seeds, although in many cases this willrequire parallel development of better genomic/EST resourcesto facilitate protein identification. Proteomics researchers areincreasingly moving from gel-based to gel-free approaches,which will contribute to identification of more proteins and isamenable to use with automated, high-throughput platforms[142]. At the same time, gel-free analyses require applicationof more robust standards of statistical analysis [143]. In theforeseeable future there will remain applications for gel-basedanalyses, especially in targeted-proteomic studies such asthose involving DIGE. A combination of gel-based and gel-freemethods will likely comprise the state-of-the-art for years tocome [c.f., [144]].

There can be no debate about the need to move fromqualitative to quantitative proteomic analyses, and there are

396 J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

proponents of both protein labeling and of label-free methodsto achieve this end [142,145]. While the debate will likelycontinue into the future, an increasing number of researchersare using a label-free method termed spectral counting [c.f., ][146–148]. Regardless of the method finally adopted, it is clearthat future proteomics-based studies of seeds will benefitfrom employing quantitative approaches.

One aspect of biology that cannot be addressed in anyother large-scale survey (e.g., transcript profiling) is theposttranslational modification (PTM) of proteins. Much ofthe elaboration of the proteome is the result of PTM [149]. Inaddition to the nature of the PTM, it is also important todetermine which residue is modified and the result of themodification. Do the PTM affect protein turnover, activity,interactions? In addition to using bottom up analysisstrategies, it is important that top down proteomic analysesare included in analysis of PTM [150]. Thus far only a handfulof studies of seed proteins have included systematic analysisof PTM, and these have addressed only phosphorylation [107].In addition to identification of the nature of PTM, it isimportant to determine their stoichiometry in the proteome.Changes in PTM stoichiometry likely indicate a specificfunctional change, while increase in protein amount with aparallel increase in the extent of PTM means that thestoichiometry has not changed and there is unlikely to beany functional consequence.

The othermajor aspect of biology that cannot be addressedin any sort of high-throughput or computational context isthat of protein interactions (e.g., the interactome). There areno extant publications addressing MS-based analysis ofprotein interactions as a component of seed biology. Butthere should be! The next wave of understanding of howproteins function will involve analysis of protein interactionsin a cellular or subcellular context [151,152].

Finally, an exciting and relatively new application of MS inproteomics studies is at the tissue- and sometimes evencellular level of analysis; imaging MS or MALDI-imaging[153,154]. By gating the detector to a specified mass (orrange), it is possible to detect the location of a protein inwhole tissue/organ mounts. Especially in conjunction withhigh-sensitivity instruments, this method could providesignificant insight into the cellular and even molecularorganization of seeds and their component tissues.

Acknowledgements

This investigation was supported by the Seventh FrameworkProgram of the European Union – International ReintegrationGrant (MIRG-CT-2007-200165). The graphics were created byM.L. Johnston. Professor J.J. Thelen provided the antibodiesused in Fig. 2D.

R E F E R E N C E S

[1] Desiderio C, Rossetti DV, Iavarone F, Messana I, CastagnolaM. Capillary electrophoresis-mass spectrometry: recenttrends in clinical proteomics. J Pharm Biomed Anal 2010;53:1161–9.

[2] Rabilloud T, Chevallet M, Luche S, Lelong C.Two-dimensional gel electrophoresis in proteomics: past,present and future. J Proteomics 2010;73:2064–77.

[3] Brewis IA, Brennan P. Proteomics technologies for the globalidentification and quantification of proteins. Adv ProteinChem Struct Biol 2010;80:1–44.

[4] May C, Brosseron F, Chartowski P, Schumbrutzki C,Schoenebeck B, Marcus K. Instruments and methods inproteomics. Meth Mol Biol 2011;696:3–26.

[5] Armirotti A, Damonte G. Achievements and perspectives oftop-down proteomics. Proteomics 2010;10:3566–76.

[6] Jacob RJ. Bioinformatics for LC–MS/MS-based proteomics.Meth Mol Biol 2010;658:61–91.

[7] Walther TC, Mann M. Mass spectrometry-based proteomicsin cell biology. J Cell Biol 2010;190:491–500.

[8] Wienkoop S, Baginsky S, Weckwerth W. Arabidopsis thalianaas a model organism for plant proteome research. JProteomics 2010;73:2239–48.

[9] Sabatinos SA, Forsburg SL. Molecular genetics ofSchizosaccharomyces pombe. Meth Enzymol 2010;470:759–95.

[10] Gilmore JM, Washburn MP. Advances in shotgun proteomicsand the analysis of membrane proteomes. J Proteomics2010;73:2078–91.

[11] Szklarczyk R, Huynen MA. Mosaic origin of themitochondrial proteome. Proteomics 2010;10:4012–24.

[12] Hajduch M, Matusova R, Houston NL, Thelen JJ. Comparativeproteomics of seed maturation in oilseeds reveals differences inintermediary metabolism. Proteomics 2011; In press.

[13] Miernyk JA, Preťová A, Olmedilla A, Klubicová K, Obert B,Hajduch M. Using proteomics to study sexual reproductionin angiosperms. Sex Plant Reprod 2011;24:9–22.

[14] Purugganan MD, Fuller DQ. Archaeological data reveal slowrates of evolution during plant domestication. Evolution2010 [Epub ahead of print].

[15] Kreis M, Forde BG, Rahman S, Miflin BJ, Shewry PR. Molecularevolution of the seed storage proteins of barley, rye andwheat. J Mol Biol 1985;183:499–502.

[16] Gibbs PE, Strongin KB, McPherson A. Evolution of legumeseed storage proteins—a domain common to legumins andvicilins is duplicated in vicilins. Mol Biol Evol 1989;6:614–23.

[17] Xu JH, Messing J. Amplification of prolamin storage proteingenes in different subfamilies of the Poaceae. Theor ApplGenet 2009;119:1397–412.

[18] Beyer K, Bardina L, Grishina G, Sampson HA. Identification ofsesame seed allergens by 2-dimensional proteomics andEdman sequencing: seed storage proteins as common foodallergens. J Allergy Clin Immunol 2002;110:154–9.

[19] Islam N, Woo SH, Tsujimoto H, Kawasaki H, Hirano H.Proteome approaches to characterize seed storage proteinsrelated to ditelocentric chromosomes in common wheat(Triticum aestivum L.). Proteomics 2002;2:1146–55.

[20] Østergaard O, Melchior S, Roepstorff P, Svensson B. Initialproteome analysis of mature barley seeds and malt.Proteomics 2002;2:733–9.

[21] De Smet I, Lau S, Mayer U, Jürgens G. Embryogenesis—thehumble beginnings of plant life. Plant J 2010;61:959–70.

[22] Haughn G, Chaudhury A. Genetic analysis of seed coatdevelopment in Arabidopsis. Trends Plant Sci 2005;10:472–7.

[23] Linkies A, Graeber K, Knight C, Leubner-Metzger G. Theevolution of seeds. New Phytol 2010;186:817–31.

[24] Wang X, Xu C, Wu R, Larkins BA. Genetic dissection ofcomplex endosperm traits. Trends Plant Sci 2009;14:391–8.

[25] Osborne TB. The vegetable proteins. Longmans. 2nd ed. NewYork: Green; 1924.

[26] Shewry PR, Napier JA, Tatham AS. Seed storage proteins:structures and biosynthesis. Plant Cell 1995;7:945–56.

[27] Weber H, Borisjuk L, Wobus U. Molecular physiology oflegume seed development. Annu Rev Plant Biol 2005;56:253–79.

397J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

[28] Templeman TS, Demaggio AE, Stetler DA. Biochemistry offern spore germination: globulin storage proteins inMatteuccia struthiopteris L. Plant Physiol 1987;85:343–9.

[29] Lawton JW. Zein: a history of processing and use. CerealChem 2002;79:1–18.

[30] Bietz JA. Cereal prolamin evolution and homology revealedby sequence analysis. Biochem Genet 1982;20:1039–53.

[31] Kreis M, Forde BG, Rahman S, Miflin BJ, Shewry PR. Molecularevolution of the seed storage proteins of barley, rye andwheat. J Mol Biol 1985;183:499–502.

[32] Shewry PR, Tathamt AS. The prolamin storage proteins ofcereal seeds: structure and evolution. Biochem J 1990;267:1–12.

[33] Qi PF, Wei YM, Ouellet T, Chen Q, Tan X, Zheng YL. Thegamma-gliadinmultigene family in commonwheat (Triticumaestivum) and its closely related species. BMC Genomics2009;10:168.

[34] Shutov AD, Kakhovskaya IA, Braun H, Bäumlein H, Müntz K.Legumin-like and vicilin-like seed storage proteins: evidencefor a common single-domain ancestral gene. J Mol Evol1995;41:1057–69.

[35] Jin T, Albillos SM, Chen YW, Kothary MH, Fu TJ, Zhang YZ.Purification and characterization of the 7S vicilin fromKorean pine (Pinus koraiensis). J Agric Food Chem 2009;56:8159–65.

[36] Häger KP, Müller B, Wind C, Erbach S, Fischer H. Evolution oflegumin genes: loss of an ancestral intron at the beginning ofangiosperm diversification. FEBS Lett 1996;387:94–8.

[37] de Miera LE Sáenz, Ramos J, Pérez de la Vega M. Acomparative study of convicilin storage protein genesequences in species of the tribe Vicieae. Genome 2008;51:511–23.

[38] Aragão TCFR, Souza PAS, Uchôa AF, Costa IR, Bloch Jr C,Campos FAP. Characterization of a methionine-rich proteinfrom the seeds of Cereus jamacaru Mill. (Cactaceae). Braz JMed Biol Res 2000;33:897–903.

[39] Gehrig PM, Krzyzaniak A, Barciszewski J, Biemann K.Mass spectrometric amino acid sequencing of a mixture ofseed storage proteins (napin) from Brassica napus, productsof a multigene family. Proc Natl Acad Sci USA 1996;93:3647–52.

[40] Hou A, Liu K, Catawatcharakul N, Tang X, Nguyen V, KellerWA, Tsang EWT, Cui Y. Two naturally occurring deletionmutants of 12S seed storage proteins in Arabidopsis thaliana.Planta 2005;222:512–20.

[41] Magni C, Scarafoni A, Herndl A, Sessa F, Prinsi B, Espen L,Duranti M. Combined 2D electrophoretic approaches for thestudy of white lupin mature seed storage proteome.Phytochemistry 2007;68:997–1007.

[42] Feng L, Zhu J, Wang G, Tang Y, Chen H, Jin W, Wang F, Mei B,Xu Z, Song R. Expressional profiling study revealed uniqueexpressional patterns and dramatic expressional divergenceof maize alpha-zein super gene family. Plant Mol Biol2009;69:649–59.

[43] Tuteja R. Type I signal peptidase: an overview. Arch BiochemBiophys 2005;441:107–11.

[44] Frigerio L, Hinz G, Robinson DG. Multiple vacuoles in plantcells: rule or exception? Traffic 2008;9:1564–70.

[45] Herman EM, Larkins BA. Protein storage bodies and vacuoles.Plant Cell 1999;11:601–14.

[46] Lending CR, Larkins BA. Changes in the zein composition ofprotein bodies during maize endosperm development. PlantCell 1989;1:1011–23.

[47] zur Nieden U, Neumann D, Manteuffel R, Weber E. Electronmicroscopic immunocytochemical localization of storageproteins in Vicia faba seeds. Eur J Cell Biol 1982;26:228–33.

[48] Burr B, Burr FA. Zein synthesis in maize endosperm bypolyribosomes attached to protein bodies. Proc Natl Acad SciUSA 1976;73:515–9.

[49] Krishnan HB, White JA. Morphometric analysis of rice seedprotein bodies (Implication for a significant contribution ofprolamine to the total protein content of rice endosperm).Plant Physiol 1995;109:1491–5.

[50] Chrispeels MJ, Higgins TJ, Spencer D. Assembly of storageprotein oligomers in the endoplasmic reticulum andprocessing of the polypeptides in the protein bodies ofdeveloping pea cotyledons. J Cell Biol 1982;93:306–13.

[51] Lycett GW, Delauney AJ, Gatehouse JA, Gilroy J, Croy RR,Boulter D. The vicilin gene family of pea (Pisum sativum L.): acomplete cDNA coding sequence for preprovicilin. NucleicAcids Res 1983;11:2367–80.

[52] Bollini R, Vitale A, Chrispeels MJ. In vivo and in vitroprocessing of seed reserve protein in the endoplasmicreticulum: evidence for two glycosylation steps. J Cell Biol1983;96:999–1007.

[53] Gatehouse JA, Lycett GW, Delauney AJ, Croy RR, Boulter D.Sequence specificity of the post-translational proteolyticcleavage of vicilin, a seed storage protein of pea (Pisumsativum L.). Biochem J 1983;212:427–32.

[54] Dickinson CD, Hussein EH, Nielsen NC. Role ofposttranslational cleavage in glycinin assembly. Plant Cell1989;1:459–69.

[55] Jung R, Scott MP, Nam YW, Beaman TW, Bassüner R,Saalbach I, Müntz K, Nielsen NC. The role of proteolysis inthe processing and assembly of 11S seed globulins. Plant Cell1998;10:343–57.

[56] Niemes S, Labs M, Scheuring D, Krueger F, Langhans M,Jesenofsky B, Robinson DG, Pimpl P. Sorting of plant vacuolarproteins is initiated in the ER. Plant J 2010;62:601–14.

[57] Scott MP, Jung R, Muntz K, Nielsen NC. A proteaseresponsible for post-translational cleavage of a conservedAsn-Gly linkage in glycinin, themajor seed storage protein ofsoybean. Proc Natl Acad Sci USA 1992;89:658–62.

[58] Müntz K, Blattner FR, Shutov AD. Legumains—a family ofasparagine-specific cysteine endopeptidases involved inpropolypeptide processing and protein breakdown in plants.J Plant Physiol 2002;159:1281–93.

[59] Gomord V, Fitchette AC, Menu-Bouaouiche L, Saint-Jore-DupasC, Plasson C, Michaud D, Faye L. Plant-specific glycosylationpatterns in the context of therapeutic protein production. PlantBiotechnol J 2010;8:564–87.

[60] Faye L, Chrispeels MJ. Transport and processing of theglycosylated precursor of Concanavalin A in jack-bean.Planta 1985;170:217–24.

[61] Kimura Y, Ohno A, Takagi S. Structural analysis of N-glycansof storage glycoproteins in soybean (Glycine max. L) seed.Biosci Biotechnol Biochem 1987;61:1866–71.

[62] Otegui MS, Herder R, Schulze J, Jung R, Staehelin LA. Theproteolytic processing of seed storage proteins inArabidopsis embryo cells starts in the multivesicular bodies.Plant Cell 2006;18:2567–81.

[63] Kumamaru T, Uemura Y, Inoue Y, Takemoto Y, Siddiqui SU,Ogawa M, Hara-Nishimura I, Satoh H. Vacuolarprocessing enzyme plays an essential role in the crystallinestructure of glutelin in rice seed. Plant Cell Physiol 2010;51:38–46.

[64] Staswick PE, Hermodson MA, Nielsen NC. Identification ofthe cystines which link the acidic and basic components ofthe glycinin subunits. J Biol Chem 1984;259:13431–5.

[65] Marshall RS, Frigerio L, Roberts LM. Disulfide formation inplant storage vacuoles permits assembly of a multimericlectin. Biochem J 2010;427:513–21.

[66] Kochhar S, Gartenmann K, Guilloteau M, McCarthy J.Isolation and characterization of 2S cocoa seed albuminstorage polypeptide and the corresponding cDNA. J AgricFood Chem 2001;49:4470–7.

[67] Larre C, Penninck S, Bouchet B, Lollier V, Tranquet O,Denery-Papini S, Guillon F, Rogniaux H. Brachypodium

398 J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

distachyon grain: identification and subcellular localization ofstorage proteins. J Exp Bot 2010;61:1771–83.

[68] Maruyama N, Fukuda T, Saka S, Inui N, Kotoh J, Miyagawa M,Hayashi M, Sawada M, Moriyama T, Utsumi S. Molecular andstructural analysis of electrophoretic variants of soybeanseed storage proteins. Phytochemistry 2003;64:701–8.

[69] Faye L, Sturm A, Bollini R, Vitale A, Chrispeels MJ. Theposition of the oligosaccharide side-chains ofphytohemagglutinin and their accessibility to glycosidasesdetermines their subsequent processing in the Golgi. Eur JBiochem 1986;158:655–61.

[70] Annongu AA, Joseph JK. Proximate analysis of castor seedsand cake. J Appl Sci Environ Manage 2008;12:39–41.

[71] Houston NL, Hajduch M, Thelen JJ. Quantitative proteomicsof seed filling in castor: comparison with soybean andrapeseed reveals differences between photosynthetic andnonphotosynthetic seedmetabolism. Plant Physiol 2009;151:857–68.

[72] Ferreira RB, Franco E, Teixeira AR. Calcium- andmagnesium-dependent aggregation of legume seed storageproteins. J Agric Food Chem 1999;47:3009–15.

[73] Krishnan HB, Oehrle NW, Natarajan SS. A rapid and simpleprocedure for the depletion of abundant storage proteinsfrom legume seeds to advance proteome analysis: a casestudy using Glycine max. Proteomics 2009;9:3174–88.

[74] Miernyk JA, Johnston ML. Chemical cross-linkingimmobilized-concanavalin A for use in proteomic analyses.Prep Biochem Biotech 2006;36:203–13.

[75] Miernyk JA, Johnston ML. Digging deeper into the seed proteome:Pre-fractionation of total proteins. In Seed Development: OmicsTechnologies Toward Improvement of seed Quality and Crop Yield,Editors, GK Agrawal, R Rakwal. Springer, in press.

[76] Chandler JW. Cotyledon organogenesis. J Exp Bot 2008;59:2917–31.

[77] Gallardo K, Le Signor C, Vandekerckhove J, Thompson RD,Burstin J. Proteomics of Medicago truncatula seeddevelopment establishes the time frame of diversemetabolic processes related to reserve accumulation. PlantPhysiol 2003;133:664–82.

[78] Hills MJ. Control of storage-product synthesis in seeds. CurrOpin Plant Biol 2007;7:302–8.

[79] Le BH, Wagmaister JA, Kawashima T, Bui AQ, Harada JJ,Goldberg RB. Using genomics to study legume seeddevelopment. Plant Physiol 2007;144:562–74.

[80] Gallardo K, Thompson R, Burstin J. Reserve accumulation inlegume seeds. C R Biol 2008;331:755–62.

[81] Vensel WH, Tanaka CK, Cai N, Wong JH, Buchanan BB,Hurkman WJ. Developmental changes in the metabolicprotein profiles of wheat endosperm. Proteomics 2005;5:1594–611.

[82] Sabelli PA, Larkins BA. The development of endosperm in grasses.Plant Physiol 209; 149: 14–26.

[83] Baubec T, Mittelsten Scheid O. Medea in full self-control.Trends Plant Sci 2006;11:469–71.

[84] Finnie C, Melchior S, Roepstorff P, Svensson B. Proteomeanalysis of grain filling and seed maturation in barley. PlantPhysiol 2002;129:1308–19.

[85] Finnie C, Svensson B. Barley seed proteomics from spots tostructures. J Proteomics 2009;72:315–24.

[86] Méchin V, Balliau T, Chateau-Joubert S, Davanture M,Langella O, Négroni L, Prioul JL, Thévenot C, Zivy M,Damerval C. A two-dimensional proteome map of maizeendosperm. Phytochemistry 2004;65:1609–18.

[87] Mak Y, Skylas DJ,Willows R, Connolly A, Cordwell SJ, WrigleyCW, Sharp PJ, Copeland L. A proteomic approach to theidentification and characterisation of protein composition inwheat germ. Funct Integr Genomics 2006;6:322–37.

[88] Xu SB, Li T, Deng ZY, Chong K, Xue Y, Wang T. Dynamicproteomic analysis reveals a switch between central carbon

metabolism and alcoholic fermentation in rice filling grains.Plant Physiol 2008;148:908–25.

[89] Kim ST, Wang Y, Kang SY, Kim SG, Rakwal R, Kim YC, KangKY. Developing rice embryo proteomics reveals essentialrole for embryonic proteins in regulation of seedgermination. J Proteome Res 2009;8:3598–605.

[90] Bevan M, Bancroft I, Bent E, Love K, Goodman H, Dean C,Bergkamp R, Dirkse W, Van Staveren M, Stiekema W, et al.Analysis of 19 Mb of contiguous sequence from chromosome4 of Arabidopsis thaliana. Nature 1998;391:485–8.

[91] Kranner I, Minibayeva FV, Beckett RP, Seal CE. What isstress? Concepts, definitions and applications in seedscience. New Phytol 2010;188:655–73.

[92] Woo SH, Fukuda M, Islam N, Takaoka M, Kawasaki H, HiranoH. Efficient peptide mapping and its application to identifyembryo proteins in rice proteome analysis. Electrophoresis2002;23:647–54.

[93] Wang W, Meng B, Ge X, Song S, Yang Y, Yu X, Wang L, Hu S,Liu S, Yu J. Proteomic profiling of rice embryos from a hybridrice cultivar and its parental lines. Proteomics 2008;8:4808–21.

[94] Perata P, Matsukura C, Vernieri P, Yamaguchi J. Sugarrepression of a gibberellin-dependent signaling pathway in barleyembryos. Plant Cell 1007; 9: 2197–2208.

[95] Neuhaus HE, Emes MJ. Nonphotosynthetic metabolism inplastids. Annu Rev Plant Physiol Plant Mol Biol 2000;51:111–40.

[96] Balmer Y, Vensel WH, DuPont FM, Buchanan BB, HurkmanWJ. Proteome of amyloplasts isolated from developingwheat endosperm presents evidence of broad metabolicactivity. J Exp Bot 2006;57:1591–602.

[97] Campos FAP, Nogueira FCS, Cardoso FC, Costa GCL, Del BemLEV, Domont GB, Da Silva MJ, Moreira RC, Soares AA, Jucá TL.Proteome analysis of castor bean seeds. Pure Appl Chem2010;82:259–67.

[98] van Wijk KJ. Plastid proteomics. Plant Physiol Biochem2004;42:963–77.

[99] van Wijk KJ, Peltier JB, Giacomelli L. Isolation of chloroplastproteins from Arabidopsis thaliana for proteome analysis.Meth Mol Biol 2007;355:43–8.

[100] Thompson R, Burstin J, Gallardo K. Post-genomics studies ofdevelopmental processes in legume seeds. Plant Physiol2009;151:1023–9.

[101] Hajduch M, Ganapathy A, Stein JW, Thelen JJ. A systematicproteomic study of seed filling in soybean. Establishment ofhigh-resolution two-dimensional reference maps,expression profiles, and an interactive proteome database.Plant Physiol 2005;137:1397–419.

[102] Agrawal GK, Hajduch M, Graham K, Thelen JJ. In-depthinvestigation of the soybean seed-filling proteome andcomparison with a parallel study of rapeseed. Plant Physiol2008;148:504–18.

[103] Gallardo K, Firnhaber C, Zuber H, Hericher D, Belghazi M,Henry C, Küster H, Thompson RD. A combined proteome andtranscriptome analysis of developing Medicago truncatulaseeds. Mol Cell Proteomics 2007;6:2165–79.

[104] Dam S, Laursen BS, Ornfelt JH, Jochimsen B, Staerfeldt HH,Friis C, Nielsen K, Goffard N, Besenbacher S, Krusell L, Sato S,Tabata S, Thøgersen IB, Enghild JJ, Stougaard J. The proteomeof seed development in the model legume Lotus japonicus.Plant Physiol 2009;149:1325–40.

[105] Hajduch M, Hearne LB, Miernyk JA, Casteel JE, Joshi T,Agrawal GK, Song Z, Zhou M, Xu D, Thelen JJ. Systemsanalysis of seed filling in Arabidopsis thaliana: Using generallinear modeling to assess concordance of transcript andprotein expression. Plant Physiol 2010;152:2078–87.

[106] Hajduch M, Casteel JE, Hurrelmeyer KE, Song Z, Agrawal GK,Thelen JJ. Proteomic analysis of seed filling in Brassica napus.Developmental characterization of metabolic isozymes

399J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

using high-resolution two-dimensional gel electrophoresis.Plant Physiol 2006;141:32–46.

[107] Agrawal GK, Thelen JJ. A high-resolution two dimensionalGel- and Pro-Q DPS-based proteomics workflow forphosphoprotein identification and quantitative profiling.Meth Mol Biol 2009;527:3–19.

[108] Jain R, Katavic V, Agrawal GK, Guzov VM, Thelen JJ.Purification and proteomic characterization of plastids fromBrassica napus developing embryos. Proteomics 2008;8:3397–405.

[109] Bourgeois M, Jacquin F, Savois V, Sommerer N, Labas V,Henry C, Burstin J. Dissecting the proteome of pea matureseeds reveals the phenotypic plasticity of seed proteincomposition. Proteomics 2009;9:254–71.

[110] Scippa GS, Rocco M, Ialicicco M, Trupiano D, Viscosi V, DiMichele M, Arena S, Chiatante D, Scaloni A. The proteome oflentil (Lens culinaris Medik.) seeds: discriminating betweenlandraces. Electrophoresis 2010;31:497–506.

[111] Angelovici R, Galili G, Fernie AR, Fait A. Seed desiccation: abridge between maturation and germination. Trends PlantSci 2010;15:211–8.

[112] Bove J, Jullien M, Grappin P. Functional genomics in thestudy of seed germination. Genome Biol 2003;3REVIEWS1002.

[113] Finkelstein R, Reeves W, Ariizumi T, Steber C. Molecularaspects of seed dormancy. Annu Rev Plant Biol 2008;59:387–415.

[114] Penfield S, King J. Towards a systems biology approach tounderstanding seed dormancy and germination. Proc BiolSci 2009;276:3561–9.

[115] North H, Baud S, Debeaujon I, Dubos C, Dubreucq B, GrappinP, Jullien M, Lepiniec L, Marion-Poll A, Miquel M, Rajjou L,Routaboul JM, Caboche M. Arabidopsis seed secretsunravelled after a decade of genetic and omics-drivenresearch. Plant J 2010;61:971–81.

[116] Holdsworth MJ, Bentsink L, Soppe WJ. Molecularnetworks regulating Arabidopsis seed maturation,after-ripening, dormancy and germination. New Phytol2008;179:33–54.

[117] Lee CP, Eubel H, O'Toole N, Millar AH. Heterogeneity ofthe mitochondrial proteome for photosynthetic andnon-photosynthetic Arabidopsis metabolism. Mol CellProteomics 2008;7:1297–316.

[118] Huang S, Taylor NL, Narsai R, Eubel H, Whelan J, Millar AH.Experimental analysis of the rice mitochondrial proteome,its biogenesis, and heterogeneity. Plant Physiol 2009;149:719–34.

[119] Maltman DJ, Gadd SM, Simon WJ, Slabas AR. Differentialproteomic analysis of the endoplasmic reticulumfrom developing and germinating seeds of castor (Ricinuscommunis) identifies seed protein precursors as significantcomponents of the endoplasmic reticulum. Proteomics2007;7:1513–28.

[120] Østergaard O, Finnie C, Laugesen S, Roepstorff P, SvennsonB. Proteome analysis of barley seeds: identification of majorproteins from two-dimensional gels (pI 4–7). Proteomics2004;4:2437–47.

[121] Hynek R, Svensson B, Jensen ON, Barkholt V, Finnie C. Theplasma membrane proteome of germinating barleyembryos. Proteomics 2009;9:3787–94.

[122] Yang P, Li X, Wang X, Chen H, Chen F, Shen S. Proteomicanalysis of rice (Oryza sativa) seeds during germination.Proteomics 2007;7:3358–68.

[123] Hägglund P, Bunkenborg J, Maeda K, Svensson B.Identification of thioredoxin disulfide targets using aquantitative proteomics approach based on isotope-codedaffinity tags. J Proteome Res 2008;7:5270–6.

[124] Bønsager BC, Shahpiri A, Finnie C, Svensson B. Proteomicand activity profiles of ascorbate-glutathione cycle enzymes

in germinating barley embryo. Phytochemistry 2010;71:1650–6.

[125] Yano H, Kuroda M. Disulfide proteome yields a detailedunderstanding of redox regulations: a model study ofthioredoxin-linked reactions in seed germination.Proteomics 2006;6:294–300.

[126] Alkhalfioui F, Renard M, Vensel WH, Wong J, Tanaka CK,HurkmanWJ, BuchananBB,MontrichardF. Thioredoxin-linkedproteins are reduced during germination ofMedicago truncatulaseeds. Plant Physiol 2007;44:1559–79.

[127] Fu Q, Wang BC, Jin X, Li HB, Han P, Wei KH, Zhang XM, ZhuYX. Proteomic analysis and extensive protein identificationfrom dry, germinating Arabidopsis seeds and youngseedlings. J Biochem Mol Biol 2005;38:650–60.

[128] Sheoran IS, Olson DJ, Ross AR, Sawhney VK. Proteomeanalysis of embryo and endosperm from germinatingtomato seeds. Proteomics 2005;5:3752–64.

[129] Pawłowski TA. Proteomics of European beech (Fagus sylvaticaL.) seed dormancy breaking: influence of abscisic andgibberellic acids. Proteomics 2007;7:2246–57.

[130] Pawłowski TA. Proteome analysis of Norway maple (Acerplatanoides L.) seeds dormancy breaking and germination:influence of abscisic and gibberellic acids. BMC Plant Biol2009;9:48.

[131] Sghaier-Hammami B, Valledor L, Drira N, Jorrin-Novo JV.Proteomic analysis of the development and germination ofdate palm (Phoenix dactylifera L.) zygotic embryos. Proteomics2009;9:2543–54.

[132] Yang MF, Liu YJ, Liu Y, Chen H, Chen F, Shen SH. Proteomicanalysis of oil mobilization in seed germination andpostgermination development of Jatropha curcas. J ProteomeRes 2009;8:1441–51.

[133] Müller K, Job C, Belghazi M, Job D, Leubner-Metzger G.Proteomics reveal tissue-specific features of the cress(Lepidium sativum L.) endosperm cap proteome and itshormone-induced changes during seed germination.Proteomics 2010;10:406–16.

[134] Shi J, Zhen Y, Zheng RH. Proteome profiling of early seeddevelopment in Cunninghamia lanceolata (Lamb.) Hook. J ExpBot 2010;61:2367–81.

[135] Lau OS, Sun SS. Plant seeds as bioreactors for recombinantprotein production. Biotechnol Adv 2009;27:1015–22.

[136] Thelen JJ, Ohlrogge JB. Both antisense and sense expressionof biotin carboxyl carrier protein isoform 2 inactivates theplastid acetyl-coenzyme A carboxylase in Arabidopsisthaliana. Plant J 2002;32:419–31.

[137] Robić G, Farinas CS, Rech EL, Miranda EA. Transgenicsoybean seed as protein expression system: aqueousextraction of recombinant beta-glucuronidase.Appl Biochem Biotechnol 2010;160:1157–67.

[138] Moravec T, Schmidt MA, Herman EM, Woodford-Thomas T.Production of Escherichia coli heat labile toxin (LT) B subunitin soybean seed and analysis of its immunogenicity as anoral vaccine. Vaccine 2007;25:1647–57.

[139] Herman EM. Genetically modified soybeans and foodallergies. J Exp Bot 2003;54:1317–9.

[140] Brandão AR, Barbosa HS, Arruda MA. Image analysis oftwo-dimensional gel electrophoresis for comparativeproteomics of transgenic and non-transgenic soybean seeds.J Proteomics 2010;73:1433–40.

[141] Schmidt MA, Herman EM. Proteome rebalancing in soybeanseeds can be exploited to enhance foreign proteinaccumulation. Plant Biotechnol J 2008;6:832–42.

[142] Schulze WX, Usadel B. Quantitation inmass-spectrometry-based proteomics. Annu Rev Plant Biol2010 [Epub ahead of print].

[143] Nie L, Wu G, ZhangW. Statistical application and challengesin global gel-free proteomic analysis by mass spectrometry.Crit Rev Biotechnol 2008;28:297–307.

400 J O U R N A L O F P R O T E O M I C S 7 4 ( 2 0 1 1 ) 3 8 9 – 4 0 0

[144] Insenser MR, Hernáez ML, Nombela C, Molina M, Molero G,Gil C. Gel and gel-free proteomics to identifySaccharomyces cerevisiae cell surface proteins. J Proteomics2010;73:1183–95.

[145] Thelen JJ, Peck SC. Quantitative proteomics in plants:choices in abundance. Plant Cell 2007;19:3339–46.

[146] Stevenson SE, Chu Y, Ozias-Akins P, Thelen JJ. Validation ofgel-free, label-free quantitative proteomics approaches:applications for seed allergen profiling. J Proteomics 2009;72:555–66.

[147] Huang Y, Houston NL, Tovar-Mendez A, Stevenson SE,Miernyk JA, Randall DD, Thelen JJ. A quantitative massspectrometry-based approach for identifying proteinkinase-clients and quantifying kinase activity.Analyt Biochem 2010;402:69–76.

[148] Lundgren DH, Hwang SI, Wu L, Han DK. Role of spectralcounting in quantitative proteomics. Expert Rev Proteomics2010;7:39–53.

[149] Farley AR, Link AJ. Identification and quantification ofprotein posttranslational modifications. Meth Enzymol2009;463:725–63.

[150] Siuti N, Kelleher NL. Decoding protein modifications usingtop-down mass spectrometry. Nat Meth 2007;4:817–21.

[151] Gache V, Waridel P, Winter C, Juhem A, Schroeder M,Shevchenko A, Popov AV. Xenopus meioticmicrotubule-associated interactome. PLoS ONE 2010;5:e9248.

[152] Sharon M. How far can we go with structural massspectrometry of protein complexes? J Am Soc Mass Spec2010;21:487–500.

[153] Heeren RM, Smith DF, Stauber J, Kükrer-Kaletas B,MacAleese L. Imaging mass spectrometry: hype or hope? JAm Soc Mass Spectrom 2009;20:1006–14.

[154] Stauber J, MacAleese L, Franck J, Claude E, Snel M, Kaletas BK,Wiel IM, Wisztorski M, Fournier I, Heeren RM. On-tissueprotein identification and imaging by MALDI-ion mobilitymass spectrometry. J Am Soc Mass Spectrom 2010;21:338–47.

[155] Rautengarten C, Steinhauser D, Büssis D, Stintzi A, SchallerA, Kopka J, Altmann T. Inferring hypotheses on functionalrelationships of genes: analysis of the Arabidopsis thalianasubtilase gene family. PLoS Comput Biol 2005;1:e40.

[156] Yamada K, Shimada T, Kondo M, Nishimura M,Hara-Nishimura I. Multiple functional proteins are producedby cleaving Asn–Gln bonds of a single precursor by vacuolarprocessing enzyme. J Biol Chem 1999;274:2563–70.