mpikg public access

49
MPIKG Public Access Author Manuscript Published in final edited form as: Rath, C. B., Schirmeister, F., Figl, R., Seeberger, P. H., Schaeffer, C., & Kolarich, D. (2018}. Flagellin glycoproteomics of the periodontitis associated pathogen Selenomonas sputigena reveals previously not described 0-glycans and rhamnose fragment rearrangement occurring on the glycopeptides. Molecular and Cellular Proteomics, 17(4), 721-736. doi:10.1074/mcp.RA117.000394. Flagellin glycoproteomics of the periodontitis associated pathogen Selenomonas sputigena reveals previously not described 0-glycans and rhamnose fragment rearrangement occurring on the glycopeptides Cornelia B. Rath, Falko Schirmeister, Rudolf Figl, Peter H. Seeberger, Christina Schaffer, Daniel Kolarich Key words: Selenomonos sputigeno, flagellin glycosylation, periodontitis, glycoproteomics, glycan rearrangement

Upload: others

Post on 17-Oct-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: MPIKG Public Access

MPIKG Public Access Author Manuscript

Published in final edited form as: Rath, C. B., Schirmeister, F., Figl, R., Seeberger, P. H., Schaeffer, C., & Kolarich, D. (2018}.

Flagellin glycoproteomics of the periodontitis associated pathogen Selenomonas sputigena

reveals previously not described 0-glycans and rhamnose fragment rearrangement occurring

on the glycopeptides. Molecular and Cellular Proteomics, 17(4), 721-736.

doi:10.1074/mcp.RA117.000394.

Flagellin glycoproteomics of the periodontitis associated pathogen Selenomonas sputigena reveals previously not

described 0-glycans and rhamnose fragment rearrangement occurring on the glycopeptides

Cornelia B. Rath, Falko Schirmeister, Rudolf Figl, Peter H. Seeberger, Christina Schaffer, Daniel Kolarich

Key words:

Selenomonos sputigeno, flagellin glycosylation, periodontitis, glycoproteomics, glycan rearrangement

Page 2: MPIKG Public Access

Flagellin glycoproteomics of the periodontitis associated pathogen

Selenomonas sputigena reveals previously not described

0-glycans and rhamnose fragment rearrangement occurring on the

g lycopeptides

Cornel ia B. Rath:t:**, Falko Schirmeister§#**, Rudolf Figl ll, Peter H. Seeberger§#,

Christina Schaffer:t:*, Daniel Kolarich§¢*

From the +Department of NanoB iotechnology, NanoG/ycobiology un it, U n iversitat fUr

Bodenkultur Wien, 1 1 90 Vienna, Austria; §Department of Biomolecular Systems, Max Planck

Institute of Colloids and Interfaces, 1 4424 Potsdam, Germany; #Department of B iology,

Chemistry, Pharmacy, Institute of Chemistry and Biochemistry, Freie Un iversitat Berl i n ,

1 4 1 95 Berl i n , Germany; IIDepartment of Chemistry, D ivision of Biochemistry, U n iversitat fUr

Bodenkultur Wien, 1 1 90 Vienna, Austria; ¢Institute for Glycomics, Gold Coast Campus,

Griffith Un iversity, Queensland , 4222, Austral ia

*To whom correspondence should be addressed :

NanoG/ycobiology un it, Department of NanoBiotechnology, Un iversitat fUr Bodenkultur Wien ,

1 1 90 Vienna, Austria . Phone: +43 1 47654-80203; Fax: +43 1 4789 1 1 2 .

Emai l: christi na .schaeffer@boku .ac.at;

Institute for Glycomics, Gold Coast Campus, Griffith Un iversity, Queensland, 4222, Austral ia ,

Phone: +61 7 5552 7026; Fax: +6 1 7 5552 9040

Emai l: d . kolarich@griffith .edu .au

**These authors contributed equal ly to this work.

Running title

Selenomonas sputigena flagel l in g lycoproteomics

Key words

Selenomonas sputigena, flagel l in g lycosylation, periodontitis , g lycoproteomics, g lycan

rearrangement

1

Page 3: MPIKG Public Access

The abbreviations used are:

AA, anthran i l ic acid;

ACN, aceton itri le;

BHI, Brain-Heart Infusion;

CBB, Coomassie Bri l l iant Blue G-250;

CID, col l is ion-induced dissociation;

0-Gal, 0-galactose;

0-GaiN, 0-galactosamine;

0-GaiUA, 0-galactu ronic acid;

o-Gle, 0-g lucose;

0-GicN , 0-g lucosamine;

0-GicUA, 0-g lucuronic acid;

dHex, deoxyhexose;

O-Man , 0-mannose;

0-Xyl, 0-xylose;

ETD, electron transfer d issociation;

FLO, fluorescence detection;

GdHCI, guan id in ium hydroch loride;

G lcNAc, N-acetylg lucosamine;

HexNAc, N-acetylhexosamine;

IPTG, isopropyl 13-0-1 -th iogalactopyranoside;

L-Ara, L-arabinose;

LB, Luria-Bertan i;

L-Fuc, L-fucose;

L-Rha, L-rhamnose;

0-Me-Rha, 0-methyl-rhamnose;

PAS, period ic acid-Sch iff;

PBS, phosphate-buffered sal ine;

PGC, porous graph itized carbon

2

Page 4: MPIKG Public Access

SUMMARY

Flagel lated, Gram-negative, anaerobic, crescent-shaped Selenomonas species are

colonizers of the d igestive system, where they act at the interface between health and

disease. Selenomonas sputigena is also considered a potential human periodontal pathogen,

but information on its viru lence factors and underlyi ng pathogenicity mechan isms is scarce .

Here we provide the fi rst report of a Selenomonas g lycoprotein, showing that S. sputigena

produces a diversely and heavily 0-glycosylated flagel l in C9L Y 1 4 as a major ce l lu lar prote in,

which carries various h itherto undescribed rhamnose- and N-acetylg l ucosamine l inked 0-

g lycans in the range from mono- to hexasaccharides. A comprehensive g lycemic and

glycoproteomic assessment revealed extensive g lycan macro- and m icroheterogeneity

identified from 22 un ique g lycopeptide species. From the multi ple s ites of g lycosylation, five

were unambiguously identified on the 437 -amino acid C9L Y 1 4 protein (Thr1 49, Ser1 82,

Thr1 99, Thr259, and Ser334 ), the only flagel l in prote in identified . The 0-g lycans addit ional ly

showed modifications by methylation and putative acetylation . Some 0-glycans carried

hitherto undescribed residues/modifications as determ ined by their respective m/z values,

reflecting the h igh d iversity of native S. sputigena flagel l i n . We also found that

monosaccharide rearrangement occurred during col l is ion- induced d issociation (CID) of

protonated g lycopeptide ions. This effect resu l ted in pseudo Y1 -g lycopeptide fragment ions

that ind icated the presence of addit ional g lycosylation s ites on a s ing le g lycopeptide . CID

oxonium ions and electron transfer d issociation, however, confirmed that just a single s ite

was g lycosylated, showing that g lycan-to-peptide rearrangement can occur on g lycopeptides

and that th is effect is inf luenced by the molecular nature of the g lycan moiety. This effect was

most pronounced with d isaccharides . This study is the first report on 0-l i nked flage l l in

g lycosylation i n a Selenomonas species, reveal ing that C9L Y14 is one of the most heavily

g lycosylated f lagel l ins described to date. This study contributes to our understand ing of the

largely under-investigated surface properties of oral bacteria . The data have been deposited

to the ProteomeXchange with identifier PXD005859 .

3

Page 5: MPIKG Public Access

INTRODUCTION

Flagel lar moti l i ty is one of the most extensively stud ied processes in prokaryotic

microbiology. The bacteria l flagel lum, a complex multi protein assembly, is best known as the

locomotive organel le that a l lows m icrobes to actively move towards favorable environments

by chemotaxis (1 ). In many pathogenic bacteria such as Escherichia coli, Vibrio cholerae,

Helicobacter spp . , Campylobacter spp., Pseudomonas aeruginosa, Borrelia burgdorferi or

Treponema spp. , and Salmonella spp the flagel lum represents an essentia l structure that is

crucial for fu l l pathogenesis (2). While it was in it ial ly bel ieved that flagel la are i nvolved in

bacterial pathogen icity solely by conferring moti l i ty, it has now become evident that this

organel le can play a role in every step of the infection cycle (3).

The typ ical bacteria l flage l lum consists of six components - a basal body, a motor, a

switch, a hook, a fi lament, and an export apparatus (4 ) . Escherichia coli and Salmonella

enterica sv. typhimurium have long served as the model organisms for studying flagel lar

assembly (4 ) , with more than 50 genes involved in flagel lar b iosynthesis and function .

Approximately half of these genes encode the structural components of the flage l lum, and

the rest is responsible for either the regu lation of flagel lar assembly or the detection and

processing of environmenta l s ignals to which flagel la respond (5) . However, there is

extensive d iversity among bacteria i n the contents and organ ization of the gene complexes

that specify flagel la as well as structural variation in the flagel lum itself (6, 7) .

The flagel la fi lament is bui l t from -20,000-30,000 copies of f lagel l in monomers (8).

The major structural flagel la proteins and their specific g lycosylation have become a focus of

recent research . Fol lowing the first description about 30 years ago, an i ncreasing number of

reports on flagel la post-translational mod ifications from archaea and bacteria has been

publ ished and reviewed (9- 1 1 ) . In most bacteria l flagel l ins described to date, the g lycans are

attached to the prote in via an 0-glycosidic l inkage. Genes for g lycosyltransferases and the

genetic information govern ing carbohydrate biosynthetic pathways that are required for

flagel l in modification are, i n many cases, located in close proximity to the genes encoding the

flagel lar apparatus. Interesting ly, flage l l in g lycosylation is main ly found in Gram-negative

4

Page 6: MPIKG Public Access

species and seems to be more commonplace in, but not l imited to, organ isms producing

polar flagel la (9) . Especial ly in pathogenic bacteria such as Campylobacter spp. ,

Helicobacter pylori and Pseudomonas aeruginosa, the important role of g lycosylation for both

flagel lar assembly as well as a means for exerting b iolog ical i nteractions has clearly been

establ ished ( 1 2- 1 5).

Selenomonas sputigena is a flagel lated, motile, anaerobic, crescent-shaped, Gram­

negative bacterium that was orig inal ly isolated from the periodontal pockets of patients

suffering from periodontitis ( 1 6) . Ora l members of the bacterial genus Selenomonas have

repeated ly been associated with periodontal d isease and recent stud ies support the

association of Selenomonas with the etiology of periodontitis, based on the detection of

Selenomonas species at sign ificant levels in subging ival b iofi lm samples of subjects with

both chronic ( 1 7) and aggressive ( 1 8) periodontitis. It appears that Selenomonas species

contribute considerably to the structural organ ization of mu ltispecies oral biof i lms ("denta l

plaque"), which is the condition under which periodontitis develops ( 1 9) . Periodontitis

conti nues to be the most frequently occurring i nflammatory d isease world-wide; in its chronic

form, it is the major cause of tooth loss and can also impact systemic health (20), underl in ing

the urgent need for therapeutic interference. However, whi le a g roup of d istinct, non-moti le

Gram-negative anaerobes (the "red complex") has been identified as keystone periodonta l

pathogens, the pathogenesis of periodontitis is sti l l not fu l ly understood (20) . Flagel lation as

present on S. sputigena cells m ight represent a hitherto unknown/unrecogn ized strategy of

the bacterium to colonize its n iche in the multispecies oral biof i lm, thereby contributing to the

establ ishment of the d isease .

To date there is essential ly no knowledge avai lable on flagel l in g lycosylation of oral

Selenomonas spp. Un l ike in mammal ian g lycoprotein synthesis, g lycosylation pathways in

bacterial species are h igh ly d iverse (21 ) , making d ifferent orthogonal approaches a necessity

to collect i nformation on monosaccharide identity, g lycan composition, and prote in

attachment. Mass spectrometry has a lways been a core technology for determin ing the

primary structure of g lycoprotein g lycans. In particular g lycopeptide product ion spectra

5

Page 7: MPIKG Public Access

provide a h igh ly sensitive and selective opportunity to i nvestigate g lycosylation levels and

s ites of g lycan attachment. Col l ision-induced d issociation (CI D) of g lycopeptides usual ly

resu lts in a prominent Y1 -ion that faci l itates identification of the peptide- l inked

monosaccharide (22). Such Y1 -assigments are, however, i ncreasingly d ifficult if more than

one site of g lycosylation is present on a s ingle peptide or if gas phase monosaccharide

rearrangements occur (23) . Halim and Zauner independently reported the detection of smal l

product ion signals in some 0-g lycopeptide product ion spectra acquired from human urinary

and human fi bri nogen g lycopeptides that they assigned to a hexose rearrangement (24, 25).

S i nce these pseudo Y1 -g lycopeptide signals were, if detected at al l , just of very low intensity,

they did not interfere with accurate assignment. The vast majority of monosaccharide

rearrangements are reported for the analysis of protonated g lycans, i n part icular when one or

more fucose residues are present on the g lycan (26). In the analysis of N-g lycopeptides,

deoxyhexose or hexose rearrangements have on ly been found to occur between N-g lycan

antennae or towards the innermost N-acetylglucosamine (GicNAc) of N-g lycans, as

comprehensively reviewed by Wuhrer et a/. (26). However, besides these few reports,

monosaccharide rearrangement and its possible effects on g lycoproteomic data analysis has

hard ly been further i nvestigated.

In th is study, we employed a variety of g lycemic and g lycoproteomic approaches to

investigate the g lycosylation of the S. sputigena flagel l in protein (Un iProt Accession

C9L Y 1 4 ), demonstrating that C9L Y 1 4 is a heavily g lycosylated protein exh ib it ing a surprising

g lyco-heterogeneity at multi ple sites of 0-g lycosylation and carrying h itherto not descri bed

0-g lycan structures. We also show that CID of 0-g lycan carrying g lycopeptides induced a

g lycan structure dependent gas phase monosaccharide rearrangement resu lting in the

formation of strong pseudo Y1 -ions. These ind icated the presence of two sites of

g lycosylation on a s ing le g lycopeptide , whi le electron transfer dissociation (ETD) confirmed

that a d isaccharide was present on a s ingle site of g lycosylation only.

6

Page 8: MPIKG Public Access

EXPERIMENTAL PROCEDURES

Bacterial Strains and Growth Conditions-Selenomonas sputigena A TCC 351 85

was obtained from the Leibniz Institute DSMZ - German Collection of Microorgan isms and

Cell Cu ltu res (Braunschweig, Germany). Bacteria were grown anaerobica l ly at 3rC in 37 g/L

Brain-Heart-Infusion (BHI) broth (Oxoid, Hampshire, UK), supplemented with 5 g/L yeast

extract (Oxoid, Hampshire, U K), 0 .5 g/L L-cysteine (Sigma), 5 IJg/mL hemin (Sigma) and

2 .0 IJg/mL menadione (Sigma). For the preparation of BHI-agar plates (swarming plates),

0 .5% w/v agar was added . Cel ls were harvested by centrifugation (5,000 x g, 20 min, 4°C),

washed twice with phosphate-buffered sal ine (PBS; pH 7 .5), and either used instantly or

stored at -20°C.

Escherichia coli DH5a cel ls and E. coli BL2 1 (DE3) cel ls (both Life Technolog ies,

Carlsbad, USA) were grown at 3rC with shaking at 200 rpm in Luria-Bertan i (LB) broth or on

LB agar p lates supplemented with 50 IJg/mL kanamycin .

Flagellin Enrichment from Bacterial Cultures-Enrichment of native S . sputigena

flagel l in protein was ach ieved by d ifferential centrifugation as described previously (27), with

minor modifications . Briefly, bacteria l cel ls from BHI-swarm ing plates were i nocu lated into

1 0 ml of BHI med ium and incubated overn ight at 3rC. Five ml of this pre-cu ltu re were then

transferred to 500 ml of ha lf-strength BHI med ium (d i luted 1 : 1 with PBS) and incubated

again overn ight at 3rC. Cel ls were harvested by centrifugation (5,000 x g, 20 min) and

resuspended in PBS (20 ml per 1 g of wet weight of ce l ls) . The suspension was blended for

2 min using a commercial blender to shear off flagel la, fol lowed by centrifugation (5,000 x g,

30 min) . The collected supernatant was centrifuged at 1 6 ,000 x g for 1 5 min and fu rther

processed by centrifugation at 40,000 x g for 3 h. Al l centrifugation steps were carried out at

4°C . The remain ing pel let was resuspended in 0 .5 ml of Mi l l i-Q (MQ) water and freeze-dried .

The purity of the sample was analyzed by SDS-PAGE.

7

Page 9: MPIKG Public Access

SDS-PAGE and Western-lmmunoblotting-SDS-PAGE was carried out on 1 2% slab

gels accord ing to Laemml i (28). Protei n bands were visual ized using Coomassie Bri l l iant

B l ue G-250 (CBB; Serva, Heidelberg, Germany) and carbohydrates were sta ined with

periodic acid-Schiff (PAS) reagent (29) . For Western- immunoblotting, prote ins were

transferred onto a n itrocel lu lose membrane (Peqlab, Erlangen, Germany) and detection was

performed at 800 nm on an Odyssey I nfrared Imaging System (LI-COR, Lincoln, USA). For

visual ization of native flagel l in C9L Y 1 4, a flagel l in-specific antiserum was used in

combination with IR Dye 800CW goat anti-mouse antibody (LI-COR). Detection of His6-tag

on recombinant C9L Y 1 4 (C9L Y 1 4R) was done with a mouse anti-His6 antibody (Life

Technolog ies) i n combination with the IR Dye 800CW goat anti-mouse anti body (LI-COR).

General Molecular Biology Methods-Restriction enzymes and l igase were

purchased from Thermo Scientific (Vienna, Austria) . Genomic DNA of S. sputigena

ATCC 35 1 85 was isolated as described previously (30). Plasmid DNA was purified from

transformed cel ls using the GeneJET™ Plasmid Min iPrep Kit {Thermo Scientific, MA, USA).

PCR fragments were ampl ified by Phusion® High-Fidel ity DNA Polymerase {Thermo

Scientific) accord ing to the manufacturer's protocol . PCR fragments and d igested plasmids

were purified from agarose gels using the GeneJET™ Gel Extraction Kit {Thermo Scientific).

Chemically competent E. coli DH5a cel ls and E. coli BL2 1 (DE3) ce l ls were transformed

accord ing to the protocol provided by the manufacturer. Recombinant cel ls were analyzed by

restriction mapping and confirmed by sequencing (Microsynth, Austria) .

Expression and Purification of Recombinant Flage//in-Recombinant S. sputigena

ATCC 351 85 flage l l in (C9LY 1 4R) was expressed as C-terminal ly hexah istid ine (His6)-tagged

protein in E. coli BL2 1 (DE3) . The cod ing sequence for C9L Y 1 4 was ampl ified from

S. sputigena ATCC 351 85 genomic DNA by PCR using the primers {Thermo Scientific)

002-fw-fla-Ncol (5'-CCGACACCATGGCATTGGTAGTTAAGAACAAC-3') and

00 1 -rev-fla-Xhol (5'-GACGACTCGAGCAGGCTGAGAACGCCGGAG-3') . The PCR product

8

Page 10: MPIKG Public Access

was d igested with Ncoi/Xhol and l igated i nto the pET28a(+) expression vector (Novagen,

Wisconsin, USA). Subsequently, pET28a_C9LY1 4R_His6 was transformed into E. coli BL2 1

(DE3) and freshly transformed cel ls were grown in LB med ium at 3rC with shaking at

200 rpm to an 00600 of -0.6. Protein expression was induced with a fina l concentration of

1 mM isopropyl 13-D-1 -th iogalactopyranoside (IPTG; Thermo Scientific) and cultu res were

grown for addit ional 3 h .

His6-tagged C9L Y 1 4R for detection purposes was purified under denaturing cond itions

using N i-NTA Agarose (Qiagen, H i lden, Germany) accord ing to the manufactu rer's protocol .

The purified protein was dia lyzed against MQ water overnight at 4 a c.

Purification of Recombinant Flagellin for Immunization of Mice-Escherichia coli

BL2 1 (DE3) cel ls harboring the expression plasmid were harvested by centrifugation

(5,000 x g, 30 m in , 4°C} and resuspended in 50 mM sod ium citrate buffer (pH 6.2) conta in ing

0 . 1 % Triton X-1 00. After add ition of lysozyme (800 IJg/mL; Sigma-Aldrich) and benzonase

(50 U/ml; Sigma-Aldrich), cel ls were incubated for 30 min at 3rC. U ltrasonication (Branson

son ifier, duty cycle 50%; output 6) was used to further break open bacteria applying ten

cycles of 1 0 pu lses with 30-s breaks, each, and soluble protein was separated from the

insoluble materia l by centrifugation (25,000 x g, 30 min, 4°C}, with both fractions contain ing

C9LY 1 4R . To i ncrease the yield of C9LY1 4R, the remain ing protein was extracted from the

pel let with 50 mM sodium citrate buffer (pH 5.5) conta in ing 5 M GdHCI, 20 mM imidazole and

0.5 M NaCI for 1 . 5 h at 4°C and shaking at 200 rpm . The extract was centrifuged (45,000 x g ,

1 h , 4°C} and the supernatant subjected to membrane-fi ltering (0.45 IJm pore s ize). The

result ing prote in extracts were combined and appl ied to a 1 -ml HisTrap HP column (GE

Healthcare, Little Chalfont, UK) . The recombinant protein was recovered in e lution buffer

(50 mM sod ium citrate buffer [pH 5.5], 5 M GdHCI, 1 M imidazole, 0 .5 M NaCI) fol lowed by

dialysis against 50 mM sodium citrate buffer (pH 5.5) . Immun ization of m ice and preparation

of polyclonal antiserum against purified C9L Y 1 4R was done at EF-BIO s . r.o . (Bratislava,

Slovakia) . 9

Page 11: MPIKG Public Access

Protein Digestion and Glycan Release-For g lycoproteomic analyses, the

preparation enriched in native S. sputigena flagel l in was separated by SDS-PAGE and

protein bands of interest were excised and desta ined (31 ) . Any cystei ne residues were

reduced with 1 0 mM d ithiothreitol (D0632; Sigma-Aldrich) for 1 h at 56°C and carbamide­

methylated by incubation in 55 mM iodoacetamide (16 1 25; Sigma-Aldrich) for 1 h at room

temperature in the dark. Subsequently, the protein was in-gel digested at 3rC with trypsin

( 1 1 04784 1 00 1 ; Roche, Basel , Switzerland) or chymotrypsin ( 1 1 4 1 8467001 ; Roche),

respectively, in 25 mM ammonium bicarbonate buffer using an enzyme-to-protein ratio of

1 :50 (w/w). Tryptic d igests were carried out overn ight, whi le chymotrypsin digests were

term inated after 4 h. Result ing (g lyco)peptides were extracted from the gel pieces and dried

in a centrifugal evaporator (in vacuo). For reversed-phase nanoLC-ESI-MSMS analyses, the

samples were resolub i l ized in 40 IJL of 0 . 1 % formic acid (9431 8; Sigma-Aldrich ) from which

1 -IJL al iquots were injected per analysis . Glycoproteomic data has been acquired for three

flagel l i n preparations that were performed on three independently g rown S. sputigena

cultu res. A technica l trip l icate analysis was performed from a single batch for the in E coli­

expressed recombinant flagel l in prote in .

0-g lycans were released from tryptic g lycopeptides by reductive 13-e l imination . Dried

g lycopeptides were incubated with 50 IJL of 0 .5 M sodium borohydride (7 1 32 1 ; Sigma­

Aldrich ) in 50 mM potassium hydroxide (22 1 473; Sigma-Aldrich) for 1 6 h at 50°C. The

released 0-g lycans were desalted over an AG 50W-X8 resin ( 1 42- 1 45 1 ; B io-Rad

Laboratories, Hercu les, USA) and purified by solid-phase extraction with porous graph itized

carbon (PGC; 2 1 0 1 0 1 , Grace Bio-Labs, Oregon, USA) (32) . For g lycemic analyses by PGC

nanoLC-ESI-MSMS, the dried 0-g lycans were resolub i l ized i n 1 5 IJL of 1 0 mM ammonium

bicarbonate of which 5 IJL were injected per analysis. Due to l im itations i n sample materia l,

g lycemic data has been acquired for a s ingle S. sputigena flagel l in preparation.

Protein and Glycan Analysis-Peptides and glycopeptides were analyzed by

reversed-phase nanoLC-ESI-MSMS on a Dionex U lt imate 3000 UHPLC onl ine coupled to a

1 0

Page 12: MPIKG Public Access

Bruker amaZon Speed ETD ion trap mass spectrometer in positive ion mode. Reversed

phase chromatography was performed on PepMap C1 8 columns (precolumn: Thermo

Fisher, PepMap1 00, 1 00 IJm [ ID] x 2 em, C 1 8, 51Jm particle size, 1 00 A pore size, P/N

1 64564; analytical column : Thermo Fisher, Accla im PepMap RSLC, 75 IJm [ID] x 1 5 em,

C 1 8, 2 IJm particle s ize, 1 00 A pore size, P/N 1 64534) using a l inear g radient of 0 . 1 % formic

acid (solvent A) and 90% acetonitri le (ACN) conta in ing 0 . 1 % form ic acid (solvent B) at a

constant flow rate of 400 nl/min at 45°C. The samples were loaded onto the trap column i n

solvent A. The analytical gradient started a t 5% of solvent B and gradual ly increased to 45%

over 30 m in . The columns were flushed with 90% solvent B for 1 0 min after each sample

before re-equi l ibrating to starting conditions.

The ion trap was set to scan from m/z 400 to 1 ,600 with an SPS Target Mass of

m/z 1 ,350 using the instrument's Enhanced Resolution mode. The ICC target was set to

200,000 with a maximum accumulation time of 50 ms. The three most intense signals of

each MS scan were selected for CID and the resu lting fragments were recorded from

m/z 1 00 to 2 ,000 in the U ltraScan mode. Besides C I D, electron transfer d issociation (ETD)

was employed for (g lyco)peptide fragmentation in separate analyses. The ETD reagent

(flouranthene) ICC Target was set to 500,000 with a maximum accumu lation t ime of 1 0 ms.

The ETD reaction time was set to 1 00 ms. The g lycopeptide signal i ntens ities were fu rther

enhanced using a Bruker CaptiveSpray NanoBooster™ ion ization source using nitrogen as

dry gas (3 Lim in at 1 50°C) and ACN as dopant solvent. An Active Exclusion setting (exclude

after th ree spectra , release after 0.5 min) was employed in the data-dependent acqu isition

(DDA) method .

The acqu ired MSMS data was analyzed using Bruker DataAnalysis 4.2 and Bruker

Prote inScape 3 . 1 with Matrix Science MASCOT Server 2 .3 . A custom database of 3,308

S. sputigena proteins was retrieved from Un iProt

(http://www.un iprot.org/uniprot/?query=selenomonas+sputigena) on June 1 0, 20 1 4 . This

database was employed for the MASCOT protein search (the search parameters are l isted in

Supplementary Table 2) . Glycopeptides were identified manual ly from the CID and ETD

1 1

Page 13: MPIKG Public Access

fragment spectra . The ExPASy tool "PepSweetener"

(http://g lycoproteome.expasy.org/pepsweetener/app/) assisted in the in it ia l identification of

the peptide and glycan moieties of the g lycopeptides based on precursor masses (33). The

onl ine tool Fragment lon Calcu lator ( I nstitute for Systems B iology, Seattle, USA;

http ://db .systemsbiology.net:8080/proteomicsToolkit/Frag lonServlet .html) was further used to

verify the detected fragment spectra . A technical dup l icate analysis of each proteolytica l ly

digested sample was performed .

The 0-g lycans released by reductive 13-e l imination (32) were analyzed by PGC

nano-flow chromatography onl ine coupled to a Bruker amaZon Speed ETD ion trap mass

spectrometer in negative ion mode . A Dionex U ltimate 3000 UHPLC was equipped with

Hypercarb columns (precolumn: KappaGuard 0 .3 mm [ID] x 30 mm, 5 1-1m particle s ize,

250 A pore s ize, P/N 35005-0303 1 5; ana lytical column: 75 1-1m [ID] x 1 00 mm, 31-1m particle

s ize, 250 A pore size, P/N 35003- 1 00065, both Thermo Scientific, Massachusetts, USA).

Solvent A was 1 0 mM ammon ium bicarbonate and solvent B was 60% ACN in 1 0 mM

ammonium b icarbonate . The flow rate was held constant at 0.9 IJL/m in at 40°C. The samples

were loaded onto the trap column i n solvent A. The analytical g radient i ncreased from 2 to

35% solvent B over a time period of 33 m in . The ion trap was operated in the U ltraScan

mode in the m/z range from 200 to 1 , 000 with an SPS Target Mass of m/z 600. The three

most i ntense signals were se lected for CID and the MSMS scan range was set to m/z 1 00 to

2 ,500 in the Enhanced Resolution mode. The acquired mass spectra (MS and MSMS) were

analyzed manual ly using Bruker DataAnalysis 4 .2 . 0-g lycans were analyzed as a technica l

dupl icate .

Monosaccharide Analysis by Reversed Phase (RP)-HPLC-To determ ine the

identity of the flagel l in 0-g lycan monosaccharide constituents, the monosaccharides

obta ined after acid hydrolys is of re leased f lage l l in 0-g lycans and of g lycopeptides were

fluorescence-tagged with anthrani l ic acid (AA) and analyzed by RP-H PLC (34 ). Further

investigation on the natu re and methylation status of the sugars was performed by

hypermethylation of the AA-Iabeled sample (35). All monosaccharide standard compounds

1 2

Page 14: MPIKG Public Access

(D-Gic, D-Gal, O-Man , D-Xyl , L-Ara, L-Fuc, L-Rha, 6-deoxy-D-g lucose, D-GicN, D-GaiN, D­

GicUA, D-GaiUA) as well as AA and methyl-d3 iodide (CD31) were obta ined from Sigma­

Aldrich, 6-deoxy-talose was prepared i n-house as described previously (35). Sequencing­

grade trypsin was purchased from Promega (Madison, WI, USA).

SDS-PAGE-separated flagel l in g lycoprote in bands were excised, destained,

carbamidomethylated, and incubated with trypsin overn ight at 3rC. The extracted material

was loaded onto RP cartridges (Phenomenex; Strata C- 1 8E [55 IJm, 70 A] 1 ml [50 mg])

equi l ibrated with 80 mM form ic acid (buffered to pH 3 .0 with ammonia) . After washing of the

sample three times with 0 .5 ml of the above mentioned buffer, the reta ined (g lyco)peptides

were eluted with 0.4 ml of 65% ACN in buffer and dried in vacuo. In para l le l , 0-g lycans were

re leased from excised gel p ieces by in-gel reductive P-el imination . For this purpose, excised ,

desta ined and dried gel pieces were covered with 1 ml of 50 mM NaOH conta in ing 1 M

sodium borohydride and incubated overn ight at 50°C. The a lkal ine solution conta in ing the

re leased 0-g lycans was loaded onto PGC cartridges (Supelco; Supelclean ENVI Carb SPE

Tubes 3 ml [0.25 g]) previously equi l ibrated with 80 mM formic acid (buffered to pH 9 .0 with

ammonia) . After washing the sample three times with 1 ml of buffer, g lycans were eluted

with 1 .2 ml of 50% ACN in buffer and dried in vacuo. Dried g lycopeptides and g lycans were

dissolved in 0 .3 ml of 4 M trifluoroacetic acid and hydrolyzed at 1 oooc for 4 h fol lowed by

drying in vacuo. Hydrolyzed samples and the monosaccharide standards (1 nmol each ) were

derivatized with AA fol lowing the microvial procedure (34 ) . In short, d ry samples were

resuspended in 1 0 IJL of MQ water, mixed with 70 IJL of label ing solution and incubated at

80°C for 1 h. Labeled sugars were then analyzed by RP-HPLC us ing a volati le buffer system

(35). Separation was performed with a core-shel l Kinetex C 1 8 column (250 x 4 .6 mm, 5 11m

particle s ize, 1 00 A pore size; Phenomenex, Torrance, CA, USA) with a Security Guard Ultra

precolumn (Phenomenex) on a Nexera X2 HPLC system with a RF-20Axs Fluorescence

Detector, equipped with a sem imicro flow cel l (Shimadzu, Korneuburg, Austria) . Solvent A

consisted of 80 mM form ic acid, buffered to pH 3 .4 with ammonia, solvent B was 80% ACN

in solvent A. The appl ied g radient started at 9.5% solvent B, ra ised to 2 1 .4% over 27 min and

1 3

Page 15: MPIKG Public Access

further to 65% in 4 min at a flow rate of 0 .85 ml/min; the column oven and flow ce l l

thermostat were set to 33°C . Fluorescence was measured with wavelengths Ex/Em 360 nm

and 425 nm. Data was analyzed i n Postrun Analysis with LabSolutions 5.73 (Shimadzu,

Germany) . For larger injection volumes, e.g. , for fractionation of peaks of interest for

inspection by MS, labeled samples were d i luted with MQ water ad 200 IJL. U p to 2 IJL were

d i rectly injected without d i lut ion. The remainder of the solution was d irectly used for

hypermethylation of AA-Iabeled sugars .

For the fractionation of peaks followed by MS analysis, dried fractions after HPLC

were redissolved i n MQ water and separated on a B iobasic column (C 1 8, 320 1-1m x 1 50 mm,

5 1-1m particle s ize; Thermo Scientific) run at a flow rate of 6 IJL/min at 35°C with an U lt imate

3000 system coupled to an amaZon ion trap mass spectrometer (Bruker, B remen, Germany).

Solvent A consisted of 80 mM form ic acid (buffered to pH 3.0 with ammonia) and solvent B of

80% ACN in solvent A. A gradient was appl ied from 1 .3% to 1 5% solvent B in 5 m in , to

56% B over 6 min and 95% B in 2 .5 m in . After mainta in ing a solvent concentration of 95% B

for 0 .4 min the column was flushed back to starting conditions over a t ime period of 0.6 min

and equi l ibrated i n starting cond itions. The ion trap was operated in the positive ion mode

with DDA. Specific parameters were optimized for a low mass range; target mass m/z 300

and a m/z range from 1 50 to 600 in the Enhanced Resolution mode with 100,000 ICC target

and 200 ms maximum accumulation t ime.

For the hypermethylation of AA-Iabeled sugars excess reagent was removed via a

self-made microcrysta l l ine cel lu lose cartridge in HILIC mode (36). The eluate was transferred

into a g lass vessel , dried over a stream of n itrogen, red issolved in 1 00 IJL of d imethyl

su lfoxide and mixed with a few crumbs of tritu rated NaOH. Subsequently, 25 IJL of CD31 were

added fol lowed by incubation for 30 min with s l ight shaking . The reaction was stopped by

addition of 0 .8 ml of 30% acetic acid and products were extracted with trichloromethane.

After washing the organic phase once with 30% acetic acid and twice with MQ water, it was

dried under a stream of n itrogen and red issolved in 25 IJL of MQ water for capi l la ry HPLC

with ESI-MS detection . F ive IJL of hypermethylated AA-monosaccharides were injected with

1 4

Page 16: MPIKG Public Access

a 20 IJL sample loop in Microl iter PickUp mode and separated on a Dionex Accla im Pepmap

(C1 8, 300 1-1m x 1 50 mm, 2 1-1m particle size; Thermo Scientif ic) at a flow rate of 6 IJL/m in at

35°C with an U ltimate 3000 system coupled to an amaZon ion trap mass spectrometer

(Bruker, Bremen, Germany). Solvent A was 80 mM formic acid (pH 3 .0) and solvent B was

80% ACN in solvent A. After 5 min of flushing with 1 .3% solvent B it was i ncreased to 38.8%

in 2 min and subsequently to 55% in m inute 32. The ion trap was operated i n the positive ion

mode with DDA. Specific parameters were optimized for a low mass range - target mass

m/z 400 and m/z range from 1 50 to 600 in the Enhanced Resolution mode with 1 00,000 ICC

target and 200 ms maximal accumulation time. MSMS parameters used were cut-off

selection defau lt, fragmentation t ime 40 ms, activated FxD, U ltraScan mode and m/z 80-800

scan range. Selected ion chromatograms and average MSMS spectra were generated using

Bruker Data Analysis 4.0. Monosaccharide analyses were performed i n dup l icate from two

b iological repl icates.

EXPERIMENTAL DESIGN AND STATISTICAL RATIONALE

A detai led g lycomics and glycoproteomics evaluation of the major flagel l in

g lycoprote in of S. sputigena was performed using a variety of orthogonal LC-MS based

techniques . Glycopeptides were generated from the SDS-PAGE separated flage l l in by

proteolytic i n-gel digestion using two proteases with orthogonal cleavage specificities.

Reversed phase LC-ESI MS/MS analyses using CID and ETD fragmentation were used to

identify and characterize the (g lyco )peptides generated by the proteolytic enzymes.

0-g lycans were released by reductive 13-el imination from flagel l in tryptic g lycopeptides

extracted from gel p ieces obta ined after SDS-PAG E separation . The released 0-g lycans

were identified by PGC-nanoLC ESI MS/MS. Monosaccharides were produced by acid

hydrolys is of flagel l in g lycopeptides and released 0-g lycans using anthran i l ic acid label l i ng

and reversed phase HPLC and MS. The same hydrolyzed monosaccharide samples have

been used for hypermethylation analyses. Sample preparation, followed by MS analysis was

1 5

Page 17: MPIKG Public Access

performed i n trip l icates (g lycopeptides/peptides)

( 0-g lycans/monosaccharides/hypermethylation ) .

DATA AVAILABILITY

and dup l icates

The mass spectrometry g lycoproteomics data have been deposited to the

ProteomeXchange Consortium (http://proteomecentra l . proteomexchange.org) via the PRIDE

partner repository (37) with the dataset identifier PXD005859.

RESULTS

Indications of Flagellin Glycosylation in S. sputigena ATCC 35185-The

separation of a crude S. sputigena cel l extract by SDS-PAGE and subsequent sta in ing with

PAS reagent revealed a s ingle prominent carbohydrate-positive protei n band with an

apparent s ize of -60 kDa (Figure 1 a) . The identity of th is band was determ ined based on

tryptic peptides as S. sputigena flagel l in Un iProt Accession C9L Y 1 4 (Supplementary

Table 1 ) .

Whi le native C9L Y 1 4 protein obtained from a crude S. sputigena cel l extract traveled

at 55-60 kDa, a recombinant His6-tagged flagel l in (C9L Y1 4R) produced in E. coli was

downsh ifted on the SDS-PAGE gel migrating as a PAS-negative band around the size

calcu lated based on amino acid composition (46 . 1 kDa; Figure 1 a). Western- immunoblotting

using a flagel l in specific antibody revealed a -1 0-1 5 kDa d ifference i n molecu lar size

1 6

Page 18: MPIKG Public Access

between C9L Y 1 4 and C9L Y 1 4R, provid ing a first ind ication of native C9L Y 1 4 being post­

translational ly modified .

C9L Y14 is the Only Detectable Flagellin in S. sputigena ATCC 35185-Enriched

native S. sputigena f lagel l in was separated by SDS-PAG E and the CBS-stained protein band

was first analyzed using tradit ional bottom-up proteomics . The prote in was proteolytical ly in­

gel digested and the (g lyco)peptides were extracted from the gel . Reversed-phase nanoLC­

ESI-MSMS confirmed the identity of the protein being S. sputigena flagel l in C9L Y 1 4

(Supplementary Figure 3) . The in it ial MASCOT search a l lowed annotation of 63.2% of the

protein sequence. Manual ana lysis of the fragment spectra showed the presence of several

0-glycopeptides. Taking a l l of the identified g lycopeptides into account, this data resulted in

a total sequence coverage of 80 .8% (Supplementary Figure 3; for fu rther detai ls p lease see

section Glycoproteomics).

Native C9L Y 1 4 exh ibited a broad protein band in SDS-PAGE analysis (F igure 1 b) ,

which is l i kely to be caused by its prote in g lycosylation . Heavy post-translational mod ification

of C9L Y 1 4 was also ind irectly supported by the fact that a plain tryptic digest performed on

equal amounts of non-g lycosylated C9L Y 1 4R resu lted in 30% higher sequence coverage

(94.5% versus 63.2% for the native prote in , Supplementary Figure 3) .

Glycomics on C9L Y14 Flagellin Identifies Novel 0-Giycan Structures-0-g lycans

were re leased from tryptic g lycopeptides of S. sputigena f lage l l in using reductive P­

el imination and analyzed us ing negative mode PGC nanoLC-ES I-MSMS . We identified s ix

different 0-g lycans in the range from tri- to pentasaccharides that consisted mostly of

deoxyhexose (dHex) and N-acetylhexosamine (HexNAc) residues (Table 1 ; Supplementary

Table 5) . Separate monosaccharide analyses identified these sugars to be rhamnose (Rha)

and GlcNAc, respectively (vide infra). The GlcNAc residue i n the Rha3GicNAc

tetrasaccharide was identified in its reduced state, ind icating that the g lycan is attached via

the GlcNAc to the protein (F igure 3) . Specific fragments at m/z 246 and 290 also indicated

1 7

Page 19: MPIKG Public Access

the presence of a branched g lycan structure as these ions were not predicted for a l inear

g lycan of the same composition by in silica fragmentation using Glycoworkbench 2 (38). In

the case of the Rha5 pentasaccharide, MSMS data confirmed the calcu lated composition.

Taken together, these data provided evidence that at least two d ifferent forms of

0-glycosylation are present on S. sputigena flagel l in (F igure 3) . In addition , th ree mod ified

ol igosaccharides were detected exh ibit ing +1 4-Da or +42-Da modifications. The exact

molecular nature of these ol igosaccharide modifications cou ld not be clarified based on this

analysis alone (Table 1 ) .

C9LY14 Flagellin 0-G/ycans Consist of Rhamnose and

N-Acety/g/ucosamine-Without any supporting i nformation on S. sputigena's biosynthetic

protein g lycosylation mach inery it was impossible to assign monosaccharide identities

d irectly from the g lycomics data . Therefore, monosaccharide analyses were performed on

the re leased 0-g lycans as well as on g lycopeptides obta ined after tryptic d igestion . Fol lowing

acid hydrolysis and AA-Iabel ing, the monosaccharide constituents were identified by RP­

HPLC. The main monosaccharides present on S. sputigena flagel l in C9L Y 1 4 were assigned

based on the comparison with defined monosaccharide standards. B lank SDS-PAGE gel

pieces from the same run were used to estimate background signals .

From the trypsin-digested g lycopeptides and the 13-el iminated 0-g lycan samples,

Fuc/Rha were identified as the major monosaccharide constituents (F igure 4a) . However,

Fuc and Rha share the same retention time in the classical reversed phase HPLC separation

with FLO and cou ld , thus, at f i rst not be d istingu ished. Hypermethylation of the a lka l ine­

released AA-Iabeled samples, however, al lowed d ifferentiation of these two deoxysugars,

reveal ing that Rha was the deoxysugar i ncorporated i nto C9L Y 1 4 0-glycans. Minor amounts

of Fuc were detected that were derived from the sample matrix as they were also present i n

HPLC-FLD of the ge l b lank (Figure 4a).

Selenomonas sputigena flagel l in C9L Y1 4 contained two additiona l sugars for which

the retention t imes did not agree with any of the avai lable standards (retention t imes 26 min

1 8

Page 20: MPIKG Public Access

and 29.8 min; F igure 4a) . Their late elution during reversed-phase chromatography (after

25 min) indicated that these compounds exh ibited a h igher hydrophobicity, potential ly

introduced by modifications such as methylation . Peaks of interest were col lected for further

investigations by LC-ESI-MS . For the peak elut ing at 26 min, so far no sol id conclusions

cou ld be drawn; for the fraction conta in ing the peak with retention time 29.8 min, a signal

was detected at m/z 300 . 1 , which corresponded to a protonated, s ingly methylated dHex

residue carrying the AA fluorescence tag (Figure 4c). Hypermethylation of the AA-Iabeled

sample, performed with deuterated methyl iodide, identified this dHex sugar as methylated

Rha (Figure 4b) .

As a consequence of the acid hydrolysis appl ied during sample processing, GlcNAc

res idues were detected as GleN (F igure 4a). The reducing end sugar should i n principal no

longer be avai lable for label ing v ia reductive amination after reductive 13-el im ination . Given

the fact that the peak area of the GleN peak was clearly increased in the tryptic g lycopeptide

sample and no fu rther peaks could be assigned either to ManN (eluting between GleN and

GaiN; (35)) or GaiN , the GleN signal detected in the 13-el iminated sample was considered to

be a matrix background effect rather than a non-reducing end GlcNAc of 0-glycans. These

resu lts were in agreement with the nanoLC-ESI-MSMS 0-g lycomics resu lts. In summary,

they confirm that the G lcNAc-contain ing 0-g lycans are l inked to the peptide via the G lcNAc.

Glycoproteomics Uncovers C9L Y14 Flagellin to be a Highly Heterogeneous

Glycoprotein-In total we identified 22 un ique g lycopeptide species derived from

S. sputigena flagel l in C9L Y 1 4 (Table 2) . This l ist of un ique glycopeptides was disti l led from

64 detected compounds exhib it ing g lycopeptide-specific features (Supplementary Table 3) .

For 27 of these compounds, the g lycopeptide identity was confirmed from the CID and ETD

data (Supplementary Table 4 ) . The remain ing 37 compounds were classif ied as l i kely

g lycopeptides based on their precursor masses, oxonium ion peaks, and/or peaks

corresponding to the sequential loss of monosaccharides from g lycopeptides, but no peptide

specific fragment ions were detected in these spectra (Supplementary Figu re 1 ) . A precursor

1 9

Page 21: MPIKG Public Access

mass (deconvoluted from measured m/z and z) was taken as an ind icator for a g lycopeptide

if it was equal (max . mass error± 0 .3 Da) to the sum of a peptide (from in silica digests) and

a p lausib le g lycan mass (composition range GlcNAc0_2Rha0_5 p lus possib le modifications). In

addition, oxon ium ions and/or specific fragmentation patterns were requ ired for confident

g lycopeptide assignment (Supplementary Figure 1 ) .

From al l detected g lycopeptides, s ix were exclusively detected as g lycopeptides,

whi le s ix addit ional ones were also found in their non-g lycosylated form, which lead us to

conclude that C9L Y 1 4 flagel l in also shows a considerable degree of g lyco­

macroheterogeneity (Figure 2) . The site of g lycosylation cou ld be unambiguously identified

for five g lycopeptides at amino acid positions Thr1 49, Ser1 82, Thr1 99, Thr259, and Ser334.

In addition to these 5 s ites Thr1 96 and Ser200 were also identified as being g lycosylated

carrying d i- and monosaccharides, respective ly, with further not yet defined substitutions of

+52 . 1 Da and +267. 1 Da (Figure 2, Table 2, Supplementary Table 4 ) . Substantial g lyco­

microheterogeneity was present on most detected g lycopeptides (F igure 5, Supplementary

Table 4, Supplementary Figure 5) . In agreement with the g lycomics data and the

monosaccharide analyses, the attached 0-glycans were main ly comprised of Rha and

G lcNAc residues. Interesti ng ly, the 0-g lycan compositions detected on the g lycopeptides

exhib i ted even larger composition diversity than found by the g lycomics ana lyses. Many

glycopeptides were detected having one, two, three or five Rha residues attached to the

peptide backbone, others were found to carry G lcNAc extended by up to five add itional Rha

res idues. Glycans consisting of an odd number of Rha residues appeared more frequent

compared to g lycans with two or four Rha res idues (Supplementary Table 4 ) .

Add itional ly, either of two disti nct modifications was detected on 3 1 out of a l l 64

detected g lycopeptides contributing to h igh g lyco-microheterogeneity. A + 1 4-Da mass sh ift

was present on several precursor ions ind icating that part of the g lycans experienced some

form of methylation during biosynthesis. Furthermore, several g lycopeptide molecu les

exh ibited a +42-Da sh ift possibly corresponding to putative acetylation (Table 2, Fig u re 5,

Supplementary Figure 5) . The g lycomics data and glycopeptide fragment spectra clearly

20

Page 22: MPIKG Public Access

showed that these modifications were present on the g lycan moieties and not on any of the

amino acids (Figure 5, Supplementary Fig u re 5) . Besides methylation and putative

acetylation, fu rther modifications were detected on some g lycopeptides that cou ld not yet be

unambiguously annotated. On Ser200 of peptide 191VNQGKTETT§F201 , one GlcNAc residue

was detected that showed an add itional mass increment of 267 . 1 Da, which yet cou ld not be

associated with any known molecule {Table 2 , Supplementary Table 4 , Supplementary

Fig u re 2) . The same peptide sequence was also detected with another mod ified g lycan

corresponding to two Rha residues, one methylation and an add itiona l +52 . 1 -Da mass

increment, a l l forming one molecule attached to Thr1 96 {Table 2, Supplementary Figure 5J).

Peptide 179NIHSQDTITVSY1 90 was found having an addit ional mass of 533.4 Da attached to

it {Table 2, Supplementary Figure 5V) . C I D fragmentation showed that two Rha residues and

one methylation are part of that modification, but to date the remain ing 227.3 Da could not be

expla ined by any of the known g lycosylation mod ifications.

Non-reducing End Rhamnose Rearrangement Produces Pseudo Y1-lon

Fragments-Manual data evaluation revealed a series of g lycopeptides carrying a

RhaGicNAc d isaccharide including their methylated and putatively acetylated species.

Interesting ly, in numerous g lycopeptide CID product ion spectra, intense Y1 -g lycopeptide

ions were detected in the same product ion spectrum that ind icated the attachment of one

GlcNAc and one Rha residue at independent s ites with in a s ingle peptide backbone

(F igures 5 and 6) . Rha-l inked 0-g lycans were also identified on S. sputigena flagel l in

C9L Y 1 4 before {Table 1 , Fig u re 3) , thus the occurrence of a g lycopeptide with two sites of

g lycosylation was entirely p lausib le. However, the detected oxon ium ions contradicted this

interpretation as these h inted towards the attachment of a s ingle d isaccharide. Subjecting the

respective g lycopeptides to ETD analyses showed that a disaccharide moiety was attached

to a s ingle amino acid (Figu re 6). These resu l ts lead us to conclude that the observed Rha­

Y1 g lycopeptide signals were more l ikely the product of a Rha rearrangement result ing from

the migration of the Rha res idue towards the peptide (Figure 6b) . This interpretation was

2 1

Page 23: MPIKG Public Access

further substantiated by the observation that the level of Rha migration was influenced by the

molecu lar natu re of the g lycan moiety. Simi lar high levels of Rha rearrangement were

observed for the native RhaGicNAc d isaccharide and its s ingly methylated form . However, if

Rha was mod ified by putative acetylation, hard ly any pseudo Rha-Y1 s ignals were detected

in the product ion spectra but a large s ing ly charged oxonium ion at m/z 392 . 1 (F igure 5). In

the case of these RhaGicNAc g lycopeptides, this behavior appeared to be i ndependent of

the peptide backbone, as a l l g lycopeptides carrying these d isaccharides exhib ited s imi lar

rearrangement fragmentation patterns.

DISCUSSION

In this study, we provide the first clear evidence that S. sputigena ATCC 35 1 85, a

Gram-negative bacterium that has emerged as a potential periodontal pathogen,

g lycosylates its flagel la . We show that the S. sputigena flagel l in C9L Y14 is modified with a

heterogeneous set of 0-l inked g lycans at mult iple serine and threonine res idues and report

h itherto not described prokaryotic 0-g lycans that S. sputigena uses to heavily modify its

flagel lar g lycoprote in . From the mult iple s ites of g lycosylation, five (i. e . , Thr1 49, Ser1 82,

Thr1 99, Thr259 and Ser334) could be unambiguously identified and from our data it is

evident that the 437 -amino acid long flagel l in protein exh ibits extensive g lycan macro- and

microheterogeneity. This g lyco-heterogeneity explains why the PAS-sta ined flagel l in­

enriched sample migrated as a rather broad a g lycoprotein band on the SDS-PAGE gel

(F igure 1 b) . It needs to be mentioned that next to C9L Y 1 4, a second protein (Un iProt

Accession F4EY J4) has been annotated as f lagel lar prote in Fl iS in the S. sputigena

ATCC 351 85 genome sequence (Nucleotide Accession NC_0 1 5437 . 1 ). C9L Y 1 4 and F4EY J4

show 58% amino acid identity and both proteins share s imi larities with flagel l ins from

Selenomonas ruminantium and other oral Selenomonas species, as well as with Anerovibrio

lipolyticus and species belonging to the genus Mitsokuel/a , al l of them members of the

Acidaminococcaceae fami ly. F4EY J4, despite its mass difference of only -2 kDa to C9L Y 1 4,

22

Page 24: MPIKG Public Access

was not detectable in any of the analyzed SDS-PAGE separated bands, confirm ing that a l l

presented data on flagel l in g lycosylation are attributable to C9L Y 1 4 .

A considerably heterogeneous pool of 0-g lycans ranging from mono- to

hexasaccharides was identified on C9L Y 1 4, consisti ng main ly of Rha and GlcNAc residues

(Table 1 , Fig u re 2). Our data on these hitherto not described 0-g lycans supported evidence

for the presence of branched and l i near structures, ind icating that the S. sputigena

glycosylation machinery can produce and attach a large variety of different g lycans onto

different sites of g lycosylation (Figure 3) . Glycopeptides were detected carrying g lycans

composed of either one, two, three or five Rha residues or one GlcNAc residue extended by

one to five Rha res idues. Futu re work wi l l show whether S. sputigena 0-glycosylation is

accompl ished by stepwise addition of s ing le monosaccharides or whether preformed g lycan

precursors are transferred onto the flagel l in prote in , and which g lycosyltransferases are

responsib le for C9L Y 1 4 g lycosylation . The CAZy database for Carbohydrate Active EnZymes

(http://www.cazy.org/b 1 644.html ) ind icates 45 g lycosyltransferases affi l i ated to various

g lycosyltransferase fami l ies to be present in the S. sputigena ATCC 351 85 genome. Four of

them (i. e. , C9L Y1 0- 1 3) are encoded by genes that are located in immed iate vici n ity of the

flagel l in structural protein C9L Y 1 4, and, thus, constitute prom ising candidates for

involvement in flagel l in g lycosylation . Further, F4EWP7 possesses a Wzy_C motif typ ical of

0-antigen polymerases, l igases and ol igosaccharyltransferases. However, s ince

S. sputigena A TCC 351 85 has been recalcitrant to genetic manipu lation so far, it was

impossible to prove the involvement of any of these enzymes in S. sputigena flagel l in

g lycosylation. The observed g lycan microheterogeneity together with the occurrence of both

threonine and serine res idues as acceptor amino acids rather h ints towards stepwise

addition and elongation, as bacteria with preformed precursors usually exhib it a more

homogenous glycosylation pattern (39). Nevertheless, relaxed specificities of bacterial 0-

ol igosaccharyltransferases have also been described previously (40).

The 0-g lycan monosaccharide constituents were unambiguously identified after AA­

Iabel ing using RP-HPLC as Rha, 0-Me-Rha and GlcNAc (Figu re 4) . The deoxyhexose Rha

23

Page 25: MPIKG Public Access

is widely d istributed among bacteria and plants . Whi le 0-Rha is main ly found as a constituent

of EPS or LPS of Gram-negative bacteria, the L-Rha isomer is more common (4 1 ) . In

flagel lar g lycans, however, Rha is rather unusual . So far Rha has been described as a

component of flagel l in 0-glycosylation i n the opportun istic human pathogen Pseudomonas

aeruginosa, and in the plant-growth promoti ng rh izobacterium Azospirillum brasilense . In

P. aeruginosa strains, L-Rha represents the reducing-end sugar, either as a s ingle

modification or further substituted through sequential extension with a heterogeneous g lycan

( 1 1 ) . In Azospiril/um brasi/ense, both L-Rha and L-Fuc have been reported in a repetitive

g lycan structure l inked to the bacteria l flagel l in (42) . Furthermore, Rha is part of a common

un ique trisaccharide on the flagel l ins of the phytopathogenic bacteria Pseudomonas syringae

pv. glycinea race 4 and P. syringae pv. tabaci 6605 (43) . The genomic information avai lable

for S. sputigena ATCC 351 85 supports the g lycoproteomics data of L-Rha modification of

flagel l in as presented in this study, s ince the bacteri um possesses homologs of the four

enzymes RmiA-D required for the biosynthesis of dTDP-L-rhamnose (4 1 ) . Specifica l ly, these

are C9LU R1 (glucose-1 -phosphate thymidi lyltransferase, RmiA), C9LUQ9 (dTDP-0-g lucose

4,6-dehydratase, RmiB), C9LURO (dTDP-4-dehydrorhamnose 3 ,5-epimerase, RmiC), and

C9LUI6 (dTDP-4-dehydrorhamnose reductase, RmiD). However, a dedicated

rhamnosyltransferase for the transfer of Rha from its nucleotide-activated form as the

reducing-end sugar of a g lycan to a polypeptide backbone remains to be found .

N-Acetylg lucosamine is a common precursor in many carbohydrate biosynthetic

pathways in bacteria, including that of peptidog lycan and, thus, is synthesized in the course

of the bacterium's housekeeping functions. There is , however, the necessity of a dedicated

0-GicNAc transferase that catalyzes the transfer of the GlcNAc residue (or a preassembled

glycan with a reducing-end GlcNAc) to the prote in . Twine et at. (44) identified an 0-l i nked

HexNAc modification - later to be recognized as 0-GicNAc (45) - on the flagel l in of

Clostridium difficile and described a g lycosyltransferase (UniProt Accession CD0240) from

the flagel lar biosynthesis gene cluster to be involved in the g lycosylation process . In the

S. sputigena genome, two genes encod ing g lycosyltransferase fami ly 2 proteins (Un iProt

24

Page 26: MPIKG Public Access

Accessions C9L Y 1 2 and C9L Y 1 3) are present adjacent to the flagel l in protein C9L Y 1 4

encod ing gene that exhib it s imi larity to the C. diffici/e transferase (36% a n d 32% sequence

identity with C9L Y 1 2 and C9L Y1 3 , respectively). Other genes on the S. sputigena

ATCC 35 1 85 genome annotated to encode g lycosyltransferases are C9LY 1 0 and C9LY1 1 .

Future work wi l l show what role these cand idate enzymes p lay i n flage l l in g lycosylation and

whether the gene locus encompassing the genes C9L Y1 0 through C9L Y 1 4 represents a real

flage l l in g lycosylation gene locus of S. sputigena ATCC 35 1 85. Probing a flagel l in-enriched

preparation or a S. sputigena cel l lysate with a 13-0-l inked GlcNAc specific monoclonal

antibody (OGT Monoclonal Antibody RL2, Thermo Scientif ic) d id not g ive any signal

(Cornel ia B . Rath, Christina Schaffer, unpubl ished observation) . However, as most GlcNAc

res idues on C9L Y 1 4 are further substituted by Rha residues and/or other modifications

(Figure 3) it is l ikely that antibody recogn ition is impaired by these mod ifications.

Mass spectrometry is one of the most frequently appl ied methods to determine the

structure of g lycoconjugates. In the analysis of re leased g lycans, monosaccharide

rearrangement occurs commonly when protonated g lycans are analyzed by C ID . However,

this has rarely been reported to occur on g lycopeptides. In these reports, however, it did not

s ign ificantly inf luence correct g lycopeptide assignment (24-26). Our data clearly shows that

monosaccharide rearrangement observed in C I D analyses of protonated g lycopeptides can

also sign ificantly i nterfere with correct data interpretation as pseudo Y1 -ions are generated

that pretend the presence of more than a s ingle site of g lycosylation (F igures 5 and 6) . The

extent of this pseudo Y1 -ion formation was g lycan structure- and s ize-dependent, whi le the

peptide moiety d id not seem to influence the observed behavior. In agreement with earl ier

stud ies performed on methylated and peracetylated g lycans, the rearrangement was

essential ly abol ished in the presence of a putative acetylation (F igure 5) (46, 47).

Nevertheless, the question remains if and how a specific g lycan structure influences the

extent of Rha migration and whether the observed fragmentation patterns possib ly offer

additional opportun ities to extract more deta i ls on the g lycan structure . It wou ld be h igh ly

interesting if information such as Rha l i nkage and/or stereochemistry cou ld be gathered from

25

Page 27: MPIKG Public Access

these observed fragmentation patterns, however earl ier stud ies performed on protonated

ol igosaccharides ind icated that this phenomenon occurred for a l l g lycosidic l inkage types and

seemed to be independent of the type of residue lost (48). I n any case our data support that

in add ition to conventional CID a combination of orthogonal fragmentation techn iques such

as ETD should be used in the primary structure evaluation of g lycoconjugates from poorly

understood organisms.

Besides g lycosylation, add itional mod ifications were observed on both the released

flagel l in C9L Y 1 4 g lycans and the g lycoprote in . Observed mass increments of + 1 4 Da or

+42 Da were annotated here as methylation, or are indicative of putative acetylat ion,

respectively. The presence of Rha methylation was confirmed by hypermethylation of AA­

Iabeled monosaccharide samples. Mod ification by acetylat ion, however, needs further

verification by orthogonal methods such as NMR. As only a l imited amount of purified

flagel l in materia l was avai lable, future work wi l l clarify this matter. Besides methylation and

putative acetylation other mod ifications were also detected on some g lycopeptides. These

cou ld, however, not be annotated based on the detected mass increments . Peptide

19 1VNQGKTETTSF201 was identified carrying a s ingle GlcNAc residue attached to Ser200

that further exh ib ited a mass increment of +267 . 1 Da, which molecu lar nature needs yet to

be determined (Supplementary Figure 2) . A methylated di-Rha g lycosylated g lycopeptide

was detected with a +52 . 1 -Da increment (Table 2) . This mod ified d isaccharide was attached

to Thr1 96 on the very same peptide (Supplementary Figure 5J). Glycopeptide

179NIHSQDTITVSY1 90 was found to carry an additional mass of +533.4 Da, which could be

partia l ly characterized to consist of a s ing ly methylated Rha d isaccharide carrying an

additional modification of +227.3 Da (Supplementary Fig u re 5V). This mass increment, too,

requires future investigations to uncover its deta i led molecular nature. Our data clearly

ind icate that the mod ifications present on the S. sputigena flagel l in g lycoprote in require a yet

to be identified subset of different enzymes that introduce the multitude of identified

modifications. The fact that even scrupulous manual data investigations did not a l low

annotation of d isti nct peptides present in the native flagel l in (e.g. , amino acid residues 1 53-

26

Page 28: MPIKG Public Access

1 78 and 2 1 6-244), ind icates that the native flagel l in is subject to other forms of mod ification

that we were not ab le to decipher, yet. Both were clearly identified in the recombinant form of

the protein (Supplementary Figure 3) .

The g lycosylation s ites of most flagel l in g lycoproteins described to date appear to be

focused around the central variable reg ion of the prote in (49, 50) , which forms the outside

surface-exposed domains (02 and 03) in the assembled fi lament ( 1 1 ) . In contrast, mu ltiple

amino acid sequence a l ignment ind icates that the N- and C-terminal reg ions, corresponding

to the DO and 01 domains, are wel l conserved in flagel l in prote ins ( 1 1 ) . Several amino acids

in the D0/01 reg ion have, in many cases, proven to be crucial for the formation of the

flagel lar fi lament and, therefore, moti l i ty (5 1 ) . Taking into consideration that we identified

potential g lycopeptides located at the N-terminal part of the S. sputigena flagel l in in addition

to C-termina l ly located ones (Figure 2), i t could well be that g lycosylation in these parts of the

protein influences flagel l in monomer i nteraction and, therefore, fi lament stab i l ity.

The innate immune receptor Toll-l ike receptor 5 (TLR5), of which f lagel l in is the

natu ra l l igand, recognizes a site of the protein that l ies predominantly in the N-terminal 01

domain , where it is centered on a stretch of eight amino acids (89-96 in Salmonella flagel l in) ,

with contri bution from the C-terminal 01 domain add itional ly required (5 1 -53). Recently, a

13-hairpin structure adjacent to the N-terminal 01 domain in S. enterica ssp. enteritidis 90- 1 3-

706 has been identified as a third reg ion fundamental for TLR5 activation in mammals (54).

In the S. sputigena flagel l in C9LY 1 4 , the g lycosylated residues T 1 49, S 1 82 and T1 99 are

very close to the N-term inal 01 /13-hairpin reg ion (Figure 2) . Glycosylation of flagel l in so close

towards the N-terminus of the prote in is unusual and has so far on ly been described for

C. jejuni FlaA (55) and for the peri plasmic flagel la of the spirochete Treponema denticola , for

which modification with a novel 0-glycan with i n the conserved 01 domain has been reported

very recently (56). It is tempting to specu late that g lycans in the proteins' N-terminal reg ion

cou ld potential ly affect recogn ition of flagel l in monomers by TLR5 and it wi l l be an interesti ng

future task to establ ish the role post-trans lational modifications of these residues play in a

larger systems biology context.

27

Page 29: MPIKG Public Access

Sti l l , the intrigu ing question remains what function S. sputigena flagel l in g lycosylation

fu lfi l ls . Being exposed on the bacterial cell surface and especia l ly being part of an important

motil ity structure, an essentia l function for flagel la assembly, motil ity or bacterium­

host/bacterium-environment interactions can be assumed . Only a relatively small subset of

oral bacteria produce flagel la and, in most cases (with the exception of Treponema dentico/a

(57)) flagel la and moti l i ty have not been strongly correlated with the colon ization or viru lence

of bacteria identified from denta l p laque. It is, however, l i kely that oral Selenomonas -

constituting a relevant portion of the biomass (58, 59) - do in some way contribute to the

pathogenic processes involved in periodontal d isease. Being motile, S. sputigena could

locate a favorable environment with in dental plaque and develop a strong association with

distinct bacteria upon selective recogn ition of the partner cel l su rface (60) . As unfortunately

no genetic tools are currently avai lable that a l low manipu lation of the S. sputigena genome, it

wi l l remain d ifficult to e lucidate the functional role of S. sputigena flagel l in g lycosylation .

Considering the divers ity of un ique g lycopeptide species and their d istribution over a

large portion of the protein sequence, the S. sputigena flagel l in cu rrently represents one of

the most heavily g lycosylated flagel l i ns . This study represents the first step towards

understanding how these Rha and GlcNAc/Rha contain ing 0-g lycans with their various

mod ifications influence or control flagel lum function and, eventual ly, pave the way for

S. sputigena's establ ishment with i n dental p laque.

This study i ncreases our understand ing of the surface properties of oral bacteria,

covering a critical gap not only in the biology of the oral cavity, but of f lagel lated

Selenomonas species in genera l . These are associated with d ifferent parts of the digestive

system - mouth , throat, gut (61 , 62) - where they can be beneficial or involved in the

degradation of local site health, either by faci l itati ng the co-organization of other bacteria or

by stimulating the host negatively. Digestive tract health is a vital component of systemic

health . Various metagenomics approaches are currently undertaken to understand the

biology of the bacteria l cel l surface and bacterial assembly in the context of systemic health.

These, however, are frequently l im ited by the lack of experimental data for mechan istic

28

Page 30: MPIKG Public Access

val idation . Bacterial cell surface g lycoproteins are known regulators of bacterial cel l-ce l l

interactions and modulators of host cells' functions . Detai led g lycoproteomics i nsights into

their individual g lycoprotein s ignatures thus are ideal ly su ited to complement metagenomics

datasets to decipher the b iolog ica l basis of d igestive tract systemic health .

29

Page 31: MPIKG Public Access

REFERENCES

1 . Adler, J . ( 1 966) Chemotaxis in bacteria . Science 1 53 , 708-71 6

2 . Josenhans, C . , and Suerbaum, S. (2002) The role of moti l ity a s a viru lence factor i n

bacteria . lnt J Med Microbio/ 29 1 , 605-61 4

3 . Duan, Q., Zhou , M . , Zhu , L., and Zhu, G . (20 1 3) Flagel la and bacterial pathogenicity.

J Basic Microbia/ 53, 1 -8

4 . Macnab, R. M. (2003) How bacteria assemble flagel la . Annu Rev Microbial 57, 77-

1 00

5 . Macnab, R. M. (2004) Type Il l flagel lar protein export and flagel lar assembly. Biochim

Biophys Acta 1 694, 207-2 1 7

6 . Penn , C . W. , and Luke, C . J . ( 1 992) Bacterial flagel lar d iversity and sign ificance in

pathogenesis. FEMS Microbial Lett 1 00 , 331 -336

7 . Bardy, S. L., Ng, S. Y . , and Jarre l l , K . F. (2003) Prokaryotic moti l ity structures.

Microbiology 1 49, 295-304

8. Namba, K., and Vonderviszt, F. ( 1 997) Molecular architecture of bacteria l flage l lum. Q Rev Biophys 30, 1 -65

9 . Logan, S . M. (2006) Flagel lar g lycosylation - a new component of the motil ity

repertoire? Microbiology 1 52 , 1 249- 1 262

1 0 . Hayakawa, J., and lsh izuka , M. (20 1 2) Flagel lar Glycosylation: Current Advances.

Glycosylation, 1 27- 1 52

1 1 . Merino, S . , and Tomas, J. M. (201 4) Gram-negative flagel la g lycosylation . lnt J Mol

Sci 1 5 , 2840-2857

1 2 . Goon, S . , Kel ly, J. F ., Logan, S . M . , Ewing , C. P., and Guerry, P. (2003) Pseudaminic

acid, the major modification on Campylobacter flagel l in, is synthesized via the Cj 1 293 gene.

Mol Microbia/ 50, 659-67 1

1 3 . Guerry, P . , Ewing, C. P . , Sch irm, M., Lorenzo, M., Kel ly, J., Pattar in i, D . , Majam, G . ,

Th ibau lt, P., and Logan, S . (2006) Changes in flagel l in g lycosylation affect Campylobacter

autoagglutination and viru lence . Mol Microbio/ 60, 299-3 1 1

1 4 . Schirm, M . , Soo, E . C . , Aubry, A. J . , Austin, J . , Th ibau lt, P., and Logan, S. M. (2003)

Structural, genetic and functional characterization of the flagel l in g lycosylation process in

Helicobacter pylori. Mol Microbio/ 48, 1 579-1 592

1 5 . Arora, S. K., Neely, A. N . , B la i r, B ., Lory, S., and Ramphal, R. (2005) Role of motil ity

and flagel l in g lycosylation in the pathogenesis of Pseudomonas aeruginosa burn wound

infections. Infect lmmun 73, 4395-4398

30

Page 32: MPIKG Public Access

1 6 . Tanner, A. C . , Haffer, C ., Brattha l l , G . T., Visconti, R. A . , and Socransky, S. S. ( 1 979)

A study of the bacteria associated with advancing periodontitis in man . J Clin Periodontal 6,

278-307

1 7 . Paster, B . J., Boches, S . K . , Galvin, J . L., Ericson, R. E ., Lau, C . N . , Levanos, V. A . ,

Sahasrabudhe, A . , and Dewh i rst, F . E . (200 1 ) Bacteria l d iversity in h uman subgingival

p laque . J Bacteriol 1 83 , 3770-3783

1 8 . Faveri, M . , Mayer, M. P., Feres, M . , de Figueiredo, L. C ., Dewh irst, F . E ., and Paster,

B . J. (2008) Microbiological d iversity of genera l ized aggressive periodontitis by 1 6S rRNA

clonal analysis. Oral Microbiol lmmunol 23, 1 1 2-1 1 8

1 9 . Drescher, J., Schlafer, S . , Schaud inn, C . , Riep, B . , Neumann , K . , Friedmann, A. ,

Petrich, A . , Gobel, U . B ., and Moter, A. (20 1 0) Molecu lar epidemiology and spatia l

d istribution of Selenomonas spp. in subging ival b iofi lms. Eur J Oral Sci 1 1 8 , 466-4 7 4

20. Hajishengal l is , G . (20 1 5) Periodontitis: from microbial immune subversion to systemic

inflammation . Nat Rev lmmunol 1 5 , 30-44

2 1 . Schaffer, C . , and Messner, P. (20 1 7) Emerging facets of prokaryotic g lycosylation .

FEMS Microbial Rev 4 1 , 49-9 1

22. Ritch ie, M. A . , G i l l , A. C . , Deery, M. J . , and Li l ley, K. (2002) Precursor ion scanning

for detection and structural characterization of heterogeneous g lycopeptide mixtures. J Am

Soc Mass Spectrom 1 3 , 1 065-1 077

23. Hinneburg , H . , Stavenhagen, K . , Schweiger-Hufnagel, U . , Pengel ley, S . , Jabs, W. ,

Seeberger, P. H . , S i lva, D . V . , Wuhrer, M . , and Kolarich, D . (201 6) The Art of Destruction:

Optimizing Col l ision Energ ies in Quadrupole-Time of Fl ight (Q-TOF) Instruments for

G lycopeptide-Based Glycoproteomics. J Am Soc Mass Spectrom 27, 507-51 9

24 . Hal im, A., N i lsson, J . , Ruetschi , U . , Hesse, C . , and Larson , G . (20 1 2) Human urinary

g lycoproteomics; attachment site specific analysis of N- and 0- l inked glycosylations by C I D

a n d E C D . Mol Cell Proteomics 1 1 , M 1 1 1 0 1 3649

25. Zauner, G . , Hoffmann , M., Rapp, E . , Koeleman, C. A., Dragan, 1 . , Deelder, A. M.,

Wuhrer, M., and Hensbergen, P. J. (20 1 2) Glycoproteomic analysis of human fibrinogen

reveals novel reg ions of 0-g lycosylation. J Proteome Res 1 1 , 5804-58 1 4

26. Wuhrer, M., Deelder, A. M., and van der Burgt, Y . E . (201 1 ) Mass spectrometric

g lycan rearrangements. Mass Spectrom Rev 30, 664-680

27. Montie, T. C . , Craven, R. C . , and Holder, I. A. ( 1 982) Flagel lar preparations from

Pseudomonas aeruginosa: isolation and characterization . Infect lmmun 35, 281 -288

28. Laemml i, U . K. ( 1 970) Cleavage of structural prote ins duri ng the assembly of the

head of bacteriophage T4. Nature 227, 680-685

3 1

Page 33: MPIKG Public Access

29. Doerner, K. C ., and White, B . A. ( 1 990) Detection of g lycoproteins separated by

nondenaturing polyacrylamide gel electrophores is using the period ic acid-Schiff sta in . Anal

Biochem 1 87 , 1 47- 1 50

30. Cheng, H. R . , and Jiang , N . (2006) Extremely rapid extraction of DNA from bacteria

and yeasts . Biotechnol Lett 28, 55-59

3 1 . Kolarich, D . , Jensen, P. H., Altmann, F . , and Packer, N . H . (20 1 2 ) Determination of

s ite-specific g lycan heterogeneity on g lycoproteins . Nat Protoc 7, 1 285-1 298

32. Jensen , P. H . , Karlsson, N. G ., Kolarich, D ., and Packer, N. H. (20 1 2) Structural

analysis of N- and 0-glycans released from g lycoproteins . Nat Protoc 7, 1 299- 1 3 1 0

33. Domagalski, M . J., Alocci , D ., Almeida, A . , Kolarich , D ., and Lisacek, F . (20 1 7)

PepSweetener: a web-based tool to support manual annotation of intact g lycopeptide MS

spectra . Proteomics Clin Appl in revision

34 . Anumu la , K. R. ( 1 994) Quantitative determination of monosaccharides i n

g lycoprote ins by h igh-performance l iquid chromatography with highly sensitive fluorescence

detection . Anal Biochem 220, 275-283

35. Windwarder, M., Fig l , R., Svehla, E ., Mocsai , R . T. , Farcet, J. B . , Staudacher, E . ,

Kosma, P . , and Altmann , F . (201 6) "Hypermethylation" of anthran i l ic acid-labeled sugars

confers the se lectivity required for l iquid chromatography-mass spectrometry. Anal Biochem

5 1 4, 24-3 1

36. Hitchcock, A. M., Costel lo, C . E . , and Zaia, J. (2006) Glycoform quantification of

chondroitin/dermatan su lfate using a l iquid chromatography-tandem mass spectrometry

p latform . Biochemistry 45, 2350-2361

37. Vizcaino, J . A., Deutsch, E . W., Wang, R . , Csordas, A. , Reisi nger, F . , Rios, D . ,

Dianes, J . A . , Sun, Z . , Farrah , T . , Bandeira, N . , B inz, P. A., Xenarios, 1 . , Eisenacher, M . ,

Mayer, G ., Gatto, L., Campos, A . , Chalkley, R . J . , Kraus, H . J., Albar, J . P . , Martinez­

Bartolome, S., Apwei ler, R . , Omenn, G . S . , Martens, L., Jones, A. R., and Hermjakob, H.

(20 1 4) ProteomeXchange provides global ly coord inated proteomics data submission and

d issemination. Nat Biotechnol 32, 223-226

38. Ceroni, A., Maass, K., Geyer, H., Geyer, R . , Del l , A . , and Haslam, S. M. (2008)

GlycoWorkbench: a tool for the computer-assisted annotation of mass spectra of g lycans. J

Proteome Res 7 , 1 650- 1 659

39. Scott, N . E . , Parker, B . L., Connol ly, A. M . , Paulech, J . , Edwards, A. V., Crossett, B .,

Falconer, L., Kolarich, D . , Djordjevic, S. P., Hojrup , P., Packer, N . H., Larsen, M. R., and

Cardwel l , S . J. (20 1 1 ) S imu ltaneous g lycan-peptide characterization using hydrophi l ic

interaction chromatography and paral lel fragmentation by CID, h igher energy col l is ional

d issociation, and electron transfer dissociation MS appl ied to the N- l inked glycoproteome of

Campylobacter jejun i . Mol Cell Proteomics 1 0, M00003 1 -MCP000201

32

Page 34: MPIKG Public Access

40. lwashkiw, J. A . , Vozza, N . F . , Kinsel la, R. L., and Feldman, M. F. (20 1 3) Pour some

sugar on it: the expanding world of bacterial protein 0-l inked glycosylation . Mol Microbiol 89,

1 4-28

41 . Maki , M . , and Renkonen, R. (2004) Biosynthesis of 6-deoxyhexose g lycans i n

bacteria . Glycobiology 1 4 , 1 R-1 5R

42. Belyakov, A. Y . , Burygin, G . L . , Arbatsky, N . P . , Shashkov, A. S . , Sel ivanov, N . Y . ,

Matora, L. Y . , Kn i re l , Y . A., and Shchyogolev, S. Y. (20 1 2) Identification of an 0-l i nked

repetitive g lycan chain of the polar flagel lum flagel l in of Azospiril/um brasilense Sp7.

Carbohydr Res 36 1 , 1 27- 1 32

43 . Takeuchi , K., Ono, H . , Yosh ida, M., Ish i i , T . , Katoh, E . , Taguchi , F . , Miki, R., Murata,

K . , Kaku, H., and l ch inose, Y. (2007) Flagel l i n g lycans from two pathovars of Pseudomonas

syringae conta in rhamnose in D and L configurations in different ratios and mod ified 4-

amino-4,6-dideoxyglucose. J Bacteriol 1 89, 6945-6956

44 . Twine, S. M . , Reid , C. W., Aubry, A . , McMul l i n , D. R., Fulton , K. M . , Austin, J., and

Logan, S . M. (2009) Moti l ity and flagel lar g lycosylation in Clostridium difficile. J Bacteriol 1 9 1 ,

7050-7062

45. Faulds-Pain, A., Twine, S. M., Vinog radov, E . , Strong , P. C ., Del l , A . , Buckley, A. M . ,

Douce, G . R., Val iente, E . , Logan, S . M. , and Wren, B . W. (20 1 4) The post-translational

modification of the Clostridium difficile f lage l l in affects moti l ity, ce l l surface properties and

virulence. Mol Microbiol 94, 272-289

46. BrOi l , L. P., Heerma, W. , Thomas-Oates, J., Haverkamp, J., Kovacik, V . , and Kovac,

P. ( 1 997) Loss of internal 1 - 6 substituted monosaccharide res idues from underivatized

and per-0-methylated trisaccharides. Journal of the American Society for Mass Spectrometry

8, 43-49

47. Zaia, J . (2004) Mass spectrometry of ol igosaccharides. Mass Spectrom Rev 23, 1 6 1 -

227

48. Bru l l , L . P., Kovacik, V., Thomas-Oates, J . E . , Heerma, W., and Haverkamp, J. ( 1 998)

Sod ium-cation ized ol igosaccharides do not appear to undergo 'i nternal res idue loss'

rearrangement processes on tandem mass spectrometry. Rapid Commun Mass Spectrom

1 2 , 1 520- 1 532

49. Samatey, F . A., lmada, K . , Vonderviszt, F . , Sh i rakihara, Y., and Namba, K. (2000)

Crystal l ization of the F4 1 fragment of flagel l in and data collection from extremely th in

crystals. J Struct Biol 1 32 , 1 06-1 1 1

50. Beatson, S . A., Minamino, T . , and Pallen, M. J . (2006) Variation in bacterial flagel l ins :

from sequence to structure. Trends Microbiol 1 4 , 1 5 1 - 1 55

33

Page 35: MPIKG Public Access

51 . Andersen-N issen , E . , Smith, K. D . , Strobe, K. L . , Barrett, S. L . , Cookson, B . T . ,

Logan, S. M . , a n d Aderem, A . (2005) Evasion of Tol l- l ike receptor 5 by flagel lated bacteria.

Proc Nat/ Acad Sci U S A 1 02 , 9247-9252

52. Hayash i , F ., Smith, K. D . , Ozinsky, A . , Hawn, T. R . , Yi, E. C . , Goodlett, D. R . , Eng, J .

K . , Akira, S., U nderh i l l, D . M . , and Aderem, A. (200 1 ) The innate immune response to

bacteria l flagel l in is mediated by Toll-l ike receptor 5. Nature 4 1 0 , 1 099- 1 1 03

53 . Smith, K. D ., Andersen-N issen , E . , Hayash i, F . , Strobe, K., Bergman, M. A . , Barrett,

S . L., Cookson , B . T., and Aderem, A. (2003) Tol l- l ike receptor 5 recogn izes a conserved s ite

on flagel l in required for protofi lament formation and bacterial moti l ity. Nat lmmunol 4, 1 24 7-

1 253

54 . de Zoete, M. R . , Keestra, A. M . , Wagenaar, J . A . , and van Putten , J. P. (201 0)

Reconstitution of a functional Tol l - l ike receptor 5 bind ing s ite in Campylobacter jejuni

flagel l i n . J Bioi Chern 285, 1 2 1 49-1 2 1 58

55. Zampronio, C . G ., B lackwel l , G . , Penn , C . W. , and Cooper, H . J. (20 1 1 ) Novel

g lycosylation sites local ized in Campylobacter jejuni flagel l in F laA by l iquid chromatography

electron captu re d issociation tandem mass spectrometry. J Proteome Res 1 0 , 1 238-1 245

56. Kurn iyati, K . , Kelly, J. F . , Vinogradov, E . , Robotham, A . , Tu, Y., Wang, J . , L iu , J . ,

Logan, S. M. , and Li , C . (201 7) A novel g lycan modifies the flage l lar fi lament proteins of the

oral bacterium Treponema denticola . Mol Microbiol 1 03, 67-85

57. Lux, R . , Mi l ler, J. N . , Park, N . H . , and Sh i , W. (2001 ) Moti l i ty and chemotaxis in tissue

penetration of oral epithelial cel l layers by Treponema denticola. Infect lmmun 69, 6276-6283

58. Kumar, P. S . , Griffen , A. L . , Barton , J. A., Paster, B. J., Moeschberger, M. L . , and

Leys, E . J. (2003) New bacterial species associated with chronic periodontitis. J Dent Res

82, 338-344

59. Ebersole, J. L . , Holt, S. C . , Hansard, R . , and Novak, M. J . (2008) Microbiolog ic and

immunolog ic characteristics of periodontal d isease in Hispanic americans with type 2

diabetes. J Periodontol 79, 637-646

60. Kolenbrander, P. E ., Andersen, R. N . , and Moore, L. V. ( 1 989) Coaggregation of

Fusobacterium nucleatum, Selenomonas flueggei , Selenomonas infel ix, Selenomonas noxia,

and Selenomonas sputigena with strains from 1 1 genera of oral bacteria . Infect lmmun 57,

3 1 94-3203

61 . Kingsley, V. V . , and Hoeniger, J. F . ( 1 973) Growth, structure, and classification of

Selenomonas. Bacterial Rev 37, 479-52 1

62. Ricke, S . C . , Martin, S. A . , and N isbet, D . J . ( 1 996) Ecology, metabol ism, and

genetics of ruminal selenomonads. Grit Rev Microbiol 22, 27-56

63 . Varki, A., Cummings, R. D . , Aebi, M . , Packer, N. H . , Seeberger, P. H., Esko, J. D.,

Stan ley, P., Hart, G . , Darvi l l , A . , Kinoshita, T . , Prestegard, J . J . , Schnaar, R. L . , Freeze, H .

34

Page 36: MPIKG Public Access

H., Marth, J . D ., Bertozzi, C. R., Etzler, M. E ., Frank, M., Vliegenthart, J. F . , Lutteke, T . ,

Perez, S., Bolton, E ., Rudd, P . , Pau lson, J . , Kanehisa, M. , Toukach , P . , Aoki-Ki nosh ita, K . F . ,

Del l, A . , Narimatsu, H . , York, W. , Taniguchi , N . , and Kornfeld, S . (201 5) Symbol

Nomenclatu re for Graph ical Representations of Glycans. Glycobiology 25, 1 323-1 324

64 . Zhang, R . , Sioma, C. S., Wang, S . , and Regnier, F . E. (200 1 ) Fractionation of

isotopica l ly labeled peptides i n quantitative proteomics. Anal Chern 73, 51 42-51 49

35

Page 37: MPIKG Public Access

ACKNOWLEDGEMENTS

Funding-This work was supported by the Austrian Science Fund FWF, projects P26836-B22

(to C .S . ) and the Ph D Programme "B iomolecu lar Technology of Prote ins" W1 224. F .S,

P.H.S. and O . K. gratefu l ly acknowledge generous funding by the Max Planck Society. O .K.

acknowledges support by the European Union (Seventh Framework Programme

"Giycoproteomics", g rant number PCIG09-GA-201 1 -293847 and IBD-BIOM project, grant

number 305479) . OK is the recipient of an Austral ian Research Counci l Future Fel lowsh ip

(project number FT1 60 1 00344) funded by the Austral ian Government.

AUTHOR CONTRIBUTIONS

C . B . R. and F .S . designed and performed the substantive majority of the experiments and

wrote the manuscript. R .F . performed the monosaccharide ana lyses. P.H.S . ed ited the

manuscript. O .K . and C.S . supervised the project and provided overa l l gu idance on study

design, execution and manuscript preparation and editing . All authors d iscussed the results

and commented on the paper.

COMPETING FINANCIAL INTERESTS

The authors declare that they have no competing f inancial interests .

36

Page 38: MPIKG Public Access

TABLES

Table 1 : 0-glycans re leased from tryptic peptides of S. sputigena C9L Y 1 4 flage l l in using reductive 13-e l imination . Released 0-g lycans were

detected using PGC nanoLC-ESI MSMS. RT, retention time in minutes; a l l mass values are in Dalton; calc. m/z, theoretical m/z value . Structures

of 0-g lycans in bold letters were determined based on MSMS spectra in negative ion mode. For a l l others, no product ion spectrum was

generated, assignment based on MS 1 level only.

RT m/z calc.

l:!.m/z Composition z m/z

24.7 457 . 1 6 1 457. 1 9 -0 .03 Rha3 28.3 749.34 1 749.31 0.03 Rha5 39 .9 763 .34 1 763 .32 0 .02 Rha5 38.6 791 .36 1 791 .32 0 .04 Rha5 24 .5 660 .30 1 660.27 0 .03 Rha3 GlcNAc 26 . 1 660 .30 1 660.27 0 .03 Rha3 GlcNAc 29.5 660.30 1 660.27 0.03 Rha3 G lcNAc

37 .9 702 . 3 1 1 702.28 0 .03 Rha3 GlcNAc

Modification

37

+ 1 4 Da +42 Da

+42 Da

Relative abundance [%]

7 .6

59.0

0 .9

0 .5

3 .8

8 .4

1 9.3

0 .5

Page 39: MPIKG Public Access

Table 2 : Summary of a l l 22 un ique g lycopeptide species identified from S. sputigena flage l l i n C9L Y 1 4 . Identified g lycosylation sites are ind icated i n

bold, red letters . RT, retention time in minutes; a l l mass values are in Dalton . * Indicates g lycopeptides where the s ite of g lycosylation cou ld not be

unambiguously identified .

RT m/z z calc. m/z !l.m/z

23.2 785.42 2 785.36 -0.06

23.4 842.48 2 842 .39 -0.09

25.8 956.02 2 956 .04 0 .02

25. 1 1 1 64 .09 2 1 1 64.08 -0 .0 1

25.4 1 1 7 1 .08 2 1 1 7 1 .07 -0 .0 1

25.3 1 3 1 0 . 1 2 2 1 3 1 0 . 1 3 0 .0 1

1 6 . 1 58 1 .79 2 581 .77 -0.02

1 7 .0 56 1 .28 3 561 .23 -0.05

1 7 .4 780.84 2 780.87 0 .03

1 7 .7 787.84 2 787.87 0 .03

1 8 .8 80 1 .9 1 2 80 1 .92 0 .0 1

1 8 .9 947.92 2 947.93 0 .0 1

1 9 .2 780.88 2 780.87 -0 .0 1

20 . 1 80 1 .86 2 80 1 .94 0 .08

20.5 947.92 2 947.95 0 .03

20.9 742 .43 3 742 .40 -0.03

24.3 842 .94 2 842 .93 -0 .0 1

24.5 982 .05 2 981 .99 -0.06

24.7 835.97 2 835 .93 -0.04

24.9 856.89 2 856.98 0 .09

24.9 1 003.08 2 1 002.99 -0.09

25.3 724.35 2 724 .34 -0 .0 1

Peptide 191VNQGKTETTSF201 179N I HSQDTITVSY1 90 179NIHSQDTITVSY1 90* 326AQALGLQGSDGSTLNISTR344 326AQALGLQGSDGSTLNISTR344 326AQALGLQGSDGSTLNISTR344* 1 96TETTSFK202 19 1VNQGKTETTSF201 19 1VNQGKTETTSF201 19 1VNQGKTETTSF201 * 19 1VNQGKTETTSF201 19 1VNQGKTETTSF201 142NQKVTATTTTF152* 142NQKVTATTTTF152 142NQKVTATTTTF152 136LVDGTHNQKVTATTTTF1 52* 256 AAGTGVGKSVGG ITF270 256 AAGTGVGKSVGG ITF270

256 AAGTGVGKSVGG ITF270

256 AAGTGVGKSVGG ITF270

256 AAGTGVGKSVGG ITF270 245TASGESAISL Y255*

38

Glycan Modification

Rha2 + 1 4 Da +52 . 1 Da Rha2 + 1 4 Da Rha2 + 1 4 Da +227.3 Da Rha3 Rha3 + 1 4 Da Rha5 Rha GlcNAc G lcNAc +267 . 1 Da Rha GlcNAc Rha GlcNAc + 1 4 Da Rha GlcNAc +42 Da Rha3 GlcNAc +42 Da Rha GlcNAc Rha GlcNAc +42 Da Rha3 GlcNAc +42 Da Rha GlcNAc +42 Da Rha GlcNAc + 1 4 Da Rha3 GlcNAc Rha GlcNAc Rha GlcNAc +42 Da Rha3 GlcNAc +42 Da Rha GlcNAc

Page 40: MPIKG Public Access

FIGURES

a

35

25

1 5

SDS

35

25

1 5

PAS

35

25

1 5

WB

b

J;- J;-� � �qj � � � kDa�--��

kDa�----� BB - BB

1 00 1 00 70 70 55 55

40

35

25

1 5

SDS

40

35

25

1 5

PAS

Figure 1 1 (a) Comparison of native flagel l in C9L Y 1 4 and E. coli-produced C9L Y 1 4R. SDS-

PAGE (SDS), period ic acid-Sch iff (PAS) sta in ing and Western- immunoblotting (WB) reveal a

clear sh ift in molecular s ize between native flagel l in C9L Y 1 4 from a crude S. sputigena cel l

extract and C9L Y 1 4R ind icati ng post-translational modification of C9L Y14 by g lycosylation .

PageRuler Presta ined Prote in Ladder (Thermo Scientific) was used as marker. (b) Flagel l in

enrichment (FE) from a S. sputigena culture fol lowed by separation by SDS-PAGE (SDS)

and PAS sta in ing (PAS) .

39

Page 41: MPIKG Public Access

1 MALVVKNNMM AVN T LNTLNK NQSALSTSLK QVS SGMKING AVDDASGYAI 5 0

5 1 SERMRVQ I RS LDQDNQNTQN GNSMMKVAEG AVS STVD I LK TLKEKVINAA 1 0 0

1 0 1 NDTNTDDDRL TIQKELNQS I DQ INDNANIT FNGKYLVDGT

vAc ;Me ;Me � _...__-----,

1 5 1 TFTNQSMS TA TVATSAM TGL LDR TGRN LNI HSQDT I TVSY

2 0 1 FKV S T Q T FSE VIKRLSGTGI VGGMA TDVGA TSVIGTNVAG DTVYTASGES 2 5 0

;;Ac ;Me v V.Ac

2 5 1 AI SLYAAGTG VGKSVGGI TF S I TDTQGNIN KSANAVLDAF SETVRAQDNS 3 0 0

r Jrf. 3 0 1 QDNSMNFQI G TRANQAI RVG MTDMRAQALG LQGSDGSTLN I STRDKANAA 3 5 0

l 3 5 1 INVLDNAI SK ALDQQTT I GA VQSRLNYTSQ NLTTASENVQ ASE STIRDAD 4 0 0

4 0 1 MAKAMTNYTK NNVLMQAAQA MLAQANQSSS GVLSLLQ

A Not detected

A Detected as putative glycopeptide

A Detected (sequence confirmed)

A Identified glycosylation site

A Detected as glycopeptide only

T Rhamnose

• N-Acetylglucosamine

Me Methylation

Ac Putative Acetylation

O 52. 1 Da

0 227.3 Da

0 267.1 Da

Figure 2 I Schematic overview of C9L Y 1 4 f lagel l i n primary structure , combin ing data

obta ined from tryptic and chymotryptic (g lyco)peptides (Table 2, Supplementary Table 4) .

Bold amino acids ind icate sequence confirmation on MSMS level, amino acid stretches

conta in ing non-bold letters ind icate assignment as g lycopeptide by characteristic

g lycosylation fragments , but insufficient peptide fragment coverage. F ive sites of

40

Page 42: MPIKG Public Access

glycosylation (Thr1 49, Ser1 82, Thr1 99, Thr259, and Ser334) were unambiguously identified

using C ID and ETD fragmentation (red labeled Ser/Thr residues). In addit ion, Thr1 96 and

Ser200 were found to carry di- and monosaccharides, respectively, with fu rther not yet

defined substitutions of +52 . 1 Da and +267. 1 Da. The putative g lycan structures detected at

those s ites are described using the SNFG nomenclatu re (63). If more than a s ing le structure

was found attached to a g lycopeptide (microheterogeneity), the structures ind icated on top of

a g lycopeptide are sorted accord ing to their mass. The identity of the respective

monosaccharides was determ ined by separate monosaccharide analyses (Figure 4) . Six

g lycopeptides were detected exclusively in their g lycosylated form (underl ined sequence

stretches), whi le the others were also found in a non-g lycosylated form

(macroheterogeneity). Six add itional g lycopeptides, for which no specific g lycosylation s ite

cou ld be assigned, are ind icated by the wide brackets pointing towards the g lycan . These

glycopeptides were assigned based on their precursor mass, oxon ium ions, and ions

representing neutral loss of monosaccharide residues from the g lycopeptide .

41

Page 43: MPIKG Public Access

a lntens.

�--------------------;:::=======::::;-----:j_Mwsi:-, 2!z9i:"i.sirmiiiinln X 1 as lntens . .--------E:-:I""c"""s":":so'""'.2:::7:-:-±o=-.3:-o T Rhamnose

2

1 .0

0.5

b lntens . x 1 06

0.8

0.6

0.4

0.2

y

x106 1 .25 29.5 • N-Acetyi-Giucosamine

1 .00 T' Glycosidic Cleavage

0.75 9J Cross-Ring Cleavage

0.50 26.1 .¢. Reduced Reducing End

0.25 24.� .JA .L 0.00 +-.---.-,......-4-Il,.::�::::;:::;.ojollc����

1 0 20 30 Time [min]

� 368.09

'i � 't

457 . 1 6

204.06 l '1' 35

r2 J 246.07 29�. 1 1

200 250 300 350 400 450 500 550

EIC 749.31±0.3 lntens. x106

4 28.3

3

2

1

0 10 20 30 Time [min]

660.32

y

600 650 m'z -MS, 28.3 min

749.37

51 1 �4 524.00

1 .1 .ll 0 0 +--------------------------����----------------�--��� x 1 0'

5

4

3

2

! 351 . 1 4

,L I ,J;;' " I ! .1. I ��� ,1 /I �09��144 � 437. 1 6 �57. 1 8 551 .22

-MS2(749 .37)

i ·Me 7 1 7.26 ·H,O

! t ·H

,O

/ ),1<; 1 � ...._,___ I l 601 .23 64 1 .23 671 _26 731 .25 24� 1 2 1 " '1_ � il L . � l ..1. J. 0 +-��������������������L-�+-����������-T

200 300 400 500 600 700 m'z

Figure 3 Negative mode C I D product ion spectra of PGC-nanoLC separated Rha3GicNAc

and Rha5 0-g lycans obta ined using reductive 13-e l imination from tryptic peptides of

S. sputigena C9L Y1 4 flage l l in . The monosaccharide identities are assigned based on

separate monosaccharide analyses (Figure 4 ) . The putative g lycan structures are

represented using SNFG nomenclature (63). (a) Three isoforms of the Rha3GicNAc

tetrasaccharide were detected [extracted ion chromatogram (EIC) inset], but product ion

spectra were only obta ined for the tetrasaccharide elut ing at 29.5 min . The m/z 204 oxonium

ion ind icated that the g lycan was attached to the prote in v ia the GlcNAc residue, as th is

value corresponds to the reduced GlcNAc oxonium ion when analyzed as deprotonated

molecu les in negative ion mode. Fragments m/z 246 and 290 ind icated a branched g lycan

structure as these ions were not pred icted for a l inear structure of the same composition [in 42

Page 44: MPIKG Public Access

silica fragmentation using Glycoworkbench 2 (38)] . (b) The Rha5 pentasaccharide was

detected as a s ingle structure (see EIC inset). Based on the product ion spectra a l inear

structure provides the best explanation for this ol igosaccharide. The fragment ions m/z 7 1 7

and 551 corresponded to the loss of water and a methyl g roup (Me).

43

Page 45: MPIKG Public Access

a

b

c

Cll C) c Cll C) rn � 0 ;j u::

X

GleN I

1--J'--------------�� �--� II

I I I I I

Pep n.d I

0-Me-Rha I

X

I I - - - ----- - - - - --- ---- --' L-- - - -�- - ---- - - - - \_ Pep blank � .... _ _ _ -------- -- --- --- - - - - --- ------ _,,_ -- _.,._

8.41 GleN I

GleN �aiN "

x1 08 1 .5

1 .0

� 0.5 'iii c Cll x1 07 .E 6

4

2

0

x106 3

� ·� 2 Cll .E 1

0

1 0 1 5

Standard

Sample

5 1 0 1 5

25,1 1 274.1

250 260 270 280 290

I I

in -gel 26.04 n.d. I

29.77 0-Me-Rha

I

\ _ _ _ _ J!1.::9�!!>��!_1_!( __ -- -- ------------ -·- --- ---·

20

6dTal

20

Me1 dHex 300.1

Standard

Rha MeRha

25

32 1 .2

25

295::J 306.2 31 3.2 1,325 2

300 31 0 320 330

30 Time [min]

Fuc

mlz 388.3 ± 0.05

mlz 388.3 ± 0.05

30 35 Time [min]

340.3

.1�422

340 mlz

Figure 4 1 Identification of monosaccharide constituents of the S. sputigena C9L Y 1 4

flagel l in 0-g lycan . (a) HPLC with fl uorescence detection of anthran i l ic acid (AA)-Iabeled

monosaccharides revealed four major residues. The C9L Y1 4 flagel l in 0-g lycan

monosaccharide constituents were derived from tryptic peptides (Pep, g reen traces) and in-

gel a lka l ine g lycan re lease ( in-gel, red traces). Empty SDS-PAGE gel p ieces (blank) were

used as controls. The identity of Rha ( 1 7 . 96 min) and 0-Me-Rha (29.77 min) was determined

by hypermethylation (see parts b and c). X refers to non-glycan related matrix peaks. (b)

Capi l lary RP-LC-ESI-MS of CD31 hypermethylated AA-Iabeled monosaccharides.

Comparison with appropriate standards (top base peak chromatogram) a l lowed identif ication

44

Page 46: MPIKG Public Access

of Rha and methylated Rha as the deoxyhexoses present on S. sputigena C9L Y 1 4 flagel l in

(bottom base peak chromatogram). Retention t ime sh ifts were due to the deuterium effect

(64) . (c) MS spectrum of a fraction col lected after HPLC of AA-Iabeled monosaccharides.

The m/z of the detected signal corresponded to the singly methylated, AA-Iabeled version of

Rha.

45

Page 47: MPIKG Public Access

a lntens. x 1 08

6

5

4

3

2

1 5 .0

1

1 7.5 20.0 22.5

--

EIC 780.88±0. 1 0 T Rhamnose --

EIC 787.85±0. 1 0 • N-Acetyi-Giucosamine T' Glycosidic Cleavage

Ac Putative Acetylation

--

EIC 801 .86±0 . 10 * Precursors of h igher

charge states Me Methylation

; 1 191VNQGKTETTSf2°1 [M+2Hj2• = 780.84 ;Me 2 191VNQGKTETTSF201 [M+2Hj2• = 787.84 ;Ac 3 191VNQGKTETTSF201 [M+2Hj2• = 801 .88

; 4 142NQKVTATTTTF152 [M+2Hj2• = 780.88 ;Ac 5 142N QKVTATTTTF152 [M+2H]2• = 801 .86

25.0 27.5 30.0 32.5 35.0 Time [min] b lntens . 1 x 1 07 ' Pep

+MS2(780.84 ), 1 7.4 min 3

2

Pep 2+

606.29 Pip

2+ 707.83 ; Pep

1 2 1 1 .71

191VNQGKTETTSf2°1 ,! I 2+ T 2031 .96 ! 67

.

9

�.:3f3 Pep 35oTo9 1 357.78

0 ��--��----�����----------------����� �� 5 l: +MS2(787.84), 1 7 .7min

x1os 2 'I' 4 Pep Pep

3

2+ 2+ 606.29 'fMe 707.85 Pep ;Me

Pep 1 2 1 1 .73

2 2+

191VNQGKTETTSF201

1 32 1 .78

,! !Me 686

11

. 3l8 "' PI�• 203l.98

1 371 .81 3541. 1 1

I 0 ��----��--�--._������--�--�----------�--������ x 1 os 3 +MS2(801 .88), 1 8.8 min

6 Pep 1 70.94

: Iii' 0 J 11

203.94

r 392. 1 0

200 400

' Pep 'fAc Pep 2+ Pep 2+ 606·28 2+ 707. 84 ..1700.3jl

;Ac 191VNQGKTETTSF201

600 800 1 000

1 2 1 1 .72 ' Pep 1 4 1 4 .78

'fAc Pep

1 399.75 '

1 200 1 400 m'z

Figure 5 1 Example of g lyco-microheterogeneity shown for five selected g lycopeptides with

a peptide backbone mass difference of just 0 .036 Da C91VNQGKTETISF201 ,

M = 1 2 1 0 .583 Da and 142NQKVTA TTTTF152 - ' M = 1 2 1 0 .61 9 Da). (a) Extracted ion

chromatograms (EIC) showing three m/z values reflecting the RhaGicNAc d isaccharide and

its mod ifications (methylation and putative acetylation) attached to the two peptide

46

Page 48: MPIKG Public Access

backbones. Whi le the non-mod ified d isaccharide is the dominant form on g lycopeptide 1 9 1 -

20 1 , the later el uting g lycopeptide 1 42-1 52 shows h igher levels of putative acetylation . A

single peak was detected for the methylated version of g lycopeptide 1 9 1 -20 1 , but none for

g lycopeptide 1 42-1 52 . Both g lycopeptides exh ibited three peaks for the putative acetylated

g lycoform , possib ly ind icating that different hydroxyl g roups are modified on the rhamnose

(see also (b)) in a site-specific manner. (b) Representative C I D product ion spectra for the

th ree glycoforms of g lycopeptide 19 1VNQGKTETISF201 . The oxonium ions clearly indicate

that methylation (364 . 1 Da) and putative acetylation (392 . 1 Da) are exclusively occurring on

the non-reducing end rhamnose. The native d isaccharide and its methylated form exhib ited a

strong pseudo Y1 -g lycopeptide signal reflecting the peptide plus one Rha that is best

explained by a Rha rearrangement (signals at 679 .332+/1 357.78+ and 686 .382+/1 371 .8 1 + ,

respectively). This is a lmost enti rely abolished when the Rha residue is modified by putative

acetylation (bottom spectrum). Red letters ind icate sites of g lycosylation unambiguously

identified in separate ETD product ion spectra (see also Figure 6 and Supplementary

Fig u re 5).

47

Page 49: MPIKG Public Access

a lntens. x 1 07

3

2

0 b x 1 06 8

6

4

2

CID • Pep

+MS2(780 .84), 1 7 .4 min

'T Rhamnose 2+

707.83 • N-Acetyi-Giucosamine Pep

Pep 1 2 1 1 . 7 1

1' Glycosidic Cleavage 2+ 606.29 ;

! 2�r ETD

lntens. x 1 07

4

! 35l09

2+ 551 .80 2+

780.84

'T Pep 2+

679.33

; Pep

MS 2+ 780.86

191VNQGKTETTSP01

[M+2H]2+ = 780.84 +MS2(780 .84), 1 7.4 min

; c10 ; 191 �hNLQ!(i I(ITlEJT1"lSlpo1 1 4 1 2.73

1 �ri 82

; s;11.�1 z1 1 T ) 1 543.81

(c-1)7 (z+1 )7 I (z+1)10 500 750 1 000 1 250 m'z 773.46 1 1 46 57 c9 1 445.72

. 1 325.71 r.:

543.32 644.34 8�5

-1!� 9 1 9 49 1 20

13.58 I '

(c-1)5 (c-1)6 (z+1)8 l � 51 6.74

l l 1 l l .

I . IW I . .... J l .. ll .JI.l o������--��u-�����--��--�--������������ 200 400 600 800 1 000 1 200 1 400 m'z

Figure 6 1 Example of Rha rearrangement occurring on g lycopeptides. (a) U nder low energy

CID conditions this rearrangement resu lts in the formation of a pseudo Y1 -g lycopeptide

signal s imu lating two independent sites of g lycosylation. (b) ETD ana lys is of the very same

precursor ion verified that just a single d isaccharide was attached on one s ite rather than two

monosaccharides on two independent s ites .

48