kite proteins: a superfamily of smc/kleisin partners ... volume 23 supplemental information kite...

5
Structure, Volume 23 Supplemental Information Kite Proteins: a Superfamily of SMC/Kleisin Partners Conserved Across Bacteria, Archaea, and Eukaryotes Jan J. Palecek and Stephan Gruber

Upload: vanngoc

Post on 29-Mar-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Kite Proteins: a Superfamily of SMC/Kleisin Partners ... Volume 23 Supplemental Information Kite Proteins: a Superfamily of SMC/Kleisin Partners Conserved Across Bacteria, Archaea,

Structure, Volume 23

Supplemental Information

Kite Proteins: a Superfamily of SMC/Kleisin

Partners Conserved

Across Bacteria, Archaea, and Eukaryotes

Jan J. Palecek and Stephan Gruber

Page 2: Kite Proteins: a Superfamily of SMC/Kleisin Partners ... Volume 23 Supplemental Information Kite Proteins: a Superfamily of SMC/Kleisin Partners Conserved Across Bacteria, Archaea,

 

 

Figure S1, related to Figure 1  Structural alignment of bacterial and human kite WHA and WHB subdomains. 

(A) PDB structures were aligned using FATCAT algorithm. Respective P‐values are given for ScpB structures ranked 

among the top‐five hits (number in brackets denotes the rank). The best ranked result not belonging to the family of 

kite and MAGE proteins are given in the right column. (B) Example of WHA (left panel) and WHB (right panel) domain 

superimposition (human NSE3 compared to Geobacillus stearothermophilus ScpB; 3W6JB). 

 

Page 3: Kite Proteins: a Superfamily of SMC/Kleisin Partners ... Volume 23 Supplemental Information Kite Proteins: a Superfamily of SMC/Kleisin Partners Conserved Across Bacteria, Archaea,

WHA 1 2 3 1 2 extension ScpB---HHHHHHHHHHHHHHH-------------HHHHHHHHH---------------HHHHHHHHHHHHHHHH---------SSSSSS--------SSSSSS-----HHHHHHHHHHH S.pn---MSTLAKIEALLFVAG---EDGIR-----VRQLAELLS----LP---------PTGIQQSLGKLAQKYE---KDPDSSLALIET-SG-----AYRLVT-KPQ-FAEILKEYSKA G.st---KPAKAIVEALLFAAG---DEGLS-----LSQIAAVLE----VS---------ELEAKAVIEELQQDCR---REERG-IQLVEL-GG-----VFLLAT-KKE-HAPYLKKLVEA C.te-QRQQLLRSLEALIFSS----EEPVN-----LQTLSQITA--HKFT---------PSELQEAVDELNRDYEATGRT----FRIHAI-AG-----GYRFLT-EPE-FADLVRQLLAP M.tu--ADELKRVLEALLLVI----DTPVT-----ADALAAATE----QP---------VYRVAAKLQLMADELTG--RDSG--IDLRHT-SE-----GWRMYT-RAR-FAPYVEKLLLD B.su---VNWKAIVEALLYAAG---DEGLT-----KKQLLTVLE----IE---------EPELNTIMADVADEYRGDTRG----IELIEY-AD-----TYMLST-KKD-FAPYLKKLIEV M.ge---ANLVAAIYGLLFVSG---EKGLT-----LAELNRVLR---KVG---------LEKIKAALVQLERKLSL--DDESG-IEIKKF-GH-----SFRLVT-KME-IKDFIHRYLPN F.nu---MSIKNQVEAIIFLGG----DENK-----IKDLAKFFK----IS---------IEDMLKILLELKDD-----RKDMG-INIEID-SE-----IVYLST-NPL-YGEIINNYFEQ P.ae---HELATLLEGILLAAG----KPLS-----LERLAELFD---EAERPE------PGQFRDALAILALS--CAGRS----FELKEV-AS-----GYRLQI-RER-FSPWVGRLWEE T.pa---APDLALLEAILFVEG----VRLS-----YACLARKLG----LS---------EQAVGECVARLGEALASGARGGGG-LELHCN-EQ-----GVALLP-AAT-VRERLATLYGK T.ma---MQLKAAIEALIFASN-----GIT-----LERLIKILE----KD---------PEEIKRALEELKKEYED-EAHG---VVLREV-NG-----RYRFFT-KPE-YAGFVSKLSGR P.ma---ISLPAKLEAVLYLKG----KPLS-----LSEMAELVN----ET---------EDITEQALFELMAGYSQ--RDTA--LEINEK-KG-----KYSLQL-KTG-LGELVKNLLPV T.el--LRTLTMRVEAILYLKA----QPLT-----LTELATLAG----TD---------REAIELALIELLNDYAH--RQTA--LEIVQI-DD-----KYSLQL-KSA-FQELVQSLVPV C.ag---VTLLQLIEAVLFVAG----EPVT-----LEQLARVLE----VS---------PEQIEAAIEELSASYAQ--RG----IRLQRH-GD-----QLMLVS-APE-AAPVIRRFLGS A.fu---MELKKIVEAILFSSS----EPVD-----ARELRKITG----KD---------KVEILNAIGELIKDYES--RDTS--IEIIKV-GE-----KYLMRV-KPQ-YAEYVERFTVR P.ab---LEDKALVEAALFVAG----RPLS-----VKELSKALG----IKS--------LDYLEKLIELIASEYSE--RKSA--IEIVKVAGD-----KWVMQV-KQE-YSQKVIHLMPK T.vo---MDDETKVEAILYATR----NPLS-----VRSISLILG----IE---------AGAISRIIKKLRLEYKK--RNTS--LEIAKI-GN-----KYRIQL-KKE-YYDFAYRVMEP Nse1----HHHHHHHHHHHHH------------HHHHHHHHHHHHH----------------HHHHHHHHHHHHHHH------SSSSS---------SSSSS-------HHHH S.c.----ATAKYLLQYILSA-----RGICHE-NALILALMRLETD-ASTLNTEWS------IQQWVDKLNDYINAI-VKLx-DYAVLQSIVLPESNRFFVYVNLASTEETKL S.p.----DKHKFILQYIMCR------TAGV--DNEQVRELVQEQY-GETAT----------VEDVINELNNSLHNF------DFKIKRVQDQLDGRLTLHFQNLSGDPVSQM D.d.----NAQHRLLLDQFTK-----RRIIS---TETLKKLVMTVNRITQVNIS--------LDDYINSLNSKIMEV---GL-QIKILN--TDGIN--DYILTNLKPDECARY M.o.----NLHRGFLQAMMAR-----GSMTLE-EAQPILSSLHNAE-KSVGAAGIx------LEAVLSMIREAISPL------DYDIKKHRHQTTKEEVWAFININSDLSTQL D.r.----DGHRLFLQNMMTN-----GIVSAA-QAGMLHKKCCELH---GGQEK--------IDDFINVVNTHLQPL------FMHIRKGMSEEDGQEHFVLVNMAETDITRM X.t.----ESHQRFLQVLMSH-----GIMESS-LVRALHRHCCEVH-KVNYMHDN-------LDDFVGVLNKHLQPL------FMKIEKGVGEEDGLTYYALVNRVENDITKM G.g.----DAHRRFLQVLMSH-----GIMEGA-EARKLHRCCCEIS-KAYYAQDK-------LDDFVSTINNQLQPL------FMQIRKGMSEVDGRTHYALVNMAETEITKM O.a.----DAHRRFLQLLMSH-----GIMEGS-EARKLHRHCCEKH-KVYYAHDK-------LDDFIGIINSLLQPL------FMEIRKGMSEEEGKPYYALVNLTETEITKM M.d.–---DSHRQFLQVLMSN-----GIIDAP-EARRVHRLFCEQH-KVYYAHEK-------LDEFVGVINTHLHPL------FMEIRKGRSEDNGKMFYALVNLAITEATKL D.n.----DVHRRFLQLLMTH-----GVLEEC-DVKRLQKHCYKVH-DCNATVEK-------LEDFINNINSVLESL------YIEIKKGVTEDDGRPIYALVNLATTSVSRM M.m.----DVHRRFLQLLMTH-----GVLEEW-EVRRLQNHCYQVH-DRNATVDK-------LEDFINNINSVLESL------YIEIKKGVTEDDGRPIYALVNLATTSVSKM H.s.----DVHRRFLQLLMTH-----GVLEEW-DVKRLQTHCYKVH-DRSATVDK-------LEDFINNINSVLESL------YIEIKRGVTEDDGRPIYALVNLATTSISKM Nse3-HHHHHHHHHHHHHHHHHH-----------HHHHHHHHHH--------------HHHHHHHHHHHHHHHH----------SSSSS----------SSSSS----HHHHH S.c.-KENPVARKMVRYILSRGESQNSIIT----RNKLQSVIHE---AAREENIAKPSFSKMFMDINAILYNVY------G---FELQG-----xQK--FILL----xTDRDL S.p.-NFQLLVRNVVRYAICSQT-SHNTIT----RKDIVQKAFP---EGTSRNL----FQSVFEEADRQLQLSF------G---FRLVA-----xHR--YWVLR---xKDSRL D.d.-ERYKLVYEYVRLLLFSNR-KKVPIT----KTEINKIILA---RFKDKSL----QGFVYKAGREYLKEFF------G---YEVVE-----xST--YILK----xQLDSI M.o.---TQLIKKLVRYALACEY-SRTPIR----REGIRDKVLG----AHGRS-----FRHVFDGAQKQLRAVF------G---MEMVE-----xNT--YILVT---xRTAAI D.r.-QIDHKVAEVVQFILIKDQ-KKIPIR----RADIGKHVIK---DYKHI------YAEVMNRVCRTFEQVF------G---LKLVE-IDLKQHI--YILIN---xRGQTV X.t.-QINLKVGEVVQYLLIKDQ-KKLPIK----RADIVRNVVK---EYKDI------YPEIFRRAQIALQQVF------G---FQLEE-IDTKSHI--YILTT---xQGDGM G.g.-EINRKVTELVQFLLVKDQ-KKIPIK----RVDILKKVIR---EYKDV------YSEIVNRAGRTLQQVF------G---LQMVE-IDTKHHI--YILTS---xEGENL O.a.–QVDQKVNELVQYLLVKDQ-KKLPIK----RADILRNVIK---EYKGV------SSEIVKGAGQVLEKVF------G---LHLKE-IDQKNHV--YIIVN---xEGDNM M.d.-QIQQKVNELVQFLLVKDQ-KKVPIR----RADMVKTVLQ---DYKDM------ASVIIERAGQTLEEVF------G---LQLTE-IDRKHHA--YILIN---xEGDGM D.n.-QLDLKVGELVQFLLIKDQ-KKIPIK----RTDILKHVIG---DYYKDV-----FPDLLKLAAERLQYVF------G---YRLVK-LEPKHNT--YVLIN---xGDAEM M.m.-QLELKVAELVQFLLIKDQ-KKIPIK----RTDILKHVVG---DYRDV------YPNLLKLAAERLQYVF------G---YKLVE-LEPKSHS--YILIN---xEDAEM H.s.-QLELKVSELVQFLLIKDQ-KKIPIK----RADILKHVIG---DYKDI------FPDLFKRAAERLQYVF------G---YKLVE-LEPKSNT--YILIN---xEDAEM MAGE-HHHHHHHHHHHHHHHHHHH----------HHHHHHHH----------------HHHHHHHHHHHHHHHH----------SSSSSS--------SSSSSS------HHH A3--AALSRKVAELVHFLLLKYRA-REPVT----KAEMLGSV---VGNWQYF------FPVIFSKASSSLQLVF------G---IELMEV-DPIGH--LYIFAT-----CLGL A4---ALSNKVDELAHFLLRKYRA-KELVT----KAEMLERV---IKNYKRC------FPVIFGKASESLKMIF------G---IDVKEV-DPASN--TYTLVT-----CLGL B18--PLNKKVVSLVHFLLQKYET-KEPIT----KGDMIKFV---IRKDKCH------FNEILKRASEHMELAL------G---VDLKEV-DPIRH--YYAFFS-----KLDL C2--YTLDEKVAELVEFLLLKYEA-EEPVT----EAEMLMIV----IKYKDY------FPVILKRAREFMELLF------G---LALIEV-GPDHF--CVFANT------VGL D1--ALLQERANKLVKYLMLKDYT-KVPIK----RSEMLRDI---IREYTDV------YPEIIERACFVLEKKF------G---IQLKEIDKEEHL--YILIST--PESLAGI E2a--PLEDRSIALVNFMRMKSQT-EGSIQ----QSEMLEFL----REYSDQ------FPEILRRASAHLDQVF------G---LNLRVIDPQADT---YNLVS---KRGFQI E2b--TMNDKANDLVQLAISVTEE-MLPIH----QDELLAHT---GKEFEDV------FPNILNRATLILDMFY------G---LSLIEVDTSEHI---YLLVQ---xEQVML F1---RLNRTVAELVQFLLVKDKK-KSPIT----RSEMVKYV---IGDLKIL------FPDIIARAAEHLRYVF------G---FELKQFDRKHHT---YILIN-----KLKP L2---PLDERANALVQFLLVKDQA-KVPVQ----RSEMVKVI---LREYKDE------CLDIINRANNKLECAF------G---YQLKEIDTKNHA---YIIIN-----KLGY Ndn--QLVQKAHELMWYVLVKDQK-KMIIW----FPDMVKDV---IGSYKKW------CRSILRRTSLILARVF------G---LHLRLTSLHTME---FALVK-----ALEP MukE--HHHHHHHHHHHHHHHHH-------------HHHHHHHHH---------------HHHHHHHHH---------------SSSSS----------SSSSS-----LOOP- E.co--QALANPLFPALDSALRS--GRHIGLDE---LDNHAFLMD---FQ----------EYLEEFYAR----------YN---VELIR-APEGF----FYLRP----RSTTLI H.du--IAIANPIFPQLDSQLRA--GRHISIEM---LDEHAFLMD---FQ----------TELESFYRR----------YH---VDLIR-APEGF----FYLRP----KASTLI V.ch--KAIANPLFPALDSLLRA--GRHVSSDD---LDNHAFLSD---FE----------PDLALFYQR----------YH---TELVR-APEGF----FYLRP----RSTSLI T.au--QAIANPLFPKLDTALRS--GKHISADD---LDSHSYLLD---YH----------DELETFYNR----------YQ---VELIK-APEGF----FYLRP----RSTSEI Oce---QAIANPLFPALDNQLRS--GRHITADE---LEQHSLLQE---YY----------SELDAFYQR----------YQ---AELVR-APEGF----YYLRP----RSTSEL Y.pe--QALANTLFPALDSQLRA--GRHIGIDE---LDNHAFLMD---FQ----------EQLEEFYAR----------YN---VELIR-APEGF----FYLRP----RSTTLI H.in--VAIANPIFPAVDSLLRS--GRHISTEH---LDNHAFLMD---FQ----------NELDGFYRR----------YN---VELIR-APEGF----FYLRP----KATTLI A.sa--EAIANPLFPRIDTALRS--GRHISADD---FEQHSALVE---YH----------NELEIFYGR----------YQ---VELIK-APEGF----FYLRP----RPSADI P.mi--QALANSLFPELDSQLRA--GRHIGIDS---LDNHAFLMD---FQ----------DELTDFYAR----------YN---VELIR-APEGF----FYLRP----RSTTLI S.ce--AAIADEHFPEVDLMLRR--GRHIGRDD---GTAYDYLAD---AQ----------AILEGFYRR----------FG---CELVQ-QSDGY----FYLLP----SGERLG MksE------HHHHHHHHHHH-------------HHHHHHH--------------HHHHHHHHHHHHHHH--------------SSSSS----------SSSSS-----HHHH Bac-------MEVVINYLFSH-----NFL-----LKEFQRE------KYQL----AVRNKDIIKRYLKVI----------G---WDFLV--DEKHGC--IVIASPHYEHRLKL Eub-------RKTIQDLLRQT---CILQMKCDP-VTLIQRD----NPRYQV----CLRNREFISDYLAVL----------D---CELVH--DQQEHL--FRIT----GDGVML Geo-------LRRAASIALDR----QFLFGDKSRDQRSFHQ--------------ILDAEDYYRNLFDAL----------N---LELIC--DRTAGY--VGVV--PRESHLTV Pse-------APIFRELFKGY------HISHR--DPELYTQ--------------LSSHQDQYRGLFRAM----------G---FELVC--DTRGF---YYFV--PEQVGAQV Aci-------AELAGRLLASG---VVWREHSRP-EAALYDD--------------AIQCEQLLREWFACI----------G---FVLVH--DSDARL--LRLY---PPGEGGG Cor-------RKALVQLLKGP---MVNALQ----HVEVWRA--------------ITTDQDALNAVLNNL----------F---LELVL--DEDAG---VAFT----xQEVLV Pho-------RRVLVSLLRQG-----VILSSQ--KAKLFEL--------------LCRYQSAVRKHLSEV----------Y---LRLVL--DEKAG---VAFI-------AGF

Figure S2, related to Figure 2, Part 1 Structure based sequence alignment of kite WHA domains. See "part 2" for abbreviations of species names.

Page 4: Kite Proteins: a Superfamily of SMC/Kleisin Partners ... Volume 23 Supplemental Information Kite Proteins: a Superfamily of SMC/Kleisin Partners Conserved Across Bacteria, Archaea,

WHB 1 2 3 1 2 4 ScpB-------HHHHHHHHHHHHHHH--------HHHHHHHH-------------------HHHHHHHHHHH------SSSSSS-----------SSSSS---------HHHHHHHH- S.pn-------SRAALETLSIIAYKQ---PIT--RIEIDAIR----GVN------------SSGALAKLQAF---DL-IKEDGK-KEVLGRP---NLYVT--T------DYFLDYMG G.st------LSQAALETLAIIAYRQ---PIT--RAEIEEIR----GVK------------SDKPLQTLMAR---AL-IKEVGR-AEGTGRP---ILYGT--T------PEFLDYFG C.te--IQRRLSRSMLEVLAVVAWHQ---PVT--KGEIQQIR----GAS------------PDYSIDRLLAR---GL-IEVRGR-ADSPGRP---LQYGT--T------EVFLDLFH M.tu---RTKLTRAALETLAVVAYRQ---PVT--RARVSAVR----GVN------------VDAVMRTLLAR---GL-ITEVGT-DADTGA----VTFAT--T------ELFLERLG B.su--------QASLEVLAIVSYKQ---PIT--RAEIEEIR----GVK------------SERILHSLVAK---AL-LCEVGR-ADGPGRA---ILYGT--T------PTFLEQFG M.ge--------SKTMEVLAIIAYNQ---PCT--RPRINEIR----GAD------------SFQIVDDLLEK---EL-IVELGR-KDTPGRP---FIYEV--S------PKFYDLFG F.nu--------SASIETLSIIAYKQ---PIT--KSEIESIR----GVS------------VDRIISNLEER---KF-VRNCGK-QETGRRA---NLYEV--T------SKFLSYLG P.ae--------RALLETLVLIAYRQ---PIT--RGEIEEIR----GVAV-----------NTQIVKTLMER---EW-IRIVGY-REVPGRP---AMLAT--T------KAFLDYFN T.pa--------RAAMETLSIVAYAQ---PVT--RAEIEAIR----GVGA------------DTMIRLLSER---RL-ICEVGK-KDIPGKP---AQYGT--T------EEFLTAFR T.ma--------DTQMEVVALLLISG---PIP--KSEIDAFR----GKDS------------SAVLSSLQRM---GI-VRKKRK-----GKS---YLYQL--S------PSFVESTM P.ma--------GATLRTLGTIALKK---RIL--QSELVDLR----GSSA------------YEHIKDLVEK---DF-VERKRQ---REGRS---YWLTL--S------EKFHRTFS T.el--------VAAQRTLALIALRG---PIR--QPEVIALR----GANA------------YQHIQELLTL---GF-IRRRRD---SQSRS---YILQV--T------ERFHQYFQ C.ag--------HAALETLAIIAYRQ---PIT--RAQIEAIR----GVDS------------SAALRALLAR---DL-ICEVGR-LETLGRP---ILYAT--T------PMFLQQFG A.fu--------RGTLRTLAVIALKQ---PIT--LAKVAKIR----GNKC------------YEHVKKLQER---GL-VKAEKK-----GRS---TILTT--T------EEFATYFG P.ab--------AGELKTLALIAYLQ---PVE--QSKIVKLR----GSQA------------YEHIKRLLEM---GL-IYAEPY-----ERT---KLLGT--T------EKFAELYG T.vo--------KYETGFLATVALNE---GAS--LSFFRKRY----GSRT------------DDMISKLKTM---SL-IRTSKK-----GNGT--AIYLG---------ENFEKVFG Nse1----HHHHHHHHHHHHHHHHHH--------HHHHHHHHH---------------HHHHHHHHHHHHHH------SSSS------------SSSS-------HHHHHHHHHHHHHHH S.c.----NQNEIEFMKWAIEQFMISG-xIVKEVNRILVAAT---xTNLFQFQELT--ATDIEDLLLRLCEL---KW-FYRTQ-----EG----KFGID------LRCIAELEEYLTSMY S.p.----PPVQIELMRKIIEWIMKCDDYQYSLTTLQIQKLS-----RKEMGLAP----SVIESHLHTFERD---GW-LRQR------EG----IWTFT------NHALAELDAYLHNEY D.d.----SGDELKFFKLILKMFIESR-VGLK--KNDILTLG-----RDELKIKL----SDADNLFRKFAED---GW-LRLSA-----SK----SFTTL-----TNRALSDMAPLLD--- M.o.----TADEMSFIKRLLDAIFDTY-xLMCITADQARKLS---xQSATDKGLK---HSEVDALMASLTEE---GW-LEKSA-----AG----FYSLA------PRALLELWSWMVESY D.r.----AENELELFRKIMDLIVESDSGSAS--STAILNSAD----KLISKKLK---KKEAELVLNKFVQD---KW-LKEQ------DG----EYTLS------VRCIVEMEPYMRTIY X.t.----AENELELFRKTMELIIISENGFAP--SISILNLAD----ELQSKKMK---KKEVEQLLQSFVQD---KW-LIGR------NG----EYTLH------TRCIMELEHYILNTY G.g.----AENELELFRKTMDLIILSENGFAS--STDILNSAD----QLKTKKMK---KKEAEQVLKIFVDD---KW-LSER------NG----EYTLH------TRCIMEMEQYILSTY O.a.----AENELELFKKTMDLIIISENGFAP--SMSILNLSD----QLQTKKMK---KKEVEQLLHNFVRD---KW-LSER------GG----EYTLH------TRCIMEMEQYIRHSY M.d.----AENELELFKKTMDLIVESESGYVS--STSILNLSD----KLQSKKMK---KKEVEQVLQMFVQD---KW-LSEK------QG----DYTLH------TRCIMELDQYICEMY D.n.----AENELDLFRKALDLIIDSETGFAS--STNILNLVD----QLKGKKMR---KKEAEHVLQKFVQN---KW-LIKE------EG----EFTLH------SRAILEMEQYIRETY M.m.----AENELDLFRKALELIVDSETGFAS--STNILNLVD----QLKGKKMR---KKEAEQVLQKFVQS---KW-LIEK------EG----EFTLH------GRAILEMEQFIRESY H.s.----AENELDLFRKALELIIDSETGFAS--STNILNLVD----QLKGKKMR---KKEAEQVLQKFVQN---KW-LIEK------EG----EFTLH------GRAILEMEQYIRETY Nse3----HHHHHHHHHHHHHHHHHH-------HHHHHHHHHHH----------------HHHHHHHHHHHH------SSSSSS---------SSSSS----------HHHHHHHHHHHH S.c.--------GVLSVILCIVFFSK--NNIL-HQELIKFLETF-GIPSDGSKIAILNI-TIEDLIKSLEKR---EY-IVRLEEKSDTDGEV-ISYRI--GRRx----LESLEKLVQEIM S.p.--------GFLMTVIAFIAVSH--CSVG-HSELQSFLQEL---LTEEETTPLHLD--ITRSLSLLVRQ---GY-LDRVK---DDTHNQ-FVYYI--GSRx----IEGLKSFVTEFF D.d.--------TLLTIILSIIFLEN--GHVE-SPQLLQFLSVL---GFSQNEPHPVYGD-LEKLLEKFCRE---QY-LTRRKN--VVDNQ--IIWVYEMGQRx----KRFILNSISDIY M.o.--------GLYSMIVTIIQLNR--GELS-DPKLKRYLQRL---NAETNTPVEK----TDLLLQRLIRQ---NY-IVKTVERNAQGDDDAITWRV--GPRx----DEAMASIVRDVY D.r.----NPKMGLLFVILSVIFMK--GGTIK-ENLVWNTLKKL--RLDPGEKHDEFGD-VKKVVTEEFVRQ---KY-LEYGKI-PHTEPVE-YEFRW--GLRx----KLKLLEFVGELF X.t.----TSKLGLLMVILSLIFMK--GNTAK-ESAVWEMLRRL--RIEPAEKHSDFGD-VKKLITEEFVKQ---KY-LEYSKV-LHTDPVE-YEFRW--GQRx----KMQVLEFVSKIQ G.g.----TAKLGLLIVILSFIFMK--GNSAK-DSAVWEFLRRL--RVHPGEKHEVFGD-VKKLVMEEFVRQ---KY-LEITPI-PLTDPPE-FNFQW--GPRx----KKDILSFVAKMQ O.a.----TAKMGLLMVILSLIFMK---GSATNESVIWETLRKL--RVDTRERHEVFGD-VKKLVTEEFVRQ---KY-LEYNRI-PHTEPVE-FEFQW--GARx----KMQVLNFVAKGP M.d.----VAKMGLLMVILSLIFMK--GNSAR-ESLVWDVLKKL--RVDPEKRHKTFGD-VKKLVKDEFVRQ---KY-LEYIRV-PHSEPPE-YEFLW--GPRx----KMQVLRFVAKIQ D.n.----QPTTGLLMIILGLIFMK--GNCIK-ESELWRFLRRL--GVYPTKKHLVFGD-PKKLITGEFVRQ---RY-LKYQRL-PHTDPVD-YELEW--GPRx----KMKALKFVAKIH M.m.----TPISGLLMIVLGLIFMK--GNTIT-ETEVWDFLRRL--GVYPTKKHLIFGD-PKKLITEDFVRQ---RY-LEYRRI-PHTDPVD-YELQW--GPRx----KMKVLKFVAKVH H.s.----TPTTGLLMIVLGLIFMK--GNTIK-ETEAWDFLRRL--GVYPTKKHLIFGD-PKKLITEDFVRQ---RY-LEYRRI-PHTDPVD-YEFQW--GPRx----KMKVLKFVAKVH MAGE-------HHHHHHHHHHHHHHH-------HHHHHHHHHH-----------------HHHHHHHHHHHH------SSSSSS---------SSSSS----------HHHHHHHHHHHH A3--------KAGLLIIVLAIIARE--GDCAP-EEKIWEELSVL--EVFEGREDSILGD-PKKLLTQHFVQE---NY-LEYRQV-PGSDPAC-YEFLW--GPRx----YVKVLHHMVKIS A4--------KTGLLIIVLGTIAME--GDSAS-EEEIWEELGVM--GVYDGREHTVYGE-PRKLLTQDWVQE---NY-LEYRQV-PGSNPAR-YEFLW--GPRx----YVKVLEHVVRVN B18-------KTGLLMIALGVIFLN--GNRAP-EEAVWEIMNMM--GVYADRKHFLYGD-PRKVMTKDLVQL---KY-LEYQQV-PNSDPPR-YEFLW--GPRx----KMKVLEFVAKIH C2--------ENSLLIIILSVIFIK--GNCAS-EEVIWEVLNAV--GVYAGREHFVYGE-PRELLTKVWVQG---HY-LEYREV-PHSSPPY-YEFLW--GPRx----KKKVLEFLAKLN D1--------KLGLLLVILGVIFMN--GNRAS-EAVLWEALRKM--GLRPGVRHPLLGD-LRKLLTYEFVKQ---KY-LDYRRV-PNSNPPE-YEFLW--GLRx----KMKVLRFIAEVQ E2a-------KASLLALVLGHILLN--GNRAR-EASIWDLLLKV---xKPQRINNLFGN-TRNLLTTDFVCM---RF-LEYWPV-YGTNPLE-FEFLW--GSRx----KMEALKFVSDAH E2b-------TQEYVMPILGLIFLM--GNRVK-EANVWNLLRRF-----SVDVGRKHSI-TRKLMRQRYLEC---RP-LSYSN--PVE-----YELLW--GPRx----KMKVLEYMARLY F1--------RLGLLMMILGLIYMR--GNSAR-EAQVWEMLRRL--GVQPSKYHFLFGY-PKRLIMEDFVQQ---RY-LSYRRV-PHTNPPE-YEFSW--GPRx----KMEVLGFVAKLH H1----------SLLMSILALIFIM--GNSAK-EALVWKVLGKL--GMQPGRQHSIFGD-PKKIVTEEFVRR---GY-LIYKPV-PRSSPVE-YEFFW--GPRx----KLKVMHFVARVR L2--------KFGLLMVVLSLIFMK--GNCVR-EDLIFNFLFKL--GLDVRETNGLFGN-TKKLITEVFVRQ---KY-LEYRRI-PYTEPAE-YEFLW--GPRx----KMLVLRFLAKLH Ndn-------MTGLLLMILSLIYVK--GRGAR-ESAVWNVLRIL--GLRPWKKHSTFGD-PRKLITEEFVQM---NY-LKYQRV-PYVEPPE-YEFFW--GSRx----KMQIMEFLARVF MukE--------HHHHHHHHHHHHHHH-------HHHHHHHH------------HHHHHHHHHHHHHHHHHH------SSSSS-----------SSSSS------------HHHHHHHH- E.co------ELDMMVGKILCYLYLSP-xIFTQ-QELYDELL---xGSDVD---RQKLQEKVRSSLNRLRRL---GM-VWFMG----HDSS---KFRIT--ES--------VFRFGADV H.du------EMEMLVGKVLCYLYLSP-xIFSQ-DDVYEELL---xGSDLD---RAKLAEKVGGALRRLARI---GI-ITRVG---EQNSK---KFIIS--EA--------VFRFGADV V.ch------ELDMLVGKVLCFLYLSP-xIFTN-QELYDELL---xGSDLD---REKLFEKVRTSLRRLRRL---GM-VITIG-----DTA---KFRIT--EA--------VFRFGADV T.au------ELDMLVGKVLCYLYLSP-xIFSL-QDLQEEIV---xGTDLD---KKKLQERIRTSMRRLRRL---GM-VTALG-----TGD---KFRVN--EA--------VFRFAADV Oce-------ELEMLVGKVLCYLYLSP-xVFSV-EDLQEEIL---xGSDLD---KRKLADKLKSAIRRLKRM---GM-VSSVG-----SQD---KFRIT--EA--------VFRFAADV Y.pe------ELDMMVGKILCYLYLSP-xIFSQ-QELYEELL---xGSDLD---KQKLQEKVRTSLNRLRRL---GM-IYFMG----NDST---KFRIT--EA--------VFRFGADV H.in------ELEMLVGKVLCYLYLSP-xIFST-QEVYDELL---xGSDLD---KQKLAEKVRAAIGRLRRL---GM-IQTVG---EQNSG---KFTIS--ES--------VFRFGAEV A.sa------ELDMLVGKVLCYLFLSP-xVFAM-GELQEEVL---xGTDLD---KKKLLEKIRTSMRRLRRL---GM-ITALG-----NSD---KFRVN--ES--------VFRFAADV P.mi------EMDMLVGKILCYLYLSP-xIFTV-QELFDELR---xGSDLD---LQKLQEKMRTSLNRLRRL---GM-ISFLP----NDTQ---RFSIT--ES--------VFRFGADV S.ce------AGEMLVGQTLALLYLDP-xLVAR-EALLQRLS---xDERVA---AETVRAQVGEALRRLADL---GF-VDLLD-----EA----RLRLR--PA--------LMRFAEPV MksE------HHHHHHHHHHHHHHH--------HHHHHHHHHH------------------HHHHHHHHHHH------SSSSS-----------SSSSS----------HHHHHHHHH Bac-------KDETIWLLVLRLIYE--xPFTT-LQEIKGKYET----FRLTFVS-------KTKLRELVQMGKQNQL-LRPID--NDIELDDC-RFQLF----------HSCIHVLQQ Eub-------LLTARIVIIMKIIYR---xTTN-LAEIREYGRN--TNLITRKLT-------NQEWSDALLLMKTHQM-IELPG-AIANLEDNTPIYIYG---------TVNIFCSAMD Geo-------TEHSLFLLVLRVIYE---xFTD-SEVMLDTFVA--HTGRKRPG--------LVRLREILRTFSRQGL-LEIDE---DEDKAI--RFRIR-----------PSIRDIVT Pse-------RLALFTFILVEHLAD---xPLL--EKYRDLFLQ------AEVQT-------QEELEEKVMRR-LTQL-GFASE---DSG-----VYRFM---PP-----MHRFLDVCL Aci-------RDFVAAVIALRFLYT---xAIS-LEELSQAVVS--LLAHKLPNAASE----RMVLLRELRKH---RV-LHFVE-GDDAGDMQMGLAVLR---PVMSFVSDEALEEALR Cor-------HFDTLIILILRQELT---xIVD-REEIREQVLL---YRVDEERDEAKL---AKRFDAAFRRI--VDYSLAKKT----ETPE---RFEVS---PALRQ--IFDADTVAG Pho-------LYDTLLLLVLRKHYQ---xIID-IERIESHLTP--FLPLTNSTKSDRRK--LKGALDKMVTK---KI-LSSVR-----GSED--RFEIT---PVIRY—-VVSAEFLES

Streptococcus pneumoniae = S.pn, Geobacillus stearothermophilus = G.st, Chlorobium tepidum = C.te, Mycobacterium tuberculosis = M.tu, Bacillus subtilis = B.su, Mycoplasma genitalium = M.ge, Fusobacterium_nucleatum = F.nu, Pseudomonas_aeruginosa = P.ae, Treponema_palladium = T.pa, Thermotoga_maritima = T.ma, Prochlorococcus_marinus = P.ma, Thermosynechococcus_elongatus = T.el, Chloroflexus_aggregans = C.ag, Archaeroglobus_fulgidus =A.fu, Pyrococcus_abyssi = P.ab, Thermoplasma_volcanium = T.vo

Escherichia coli = E.co, Haemophilus ducreyi = H.du, Vibrio_cholerae = V.ch, Tolumonas_auensis = T.au, Oceanimonas GK1 = Oce, Yersinia_pestis = Y.pe, Haemophilus_influenzae =H.in, Aeromonas_salmonidiae = A.sa, Proteus_mirabilis = P.mi, Sorangium_cellulosum = S.ce,

Schizosaccharomyces pombe = S.p., Saccharomyces cerevisiae = S.c., Dictyostelium discoideum = D.d., Magnaporthe oryzea = M.o., Danio rerio = D.r., Xenopus tropicalis = X.t., Gallus gallus = G.g., Ornithorhynchus anatinus = O.a., Monodelphis domesticus = M.d., Dasypus novemcinctus = D.n., Mus muscullus =M.m., Homo sapiens = H.s.

Bacillus_cereus_B4264_Type2 = Bac, Eubacterium_rectale_ATCC33656_Type3 = Eub, Geobacter_metallireducens_GS-15_Type4 = Geo, Pseudomonas_aeruginosa_PAO1_Type1 = Pse, Acidovorax_delafieldii_2AN_Type7 = Aci, Corynebacterium_glutamicum_ATCC13032_Type5, = Cor, Photorhabdus_luminescencs_laumondii_TT01_Type6 = Pho

ScpB, MukE, NSE1, NSE3 alignments are based on crystal structures, MksE structure was predicted using I-TASSER, hydrophobic pattern from crystal structures (highlighted hydrophobic residues are mostly intramolecular keeping the WH fold) have been used as master pattern for alignment of the other sequences (based only on secondary structure)

Figure S2, related to Figure 2, Part 2 Structure based sequence alignment of kite WHB domains.

Page 5: Kite Proteins: a Superfamily of SMC/Kleisin Partners ... Volume 23 Supplemental Information Kite Proteins: a Superfamily of SMC/Kleisin Partners Conserved Across Bacteria, Archaea,

 

Figure S3, related to Figure 3  Comparison of kleisin/kite and kleisin/heat interactions. 

Cartoon representation of (A) a ScpAB  (PDB: 3W6K), (B) a MukEF (PDB: 3EUH) and (C) a Scc1/Scc3 (PDB: 4PJU) sub‐complex illustrating extensive interactions of kite and heat proteins with an extended kleisin‐peptide.