supplementary information · expressing evd: epi12 (f9) and epi35 (f9) previously described in...
TRANSCRIPT
Reconstructing de novo silencing of an active plant retrotransposon.
Arturo Marí-Ordóñez, Antonin Marchais, Mathilde Etcheverry, Antoine Martin, Vincent Colot and Olivier Voinnet.
SUPPLEMENTARY INFORMATION:
Supplementary Figure 1: EVD vs ATR expression and LTR methylation.
Supplementary Table 1: Figure 1 and Supplementary Figure 1 bisulfite sequencing raw data.
Supplementary Figure 2: Arabidopsis gynoecium and ovule development.
Supplementary Figure 3: EVD in situ hybridization in WT and two additional epiRILs.
Supplementary Figure 4: Suppression of silencing assay in N. benthamiana.
Supplementary Figure 5: GAG RNA binding domain conservation and EVD polyprotein product
production in EVD constructs.
Supplementary Figure 6: Mapping of 21-22-nt siRNAs to EVD in epi15 F14 and EVD copy
number in epi454 and 438 F8.
Supplementary Figure 7: Bisulfite sequencing dot-plots of EVD LTR in epi15 generations and
EVD copy number and DNA methylation in epi439 generations.
Supplementary Table 2: Figure 4 and Supplementary Figure 7 bisulfite sequencing raw data.
Supplementary Figure 8: Correlation of LTR 24-nt siRNAs abundance with EVD copy number
and 21-24-nt siRNAs mapping correlation on EVD.
Supplementary Table 3: Figure 5 bisulfite sequencing raw data.
Supplementary Figure 9: Bisulfite sequencing dot-plots of RPP4 Solo-LTR in epi15 generations.
Supplementary Table 4: Figure 6 and Supplementary Figure 9 bisulfite sequencing raw data.
Supplementary Figure 10: EVD life cycle in Arabidopsis.
Supplementary Table 5: List of primers.
Nature Genetics: doi:10.1038/ng.2703
(bp)
-578
-448 -415
-163
ATR (AT1G34967)
EVD (AT5G17125)
1026 bp
448 bp
448 bp 578 bp
415 bp 163 bp BglII BglII
BglII
RT+
WT met1 epi15 F8F11O6(Chr1)
F2K13(Chr5) gDNA
WT
epi1
5 F8
m
et1
EVD (AT5G17125) 5’LTR ATR (AT1G34967) 5’LTR
0% 10% 20% 30% 40% 50% 60% 70% 80% 90%
100%
WT met1 epi15 F8 WT met1 epi15 F8
EVD ATR
% m
ethy
late
d cy
tosi
nes
CG CHG CHH
a b
c
Supplementary Figure 1
Supplementary Figure 1: EVD vs ATR expression and LTR methylation.
(a) Restriction fragment length polymorphism (RFLP) analysis of RT-PCR products
shown in Figure 1a. genomic DNA (gDNA), and PCR products obtained from F11O6
BAC (comprising chromosome 1 ATR, AT1G34967, copy) and F2K13 BAC (compris-
ing chromosome 5 EVD, AT5G17125, copy), were digested with BglII to determine
whether transcripts originated from EVD or ATR. (b) Bisulfite sequencing-based meth-
ylation analysis at the EVD and ATR 5’LTR in WT, met1 and epi15 F8. The 5’LTR of
AT5G17125 (EVD) and AT1G34967 (ATR) was specifically amplified using a primer
inside the LTR and a specific primer outside of the LTR 5’ end. Cytosine methylation
levels are given at CG, CHG and CHH sites. Error bars represent the 95% confidence
limits given by the Wilson score interval (see Supplementary Table 1 for details). (c)
Dot-plots depiction of bisulfite sequencing data obtained for individual clones used to
generate the results shown in Figure 1b, Supplementary Figure 1b and Supplementary
Table 1. Filled circles indicate methylated cytosines and empty circles unmethylated
cytosines.
CHH CHG CG
WT
epi1
5 F8
m
et1
Nature Genetics: doi:10.1038/ng.2703
L2
L1
MMC
MMC Sync FM
b
ad
medial domain: fused lateral margins
niamod l ar et al
cmm ab
p p
a
Supplementary Figure 2: Arabidopsis gynoecium and ovule development.
(a) Scheme of a cross-section of the Arabidopsis gynoecium. The gynoecium is formed
by the congenital fusion of two carpels (modified foliar organs that bear the ovules)
through their lateral margins (cmm, yellow). Placental tissues (p, pale yellow), from
which ovules arise (see b), differentiate from the cmm at the intersections of lateral and
medial domains21-23. ab, Abaxial lateral domain (green); ad, adaxial lateral domain
(blue); abaxial medial domain (orange). Adapted from ref. 23. (b) Ovule development
in Arabidopsis. Ovule primordia emerge from the placentas, where a somatic cell from
the sub-epidermal L2 cell layer (in red) differentiates into the megaspore mother cell
(MMC) that undergoes meiosis, forming a syncytium (sync) of haploid nuclei, of which
three degenerate, leaving a single haploid product: the female megaspore (FM). The
FM then gives rise to the female gametophyte. Adapted from ref. 24.
Supplementary Figure 2
Nature Genetics: doi:10.1038/ng.2703
epi15 WTa
3
1 2
5
7
9
4
6
8
10
WT epi15 epi12 epi35 b
21
15141312
111098
75
43
6
WT non transf. WT pLTR:GUS-GFPc
Supplementary Figure 3
Supplementary Figure 3: EVD in situ hybridization in WT and two additional
epiRILs.
(a) In-situ hybridization study of EVD transcript accumulation, using an antisense
probe, in developing embryos of WT and epi15 F10. Longitudinal sections of embryos
at globular- (1-2), heart- (3-4), mid-torpedo- (5-6), late-torpedo- (7-8) and mature
green-stage (9-10). (b) In-situ hybridization study of EVD transcript accumulation,
using an antisense probe, in various tissues of WT, epi15 F10 and two other epiRILs
expressing EVD: epi12 (F9) and epi35 (F9) previously described in ref.18. Longitudinal
section of the SAM region from a young Arabidopsis rosette (1-4). Longitudinal sec-
tions of flower bud (5-7). Gynoecium transversal sections at floral developmental stage
8 (8-11). Longitudinal sections of ovule primordia at floral developmental stage 12
(12-15). (c) Longitudinal section of plant rosettes at the SAM region of non-transgenic
WT and pLTR:GUS-GFP transgenic WT plants, following GUS-staining. Bars = 50 ȝP�
for (a) 1-4, (b) 12-15; 100 ȝP�IRU�(a) 5-10, (b) 1-7 and (c); 10 ȝP�IRU�(b) 8-11.
Nature Genetics: doi:10.1038/ng.2703
DCL4DCL2
dsRNA
21nt-siRNA
(A)n
AGO1
RDR6
AGO1
GFP GFP + P19
DCL4DCL2
dsRNA
21nt-siRNA
AGO1
RDR6
P19
DCL4DCL2
dsRNA
21nt-siRNA
AGO1
RDR6
GFP + P19m
(A)n (A)n
AGO1
P19m
@EVD
GFP + P19 GFP + P19m GFP + EVD 4 dpi 8 dpi 4 dpi 8 dpi 4 dpi 8 dpi
@GFP
@U6
#�·*$*�
@GFP
EtBr
mR
NA
siR
NA
s
a b
-4.2kb
-21nt -24nt
-21nt -24nt
-0.7kb
Supplementary Figure 4
Supplementary Figure 4: Suppression of silencing assay in N. benthamiana.
(a) Suppression of silencing assay by Agrobacterium-mediated transient expression in
N. benthamiana leaves. Agrobacterium tumefaciens, infiltrated into plant leaves medi-
ates transfer of transgenes into plant cells, allowing ectopic expression of the genes of
interests, but it rapidly decreases 3-4 days post infiltration (dpi) due the onset of
PTGS30. This feature of the system has been routinely used to characterize proteins
with VSR activity by co-infiltrating the protein of interest with a GFP reporter transgene
acting both as a trigger and a target of PTGS30 . In the scheme, when infiltrated alone,
GFP protein levels (visualized under UV illumination) are reduced due to the action of
PTGS; however co-infiltration with the VSR P19, prevents the onset of PTGS in the
infiltrated area allowing significantly higher levels of GFP accumulation. P19 acts as a
head-to-tail homodimer that specifically binds 21-nt small RNA duplexes, preventing
their loading into AGO. This property is abolished in the point mutant P19m, which
dimerizes but is unable to bind small RNAs. (b) Analysis of p35S:EVD transient
expression in N. benthamiana. The potential suppression of silencing activity of EVD
was tested by co-infiltration of EVD constructs (Figure 2g) with the p35S:GFP reporter
construct. p35S:GFP co-infiltrated with p35S:P19 or its mutant version p35S:P19m
were used as controls for positive and negative suppression activity, respectively.
Northern analysis of High-molecular weight (mRNA) and low-molecular weight RNA
(siRNA) was carried out on samples extracted from infiltrated leaves at 4 and 8 dpi.
Ethidium bromide (EtBr) staining of ribosomal RNA and detection of U6 are shown as
loading controls.
Nature Genetics: doi:10.1038/ng.2703
MOCK
HA-EVD-FLAG
HA-EVD¨R
BD -FLAG
MOCK
HA-EVD-FLAG
HA-EVD¨R
BD -FLAG
@HA @FLAG
-35 kDa
-55 kDa
Coom. Coom.
c35S GAG PR IN RT-RNase
35S GAG PR IN RT-RNase
...YKNEDRKLLTCDHCKKKGHTKDKCWLLHPHLKPA...
...YKNEDRKLLT-----------------------------WLLHPHLKPA...
HA-EVD-FLAG
HA-EVD6RBD-FLAG
HA
HA
FLAG
FLAG
b
130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 305 315 325 374 384 394 404
MPMV/1-275SRV-1/1-275SERV/1-274JSRV/1-280SMRV-H/1-285MMTV/1-280LPDV/1-269RSV/1-298HERV-K10/1-287HIV-2/1-243SIVMAC/1-245HIV-1/1-244SIVAGM/1-248SIVMND/1-253EIAV/1-242FIV/1-227VMV/1-227SA-OMVV/1-226CAEV/1-228BIV/1-240STcLV2PP1664/1-213HTLV-2/1-213HTLV-1/1-213KoRV/1-251GALV/1-252MdEV/1-252PERV-MSL/1-255BAEVM/1-241RCHO-K1/1-237MuLV/1-250FeLV/1-241REV/1-227HERV-E/1-235RTVL-Ia/1-228WdSV/1-209Cereba/1-255CRM/1-261Galadriel/1-235Monkey/1-159Legolas/1-269Tma/1-118Retrosat-2/1-273pCretro3/1-221pCretro6/1-229Pyggy/1-234Real/1-233Cgret/1-271Pyret/1-248Skippy/1-245Maggy/1-241Dane-1/1-218Cer5/1-245Cer6/1-262Cer4/1-250Peabody/1-273Beetle1/1-240Cft-1/1-259Sushi-ichi/1-253marY1/1-249Reina/1-236Gloin/1-227Ifg7/1-159Micropia/1-240Blastopia/1-226MGLR3/1-246Grasshopper/1-279Cigr-1/1-257Mag/1-214SURL/1-245Osvaldo/1-267Cer1/1-222RIRE2/1-236Ogre/1-290Tat4-1/1-249Tft2/1-239Ty3-1/1-255Circe/1-277Skipper/1-230Athila4-1/1-229Cyclops-2/1-234SIRE1-4/1-314Mdg3/1-227Endovir1-1/1-299Oryco1-1/1-279Poco/1-257Batata/1-237Tto1/1-236V12/1-239RTvr2/1-233Tnt-1/1-238Tork4/1-231Xanthias/1-2351731/1-230Tricopia/1-242Yokozuna/1-207Copia/1-238Hopscotch/1-276Retro!t/1-281Koala/1-286Hydra1-1/1-227Hydra1-2/1-237CoDi6.3/1-266CoDi6.2/1-287Zeco1/1-231Olco1/1-230Cico1/1-224GalEa1/1-271CoDi6.5/1-296Mtanga/1-211CoDi6.4/1-289Oryco1-2/1-270Humnum/1-235PyRE1G1/1-243Koco/1-241Sto-4/1-262Fourf/1-234Osser/1-257Melmoth/1-256TSI-9/1-297Tse3/1-214Ty4/1-212EVADE/1-265
QQAGNWDF DML T GSGNYS S T DAQMQYDPGL F AQ I QAAAT KAWRKL P VKGDPGAS L T GVKQGPDEP F ADF VHRL I T T AGR I F GSAEAGVDYVKQL AY ENANPACQAA I R - - P Y RKK - - - T DL T GY I RL CSD I GP S YQQGL AMAAAF SGQT VKDF LQQAGNWDF DML T GSGNYS S T DAQMQYDPGL F AQ I QAAAT KAWRKL P VKGDPGAS L T GVKQGPDEP F ADF VHRL I T T AGR I F GSAEAGVDYVKQL AY ENANPACQAA I R - - P Y RKK - - - T DL T GY I RL CSD I GP S YQQGL AMAAAF SGQT VKDF LQQAGNWDF DML T GSGNYANT RAQMQYDPGL F SQ I QAAAT KAWRKL P VKGDPGAS L T AVKQGPNEP F SDF VHRL MT T AGR I F GNAE T GVDY VKQL AY ENANPACQAA I R - - P Y RKK - - - T DL T GY I RL CSD I GP S YQQGL AMT AAF SGQT VRDF LRQQG I T S Y EML I GEGPYQAT DT QL NF L PGAYAQ I SNAARQAWKKL P S S S T KT EDL SKVRQGPDEP YQDF VARL L DT I GK I MSDEKAGMV L AKQL AF ENANSACQAAL R - - P Y RKK - - - GDL SDF I R I CAD I GP S YMQG I AMAAAL QGKS I KE V LRP P PKDGP L K I PGAS P YQNNDKQAQF P PGL L T Q I QSAGL KAWKRL PQKGAAT T S L AK I RQGPDES Y SDF V SRL QE T ADRL F GSGES E S S F VKHL AY ENANPACQSA I R - - P F RQKE - L S T MSP L L WYCS - - - - AHAVGL A I GAAL QNL APAQL LKRKK - V S L DML L GT GQF L S P S SQ I KL SKDV L KDV T T NAV L AWRA I P P PGVKKT V L AGL KQGNEE S Y E T F I S RL E EAV YRMMPRGEGSD I L I KQL AWENANS L CQDL I R - - P I RKT - - - GT I QDY I RACL DAS PAV VQGMAYAAAMRGQKYS T F V- - V T V AV L KG - SDPAMAT PQL QAARMR - GRE I QASCQASV S I CGGRT VGKRT DPWT KV T QGL GEP F L S F AERL L NAY EKSQL P EAAKNAV F RDCVKQQGNML T QQYML GAPS T ENT - - AE L V KY L L KRENEG - - AAQGNAAA I V AAL K - - - - - -E RT NL DRL KGL ADGMVGNPQGQAAL L RPGE L VA I T ASAL QAF RE VARL AE PAGPWAD I T QGPS E S F VDF ANRL I KAV EGSDL P P T ARAP V I I DCF RQKSQPD I QQL I RAAP S T L T T - PGE I I KY V L DRQKT AP L T DQG I AAAMSSA I QP L VMAV- P P V I DADQL L G I GQNWST I SQQAL MQNEA I EQVRA I CL RAWEK I QDPGS T CP S F NT VRQGSKE P Y PDF VARL QDVAQKS I ADEKAGKV I V E L MAY ENANPECQSA I KP L KGKV PAGSDV I S E Y VKACDG I GGAMHKAML MAQA I T GV V L GGQVRE PRGSD I AGT T S T V E EQ I QMF RPQNPV P VGN I Y RRWI Q I GL QKCVRMYN - P T N I L D I KQGPKE P F QS Y VDRF YKS L RAEQT DPAVKNWMT QT L L VQNANPDCKL V L KGL G - - - - M - NP T L E EML T ACQGVGGPGQKARL MAEAL KE V I GPAP IRE P SGSD I AGT T S T V E EQ I QMYRQQNP I P VGN I Y RRWI QL GL QKCVRMYN - P T N I L DVKQGPKE P F QS Y VDRF YKS L RAEQT DPAVKNWMT QT L L I QNANPDCKL V L KGL G - - - - T - NP T L E EML T ACQGVGGPGQKARL MAEAL KEAL APAP IRE PRGSD I AGT T SNL QEQ I GMT N - NP P I P VGD I Y KRWI I L GL NK I V RMYS - P V S I L D I RQGPKE P F RDY VDRF YKT L RAEQAT QEVKNWMT E T L L VQNANPDCKS I L KAL G - - - - T - GAT L E EMMT ACQGVGGPSHKARV L AEAMSQAQHT N I MRDPRGSD I AGT T S S VAEQ I E T F NANPRVDVGR I Y RGWV I L GL QKCVKMYN - P I S V L D I RQGAKE P F KDY VDRF YQAL RAEQT PQDVKNWMT E T L L I QNANPDCKL V L KGL G - - - - I - HP T L E EML T ACQGVGGPGHKAKL MVEAMQQMQGVNMVRT P SGSD I AGT T S T V E EQL AMNMQQNA I NVGT I Y KSWI I L GMNRL VKSHC - P I S I T DVRQGPKEAF KDY VDRF YNVMRAEQASGEVKMWMQQHL L I E NANP ECKQ I L RS L G - - - - K - GAT L E EML EACQGVGGPQHKARL MAEMMRT V VGQSQNPPQGP I PMT ARF I RGL GVPRERQMEAF DQF RQT YRQWI I E AMSEG I KVM I GKPKAQN I RQGAKE P Y P E F VDRL L SQ I KS EGHPQE I S KF L T DT L T I QNANEECRNAMRHL R - - - - P - E DT L E EKMYACRD I GT T KQKMML L AKAL QT GL AGP F KGPRP L P Y F T AAE I MG I GL T QEQQAARF APARMQCRAWY L EAL GKL AA I KAKS PRAVQL RQGAKEDY S S F I DRL F AQ I DQEQNT AE VKL Y L KQS L S I ANANAECKKAMSHL K - - - - P - E S T L E EKL RACQE I GS PGYKMQL L AEAL T KVQVVQSK- - - - - - V L T VDQ I MGVGQT NQQASANMDQARQ I CL QWV I T AL RS VRHMSHRPGNPML VKQKNT E S Y EDF I ARL L E A I DAE P V T DP I KT Y L KV T L S Y T NAS T DCQKQMDRT L GT RVQ - QAT V E EKMQACRDVGSEGF KMQL L AQAL RPQGKAGHK- - - - - - V L T VDQ I MGVGQT NQQASANMDQARQL CL QWV I T AL RS VRHMSHRPGNPML I KQKNSE S Y EDF I ARL L E A I DT E P V T DP I KT Y L KV T L S F T NAS T DCQKQMDRV L GT RVQ - QAS V E EKMQACRDVGSEGF KMQL L AQAL RP PRKEGKQ- - - - - GGL T VDQ I MGVGQT NQAAAANMDQARQ I CL QWV I NAL RAVRHMAHRPGNPML VKQKT NE P Y EDF AARL L EA I DAE P V T QP I KDY L KL T L S Y T NASADCQKQMDRT L GQRVQ - QAS V E EKMQACRDVGSEGF KMQL L AQAL RPGKGKGNG- - - - - - - - - T DE I I GKGL S SAEQAT I S V ECRE T F RQWV L QAAMEVAQAKHAT PGP I N I HQGPKE P Y T DF I NRL VAAL EGMAAPE T T KE Y L L QHL S I DHANEDCQS I L RP L G - - - - P - NT PMEKKL EACRV VGSQKSKMQF L VAAMKEMG I QS P I- - - - MT GYNPMAGP L RMQANNP - - - AQQGL RRE YQNL WL AAF SAL PG - NT RDP SWAA I L QGL E E P YCAF V ERL NVAL DNGL P EGT PKE P I L RS L AY SNANKDCQKL L Q - - ARGHT N - - S P L GDML RACQAWT PKDKARV L V VQPRKP - - - - - - -- - - - MT GYNPMAGP L RMQANNP - - - AQQGL RRE YQNL WL AAF S T L PG - NT RDP SWAA I L QGL E E P YCAF V ERL NVAL DNGL P EGT PKE P I L RS L AY SNANKECQK I L Q - - ARGHT N - - S P L GEML RT CQAWT PKDKT KV L V VQPRKP - - - - - - -- - - - I T GYNP L AGP L RVQANNP - - - QQQGL RRE YQQL WL AAF AAL PG - S AKDP SWAS I L QGL E E P YHAF V ERL N I AL DNGL P EGT PKDP I L RS L AY SNANKECQKL L Q - - ARGHT N - - S P L GDML RACQAWT PKDKT KV L V VQPKKP - - - - - - -- - T QL ENL I NEAF P L NRPQWDHNT AEGRERL L V YRRT L VAGL KGAARRP T NL AKVRE V L QGP T E P P S V F L E RL MEAYRRY T P F DP S S EGQKAAVAMS F I GQSAPD I KKKL QRL EGL - QDHS L QDL I KE AEKV YHKRE T E E EKQEREKKE T E ERE- - T QL ENL I NEAF P L NRPHWDYNT AAGRERL L V YRRT L VAGL KGAARRP T NL AKVRE V L QGPAE P P S V F L E RL MEAYRRY T P F DP S S EGQQAAVAMAF I GQSAPD I KKKL QRL EGL - QDY S L QDL VKEAEKV YHKRE T E E ERQEREKKEAE EKE- - T AL PNL VDEAF P L NRPNWDYNT AEGRGRL L V YRRT L VAGL RGAARRP T NL AKVRE V L QGQT E P P S V F L E RL MEAYRRY T P F DP S S EGQKAAVAMAF I GQSAPD I KKKL QRL EGL - QDY T L QDL VKEAEKV YHKRE T E E ERQEREKKE V E ERE- - T RL QNE I DMGF P L T RPGWDYNT AEGRE S L K I Y RQAL VAGL RGASRRP T NL AKVRE VMQGPNEP P S V F L E RL L E AF RRY T P F DP T S EAQKAS VAL AF I GQSAL D I RKKL QRL EGL - QEAE L RDL VKEAEKV Y YKRE T E E EREQRKERERE ERE- - T QL PNE I DEGF P L T RPDWDYE T APGRE S L R I Y RQAL L AGL KGAGKRP T NL AKVRT I T QGKDES PAAF MERL L EGF RMY T P F DP EAP EHKAT VAMS F I DQAAL D I KGKL QRL DG I - QT HGL QE L VREAEKV YNKRE T P E EREARL I KEQE ERE- - T QL PNE I DAACP L ERP EWDF T T EAGRT HL RL YRQL L VAGL RGAGRRP T NL AQVKQV I QGAE E S PAT F L E RL KGAYRMY T P YDP EDPGQAT T L AL S L I WQSAPD I RNKL QRL ENL - QGY T L QDL L KEAER I S NKRE T QT ERE ERWRKEARRRD- - T QL PNE VNAAF P L ERPDWDY T T T EGRNHL V L YRQL L L AGL QNAGRSP T NL AKVKG I T QGPNES P SAF L ERL KEAYRRY T P YDP EDPGQE T NV SMS F I WQSAPD I GRKL ERL EDL - KS KT L GDL VREAEK I F NKRE T P E ERE ER I RRE T E EKE- - T QL PNV I DE T F P L T RPNWDF AT PAGREHL RL YRQL L L AGL RGAARRP T NL AQVKQVVQGKEE T PAAF L ERL KEAYRMY T P YDP EDPGQAAS V I L S F I YQS S PD I RNKL QRL EGL - QGF T L SDL L KEAEK I Y NKRE T P E ERE ERL WQRQEERD- - V T DERE I E AQF PAT RPDWDPNT GRGNDNL ERYRQ I L L RGL RAAARKP T NL SK I T E V RQGADES P T AY L ERL YQAYRT WSP I DPRAP ENQAA I V I QF V SQSAPD I RKK I QK I DGF - QGKS L S E L VA I AQKV F DQREDPAKAT HE L T QKMAKV L- - QNPQEY VRT QL PGT DPQWDPNR - EDMQRL NRYRKAL L EGL KRRAQKAT N I NKV S E V I QGKE E S PAKF HERL CEAYCMY T P F DPDS P ENQRM I NMAL V SQS T ED I RRKL QKKAGF - AGMNT SQL L E I ANQV F VNRDAASRKE T T RMN - - - - - -- - - - - - T QKL SKAF D I QQEKDEGP I RF L DRL KEQMKQY T GL NL EDP L GQRML K I HF V T KGWPDVSKKL QKAQKVY VRRDREKQKQKSKPML S T F QQVAPNP YAT KGF QGARN - - - - - - - - - - - Y KRSQASQT QF RE T KP SARGPKS T F PRP PKE- - - - - AF T Y KP PQT AHE Y VKHAE I I F KNNSGL EWQHAT V P F I NMVVQGL P PKV T RS L MSGNPDWS T KT I PQ I I P L MQHY L L QSRQDAK I KQT P L V L QL AMPAQT MNGNKGYVG - - - - - - - - - - S Y P T NE P Y Y S F QQQQRPAPRAP PGNVP SN - -AN I P T T WAAL KT AMHT RWVPP Y YQCE L L QNL QRL RQGKKS V E E Y YQE L QT GM I RCG I V EDNEA - ML ARF MGGL NRD I QS I L KYKDYN - - - - T HS F I S SCL QS T SAGT G I ART NF SA - GHT S SWT PG I T S T S T CP S T SAP P T SAANS T RDT RKQANNMPQT WDAL KRVMRARF V P S Y YARDML NKL QQL RQGT KS V E E Y YQE L QMGML RCN I E EGE E S - AMARF L GGL NRE I QD I L AYKDYANV T RL F HL ACKAERE VQGRRASARSNV SA - GKS T PWQQRT T T SMT GRT L AP T P S P SRPAP P P S S SDKRPR I DT WDKL I KEMRDQF L P SNASWL ARDKL KRL RQT G - S V RE Y I KE F T S VML D I QNMSDED - - KL HNF I SGMQGWAQNE L RRQNVK - - - - - - - - - - - - - - DL PGA I AAADS L VDF - RT T RP S T DV P S T SKNKKKNEKKGEWRKDSRKENANDK- - - V DT WEDL KRE L RT QF L P ENT E F VARRKL RQL HQS T - S I RDY VKQF SAL ML D I QDMSEKD - - KL F S F L DGL KPWAQQE L NRRNV T - - - - - - - - - - - - - - E V VGA I AAAERL T DF - V S S EDP T RRKQSS SNL P PKHSRGKE L GGKQKKKS SHKGDEVRS F ADF EDE F NKKY F P P EAWDRL ECAY L DL VQGNRT VRE YDE E F NRL RRY VGRE L E E EQAQL RRF I RGL R I E I RNHCL VRT F NS V S E L V ERAAM I E EG - - - I E E E RY L NREK - AP I RNNQS T KPADKKRKF DKVDNT KSDAKT GECV T CGRQAD I SWADF VAE F NAKY F PQEAL DRMEAHF L E L T QGS - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -P GT E V T WAE F CRS F RKAQVPDGVVAQKKRE F RAL HQGNRT V T E Y L HE F NRL ARYAP EDVRT DAEKQEKF MAGL DDE L T NQL I SGDYADF ERL VDKA I RQEDQRNKMDRKRKAAQF R - APQGSHQRPRF T PGQQGGP T T M I V RQHRP F NS SNF HQ- - - - - I RL RCS PAVA I H I T SCE T AKSAWDT L KNMYAT QGT MAK - - - V T L V RRL QKY EMSEGADMEDE L RKL T EMRAEAAEAGG - - K I T DEDF AF N I L S AL P P SWES F VAAQN - - - - - - - - - - - NV E K - - - S SDV I GRV L S ENL RRQSK - - GDT T- - - - - I RL RV SDAV L V Y I S AAT T AKEAWDT L MQT F QPSGMYSY - - - I L AY RKF YQT RCPDGGD I P EHL GQMRRL RDE I HT I AGT AR I SDRE F AF AL L I S L PDSWDQF I QA I D - - - - - - - - - - - AT ADNL T SAQV I AR I L QEDKRRKEREGAKGD- - - - - S I DE L WHF MDQQYDDPHQKSKAL DRL L NL RQGKL S VRDYHME F NRL E I QSGERF GEAAKKSMF L KGL HT KL QEAL AT VDEDL - S Y EQL I NKA I RT SDNL Y - - - - - - - - - - - - - - - - - - RV S L T AR - ARQGHT L AEQS T PKT T QREAS - -- - - - - T V VGL WEHMDT HF QDVHQQQRAL NKL RHL KQGRRP I RDY V S E F NQL RV E SRQQF S P V VAREMF S EGL RE E L QKL ML HT PKNG - S L KE YMDKA I E L SDDL Y - - - - - - - - - - - - - - - - - - R I QL HGRNQRS T AHDGAQNHPRAVQREAS - -- - - - - - F DGY VKAL QS L F L DPDEKRQAERDL SNL RQNK - S AT L Y AAE F RRL AAR - - L DMT DE SKV F AF YQGL KDDVKDEMAKVDT P P - S F L DY V E YA I K I DNRL F E RRKEKGEKR - - - - - - - - QNPNSGRKYQWQPQQRF NNQRNDNRPRNNWN- - - - - - F AGYKRT F RS L F QE PDKKRQAERNL ANL RQT K - S ASAY T T E F KRL AT R - - L D I T E KT K I L QF YQGL KL R I KDE V SKL DRP E - DF L E Y V E L A I KL DNR I Y E RRQEKQGGQR - I F V T L I RQT NT GRKYQHPQP T YHRNNGGWQQPRHMAP- - - - - GRHEMEEAL KMAF GT I DEKGQAERK I KT L KQT G - S AS T L GV E F L QL ASK - - L PWDQDV L MS F F F DAL KEQVQQE L WEKDRPR - T L V E Y I NMAVK I DDRQF AWRT - - - - - - - - - - - - - - RNSRGNKGRQDNKPRYHANQGRT RQT D - - - -- - - - - DY T DL L QY L E EAF GDPDHVQNAQNKL YAL KQRNVDF AE Y L S E F QRL S L EG - - EMP EDAL P P L L F QG I S E E L QDML L HNPAP SRQYHE F T RHL QS L DNRYRQHQQYKN - - - - - - - - - - - RQT R - - - - - T P RAAAP PARAAPRT QP - D I P R- - - - - DY ED I L D I L DQAF GDPNCVNNARNE L F RF QQNNKE F GL F F AE F QHL AL EG - - EMP E E T L S T L L EQL I NRE L KGML MYNQPP T - DYHE F AKF L QE L ENRRRHYE I NL Q - - - - - - - - - - - S AS RNY PA I T RT AT SQL L RT NY T T L P RT I E N- - - - - S D I Y S I S F DDT V T AL T K I F GS T KS L MMRRQQCL Q I CRANGL SQDY L DY T NS I SDAV L DSKL S SMT SDEWS I F L F L RGL NS PGDEKAKL Y L MQYV EAS EKKNEKL KL S - - - - - - - - - - - DVHDEWMKF I QMHQQSK I V S VKP SKS SQQVD- - - - - T DF L G I P F T E T L QNL T KAF GSHRSMMMRRQQCL Q I S RANGES L DP L E Y SNRVGDAV L EAKL ASMS T DEWSV F I F L RGL DAPGDNPAKAY L MQF CE T AEKKGEKL KL S - - - - - - - - - - - D I HDEWI RF I QMKQQT KV V SAL P VKP T PQQA- - - - - CD I Y E V P F V T T V EAL EKAL GSHRS L M I RRQT CL HL SRT NCT S L DP L KWT NQ I S E AV L DAE L AKMS T SDWG I F L F L KGL DAPGDAQAKAY L MQF Y E T SDRKGEKL T L S - - - - - - - - - - - NMHDEWVRF L QT KSQT K I V SHS - - - - S P KT S- - E V I T WDV F RRE F MT KY Y P EDVRGKKE I E F L E L KQGNMSV T DYAAKF V E L SKF Y PHY T GAGAE F SKC I KF ENGL RS E I KKAVGYQK I R I F T E L VDSCR I F E EDNNAHYK I V SDRRGKQHQNRGKPYDAP VGKGKQGAAPAQRT SRGGAPAGYS- - K I KSWKKL KKKMEAKF L P S T Y T L DL YNQL SDL F QGT KNVE T Y I HE Y EKL MMKL DVQERE EQT MARF VKGL DRD I QKKL E L QS Y S T L DE L F RL AL KV EKHQKDKKKKV F SKG - - - - - - - - - - T S Y AKGGSS S Y VKP T S T S P F T P KS EAGSKF L- - ADGV F KS YNHL KHAMKSV F GVSNE I AT AVRV I QHL T QKT S T AE YAAKF QEYAQL T DWDDEAL QVMYRRGL KEHVKDE L MRDGRK I DGL GDL VQV T I DL DDKL Y ERAMERRYDSK - - - - - - - V SGKAGY T PGYDNRNRGF NDNYNKPKDKP Y Y- - - - - L S SDY SAF RRE F KAV F EHP T YGEDAASRL L AL QQGSRS VAE Y T L E F R I L AAE SRWGE T AL RSAYRRGL S EA I KDL I V RDRP S S L NE L I T L S L QMDERL RERRQERAQRAGGS T RQL SHRT S SAPDF S L T S T AAP P PH I L L QS PAHP S PR- - - - I L RQT KE L GS L P F S T WGDF RAE F VRHF CP L NERHRAV T RL E S S T YHQGRRSVDDY I DE F KDL VDMAGYGSGL P V VKF RRGL NRE I QDQ I AQML EGRPADDD I L AWYEAAS L CDENRL T NAAF NAAF HAP T T T HT P Y T F NRS L PRPNP I P T- - - - - - WE E F CGCL NARF GRDQHSML I RQF YH I HQT NSMSE Y V EQF DV L L HQL L AHEGQL T PAM I T ARF L DGL KE E VRS V V V I QRPNNL DAACS L AL L QEDV L MHSGRREHR - - - - - - - - - - - RQDGGGF SKP L QKGT T MAGSS L SAT T NRVAY- - - - - L L L RF GNL KSRGPSQS L L C I KQT GF V V E Y I HQF EDL S SQVSGL DD I KL E E I F L NGL S L KMQE I VHMWNL QNL P EM I GVARDMES S L I HRV V L KE L QSGKGGSK I QS E - - - - - - - - - - - S RAL V P YQP L VWKSRP F I T E P SGT KERAT P I- - - - - - WSNF S I S I MCKT HKRY V L QVY I WNQ I S L YG I NGF AL VNKL S L GQL L RKNQH I MR I QKAT S F F SQF I S L KQKDS I MEHEKAF RL VRKMESK I MAT GKHT T H I Y KDGS - - - - - - - - - - - - V F AP S L PQP I ML T PQKL E EKRAKG - - - - - -- - - - MT WQE F QE L F L QRF E T E E T P AAT F L NL L NSRP T AAECYAV YASRL V T QL T T KWRNME I E E I AV T T V L AHMAN I DSRL QRV L F T SNVRT RSKL QAE L KAF T F DKKRHAR - - - - - - - - - - - - - DDNL GPDQKNRKAS P V VCHF CSKPGRR I A- - - - - - E KQKY VQARSKM I GSAE L F L E S ECV SGY T E L KE L L I E E F SGS YNSAV I HKKL QDRKKKRE E T L HDY L L QMKK I AAL GE V E T VAL I T H I V NGL D I KKE YKGAML RCK - - - - - - - - - - - - T L KE L KQE F E I Y E S L N I V DKPN I QPKPKQ I- - - - DQ I T T YDG I MGHL RE E YKDT KL EDT ARDE L DKL I MS T KDKF M I F KNKF VKAGQSGL PKRSWKRE L HRRL P T NL RVAMV I YHQDT NV S F DAY VRT ADG I S YDL T KAYAS - - - - - - - - - - - RT EDKNKT KT T S I KKT T GGSV I P RAGGPARTRDGAYRY T DF L DY L ERT YDDPHKRAQAL AE L E T L KMKPGQS F AQF I A I F E RT L AT AGGL AWADEVRT NF L RF RV S PR I RE ACVGRGMGDGT Y L GAVA I Y RQVAQDL EA I E L DRRF GPHRAGAAT APRP PKDEDT PMT GVAAMGSRPNGGARGRR- - - - - - Y E E I I RT L RS V Y I KKKNN I Y S RY L L L S RQQSP E E S I P E Y I QE L KS L SRQCE F T AV EANQYRE E L T RDAF I DGL S S S V I RQRL L E VDS L DL MKAF E L ADS L DRAQKYS T SMGRNAAP T P S YAASAPQHQHEPKRSAPQHQYE E T VCRQQ- - - QL T Y ANAVAML AAHL QPKP S I L AERYKF RQRRQL NE S I ADY L T E L KKL SKNCE F GSS L DENL RDQMVCGL KS E I I RQRL F AE EKL E YKRAV T L AL S L EAAERDA I AV ER - - - - - - - - - - - - - T P I E E VNK I NF NECSRCGDRRHQAKDC I Y- - - - - - GE SMKL EKV L DKY E E YCMPKRNL T Y ERHRF F T RSQL EHE S VDQYAT E L RSRAQSCE F S S L KDS L I RDR I I CG I L DAGL RERL L RQDDL T L DKT L QVCRAAEQT RT QAQAL S SAGENAS I P VDVDS F SKT KKKF SGSKSKSKQSKPQDH- RHWRT WGT F VRS F QE F F F AEDY L EDL KDE VKRRKQMVDEP F K I YMV EMQT L MRP L RYGPDHEMKL I Y NNS I P DL RAYARP YQF QS L ME L MKL ADE F E E L ERDRERL RRL QRPART RL MAMEEDDGHEE EML RRGAL E E S PRPAPRT GAT GQRT- - - - - K I MDT DE I NHYDVCREM I I RHEWNVSK I NEKQCL NSKL S EKRGKVQNAENF VNQNT QNNF KP F S PNKAADNSRNSWNNNSQNNSAASQN I S REQSWKT I S V PQKHQN - - - - - - - - - - - P SDRCSDCQQRGWHMF WCSKKSKDNASQKCD- - - - - SWAE L RDHF I ANF QGT F ERPGT QYDL YNV I QKSGES L RDY I RRF S EQRNK I SD I T DDV I I AAF T KG I RHE E L VGKF GRKP PKT VKL MF EKANEYAKAEDAV T ASKQS - - - - - - - - - - - GP SWKPNKGT PAKGGGGSNNHKDRKRKP E E LMDMAPSRT QL QS L CQKSNES F KE YAQRWRE L AARVQPPML ERE L T DM I GT MGSCP F GS F SDV V I CGERT E S L I KT GK I QDVGSS S SKKP F AGAPRRREGE T NAVQHRRDQNR I E Y RQAAAV T I P APQPRQQQQQRVQQPQQQQQQRPYQPRQRM- - - - - S F L E L S I AF L KHY SMY I E RGT T HADF WT MAQGPKE P L RT F I DRF KAV V SR I AV PDEAA I S AL RNAV SHE SRF CDDL L L NRP V Y L EDAL HKAT RY I EME E EKL AF SRKHT SN - - - - - - - KQSKEKVQDDYY E PRQHYNQEYRQE T ERKAA- - - - - NF NE L SAV F L KQYS I L MDKS I S I P T SG I Y L KGRMRHF GL L S PNL KT CF QSSQGSHNSRL YRL CKKDYVM I RGL RK I I L HKPDT I QDAL F RANNWMEVEDEKE I F AKRN - - - - - - - - - KQAKL AV T F P PKKF E PRENQGPRKF GSQP F NN- - - - - - F DT F I QGL YQHF YKP PD I NK I F NA I T QL S EAKL G I E RL NQRF RK I WDRMPPDF MT EKAA I MT Y T RL L T KE T YN I V RMHKPE T L KDAMEEAYQT T AL T E RF F PGF E L DADGDT I I GAT T HL QEE YDSDYDS EDNL T QNGYVHT VRT RRSAL AF YWRVHRS VDNMNWNV L CRRL KKRYQDQRSDRE I K I AMRRRKEGS T ENF DDF L DAML S I ADS L S E PMQD I E I T V EDRHNL KP E I KHE L L H I DT PNL A I L RKECHRHEDF F RS T RT KP I QRSNT SKRF VNA I L HEDDSEDQSE E EQDVEDE I- - - - AQDV I KN I E A I MS YHNV I NP V Y L I HGT AE VQL E Y L RS I L DNN I L AL L KMVV PDWL QT DVA I KL NT QT D I T ECREA I T E L V L NNADKF SRRRP SDSASNNDEKF P L KKN - - - - - - - - - - - HAT PNHNSHNRNS F DAQADK I RAA I L P E I RK- - - - - T WDDCKKV F L S KF F SNART ARL RNE I SGF SQKT GES F CEAWERF KGY T NQCPHHGF T KAS L L S T L Y RGV L PR I RML L NT ASNGNF QNKDVE EGWE L V ENL AQSDGNYN - - - - - - - - - - E DCDRT VRGT ADSNDKHRKE I KAL NDKL DR I- - - - - DWNV L E EKF I E RF F SHNRF MESKT A I S V F SQGT NAS L NEAWERF KSML RKCKGHGF DE L T Q I H I F KNGF QPNCKT L L DAT PGGS L L S KNAQEAT D I I NRMAL NDL QS - - - - - - - - - - - QS RGNS L KKPRV L E L GT NDA I L AQNKL I S PQ- - - - - L F NGVDKN I F RL I NT CT VAKDAWE I L K I T HEGT SKVK I S RL QL L AT KF ENL KMKEE EC I HDF HMN I L E I ANACT AL GER - - I T DEKL VRK I L RS L PKRF DMKV T A I E - - - - - - - - - - - E AQD I CNMRVDE L I GS L QT F E L GL SDRAEKK- - - - - HL R I L L I T KL KGNAQVWL HAHPARL I E P I DNL L DQL S L T F GEQSSKAE I RRKF E SRKWKT E ENF CS Y YDEKMAL SNG I N I DDDE L L DQM I EG I P L QNF RT QAR I QCF S - - - - - - - - - - - T P S EML RAF SN I RL PARRE P P VQP T DYKDA- - - - - I F NS VNQNQF KR I QNCESAKEAWDKL AKAY EGT S S VKRSR I DML ASQF ENL T MDES EN I E E F SGK I S A I AS EAHNL GKK - - Y KDKKL VKKL L RCL P SRF E SKRT AMG - - - - - - - - - - - T S L DT DT I DF E E V VGML QAYE L E I T SGKGGY- - - - - I QQGVAE S L F P R I I GAKKSKKAWDKL KE E F QGSQKV L AVKL QT L RRQF QNL L MKES EKVKDY F SRV I E I V NQMRL YGED - - I NDQKV V EK I L I S L P E KY E Y I V AA I E - - - - - - - - - - - E S KDL S T L T I QQL MSS L E SHE ERKL QREGSS- - - - - L Y S AV S EA I F P R I I AL DT AKQ I WNY L QEE F HGNER I I QMQV L NL RRE F EMQKMKE T E T I RDF SNKL L S I V NKVRL L GED - - L PDKRV V EK I L V T L P ERF E SK I S S L E - - - - - - - - - - - E S KDL SK I S L AE L L NAL QAQEQRR I I RE - E S- - - - - I RQWVEDNV L NH I I NE T HARS L WNKL E T L YASKT GNNK - - - L F L L KQMMN I RYREGT L I NDHVNDF QGV L DQL SGMG I K - - F EDE V L GL WL L NT L PDSWE T F RV S L T - - - - - - - - - - - NS APNGVV T MEY VKSG I L NE EARRRSQDT S T- - - - - I RQF V EDNVYNH I SGV T HARS L WDKL E E L YASKT GNNK - - - L F Y L T KL MQVKY V EGT T VADHL NE I QG I VDQL SGMG I K - - F DDE V L AL MV L AT L P E SWE T L KV S I T - - - - - - - - - - - NS APNGVVNME T VKSG I L NE EMRRRSQGT S S- - - - - I RQWVDQSV F HHVAKE VDAY S L WQKL E S L Y ERKT AQNK - - - AF M I RRL VNL KYKDGNSVAEHL SNF QGL L NE L S T MKL E - - L DDE VQAL L L L S S L PDSWE T L V V S L S - - - - - - - - - - - NS T PNGV I T VNMVKDNMF NEEARRKE L G I S S- - - - - V DL SGS L L M I MF I I L L L MRHMQEAYG I S L S PCMPVRT N - - - KY L L KNF GE L KYKEGT P V SNHL S E F QGRYDQL AGVG I K - - F DDDL L DL F L L NS L PDSWE T F RV SM I - - - - - - - - - - - S V T PNGD I S L QMAKSGAL NE EMRRKT RGT S S- - - - - I RL HL SDDVVNN I I DEDT ARG I WT RL E S L YMSKT L T NK - - - L Y L KKQL YAL HMSEGT NF L SHL NV F NGL I T QL ANL GVK - - I E E EDKA I L L L NS L P S S YDNL AT T I L - - - - - - - - - - - HGKT T - I E L KDV T SAL L L NEKMRKKP ENQGQ- - - - - I ML CL ADDV I T E V SDE E T AAGL WL KL E S L YMT KS L T NK - - - L L L KQRL F GL RMAEGT QL REHL EQL NT L L L E L RN I DVK - - I E DEDAAL I L L V S L PMS F ENF VQS F I - - - - - - - - - - - V GKDT - V S L E E VRSAL HS - RE L RHKANGT S T- - - - - L F L NL KE T QL MH I KACT T S ADAWKKL CDVHL PGGP I RK - - - VQL YQKL SRL RML EGDNVVQYVNKF AE T VNKL AEMD I T - - I NDE L KS I ML L S S L PNSWENF V VA I E - - - - - - - - - - - T RDT L P T F E T V KVKL L E EGARKQE - - RDDRE- - - - - I I L S V KS SQL GYVKGCL T AAEAWKV L QDVHQPKGP L RT - - - VML YKKL L SKRL L EGQS I S SH I KE F KE I F DAL DAV E I G - - I T E KL RS V V L L S S L P E S F ENF V VA I E - - - - - - - - - - - T RDDV P L F DAL C I KL I E EDT RRGG - - AEQQR- - - - - M I L S I S S T E L RQ I KNCV T S L EMWCKL E ECYQSKGPARK - - - AT L L KNL I L CKMSDGDDVRSHL NNF F AS VDKL V EMNVD - - I NPDL L A I L ML Y S L P PNF DNF RCA I E - - - - - - - - - - - S RDNL P T P EAL R I K I I E E SDARKGNSRENSS- - - - - I V NCL SDKHL E Y VRSAT SAKEM I ENL RK I F KRKS T L S A - - - L Y V RKKL L T L KCDPS V E L SDHF NNF DML I RQL E E T GS T - - V NEQDKVCHL L L T MPDSYNT T I T AL E - - - - - - - - - - - T T N - V E L T T E Y VKSKL L DAE L KNQNNSMES -- - - - - I I E Y L SDS F L NF AT SD I T ARQ I L ENL DAV Y ERKS L ASQ - - - L AL RKRL L S L KL S S EMS L L SHF H I F DE L I S E L L AAGAK - - I E EMDK I SHL L I T L P SCYDG I I T A I E - - - - - - - - - - - T L S E ENL T L AF VKNRL L DQE I K I KNDHNDT S- - - - - L F S S L T RE I RSQVAGAP T AAAAWKT I E NT F S T RSHAGA - - - - I NV RL AL T T T QKGQS T V T E Y V SKMRAL GDE I AAT GKP - - I DDE E L VAY I I NGL DS E F DP V V EAL I - - - - - - - - - - - AKNAS V T VAE V Y SQL L GF ENRVK I R - - - - - -- - - - - L L S S L S RDV L I QVAT CKT AAEAWRS I E AL Y S T GT RARA - - - - V NT RL AL T NT KKGT MK I AE Y VAKMRAL GDEMAAGGHP - - L DE EDL VQY I I AGL NEDF S P I V SNL C - - - - - - - - - - - NKSDP I T VGE L Y SQL VNF E T L L DL YRS T G - -- - - - - L L S T I GREAL AQV T MCNT AF SAWNT I E T MF S S V T RARS - - - - I NT R I AL AT I KKGDL S I T E Y VGKVRAL GDE L AASGKK - - I DE E E L V S Y I I T GL DE E F E P I V S S L V - - - - - - - - - - - S KT E S V T VGE V Y SQL L HF EQRMAL RQASHGD- - - - - - - - I AL I I RDAKDKGREAF K I L KDHYAS T SKPR I I T L YNQL T S - - - - - - - - - L KKF T NE S I T DY V I RAEKSAT S L NT AGES I SDS L L I AMV L KGL PDDYRAF VA I I T - - - - - - - - - - - QS E T SNNF QKF KQAL RS F E E T E Y T RT T H I E -- - - - - - - - L S L F MRDAADDGRKAL R I L RAY YAGKGKPR I I S L Y T E L T S - - - - - - - - - L QKKP S E S V T DY I I RAE T A I T AL RNAE E T L SDGL L I AM I L KGL PDS F KP F A I H I T - - - - - - - - - - - QSDT E I T F AE F KT KL RNF E S T KNF HS VAS E -- - - - - I AF NL VDNCT T AEQPEGNCK I AWERL VHKYAPKT AP S Y I QL KKDF ANSKL AS VDT DPDEWMT DL ECL RS EMNKV T I PGKT DMSE VDL I I H I L SNL P E E Y E VAV S E L E - - - - - - - - - - - E KL KNT YRP L SME T VREKL NSRF ER I T KN - -- - - - - V AF N I V E T AKS T DY P EGNARL AWL HL ANKYAPKT GT S Y I QL MRS F VNSKL D - L GT DPDDWI T E L E S L RT EMDKVK I SGKT DMSDVDL I I H I I AS V P E E Y E VAV SDL E - - - - - - - - - - - DRL T SGADK I D I E I V RE KL SARF DRL KKNDV- - - - - V DL DSDT GMT T L MAKL DDV F L KE EKDRAY EAY S Y F DG I T KDS S V SMADY I I DF EQRYN - - - RKG I I L PDAV I AF KL L DT ACL DDKNRQL AL T ACT E L T F S SMKSAL K - - - - - - - - - - - R I F GGKAAGL SNG I QVNQDVAF F T KQR - - - -- - - - - E DL NKDT GME T L L V KL DDL F L KE EKDR I Y E AY S S F DR I T RS SNV SMT DY I I DF EQRYGRMHKYKME L PDAV L AF KL L DT ACL DVKDRQL AL T ACAE L T F AAMKSAL K - - - - - - - - - - - R I F GGKMSG - - S T VG I NQDAAY I T EQ - - - - -- - - - - E DL HNENGMVQL L EKL DT AF KE EK I DEAYN I Y T NF I NF NKKEGS T MADY I L E F EHL ASKMSMYEMNL P S T I L AF KL L DGAKL SQSERQL AL T VASDL KF NSMKSAL K - - - - - - - - - - - R I F S KNCEG - - E S S T T I KE EAF I MRQK - - - -- - - - - AV L E SDDGVKT I L E KL DGL F L GDKGRGQF NAF QE L YNL RRADG I E I RKF I SQF EHT Y YKF I KQT MS L PDS VRAF ML L ASCKL S EGEQQL VMSA I SDV T Y ENMT NAL N - - - - - - - - - - - Q I F SGGF GGHASDSKHNAS I V DS I KT E P V F V- - - - Y L VWACQDRAF N I VDKVKDNVAF KAWQML KKKY E PKE VDAY VQL QEE F QECKMESDKDDP T V F I E E L E V I NGYMGA I QDKY EKDDVE I I S H I F S KL PAS Y S E F I T SQKT KG I KNT S V ED I KDGL E E YWKRT I KT S T KT T AT E EKAL S V SD- - - - - I AL L VDDCQHP L I RDCKT AKGT WDAL QKHHQKT T MS T K - - - - - V S L L KKL CKAE YDE SGDMEAHL F RMDE L F S S L MNAGQE L DS S L KVAMV L KSMPE S YDHL T T T L E T - - - - - - - - - - RSDDDL T ME L VKSKL L DEAQKRMEKSHQSE -- - CE KT AY S Y V ENAV T E RAPMGDAYEAWKKL CERY E PKE I E SDY T S I E HS F KDCT I GS I NENV E EWF L E L E YWNT RMGK I KQS YMRDDL QMKAH I I DQL P EAY EAVKVKL SGAY T T T SMEAF KR I I L DF WKRHSKT DNNKVMSHSE VKEKCGHC- VQEAWEA I KT RR I GVQHVREANAQQL RRE F GD I I F KDGE T VDDF SMR I GV L ANNL RT L GDN I T DAE V VQKL L QV V P EHL QQ I A I S I E T L L DVNE L S L E E V T GS L RS V EQRKQRKAVAAS SRVDAYGCL L F T E E EWL AKF RKAAS L QDAAHSS S- - - - - L V L T I DP S L Y VHVKS VQKT HE L WKKL KNL F DDSGF T RR I S L L RS L I S I RL ENS V SMT S Y V T Q I I E T SQKL SGT GF K I NDDWVAS L L L AGL T DKY T PM I MA I E NCG I P - - - - - - - - - - L T ADA I KT KL L DMEE SGNARE SDNSASNSGGA- - - NV RAL AQ I KMAVKPHL MNM I ADCKT AAE VWECL RV L F DHE T AT RRGE L E EAL AT L RKQPNE T I ME Y VGRT KG I RDDL AT AGVNKDEHS F CL D I L RGL PAP F KT I RT V L RN - - - - - - - - S GKL L T L P T I ANRL ME T E L RAE SDDDVPQAVQA- - - - KRQS E L RKVHL L R I AT RF L I S RQAT KRRAKF S T V T L VMSAKVWL RNWRCANVY F HNYQVRCHF I F S I S L MS L VNCL L VQNRNGMPSS YNGV I T A I E T L S EDNL T I AF V KN - - - - RL L DHE I K I NNDNRDT SRKVMKVV VQNKSNNNKY T H- - - NDKCM I I L KRS I KE P L KS S I P ECE T AKE Y L ERVASHHQGSSKAYACS L MT E F VNAKYDGNGVRP F I QKM I S I V AK I NKY L GS P L HE E F V V F M I MKS L PKE F E T F H I QYNT S V T DKWN I DQL L AQCVQEE ERL KQT GDSVNL I KGNVPRQNK- - - - - KE L WDAL VGKF GV T DAGSE L Y L MEQL YDYKMVENRS V V EQAHE F QAL AKE L E L F PCP L PDKF VAGG I I AKL P P SWKDF AT S L KHKRQE F NV E E L I GT L DV E ERART K - - - - - - - - - - DSGKGVE T S T ANV VQKRNF RKF NKKKNQNKPE- - - - ARDL L NAL DERYKGL MEHHV T RL WRDF NAF KMQPNQS L RDY I KR I KD I V E E L T ACKQPVADANAV L I AL KGL P SRYDP Y VAP Y Y S T T L MPKT L DE L ENNL L GV E T V I DERAS T S V L PNT L F AGKGDQDRGNERGGGNSRGGGYRRGNKYG- - - - - L L NS V S PQ I Y RS I L RMNDASD I WRDL NSRF NV T NL PRT YNL T QE I QDF RQGT L S L S E Y Y T RL KT L WDQL DS T EAL DE PCT CGKAMRL QQKAEQAK I V KF L AGL NE S YA I V RRQ I I AKKAL P S L GE V YH I L DQDNSQQS F SNV VAP PAAF- RV VD I L QAS L CKP E YDR I S S E E L AQ I WDKL KKYHEGSNAVKT RKF E I Y RKE F DAF SQL PGE S I E DL F ARF QV I VNKMKAL NKDMPYNDHARAL RL L HS L T DEWDMKVEA I V E S AGYE T L T L T S F S V S L SQKRL MSS P SANKNP T SAS S SDV PM- - - - - WF HS Y L T T YRN I HGKS P S T QEACAACREQYGNES F DE EN I QNL L DL Q I NF SNRE PG I T KF L QYRHL L ERMNSHE L I L H I Y I HT L P PA I RNY I F SQKAKGL DEAMAL T RR - - - - - - - - E G I RNGQNRKL P S E F RKDKVNSKKV F KSKF KQ- - - - - - L GE S L D I MNT NT T D I F R I I DGKNKYRS L HGRDVR I RAWEKV L VDT T CRNSAL L MNKL QKL V L MEKWI F S KCCQDCPNL KDY L QEA I MGT L HE S L RNS VKQRL YN I P - - - - - - - - - - - - - HDVG I DHE E F L I NT V I E T V I DL S P I ADDQ- - - QAV L AL L QNS L E T S I L EGY S YCE T AKE L WDT L KNV YGNESNL T RV F E VKKA I NE L SQEDL E F T KHF GKF RS L WSE L E S L RPGT L DPK I L HERREQDKV F GL L L T L NPGYNDL I KHL L RS EKL P S L DE VCSK I QKEQGS T GL F GGKSE L I T A
- - - E KGGCCF KCGKKGHF AKNCHEHAHNNAEPKV P- - - E KGGCCF KCGRKGHF AKNCHEH I HNNSE T KAP- - - DKG - GCF RCGKRGHF AKDCHENQNKNPEAK I P- QKSGNSGCF VCGQPGHRAAVCP - - QKHQT S VNT P- NPA I F QE T APKK I QP P T QL P T QPNAPQAS L I KNL- QGAEGPVCF SCGKT GH I RKDCKD - - E KGSKRAP P- E ARAGANCF KCGAVGHMRRDCPS - - - - - - L NKR -- GGRARGL CY T CGSPGHYQAQCPK - - - - - - KRKSG- - - - - - GKCYNCGQ I GHL KKNCPV L NKQN - - - - - -- - - RKAF KCWNCGKEGHSARQCRAPRR - - - - - - - -- - P RKP I KCWNCGKEGHSARQCRAPRR - - - - - - - -- - GQRK I KCF NCGREGHL ARNCRAPRK - - - - - - - -- - GRGPPRCF KCGQ I GH I QKDCPRAGP - - - - - - - -QP T GRKP I CF NCNKEGHVARF F KAPRR - - - - - - - -GP L KAAQT CYNCGKPGHL S SQCRAP - - - - - - - - - -- - - - S GP VCF NCKKPGHL ARQCRDV - - - - - - - - - -- - - - V NQKCYNCGKPGHL ARQCRQG - - - - - - - - - -- - - - GVQKCYYCGKPGHL ARQCRQG - - - - - - - - - -- - - - QPQRCYNCGKPGHQARQCRQG - - - - - - - - - -S GP EDGRRCYGCGKT GHL KRNCKQ - - - - - - - - - - -- - - P P T QPCF RCGKT GHWSRDCT L - - - - - - P RP P P- - - P P T QPCF RCGK I GHWSRDCT Q - - - - - - P RP P P- - - P PNQPCF RCGKAGHWSRDCT Q - - - - - - P RP P PRQEKNL T K I L AAV V S EKGF RGRQAGNL SNRAMRAPPKKKNL T K I L AAV V SREGS T GRQT GNL SNQAKKT PRQERNL SK I L AAV I NDRQSEKGRT GF L GNRAVKP PRQEKNL T K I L AAV V EGKSNT ERERDF RK I RSGPRQKRDKHL T K I L AAV V T EKRAGK - - - - - - - - - - - - - -KRNREMSRL L AT V V T GQR - - - - QNRQRDGR - - - - -DRHREMSKL L AT V V SGQR - - - - QDRQGGER - - - - -KRHKEMT KV L AT V VAQNRDKDRE E SKL GDQ - - - - -RAERGSKKT P PGK - - - - - - - - - - - - - - - - - - - - - -- V RPGE T RL L AAA I RGV P PKEARQKGGPGKE T QP -- - - - - - - - - KARPKNPKT ERGEGQD - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - AT T F ANS T P SRPAY S S S S SMAF T GQ - - - - - -- - - - S S T NSAT KSAQKPAGSAS S VAS T GR - - - - - -- - - - QMKDGKDRPKSKDGNSKG - - - - - - - - - - - - -- - - - - - - - - - P NPKGKAAKPGG - - - - - - - - - - - - -- - - - S GT CWKA I GACGRCGSKDHA I QSCPKMEPGQ- - - - - - - - - - - - GRF G - - - - - - - - - - - - - - - - - - -- - - - GSQNHHGSQPNRGAAPRP PMAPAQSGQPAQA- - - - - - L V ARNG - - - - - - - - - - - - - - - - - - - - - - -- - - - - - L L AKSK - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - - P E AMDWEP T KVGQAR - - - - - - - - - -- - - - - - - - - - P E AMDWEPSKV SQAR - - - - - - - - - -- - - - - - RQS T S YGQHSGPMDL SAT Q - - - - - - - - - -- - - - - - Y N - T AY S T HNGPMDL SAT Q - - - - - - - - - -- - - - - - - - - T S YGT EAGPMA I GMT K - - - - - - - - - -- - - - - - P KPNAE L P L NDPMDL S SQR - - - - - - - - - -- - - - - - L QRT QPDVHNNAMDL L S I C - - - - - - - - - -- - - - - - V NKVDT NRSKKKKKP I P - - - - - - - - - - - -- - - - - - V E VNKVDPRT PKKKKF K - - - - - - - - - - - -- - - - - - ADVNK I S V KY P PKKPAY - - - - - - - - - - - -- - - - ML V RL VHKNNVMHCL K I KKCF PCG - - - - - - -- - - - - - GKKVAE S T SKKDD - - - - - - - - - - - - - - - -- - - - - - PME L DV T EKGRK I RNSKGNRR - - - - - - - -- - - - - - E E PMQ I GRSRL SRQEREQR - - - - - - - - - -- - - - - - T P AVNV P V PMDVDAARSRAR - - - - - - - - -- - - - - AS RL EDSKL T AL KAYRKAKG - - - - - - - - - -- - - - - - - - P RKHF T DV E L DEKRRKG - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - S KMRQDRRAKPQREKS - - - - - - - - - -- - - V KADHCF NCGSREHKRKDCT L P - - - - - - - - - -- - - - - - - - AVDEGQKGKL S V E EMRG - - - - - - - - - -- - - - - QT QP SDT NRRDT RPRAQWVPSD - - - - - - - -- - - - - - - KC I F CGGF AHSRT NCPAR - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - KT CSRCGNRHSKDKCPAQG - - - - - - - - - -- - - - P S RACRVCGQEGHRAVRCRNR - - - - - - - - - -- - - - - - - - - - QSGWHMASCF KL KNR - - - - - - - - - -- - - - - - AT HS SRQRSRVNT F DK I MN - - - - - - - - - -S L PMSYAE L L P E L L RL GL V E L CT MAPP - - - - - - - -- - - - - E VDP VAQPANL EKARRT DE E - - - - - - - - - -- - - - - - - KQF QGKGRSNT WVRDE T S - - - - - - - - - -- - - - - MSNHRNRRNNNPSRE EC I KN - - - - - - - - - -- - T S E K I KCWNCDESGHGYQNCL KT R - - - - - - - - -- - - - - - - V DL KK I RS S F I V Y RQSKG - - - - - - - - - -- - - - - - - - - - - QQKHVHF L VD - - - - - - - - - - - - - -- - - - - - - - - - - - QQ I KE L RE P SKAK - - - - - - - - - -- - - - - - NL AF V SNDEGEEDE YDL NT DEGL T NAV V L- - - - - - - - CANCNSRGHKAD I CKKP - - - - - - - - - -- - - - - - GV AL AV S S EKNE - - - - - - - I QE L KDSMSM- - - - - - NAF QSKL S F RPQ - - - - - - - - - - - - - - - - -- - - - - - GAF QAKMHVQ - - - - - - - - - - - - - - - - - - -- - - - - - S D I L V T DDRGRNKQK - - - - - - - - - - - - - -- - - - - - S E V L AV T T RGRSQNK - - - - - - - - - - - - - -- - - - - - T E AL V I E RRRRSKSRKP S - - - - - - - - - - -- - - - - - - S V L I T ENKGRSQKKE - - - - - - - - - - - - -- - - - - - I T EGRGRSYQRS SNN - - - - - - - - - - - - - -- - - - - - QP SGL F T S SGKGRKN - - - - - - - - - - - - - -- - - - - - V QAV Y VHT KL HGT AK - - - - - - - - - - - - - -- - - - - - QT E SAKAF T AVHKPQ - - - - - - - - - - - - - -- - - - - - M I ANKSGQREYKKKL - - - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - VMNA I V HNKNNT YKNNL F - - - - - - - - - - -- - - - - - T ACAAT SGNRGSG - NQGRG - - - - - - - - - -- - - - - - QGGAAF VANRGRG - GGGGG - - - - - - - - - -- - - - - - S HGS I NMANRGRGRS F QHW- - - - - - - - - -- - - - - - - - - - - - - - - HSDT K I MKF K - - - - - - - - - -- - - - - - - - - - - - - - - DNVMKANAS P - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - AE AKE - - - - - - - - - -- - - - - - AVNE T AL SAL GV L L EDEDL - - - - - - - - - -- - - - - - - P QRREKRNEMWQSGHS - - - - - - - - - - - -- - - - - - - - - RRQL NKPWSHNDKQ - - - - - - - - - - - -- - - - - - - QF KKNKMNAVKQ - - - - - - - - - - - - - - - -RGGRRGRQPYRARDNYRP S T EQN - - - - - - - - - - - -- GGRF KGT CNYCGKPGHKA I DCF KRKKEQGENGGN- - - - - - - - - - S I L RVGP E - - - - - - - - - - - - - - - - -- - - HSQDKCWSKP ENKHL RPNGKQG - - - - - - - - - -- - - DRQGQGRGNKDDGASKE T QPKPAN - - - - - - - -- - - - - Y KKNRNVGS T AAT RDNAQMS - - - - - - - - - -- - - - - - - - AMAL KKDEKKGQQDKKP - - - - - - - - - -- - - - - - NF KNRV T KPNKF F KGRPRN - - - - - - - - - -- - - - - - KKF KKQGKQNNQASS SNNNQP - - - - - - - -- - - - - - - KP VHT AQF KKKNNNNKGK - - - - - - - - - -- - - - - G I RCF QL QGNRS L RQGMPKP - - - - - - - - - -- - - - - - - E I T QS P SMDP T VCY VQNG - - - - - - - - - -- - T NP V P P S F AL S S L CHVADDE L E T L DDDQL V L L S- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - - E V V ANKGVYKNEDRK - - - - - - - - - -
- - - - - - - - L CPRCKRGKHWANEC - - - - - - - - - - - -- - - - - - - - L CPRCKRGKHWANEC - - - - - - - - - - - -- - - - - - - - L CPRCKRGRHWANEC - - - - - - - - - - - -- - - - - - - - L CPRCKKGKHWARDC - - - - - - - - - - - -- - - - - - T T KCPRCKKGF HWASEC - - - - - - - - - - - -- - - - - - - - L CPRCKKGYHWKSEC - - - - - - - - - - - -- - - - - - GARCWSCGGAGHL ARDC - - - - - - - - - - - -- - - - - - RE RCQL CDGMGHNAKQC - - - - - - - - - - - -T T GRE P PDL CPRCKKGKHWASQC - - - - - - - - - - - -- - - - - - - QGCWKCGKPGH I MT NC - - - - - - - - - - - -- - - - - - - QGCWKCGKMDHVMAKC - - - - - - - - - - - -- - - - - - - KGCWKCGKEGHQMKDC - - - - - - - - - - - -- - - - - - - NKCL KCGKPGHL AKDC - - - - - - - - - - - -- - - - - - - KGCWNCGAMDHQKAQC - - - - - - - - - - - -- - - - - - - KVCF KCKQPGHF SKQC - - - - - - - - - - - -- - - - - - - KKCNKCGKPGHL AAKC - - - - - - - - - - - -- - - - - - - I I CHHCGKRGHMQKDC - - - - - - - - - - - -- - - - - - - I I CHHCGKRGHMQKDC - - - - - - - - - - - -- - - - - - - I I CHNCGKRGHMQKEC - - - - - - - - - - - -- - - - - - - QKCYHCGKPGHQARNC - - - - - - - - - - - -- - - - - - - - P CP L CKDPSHWKRDC - - - - - - - - - - - -- - - - - - - - P CP L CQEP SHWKRDC - - - - - - - - - - - -- - - - - - - - P CP L CQDP T HWKRDC - - - - - - - - - - - -GRP P L DKDQCAYCKERGHWAREC - - - - - - - - - - - -GRP P L DKDQCAYCKEKGHWAREC - - - - - - - - - - - -RKT P L E KDQCAF CKEKGHWAKDC - - - - - - - - - - - -NRT P L DKDQCAYCKERGHWARNC - - - - - - - - - - - -RRPKVDKDQCAYCKERGHWI KDC - - - - - - - - - - - -RGPHL DRDQCAYCKEKGHWAKEC - - - - - - - - - - - -RRPQL DKDQCAYCKEKGHWAKDC - - - - - - - - - - - -RK I P L GKDQCAYCKEKGHWVRDC - - - - - - - - - - - -GRP P L GKNQCAYCKE EGHWKKNC - - - - - - - - - - - -GCQS L QCNQCAYRKE I GYWKNKC - - - - - - - - - - - -- - - - - - - - KCYKCGRT AP F KREC - - - - - - - - - - - -- - - - - - - - T CF F CKQPGHWKADC - - - - - - - - - - - -- - - - T HE V I CHCCKGGGHHAREC - - - - - - - - - - - -- - - - T RDV L CYRCKGYGHVQRDC - - - - - - - - - - - -- - - - - - - - - CWT CGG - PHL AKSC - - - - - - - - - - - -- - - - - - - - - CF L CGG - PHMVREC - - - - - - - - - - - -- V L GE E T RT CF YCGKT GHL KREC - - - - - - - - - - - -- - - - - - - - E CYHCRERGHL RPNC - - - - - - - - - - - -- E T GVKPGSCF NCGE L GHF ADKC - - - - - - - - - - - -KS KF L KGV F CHGCGKEGHL KNVC - - - - - - - - - - - -- - - - - - - - - CHNCGKQGHL KKDC - - - - - - - - - - - -- - - - KHQF KCYNCGKSGHMARQC - - - - - - - - - - - -- - - - RAP L T CY SCGKPGH I ARDC - - - - - - - - - - - -NDNWKKDKQCF NCNQKGHL AY EC - - - - - - - - - - - -- - - - P KSGKCF KCGKT GY I AKNC - - - - - - - - - - - -- - - - - - - - - CYNCGKKGHYEREC - - - - - - - - - - - -- - - - - - - - L CYRCGSQEHF VAKC - - - - - - - - - - - -- - - - - - - - E CF HCGSP EHMVRNC - - - - - - - - - - - -S KKKGE I P T CF YCNKKGHYAT NC - - - - - - - - - - - -NE KKE F I P T CF T CKKSGHT APNC - - - - - - - - - - - -P KT S KNV I T CYRCGAAGHMQYDC - - - - - - - - - - - -ADCKHKEM I CF NCGEEGH I GSQC - - - - - - - - - - - -- - - - - - - I KCF KCHGYGHYQAQC - - - - - - - - - - - -P P S SRE T RT CYGCGKPGH I ARDC - - - - - - - - - - - -- - - - L RDQL CL YCGNNGHF I QAC - - - - - - - - - - - -- - - - - KL DL CYRCGEPGHRAGAC - - - - - - - - - - - -- - - - - - - - L CF KCGGKWGHAHRC - - - - - - - - - - - -- - - - - - - - L CF KCDAHY F KGHVC - - - - - - - - - - - -- - - - - - - - L CY SCDSKY T KGHKC - - - - - - - - - - - -- - - - - - NV T CYRCGQPGHF SNQC - - - - - - - - - - - -- - - - - - - T KCF SCNQEGH I S S KC - - - - - - - - - - - -- - - L I RT GKCF QCREPGH I S RNC - - - - - - - - - - - -Y QRRRE T GACL RCGNSGHQVADC - - - - - - - - - - - -- - - - - - NEDCF KCGRRGHF SRAC - - - - - - - - - - - -- - - - - - - - V CS SCHE T GHL RRMC - - - - - - - - - - - -- - - - - - - KT CKKCGKQNHF AKCC - - - - - - - - - - - -- - - - - AL DF CWQCGR I GVRT VACC - - - - - - - - - - -- - - - - - - - ACF RCNEMGH I AWNC - - - - - - - - - - - -- - - - - - - S QCPHHPNSNHVAKDC - - - - - - - - - - - -P GYDANVRCDF HSGAPGHHT EKC - - - - - - - - - - - -- - - GGL DQYCDF HKRSGHS T AAC - - - - - - - - - - - -- - - - - - - - Y CD I HRV T RHL T KDC - - - - - - - - - - - -- - - - - - - RL CF YCKKEGHRL NEC - - - - - - - - - - - -- - - - - - R I I CYGCGT P E V YKPNC - - - - - - - - - - - -- - - - - - - - Y CL NCGKSNHS T S T C - - - - - - - - - - - -- - - - - - - - DEQYQVQDGEGNQL E E - - - - - - - - - - -- - - - - - Q I T CCE L CKGDHDT GF C - - - - - - - - - - - -KP SHSKG I QCHGCEGYGH I I AECP T HL KKHRKG - -- - - KRE PGSCYACGQL GHL VAQC - - - - - - - - - - - -RNSKRS E I QCHECQGYGH I KAECP S L KRKDL KCS EKRY VQF KF VCD I CQKS SHT T DMCWKK - - - - - - - - -KRYGNF P - P YQHCKKT NHL QKYCWWRPD - - - - - - -S RS RYKD I ECHYCGKKSH I KKY S - - - - - - - - - - - -S - NKF ANV ECHYCKKKGH I KRF C - - - - - - - - - - - -S KS RKE - I KCF YCGKPGH I KREC - - - - - - - - - - - -S KS RYKNV ECHYCHKT GH I KKNY - - - - - - - - - - - -NRSKSRVRNCYNCNQPGHF KRDC - - - - - - - - - - - -S KGAKPDDVCNYCKEKGHWKF DC - - - - - - - - - - - -L E KHS F KGKCY I CEKF GHRAREC - - - - - - - - - - - -S AKKRKDVVCYNCGERRHF KANC - - - - - - - - - - - -KHSKT T E I RCF KCNK I GHRVADC - - - - - - - - - - - -- QDAHS F L SCF KCGKT GHKA I QC - - - - - - - - - - - -KGNSKYKVKCHHCGREGH I KKDC - - - - - - - - - - - -RGGT DNRP T CQVCHKKGHVVADC - - - - - - - - - - - -T GGQDRRP T CQVCF KRGHT AADC - - - - - - - - - - - -V GGSEGRVKCQL CF KRGHT VMEC - - - - - - - - - - - -- GN I KKV L T CY T CGT I GHKS S ECQAKT KPK - - - - -E GL AY SD I T CYNCQQKGHKMRT CPAL NCKR - - - - -AF KKQYKGRCSNCGEYGHKSGDCSERDKP S SGT GNAF VKQF KGRCNKCGT YGHKAAT CP S VKNDGGDPGTL DKYGKRSRCA I CQS T F HWADC - - - - - - - - - - - - -L DRYGRRSKCA I CQS T F HWAKDC - - - - - - - - - - - -L DKL GRV SRCV I CDSKL HWAAKC - - - - - - - - - - - -L GT DGQV T RC I I CDSKF HWARGC - - - - - - - - - - - -NGKNKS E V T CF RCKKKGHKSY ECP - - - - - - - - - - -- - - - - KK I T CHRCRKPGHMKRDC - - - - - - - - - - - -- GKNKKD I QCF NCGKKGHYKSDCR - - - - - - - - - - -- L GGRNPGNCKNCGKRGHWAKDC - - - - - - - - - - - -- KS KPK I I T CYKCKQT GHYRNQC - - - - - - - - - - - -- - - - - - - - V CF YCKKPGHK I ADC - - - - - - - - - - - -- - - - - - N I KCHHCGKEGH I KRDC - - - - - - - - - - - -GGF S V P PDT CL YCKKT GHYKRKC - - - - - - - - - - - -- - - - - - - GGCF VCGSDQHWAREC - - - - - - - - - - - -- - - - - - KKECAKCHL L GHT AE EC - - - - - - - - - - - -- - P NKGRP I CS F YNRVGH I AERC - - - - - - - - - - - -RRNRRRS EGCF T CGERGHF AADCPNKRT F DQGNNS- - - - - - - - KCGYCGF T GHSENEC - - - - - - - - - - - -- - - - - - - - S CMYCKS V F HCS I NCK - - - - - - - - - - -- - - - - - L L T CDHCKKKGHT KDKCWL L HPHL KPAKF
Conservation
CxxCxxxxHxxxxC
Retroviridae
Ty3/Gypsy
Ty1/Copia
a
Supplementary Figure 5: GAG RNA binding domain conservation and EVD poly-
protein product production in EVD constructs.
(e) Alignment of GAG amino acid sequences from representative members of Retrovir-
idae and Pseudoviridae (Ty3/Gypsy and Ty1/Copia). The bottom sequence (black
arrow) corresponds to the EVD-encoded GAG protein. Coloured columns in the align-
ment indicate residues with conserved physico-chemical properties. The black box
displays the consensus sequence logo of highly conserved residues corresponding to
the RNA-binding motif (Cx2Cx
4Hx
4C). (c) Scheme of p35S:HA-EVD-FLAG and
p35S:HA-EVDǻ5%'-FLAG constructs, see methods for details. (d) Accumulation of the
EVD polyprotein N-terminal and C-terminal products from p35S:HA-EVD-FLAG and
p35S:HA-EVDǻ5%'-FLAG was assessed by Western analysis of total proteins extracted
from N. benthamiana-infiltrated leaves at 4 dpi. Equal loading was verified by Coomas-
sie staining of the membranes after blotting.
Supplementary Figure 5
Nature Genetics: doi:10.1038/ng.2703
5’LTR 3’LTR GAG PR IN RT-RNase
a
5630000 5631000 632000 5633000 634000 5635000
100
010
0
Coordinates
Abun
danc
e of
siR
NAs
(rpm
) s
as
epi15 F14
21-22 nt siRNAs
0
5
10
15
20
25
35
30
epi454 F8 epi439 F8
b
EVD
cop
y nu
mbe
r
Supplementary Figure 6
Supplementary Figure 6: Mapping of 21-22-nt siRNAs to EVD in epi15 F14 and
EVD copy number in epi454 and 438 F8.
(a) Deep sequencing analysis of 21-22-nt siRNAs mapping to EVD in epi15 F14. y
axis: number of normalized small RNA reads per million sequences (rpm); x axis:
Arabidopsis chromosome 5 coordinates corresponding to the EVD locus found in
Col-0 (TAIR 9). s, sense; as, antisense. (b) EVD copy number in epi454 and 439 F8
calculated as the total EVD insertions inferred from the number of heterozygous and
homozygous insertions obtained by sequencing (Gilly et al, submitted) including the
original EVD and ATR loci.
Nature Genetics: doi:10.1038/ng.2703
EVD (AT5G17125���·LTR
WT
epi1
5 F8
ep
i15
F11
epi1
5 F1
4
a b
c
d
Supplementary Figure 7
ATR
EVD
EVD insertions in epi439 F8
20
30
40
50
10
0
EVD
cop
y nu
mbe
r
ND
epi439 F9 1 2 3
epi439 F15 epi439 F17
40%
60%
80%
100%
20%
0%
epi439 F9 epi439 F15 epi439 F17
% D
NA
met
hyla
tion
ND
Heterozygous
Ch.I
Ch.II
Ch.III
Ch.IV
Ch.V
1 2 3 1 2 3 4
1 2 3 1 2 3 1 2 3 4
Supplementary Figure 7: Bisulfite sequencing dot-plots of EVD LTR in epi15
generations and EVD copy number and DNA methylation in epi439 generations.
(a) Dot-plots depiction of bisulfite sequencing data obtained for individual clones used
to generate the results shown in Figure 4a and Supplementary Table 2. Filled circles
indicate methylated cytosine and empty circles unmethylated cytosine. (b) Scheme of
Arabidopsis chromosomes showing EVD insertions found in the ddm1-derived epi439
line at F8. The original EVD and ATR loci are marked with blue triangles, red triangles
indicate new transposition events. The three new insertions were selected for further
analysis. (c) EVD copy number measured by qPCR in individual plants of epi439 F17.
EVD copy number was not determined (ND) in the F9 and F15 generations. Error bars
represent standard deviation from two independent experiments involving duplicate
PCRs each. (d) McrBC-qPCR-based methylation analysis at the LTRs of EVD new
insertions shown in (b) in the epi439 F9, -F15 and -F17 individual plants used in (c).
Error bars represent standard deviation from duplicate PCRs in c-d.
CHH CHG CG
Nature Genetics: doi:10.1038/ng.2703
0
10
20
30
40
50
05
101520253035404550
WT WT x epi15 F14 epi15 F14
EVD
cop
y nu
mbe
r
EVD
cop
y nu
mbe
r
@LTR
#�·*$*
@U6
-21nt -24nt
-21nt -24nt
}F1�
F2
a(1)
(2)
(3)
b
21-to
-24-
nt C
orre
latio
n
Supplementary Figure 8
Supplementary Figure 8: Correlation of LTR 24-nt siRNAs abundance with EVD copy number and
21-24-nt siRNAs mapping correlation on EVD.
(a) To explain the PTGS-to-TGS shift in EVD silencing, we considered the possible emergence of discrete
EVD loci with peculiar features allowing production of 24-nt LTR siRNAs with potent trans-TGS activities.
This hypothesis had two predictable and interrelated implications: (i) such loci would produce dispropor-
tionate amounts of 24nt siRNAs independently of EVD copy numbers and, (ii) these loci would be easily
segregated away from an F14. We therefore tested these ideas here. (1) EVD copy number measured by
qPCR in WT, epi15 F14 and the F1 progeny of the F14 X WT cross. (2) Same as in (1) but in distinct
plants of the F2 progeny, grouped according to their respective copy numbers. Error bars represent
standard deviation from two independent experiments involving triplicate PCRs each. (3) Northern analy-
sis of EVD LTR 24-nt siRNA levels in F2 progeny plants containing similar numbers of EVD insertions as
determined in (2). U6 detection is shown as loading control. The results clearly show that (i) all F2s, with
no exception, produce 24nt siRNAs and (ii) 24nt-LTR siRNA levels strictly correlate with the copy number
of segregating EVD copies, making the emergence of the hypothesized, specific EVD loci highly unlikely.
(b) Pearson correlation coefficient (Y-axis) between the 5’-end of all 24mers and all 21-mers mapping the
2,000 first nucleotides of EVD with the computation of virtual a lag from -10 to +10 nucleotides (X-axis)
between the first nucleotide of all 21-mers and all 24-mers. This allows to assess if DCLs process the
24-mers and the 21-mers at the same position of a shared dsRNA substrate.
0.00
0.05
0.10
0.15
0.20
0.25
0.30
-10 -9 -8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8 9 10
Lag (nt)
Nature Genetics: doi:10.1038/ng.2703
WT
epi1
5 F8
ep
i15
F11
epi1
5 F1
4 ep
i15
F15
RPP4 (AT4G16860) SOLO LTRSupplementary Figure 9: Bisulfite sequencing
dot-plots of RPP4 Solo-LTR in epi15 generations.
Dot-plots of bisulfite sequencing from individual clones for
results shown in Figure 6b and Supplementary Table 4.
Filled circles indicate methylated cytosine and empty
circles unmethylated cytosine.
CHH CHG CG
Supplementary Figure 9
Nature Genetics: doi:10.1038/ng.2703
M
M
M
p p
M
M
M
}
M
�·LTR �·LTR GAG PR IN RT-RNase
GAG PR IN RT
RDR6
DCL4/2
AGO1/2
�·LTR �·LTR GAG PR IN RT-RNase
GAG PR IN RT
RDR6
DCL4/2
�·LTR �·LTR GAG PR IN RT-RNase
DCL3
AGO4
RDR2DCL3
AGO4
PolV/IV
AGO1/2
PolV/IV
?
FM
FM
ov ov
VLP VLP
ORF 21-22-ntsiRNAs
ORF 21-22-ntsiRNAs
ORF 24-ntsiRNAs
LTR 24-ntsiRNAs
Reve
rse
Tran
scrip
tion
/ Int
egra
tion
Reve
rse
Tran
scrip
tion
/ Int
egra
tion
DNA methylationDNA methylation
�·LTR �·LTR GAG PR IN RT-RNase
DNA methylation
SOLO LTR
DNA methylation
cis-
trans-� �
i) ii) iii)
(1)
(2) (3)
(4)
(5) (6)
(7)(8)
(9)
(10)
(10)
Supplementary Figure 10a
b
Nature Genetics: doi:10.1038/ng.2703
Supplentary Figure 10: EVD life cycle in Arabidopsis.
(a) EVD expression pattern ensures host integrity through developnet and transmis-
sion of new insertions to the next generation. Once reactivated, adaxial L2-specific
EVD expression (in red) restricts potential transposition to somatic tissue. Meristem
(harbouring the stem cells from which aerial organs derive; M) exclussion will then
allow the plant to develop without the accumulation of EVD new insertions. However,
L2 expression and transposition in gynoecia (flower female reproductive organ),
particularly at the placental regions (P) which give rise to the ovules (ov), from which a
L2 cell differentiates into the haploid female megaspore (FM) , will allow transmission
of EVD new insertions (see Supplementary Fig.2 for more details). Therefore, after
fertilization, and embryo development, each seed will display different number and
pattern of EVD insertions. (b) Rise and demise of EVD. In early proliferation stages
(A), a low EVD copy number (light blue; i) accounts for moderate transcription and
dsRNA synthesis by RDR6 (1). Processing of dsRNA by DCL4 and DCL2 produces
3’GAG 21-22-nt siRNAs (2) loaded into AGO1/2 (3) that can only partly degrade the
EVD RNA (4) due to its protection by GAG as part of putative viral-like particles (VLPs;
5). Because single seeds acquire distinct patterns and numbers of EVD insertion over
generations, an increasing amount of progeny plants will attain the ֤40 copies thresh-
old (dark blue; ii) beyond which saturating dsRNA levels allow processing, by DCL3, of
3’GAG 24-nt siRNAs loaded into AGO4 (6), these induce de novo methylation of the
3’GAG DNA (7), which, in turn, initiates antisense transcription, possibly by PolIV/PolV,
and coordinated spreading of methylation towards the LTR (8). Eventual epigenetic
silencing of EVD is achieved over a few subsequent generations (red, iii) resulting in
TGS maintained at the LTR by RdDM (9). Ensuing 24-nt LTR siRNAs display potent
cis- and trans-silencing properties (10).
Nature Genetics: doi:10.1038/ng.2703
Total % methylated # Clones # Exp
Wilson Score Interval (95% confidence level)
CG CHG CHH CG CHG CHH CG int- CHG int- CHH int- CG int+ CHG int+ CHH int+
EVD
WT 85 102 1274 84,70% 7,84% 10,12% 17 2 75,57% 4,03% 8,58% 90,83% 14,71% 11,90% met1-3 80 96 1199 6,25% 3,12% 3,00% 14 2 2,70% 1,07% 2,17% 13,81% 8,78% 4,13%
epi15 110 132 1647 3,66% 1,51% 1,94% 22 2 1,44% 0,41% 1,38% 9,01% 5,35% 2,73%
ATR
WT 80 96 1197 88,75% 18,75% 22,05% 16 2 79,98% 12,20% 19,79% 93,97% 27,70% 24,49% met1-3 103 125 1560 3,88% 4,00% 6,28% 21 2 1,52% 1,72% 5,18% 9,56% 9,02% 7,59%
epi15 89 106 1345 85,39% 17,92% 21,04% 18 2 76,59% 11,78% 18,95% 91,26% 26,30% 23,30%
Supplementary Table 1
Supplementary Table 1: Figure 1 and Supplementary Figure 1 bisulfite sequencing raw
data.
Raw data from the bisulfite sequencing analysis in Figure 1b and Supplentary Figure 1. The
number of CG, CHG and CHH sites as well as the number of sequenced clones are indicated.
Wilson score intervals are shown for a confidence level of 0.95.
Nature Genetics: doi:10.1038/ng.2703
Total % methylated # Clones # Exp
Wilson Score Interval (95% confidence level )
CG CHG CHH CG CHG CHH CG int - CHG int - CHH int - CG int + CHG int + CHH int +
WT 90 108 1350 82,22% 10,28% 3,92% 18 2 73,05% 5,85% 3,01% 88,75% 17,44% 5,09% epi15 F8 75 90 1125 2,66% 1,12% 2,76% 15 2 0,73% 0,20% 1,95% 9,20% 6,04% 3,89%
epi15 F11 75 90 1125 8,00% 3,33% 3,82% 15 2 3,72% 1,14% 2,85% 16,37% 9,34% 5,11% epi15 F14 90 108 1350 93,33% 21,29% 13,85% 18 2 86,21% 14,63% 12,11% 96,91% 29,93% 15,80%
Supplementary Table 2
Supplementary Table 2: Figure 4 and Supplementary Figure 7 bisulfite sequencing raw
data.
Raw data from the bisulfite sequencing analysis in Figure 4a and Supplementary Figure 7. The
number of CG, CHG and CHH sites as well as the number of sequenced clones are indicated.
Wilson score intervals are shown for a confidence level of 0.95.
Nature Genetics: doi:10.1038/ng.2703
CG CHG CHH CG CHG CHH CG int- CHG int- CHH int- CG int+ CHG int+ CHH int+WT 176 80 581 94,88% 32,50% 8,60% 16 2 90,56% 23,24% 6,58% 97,28% 43,36% 11,16%epi15 F8 182 89 660 4,39% 3,37% 2,57% 18 2 2,24% 1,15% 1,61% 8,43% 9,45% 4,08%epi15 F11 194 90 663 26,28% 4,44% 2,26% 18 2 20,59% 1,74% 1,37% 32,89% 10,87% 3,70%epi15 F14 252 115 850 60,31% 33,91% 6,58% 23 2 54,16% 25,90% 5,10% 66,15% 42,96% 8,45%WT 117 104 585 77,77% 2,88% 0,68% 13 2 69,42% 0,98% 0,26% 84,35% 8,13% 1,74%epi15 F8 162 144 810 5,55% 0,69% 0,86% 18 2 2,95% 0,12% 0,42% 10,21% 3,82% 1,77%epi15 F11 198 176 990 30,80% 11,36% 5,65% 22 2 24,79% 7,47% 4,38% 37,54% 16,90% 7,27%epi15 F14 180 159 899 77,33% 8,80% 3,67% 20 2 70,68% 5,31% 2,62% 82,84% 14,23% 5,11%
Total % methylated Wilson Score Interval# Clones # Exp
5'GAG
3'GAG
Supplementary Table 3
Supplementary Table 3: Figure 5 bisulfite sequencing raw data.
Raw data from the bisulfite sequencing analysis in Figure 5d-g. The number of CG, CHG and
CHH sites as well as the number of sequenced clones are indicated. Wilson score intervals are
shown for a confidence level of 0.95.
Nature Genetics: doi:10.1038/ng.2703
CG CHG CHH CG CHG CHH CG int- CHG int- CHH int- CG int+ CHG int+ CHH int+WT 189 112 1451 0,00% 0,89% 1,72% 19 2 0,00% 0,16% 1,17% 1,99% 4,88% 2,53%epi15 F8 180 108 1385 2,22% 0,00% 1,01% 18 2 0,87% 0,00% 0,60% 5,57% 3,43% 1,69%epi15 F11 190 113 1460 1,05% 0,00% 1,43% 19 2 0,29% 0,00% 0,94% 3,75% 3,29% 2,18%epi15 F14 189 114 1460 13,75% 10,52% 17,39% 19 2 9,56% 6,12% 15,53% 19,39% 17,49% 19,42%epi15 F15 200 119 1533 51,11% 26,05% 25,11% 20 2 44,23% 19,00% 23,00% 57,95% 34,60% 27,34%
Total % methylated Wilson Score Interval (95% confidence level)# Clones # Exp
Supplementary Table 4
Supplementary Table 4: Figure 6 and Supplementary Figure 9 bisulfite sequencing raw
data.
Raw data from the bisulfite sequencing analysis in Figure 1b and Supplementary Figure 9. The
number of CG, CHG and CHH sites as well as the number of sequenced clones are indicated.
Wilson score intervals are shown for a confidence level of 0.95.
Nature Genetics: doi:10.1038/ng.2703
Supplementary Table 5: List of primers.(Non%converted,bisulfite,primers,sequence,is,indicated,in,braquets)
EXPERIMENT PRIMER SEQUENCE (5'-3')EVD-RT F GATCTTGGTGAACTCAAATACTTEVD-RT R TACTCCCCCTCAAGCGCAAGACEVD-qRT F GATAGAGGAGATAGAAGATCTACAACTGGEVD-qRT R CTCTATACTCCGATTCTGCACTCGAACAEVD-P F TTTCATGGTATCAGAGCATAAGATEVD-P R CACCCGAGTCTATCACAATTGAAC
EVD-LTR F TTGATCAAGACTCAAATAAGAAAGGCCEVD-LTR R TATGCTCTGATACCATGAAAGAATATEVD-P F TTTCATGGTATCAGAGCATAAGATEVD-5G R TTTTTAAGTGTATCCCACAACTCTTEVD-3G F TAAGTCAAGAAGACTTAGAGTTTAEVD-3G R ACTTTGCTCCTCATGAGTTTCTTEVD-PI F GGTGGTTCAAAAGGAGAACTCTATGTTCTEVD-PI R GAAACATCATTGACCGCGCCACCTCCATEVD-IR F GAAGGATTGGGAGAACCTAAAAGATCTTACEVD-IR R CTTATACTTGATTGTGAATACCCACCTTGAEVD-RR F CAAAGACGGTATAGACTCTACCAAGACEVD-RR R CTCTATACTCCGATTCTGCACTCGAACAEVD-RN F GATAGAGGAGATAGAAGATCTACAACTGGEVD-RN R TACTCCCCCTCAAGCGCAAGACmGFP-F AGTAAAGGAGAAGAACTTTTCACTmGFP-R TTCCGTCCTCCTTGAAATCGAEVD ORF attB1 GGGGACAAGTTTGTACAAAAAAGCAGGCTTCATGGAGACTTCTCAAAAGATGEVD ORF attB2 GGGGACCACTTTGTACAAGAAAGCTGGGTTTCAAGAGTGAGATAGATCCEVD dRBD F AGGATAGAAAACTGTTGACTTGGTTGCTACATCCACACCTEVD dRBD R AGGTGTGGATGTAGCAACCAAGTCAACAGTTTTCTATCCTHA-EVD F ATGTACCCATACGATGTTCCAGATTACGCTGGTGGAGGCGAGACTTCTCAAAAGATGAEVD-FLAG R TCACTTGTCATCGTCATCCTTGTAATCGCCACCTCCAGAGTGAGATAGATCCACAAHA-EVD attB1 GGGGACAAGTTTGTACAAAAAAGCAGGCTTCATGTACCCATACGATGTTCCEVD-FLAG attB2 GGGGACCACTTTGTACAAGAAAGCTGGGTTTCACTTGTCATCGTCATCCTTGT454 EVD-A F CTACTGGCGTGGATGCTTTT454 EVD-A R GTAAACAGAGTTCAGATTTAAACCAA454 EVD-B F TTCTTCCAAAATTTGATTACTTGC454 EVD-B R TCCGAAAATGGCATAAGAGG454 EVD-C F TTGGAATTCGATTCAACTAGAGG454 EVD-C R CTCAAATTTCATTGGAGACACATC454 EVD-D F TGTGCGTAGCTTTTCCTTGA454 EVD-D R ATTGAGGAACCAGCATCACC454 EVD-E F GCCTTAATGAATGGCCCAATA454 EVD-E R ACTCCACGGGACTACCTCAC454 EVD-F F GACCCCAACCAATCAATCAC454 EVD-F R GACGTTATGGTGGAATATGTGC454 EVD-G F AAAATGTCACTTATCATTGCAAGC454 EVD-G R TTTTAGTGGTGCGGGCTTTA454 EVD-H F CCAACGTGGGTTAGATCCAT454 EVD-H R ACGGCTATTGGAGTTTTTGA454 EVD-I F CGGTGTTTGTAGTGGGAACC454 EVD-I R GCAAAAACATTCGTACATTTCG454 EVD-J F TCTCTTGTCTTGTGCGACTCA454 EVD-J R CACACCATGGATAATGCTCTT439 EVD-ChI F ATCAACAAATAGATATCTTTGCATTTT439 EVD-ChI R TTCTTTTCGGATTCTGTGTGG439 EVD-ChIII F ATGGTTCGATTGTGGAGGAA
EVD-LTR BS F TTTATAARCATAAAAACATAATCTTATRCTCTAATACCATA (TTTATAAGCATAAAGACATGATCTTATGCTCTGATACCATG)
EVD-LTR BS R GGAAAAAAAYGAAGGAATTTTAGAATAYGATTTGGTAATTT (GGAAAAAAACGAAGGAACCCTAGAACACGATTTGGCAATTC)
ATR-LTR BS R TTTTAATGAGGTGATGAATTATGTGTTTTAAGTTTAGATTT (TTTTAATGAGGTGATGAATCATGTGTTCCAAGCTTAGACTC)
RT-RN probe
EVD RT-PCR/RFLP
EVD copy# qPCR/qRT-PCR
EVD ISH/Southern/ HMW-Northern probe
LTR Bisulfite PCR
LTR probe
5'GAG probe
3'GAG probe
PR-IN probe
IN-RT probe
RN probe
mGFP probe
EVD WT GAG-POL ORF CloningEVD dRBD
Site-directed Mut.
EVDHA-GAG-POL-FLAG
Cloning
epi454/439 EVDinsertionszygosity
Nature Genetics: doi:10.1038/ng.2703
439 EVD-ChIII R CAACATTCTAGGGTTTTACCTTGC439 EVD-ChV F TCTCCCACCCAAAATCTCAG439 EVD-ChV R GCGTGTTGTTTTGCAGTGATEVD-LB R ACTGGTCATTGGTTGACCTTGEVD-RB F CCAAGACCAAGGAACCTCTTC454 EVD-A qF ATGAAAATATTTGGCGTGTTCAT454 EVD-A qR TCGAATCTGTCTTCTAGATTATGTGTG454 EVD-B qF GCTCAGAGGCAGTTGTGAAAG454 EVD-B qR ACTTTTCAAGCAACGTTGTGC454 EVD-C qF CATCAGCTTGATGATCTACAGAGAG454 EVD-D qF TTGTCCCGAGGATAGTCTTC454 EVD-D qR ACCAGAGTATATAGAACATGATATTGAAGTG454 EVD-E qF CGTTTTAAGACGAAGGGTGTG454 EVD-E qR AGGACTGACGTCACACCCTA454 EVD-F qF CCTTTCGAGTCGAGATCCAAT454 EVD-F qR TCATATTCACATATGTTGGTGTTGG454 EVD-G qF CTGTACATTTCAATATTTTATCCATCTTG454 EVD-G qR AAATCACAACATATGTAAATGTATCAAGT454 EVD-H qF GAAGACTTTATGCCGTTCATCA454 EVD-H qR TTACATTCTTAGCGTTTCCTGGT454 EVD-I qF TTCAGTTTCATTAATAAGAACCAGGTTAG454 EVD-J qF AAAACCTGGTACCTTCTCTGTGA454 EVD-J qR TCAAAAACATTCAGAAGGTCAGAG439 EVD-ChI qF AAAGCAGTCGAGAATAAGAACCA439 EVD-ChI qR AGATCTTCTGTTTTGATGCAAGTC439 EVD-ChIII qF TGAGGAGTTGAATTCTGTGATGA439 EVD-ChIII qR CTTGAGCCCCAAGGAAGTC439 EVD-ChV qF CCAGTTTGACGAAATTAAAACAA439 EVD-ChV qR CGTAACCAAACATAAGGAGAACC
RPP4-qRT F ACATGCGAGGACAAACCAGAGGRPP4-qRT R TTCAATTCCGACGAAGTCACC
AT1G68420 qF GTCTCACACTTCAAGGCATCTTGCAT1G68420 qR TTCATAGCTTCCTCTACACCGGCAAT5G25550 qF GTTACAAGCGTGGAAACGTGCCATAT5G25550 qR TGTAGTCGCACACACGTGATCCAAACT2 qF GCACCCTGTTCTTCTTACCGACT2 qR AACCCTCGTAGATTGGCACAAT5G13440 qF ACAAGCCAATTTTTGCTGAGC AT5G13440 qR ACAACAGTCCGAGTGTCATGGTAT2G36060 qF TGAAGTCGTGAGACAGCGTTGAT2G36060 qR GGGCTTCTCCATTGTTGGTCAT4G29130 qF GGCGTTTTCTGATAGCGAAAAAT4G29130 qR ATGGATCAGGCATTGGAGCT
EVD-5GAG BS F ATTTTAAGGGGGAAATTATTTAATTTGGTTAAGAATAAT(ACTTCAAGGGGGAAACTACCTAACTTGGTCAAGAACAAC)
EVD-5GAG BS F ACCTAAACTTACCAAAATATTTAATAAACTCTAAATCTTCTTA(ACCTGAACTTACCAAAGTGTTTAGTAAACTCTAAGTCTTCTTG)
RPP4 qRT-PCR
5'GAGBisulfite PCR
3'GAGBisulfite PCR
EVD-3GAG BS F GTGTTTGAAGTGGAAGAAGGYGATTAAYGAATTAA (GTGTTTGAAGTGAAGAAGGCGATTAACGAACTAA)
EVD-3GAG BS F ATAACCCRACTTAACTTTRCTCCTCATAAATTTCTTAAA(ATGACCCGGCTTGACTTTGCTCCTCATGAGTTTCTTGAG)
AT1G68420qRT-PCR
RPP4-SLTR BS F GGGAGGYGTTTAGGTAATTTATAYGTATATTATATATTGATT (GGGAAGGCGTCCAGGTAATTCATACGTATATTATATATTGATC)
RPP4-SLTR BS R TTCCTCTACAAAAATATTTTAAAAACACAAATTTTATAATTATCAA(TTCCTCTACAAAGATATTTTGAAAACACAAATTTTGTGGTTATCAG)
AT5G25550qRT-PCR
qRT-PCRControls
epi454/439 EVDinsertionszygosity
epi454/439 EVDinsertion
specific McRB-qPCRmethylation analysis
RPP4 SOLO LTRBisulfite PCR
Nature Genetics: doi:10.1038/ng.2703