research-repository.uwa.edu.au · web viewgenome wide identification and comparative analysis of...

61
Genome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman Alamery A B , Soodeh Tirnaz D , Philipp Bayer A D , Reece Tollenaere A , Boulos Chaloub C , David Edwards A D and Jacqueline Batley A D E A School of Agriculture and Food Sciences, University of Queensland, St Lucia, Queensland, 4072, Australia B Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia C URGV (Institut National de la Recherche Agronomique, Université Evry Val d'Essonne), Evry, France D School of Biological Sciences, University of Western Australia, Crawley, Western Australia, 6009, Australia E Corresponding author. Email: [email protected] 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14

Upload: others

Post on 25-Dec-2019

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Genome wide identification and comparative analysis of NBS-LRR

resistance genes in Brassica napus

Salman AlameryA B, Soodeh TirnazD, Philipp BayerA D, Reece TollenaereA, Boulos ChaloubC,

David EdwardsA D and Jacqueline Batley A D E

A School of Agriculture and Food Sciences, University of Queensland, St Lucia, Queensland, 4072, Australia

B Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia

C URGV (Institut National de la Recherche Agronomique, Université Evry Val d'Essonne), Evry, France

D School of Biological Sciences, University of Western Australia, Crawley, Western Australia, 6009, Australia

E Corresponding author. Email: [email protected]

1

1

2

3

4

5

6

7

8

9

Page 2: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Abstract

Plant disease resistance genes play a critical role in providing resistance against pathogens.

The largest family of resistance genes are the nucleotide binding site (NBS) and leucine rich

repeat (LRR) genes (NBS-LRRs). They are classified into two major subfamilies,

toll/interleukin-1 receptor-NBS-LRR (TNL) proteins and coiled-coil CC-NBS-LRR (CNL)

proteins. Here, weWe have identified and characterized the complete set of 641, 249 and 443

NBS-LRR genes in Brassica napus, B. rapa and B. oleracea, respectively. A ratio of 1:2 of

CNL: TNL genes was found in the three species. DFurther domain structure analysis revealed

that 57% of the NBS- LRR genes are typical resistance genes and complete withcontain all

three domainsdomains (TIR/CC, NBS, LRR), whereas the remaining genes are partially

deleted or truncated. Fifty nine percent of the NBS-LRR genes were found to be physically

clustered, and individual genes involved in clusters were more polymorphic than those not

clustered. Fifty percent of the NBS-LRR genes in B. napus were identified as duplicates,

reflecting a high level of genomic duplication and rearrangement. The comparative analysis

between B. napus and its progenitor species indicated that more than 60% of NBS-LRR

genes arewere conserved in B. napus. This study provides a valuable resource for the

identification and characterization of candidate NBS-LRR genes.

Additional keywords:

Brassica, comparative genomics, disease resistance, gene cluster, gene duplication

Running head: Identification of resistance genes in Brassica napus

Conflict of interest: The authors declare no conflicts of interest

2

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

Page 3: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Introduction

Plants have developed Resistance (R) genes as a crucial component in the immune system,

which plays an effective role in the resistance towards a wide spectrum of pathogens, such as

fungi, bacteria, viruses, oomycetes and nematodes (Wan et al. 2012; Dangl et al. 2013). The

understanding of the molecular structure and function of disease resistance genes (R genes)

has been crucial for plant resistance research.(Wan et al. 2012). Till now, over 140 R genes

have been cloned in different plant species (Zhang et al. 2016). According to Spoel and Dong

(2012), in the plant immune system, R genes encode proteins monitoring the perturbations of

self- molecules trigged by pathogen effectors (Spoel and Dong 2012). In this way, a few

hundred R genes could be sufficient to interact with a larger range of pathogen-encoded

effectors. These R genes are diverse in terms of their structure, function and evolution,

however, they can be grouped into five different classes, based on structural similarities of

their predicted protein products (Staskawicz et al. 1995; Liu et al. 2007). The largest class

includes proteins with putative nucleotide binding site (NBS) and leucine-rich repeats (LRR).

The NBS-LRR resistance genes appear to code for intracellular receptors that are composed

of a variable N terminal domain followed by the central NBS domain and C-terminal LRR

domains. The LRRs may be the main determinant in recognition specificity of the avirulence

gene product and as components of a signal transduction pathway (Ellis et al. 2000).

Sequence analyses revealed that NBS domains share a high degree of sequence identity and

have a number of conserved motifs. This characteristic specialty can be used to identify NBS-

LRR genes (Meyers et al. 1999; Wan et al. 2012).

Based on the presence of a toll/interleukin-1 receptor (TIR) domain at the N-terminusal,

NBS-LRR genes can be divided into TIR-NBS-LRR (TNL) and non- TIR-NBS-LRR (nTNL)

(Cannon et al. 2002). As the proteins encoded by nTNL genes always contain a coiled-coil

3

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 4: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

domain at the N-terminalterminus, nTNL genes are also named referred to as CC-NBS-LRR

(CNL) (Meyers et al., 2003). These two major subfamilies of plant NBS-LRR proteins are

involved in defence against pathogens in different signalling pathways. They are distinct in

their NBS conserved sequence motifs and are therefore found in separate clusters in

phylogenetic analyses (McHale et al. 2006). In addition, a small group of nTNL genes with

RPW8 (RESISTANCE TO POWDERY MILDEW8) domains at the N-terminal terminus

have been identified (Xiao et al. 2004; Collier et al. 2011). Recently the RPW8-NBS-LRR

(RNL) genes were revealed as sister to all CNL genes, (Shao et al. 2014). According to

Collier et al. (2011) suggested that, the RNL genes do not specifically respond to different

pathogensdidn’t show specific response to different pathogens. They work by assisting TNL

and CNL genes, therefore (Bonardi et al. 2011) described them as ‘Helper NBS-LRRs’. With

this specific function, RNL genes are not directly involved in the recognition of rapidly

changing pathogensdo not evolve rapidly to respond to the quick change of pathogens, while

TNL genes and CNL genes’ function depends on the evolve rapid pathogen recognition. ly in

a pathogen-specific way (Zhang et al. 2016).

Plant genome sequencing has facilitated genome wide analysis of NBS-LRR genes in many

species including rice (Monosi et al. 2004), Medicago (Ameline-Torregrosa et al. 2008),

sorghum (Paterson et al. 2009), Arabidopsis (Meyers et al. 2003; Tan et al. 2007), papaya

(Porter et al. 2009), melon (Garcia-Mas et al. 2012), peach (Verde et al. 2013) and soybean

(Zhou et al. 2016). These studies showed that the NBS-LRR class of genes is abundant and

widely distributed throughout the genome. Moreover, it has been demonstrated that the

majority of NSB-LRR genes are present in gene clusters (Hulbert et al. 2001). The clustered

arrangement of these genes may be a critical attribute for the generation of novel resistance

4

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 5: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

specificities (Meyers et al. 2003). With the possibility that NBS genes expanded prior to

familial divergence, the comparative analysis of NBS-LRR genes may help with

understanding the evolution and pathogen resistance mechanism of different species, or the

mining of novel R genes (Shao et al. 2016b; Zhang et al. 2016).

Brassica is one of the most important genera in the Brassicaceae family due to its agricultural

and commercially importantceeconomic importance. B. napus is the most important

commercial Brassica species and is grown primarily for its seed which yields approximately

40% oil. B. napus originated from interspecific hybridisation between the diploids B. rapa (A

genome) and B. oleracea (C genome) resulting in an amphidiploid genome, containing the A

and C sub-genomes A and C genome (AACC, n=19). Sequencing of Brassica genomes over

the last decade has resulted in reference genome sequences for B. napus (Chalhoub et al.

2014), B. rapa (Wang et al. 2011) and B. oleracea (Liu et al. 2014; Parkin et al. 2014)

permitting a comprehensive study of R genes in these Brassica species.

In a previous study of R genes in the incomplete B. rapa genome, a lower number of NBS-

LRR genes were predicted than found in other sequenced crops (Mun et al. 2009). However,

these authorsey estimated the number of NBS-LRR genes in the B. rapa genome should be

higher than in Arabidopsis (Mun et al. 2009). This estimation has been confirmedevidenced

by Hofberger et al. (2014) and Zhang et al. (2016b). Both studies performed the NBS-LRR

identification in five Brassicaceae genomes, in which more NBS-LRR genes were detected in

B. rapa than Arabdopsis. However, the difference was not significant (Fourmann et al. 2001;

Vicente and King 2001; Mun et al. 2009; Hofberger et al. 2014). Yu et al.(2014) has

identified 206, 176 and 157 NBS-LRR genes in B. rapa, B. oleracea and Arabidopsis,

respectively (Yu et al. 2014). These authorsy found that the number of NBS-LRR genes in

these three species was similarclose despite the differences in genome size and complexity.

5

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 6: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

This is surprising as B. rapa and B. oleracea would be expected to host morea greater

numbers of NBS-LRR genes than Arabidopsis because the Brassica genomes have

undergone triplication following divergence from their common ancestor with Arabidopsis.

Similarly, wWith the similarclose genome size without genome duplication, the number of

NBS-LRR genes of peach is nearly three times more than that of melon, with 408 and 104

respectively (Garcia-Mas et al. 2012; Verde et al. 2013). As Jacob et al. (20135) concluded,

the number and distribution of NBS-LRR genes varied among plant species the variationety

among different species is dramatic regardless ofno matter with the genome size, and the

Brassicaceae family has moderate rich NBS-LRR genes detected . Furthermore, almost 50%

of NBS family members were detected as tandem arrays within homogenous clusters,

suggesting tandem duplication in combination with polyploidy after the separation from the

same ancestor played an important role in the expansion of NBS-LRR genes in the Brassica

genome (Fourmann et al. 2001; Vicente and King 2001; Mun et al. 2009).

The objectives of this study were to identify and characterize the structure and distribution of

NBS-LRR genes in B. napus and compare these with genes from B. rapa and B. oleracea.

Further association between NBS-LRR genes and resistance QTL can help to accelerate

isolation of disease resistance genestics.

Materials and methods

Brassica reference genomes

The genome sequence and annotation of B. napus Darmor was used as the reference genome

(Chalhoub et al. 2014). The genome sequences of B. rapa v3.0 (Wang et al. 2011) and B.

oleracea To1000 v1.0 (Parkin et al. 2014) were also used for comparative analysis.

Identification of NBS-LRR genes by MEME/ MAST and InterProScan analysis

6

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

Page 7: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

The MAST/MEME (Motif Alignment Search Tool/Multiple Em for Motif Elicitation) suite

of software was used to identify predicted genes that contain motif homology to known

disease resistance genes (Bailey et al. 2009). NBS-LRR “positive” and non NBS-LRR

“negative” sequence training sets, a consensus of 20 amino acid motifs derived from MEME

analysis (Jupe et al. 2012) were used as queries in a MAST search against the predicted genes

of the B. napus, B. oleracea and B. rapa genomes. Predicted genes were considered to be

candidate CNLC or TNL NBS -LRRs if the reported MAST E values were ≤ E-24.

InterProSacan (Jones et al. 2014) was used for further analysis to identify RNLs, another

group of nTNLs., Tthe protein sequences were subjected to two databases, Pfam for detecting

TIR, RPW8, NBS-LRR domains and Coils for detecting CC domain.

Identification of NBS-LRR genes by CNLs and TNLs for validation

To further validate the results from the MAST output, consensus sequences of plant CNLs

and TNLs (Cannon et al. 2002; Cannon et al. 2004) were obtained (Ameline-Torregrosa et

al. 2008). Candidate genes containing NBS domains were identified using tBLASTn and

BLASTp (maximum E- value 1E-5) (Altschul et al. 1997) performed using these consensus

sequences against the B. napus, B. oleracea and B. rapa genomes. Candidate NBS-LRR

proteins were provisionally assigned to either the CNL or TNL groups on the basis of

sequence similarity. Then, the results of the two different analyses (BLAST and MAST) were

compared NBS-LRR candidate genes identified from the MAST output were compared with

CNL and TNL BLAST results to validate the NBS- LRR genes. Any candidate NBS-LRR

genes that were not identified by both approaches that did not correlate between the two

approaches was were subject to further searches against the NCBI non-redundant nucleotide

database.

Characterization of candidate NBS-LRR genes

7

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 8: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

All predicted NBS-LRR genes were surveyed to further determine their domain structure and

whether they contained TIR, CC, NBS, and/or LRR domains. The predicted genes were

searched for similarity to known proteins using BLASTp (Altschul et al. 1997) with a

threshold of E < -10 against the NCBI non-redundant (nr) protein database.

Multiple alignment and phylogenetic analysis

Alignment and phylogenetic analyses were performed with all NBS-containing proteins. The

NBS region was defined as the region extending from the P-loop to the MHDV motif, which

contains about 300 amino acids, using the corresponding conserved motifs from B. rapa

(Mun et al. 2009).

Genes with ≤ 50% of total NBS domain length, such as genes with partial NBS domains, with

length less than 150 aa or genes lacking NBS domains such as TIR genes, TIR-LRR genes

and linker LRR genes were excluded from phylogenetic analysis.

The NBS sequences were then aligned using CLUSTAL W (Thompson et al. 1994). A

phylogenetic tree was constructed using the neighbour-joining method (Saitou and Nei 1987)

with bootstrap multiple alignment resampling set at 1000. These functions were performed

using Molecular Evolutionary Genetics Analysis (MEGA) software version 6.0 (Tamura et

al. 2013).

NBS-LRR gene cluster and duplication analyses

NBS-LRR gene clusters were determined by their physical position on each chromosome

according to criteria previously published and widely used (Holub 2001; Richly et al. 2002;

Meyers et al. 2003). The parameter to define a cluster was three or more NBS genes that

occurred within a maximum of eight non-NBS genes. Furthermore, the distance between

neighbouring NBS-LRRs was required to be ≤ 200 kb.

8

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

Page 9: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Gene duplication was defined in accordance with the criteria previously published and widely

used (Zhou et al. 2004): (1) the alignment covered more than 70% of the longer gene and (2)

The aligned region had an identity >70%. A block of duplications was defined as having

more than one gene in a gene cluster involved in the duplication if more than one gene in a

gene cluster was involved in the duplication. Homologous or paralogous genes were

confirmed by the BLASTp comparison of all the predicted NBS-LRR proteins against each

other with E-value of E-20.

Reciprocal Best BLAST of NBS-LRRs for comparative analysis between of B. napus, B. rapa

and B. oleracea

A reciprocal best BLAST (RBB) analysis was used to compare which of the NBS-LRR genes

predicted across the three genomes; B. napus, B. rapa and B. oleracea, were conserved. The

NBS- LRR genes from B. rapa (v3.0) were compared to with the identified NBS-LRR genes

identified from the B. napus cv. Darmor (A genome) and vice versa using BLAST. The

identified NBS-LRR genes from B. oleracea (To1000 v1.0) were also compared with

identifiedto NBS-LRR genes identified from the B. napus cv Darmor (C genome), and vice-

versa using BLAST.

Results

Genome wide identification of B. napus NBS-LRR genes

In total, 641 candidate NBS-LRR genes were identified in B. napus, consisting of 615 genes

from the MAST output with reported E value of < -24, plus additional genes identified from

consensus sequences of TNLs and CNLs (Table 1, Supplementary Table 1), with

approximately 80% of the NBS-LRR genes identified by both two methods. Genes that were

9

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

Page 10: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

not correlated between the two methods were truncated by either a premature stop codon or a

frameshift mutation causing them to fall under the level of significance of MAST or

CNL/TNL analysis, suggesting that they are pseudogenes. The 20 putative conserved motifs

are presented in Table 2 and Figure 1. These motifs have distinct patterns depending on

whether they are present in the TNL or CNL groups.

Of the 641 candidate NBS-LRR genes, 366 (57%) are typical or regular NBS-LRR genes,

with 124 CNLs and 242 TNLs (Table 2). These 366 genes show highly conserved NBS

regions and complete open reading frames. The remaining 188 genes (29%) were classified

as non–regular genes because of the lack of specific domains. These genes were classified

into three distinct groups for TNLs and CNLs. Non-regular TNLs were classified as TN (74),

N (9) and NL (43) whereas, non-regular CNLs were classified as CN (24), N (8) and NL (23).

These non-regular genes were described as being partial or truncated within the N-terminal

domains and/or have an absence of LRR domains. In addition, 87 TIR genes thatwhich lack

both NBS and LRR domains were also identified in B. napus. Furthermore, 73 NBS-LRR

genes (43 CNL and 30 TNL (21 TIR + 9 TNL) were encodedhad by a single reading frame

without introns (single exon) (Supplementary Table 1). ToFor identifying another group of

nTNL, named RNL, further analysis was performed using InterProScan which revealed 17

genes containing the RPW8 domain; whereby, 10 of them contained RPW8 and NBS-LRR

domains (6 genes on the A genome and 4 genes on the C genome) and the remaining 7 of

them contained CC, RPW8 and NBS-LRR domains.

Alignment and phylogenetic analysis

Phylogenetic analyseis of 176 CNL genes and 366 TNL genes were performed separately to

identify the similarity and relationship between genes (Figures 2 and 3). Genes with more

than 50% deletion in the NBS domain or with a lacking of the NBS domain (such as TIR

10

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 11: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

genes) were excluded from the analysis. The trees were divided into clades on the basis of

clade rooting resulting in 16 clades for CNLs and 20 clades for TNLs (Figures 2 and 3).

Analysis of NBS-LRR genes on chromosomes A7 and C6 in B. napus and their homologs in

the diploid progenitor species,: B. rapa and B. oleracea, separates the TNL and CNL genes

into two clades (Figure 4). TNL and CNL groups were further divided into distinct subgroups

represented by conserved and orthologous genes from all three species. Further comparison

showed, TNL and CNL genes on homeologous chromosomes, such as A7 and C6, were

located in the same clade.

NBS-LRR gene clustering

The NBS-LRR genes were found to be distributed across the genome and are present on all

19 chromosomes (Table 34 and Supplementary Table 1), with 59% (378) of genes residing in

clusters. A total of 72 clusters were observed, containing up to 11 genes and with an average

of 5 genes. There were a greater number of TNL clusters (51) than CNL clusters (14), with

eight mixed clusters containing both TNLs and CNLs. Out of the total 72 clusters, 32 were

located in monophyletic clades and shared a recent common ancestor. These were considered

to be homogeneous, while the remaining 40 clusters are heterogeneous as they contained

more distantly related NBS-LRR genes.

NBS-LRR gene duplication

A total of 319 (50%) of the NBS-LRR genes were identified as duplicates (Table 34), with

more TNL (216) than CNL duplicates (103). The number of duplicated genes was consistent

between the A and C two sub-genomes, and 171 (53%) of duplicates were in gene clusters

(Table 34). Two types of genomic duplication were observed (Figure 5), intra-genomic and

inter-genomic. There were 41 duplication events between the A and C genomes, many of

11

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

Page 12: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

which showed collinearity between homeologous chromosomes. For example, 14 genes on

chromosome A1 were collinear with chromosome C1 (Figure 5). However, duplications were

also observed between non-homeologous chromosomes. For example, five genes in

chromosome A1 were collinear with genes on chromosome C3 (BnC3-49-CNL-NL), C5

(BnC5-2-TNL-TIR), C6 (BnC6-11-CNL-CN), C7 (BnC7-37-TNL-TNL) and C9 (BnC9-37-

TNL-TIR) (Figure 5).

12

1

2

3

4

5

6

7

Page 13: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Comparative and conservation analysis of B. napus NBS-LRR genes with those ofits related

diploid species B. rapa and B. oleracea

In addition to the 641 NBS-LRR genes identified in B. napus, we identified a total of 249

genes in B. rapa and 443 genes in B. oleracea (Tables 3 and 5). The proportion of CNLs to

TNLs was consistent across the three species, with approximately 70% of R genes

comprising TNLs.

Many of the NBS-LRR genes showed highdemonstrated synteny between the genomes, with

260 and 149 B. napus NBS-LRR genes haveing orthologous genes in B. oleracea and B.

rapa, respectively (Table 45). Compared to the diploids, 100 NBS-LRR genes were missing

in the B. napus A genome and 183 in the C genome. NBS-LRR genes were also identified in

B. napus which wereare absent in B. rapa (103) and B. oleracea (121) respectively.

Approximately 324 (50%) of NBS-LRR in the A and C genomes of B. napus were found to

be syntenic and collinear to their respective progenitor genomes (Table 34). In addition, 72%

of these syntenic and collinear genes in B. napus were located within a gene cluster (Table

34).

Comparative analysis of chromosome A7 and C6 in B. napus and Rlm disease resistance

association

Comparative analysis of gene content of chromosomes A7 and C6  of Analysis of A7 and C6

between B. napus and the diploids showed that gene content was similar, with 19 NBS-LRR

genes on chromosome A7 of both B. rapa and B. napus, and 44 NBS-LRR genes on

chromosome C6 in B. napus compared to 45 genes in B. oleracea. We also found

14Fourteen NBS-LRRs wereinvolved in homeologous exchanges between B. napus

chromosomes A7 and C6 (Figure 6). Moreover, we found four genes were absent in the

13

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

Page 14: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

syntenic region of B. rapa and two genes showed substantial differences in sequence identity

(Figure 6).

Discussion

Genome wide identification distribution of B. napus NBS-LRR genes

The sequencing and assembly of Brassica genomes provides a valuable resource to study the

gene structure and evolution. Here we used the three public Brassica genomes for the

identification and characterization of candidate NBS-LRR genes to examine their

distribution, domain structure, clustering, duplication and conservation.

A combination of MAST and CNL/TNL analyses was applied. This enabled 20 NBS-LRR

specific motifs to be distinguished between TNL and CNL subfamilies. The MEME/MAST

analysis is highly dependent on accurate gene prediction and annotation. In contrast,

consensus CNL and TNL analysis using either tBLASTn or BLASTp can identify any gene

or protein sequence with sequence identity with known NBS-LRR motifs.

The 641 NBS-LRR genes in the B. napus was less than the sum of 249 genes in B. rapa and

443 genes in B. oleracea reflecting gene loss that may occur during polyplidization and

interspecific hybridization processes. The polyploidization events occur during divergence of

B. rapa, B. oleracea and B. nigra from a common hexaploid ancestor. Evidence for such

polyploidization events include genome replication and rearrangement (Lukens et al. 2004).

As a result of these events, novel genetic variation yet with high genome similarity between

each of the Brassica diploid species is created, despite difference in chromosome number

(Lagercrantz and Lydiate 1996; Lagercrantz 1998; Babula et al. 2003). Gene loss is reported

to be common during polyploidization event (Town et al. 2006) and interspecific hybridiation

14

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

Page 15: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

between B. rapa and B. oleracea (Zhang et al. 2016). The proportion of genes encoding

NBS-LRR in all three species is consistent with estimates for other plant species, which range

between 0.6 and 1.8% (Meyers et al. 2003; Monosi et al. 2004; Ameline-Torregrosa et al.

2008; Kohler et al. 2008; Yang et al. 2008).

The number of NBS-LRR genes identified here is higher than that identified by Chalhoub et

al. (2014) where 425, 211 and 274 genes were identified in B. napus, B. rapa and B.

oleracea, respectively. This difference is likely to be caused by a combination of a less

stringent MAST E value used here and manually checking for genes with partial NBS

domains. Their focus was also on the evolutionary relationship of NBS-LRR genes between

B. napus and its progenitor species, B. rapa and B. oleracea, whereas our focus was to study

the structure of NBS-LRR genes. We aimed to identify more genes with NBS-LRR

characteristics that could be potential candidates for disease resistance gene

candidatesidentification. Other studies, using different approaches: HMMER and Pfam, also

reported fewer number of NBS-LRR genes in B. rapa and B. oleracea (Yu et al. 2014; Zhang

et al. 2016) than in B.napus. Our study also identified linker-LRRs, this linker region lies

between the NB and LRR domains of a TNL and CNL, which is also a functional region for

defence mechanism (Eitas and Dangl 2010). The proportion of genes encoding NBS-LRR in

B. napus, B. rapa and B. oleracea is consistent with estimates for other plant species, which

range between 0.6 and 1.8% (Meyers et al. 2003; Monosi et al. 2004; Ameline-Torregrosa et

al. 2008; Kohler et al. 2008; Yang et al. 2008).

Approximately 70% of the NBS-LRR genes in B. napus, B. rapa and B. oleracea genes are

TNLs. This is consistent with B. rapa and B. oleracea. The greater number and diversity of

TNL compared to CNL and RNL genes has been previously reported in B. rapa (Mun et al.

15

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 16: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

2009; Yu et al. 2014), Arabidopsis (Yu et al., 2014, Meyers et al., 2003), M. truncatula

(Ameline-Torregrosa et al. 2008), Populus trichocarpa (poplar) (Kohler et al. 2008) and

Linum usitatissimum (linseed) (Kale et al. 2012). The greater number of TNL is unusualique

but not limited to the Brassicaceae family, as compared to other angiosperm lineages such as

in the Fabaceae, Solanaceae and Poaceae families (Shao et al. 2016b). This distinct pattern of

TNL abundance in the Brassicaceae suggestsimplicated that the TIR domain is more

functionally active than the CC (Zhang et al. 2016). The ration of TNLs and CNLs is variable

among species thatand could reflect the adaptation of the R genes to the predominant

pathogens The predominance of TNLs or CNLs in a genome may be due to the nature of

pathogens that infect the plant species or another evolutionary basis (Leister 2004; Lozano et

al. 2012).

In many studies, nTNL genes are commonly named as CNL genes, because a CC domain is

frequently detected at the N-terminal terminus of the NBS-LRR genes. Recently, another

group of nTNL hasve been revealed with a distinct RPW8 domain, which indicates that

nTNL genes have a heterogeneous domain composition (Bonardi et al. 2011; Collier et al.

2011; Zhang et al. 2016). Phylogenetic analysis of NBS genes in some legume genomes

hasve identified the RNL subclass as a sister to all CNL genes, which indicates that the RNL

could be an accidently diverged NBS subclass (Shao et al. 2014). The phylogenetic analysis

for nTNL genes in five Brassicaceae species located all RNL genes in a clade and all CNL

genes fell into a sister clade (Zhang et al. 2016). In this study, among 17 genes with the

RPW8 domain, 7 genes contained both CC and RPW8 domains suggesting the RNL genes

are derived from the CNL lineage. The function of a few number of RNL genes is still

unclear, it has been suggested that they maintain their role in defence signal transduction

(Shao et al. 2016a) or they may help several or specific subclasses of NBS genes to achieve

their function (Collier et al. 2011). These genes are not directly involved in the response to

16

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

Page 17: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

the pathogen and are not necessarily duplicated in large numbers (Zhang et al. 2016). This

could support the finding of fewer numbers of RNL genes in the genome compared towith

the TNL and CNL genes.

The NBSSB-LRR genes were unevenly distributed across the genomes where chromosomes

A9 and C9 harbour the most genes while chromosomes A5 and C4 contain the least in the

respective genomes (Table 34). This uneven distribution pattern appears to be common in

plants (Meyers et al. 2003; Zhou et al. 2004; Kohler et al. 2008; Yang et al. 2008). Many

(59%) NBS-LRR genes are physically clustered in B. napus, which corresponds is a similar

proportion to that found in other plants including Arabidopsis (61%), M. truncatula (50%)

(Meyers et al. 2003; Ameline-Torregrosa et al. 2008) and potato (58%) (Jupe et al. 2012).

The B. napus NBS-LRR genes were divided into regular and non-regular NBS-LRR genes.

The regular NBS-LRR genes have all domains and complete open reading frames, whereas

the non-regular NBS-LRR resistance genes have partial motifs of the NBS domain or lack

CC, TIR or LRR domains. These were considered to be truncated or pseudogenes. Truncated

NBS-LRR genes have been described previously in plant genomes and they often surround

regular NBS-LRR genes (Meyers et al. 2003; Ameline-Torregrosa et al. 2008; Jupe et al.

2012). Non-regular NBS-LRR genes may be arise due to the partial deletion of redundant

NBS-LRR genes following whole genome duplication. Here we reported genes that encode

for only a TIR domain, lacking both NBS and LRR domains. These may be truncated NBS-

LRR genes or represent novel TIR containing proteins (Meyers et al. 2002; Yu et al. 2014).

Phylogenetic analysis

Different evolutionary patterns between TIR-NBS and non-TIR-NBS gene types during the

evolution of angiosperms has been suggested (Yang et al. 2008; Shao et al. 2016b). The

average number of genes per clade was similar for both TNLs and CNLs, despite there being

17

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 18: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

a greater number of TNLs. This suggests that the NBS domains in CNLs are more divergent,

consistent with observations in other species (Bai et al. 2002; Cannon et al. 2002; Wan et al.

2013) and supporting the hypothesis that the CNL family is highly diverse and originated

prior to the split between gymnosperms and angiosperms (Wan et al. 2013). A phylogenetic

study of the N-terminus domain (TIR or CC) revealed that both TIR and CC play a role in a

distinct signalling pathway, where the genes evolved independently since the split between

monocots and dicots. The TIR domain was also reported to be more ancient than CC when

the TIR homolog was found in ancient trees, such as Pinus radiata (Pan et al. 2000). Another

study supporting this theory is that both monocots (rice) and dicots (Arabidopsis) share high

sequence homology of TIR-NBS genes in their genome, suggesting that the TIR domain is

was contained in the common ancestor before the both genomesmonocots and dicots split

(Bai et al. 2002). However, there is also evidence that CC evolved even earlier, before

gymnosperms and angisosperms split (Bai et al. 2002; Cannon et al. 2002; Wan et al. 2013).

The higher diversity of the CNL family in monocots is supported by the random number and

position of intron position compared to a more conserved intron position of TNLs in dicots

(Hammond-Kosack and Jones 1997; Meyers et al. 1999). HoweverYet, CNLs are less diverse

than TNLs in legumes (Zheng et al. 2016). This occurrence could be attributed to the lower

or slower evolutionary rate of CNLs less involvement of evolutionary events, such as gene

duplication, on the CNL group or the events happen at a slower rate (Song and Chen 2015;

Zheng et al. 2016). The TNL family is more homogeneous where members within a clade are

more related to each other in terms of gene organization and sequence identity (Lozano et al.

2015), and are only found in dicotyledonous species, reflecting their more recent evolution

(Tarr and Alexander 2009; Wan et al. 2013).

CAs compared to TNL, the CNL proteins are commonly encoded by single exons (Bai et al.

2002; Ameline-Torregrosa et al. 2008). In this study we also found that TNL genes showed

18

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

Page 19: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

complex structures, and various domains (TIR, NBS, and LRR) were often separated by

introns, with many short exons in the LRR region., Ssimilar results were also found in

Arabidopsis (Meyers et al. 2003).

Based on the phylogenetic analysis the NBS-LRR genes located on homeologous

chromosomes of B. napus were most likely found in the same clade. For example, in this

study, BnA1-1CNL and BnC1-2CNL, BnA1-5TNL and BnC1-6TNL were found in the same

clade. A comparison between TNL genes on chromosomes A7 and C6 were found them in

the same clade suggesting that these genes have been maintained without substantial

divergence or selection during the formation of B. napus from its progenitor species. These

TNL genes are also highly conserved between A and C genome, suggesting effects of whole

genome duplication and/or tandem duplication from a common ancestral source of NBS-LRR

genes. We also found mixed clades dominated by sequences from one chromosome, and

usually from one or a small number of genomic clusters, but many clades also contained

genes from other chromosomes. This could arise through chromosomal rearrangements or by

large-scale genomic duplication suggesting that gene duplication in B. napus may play a role

in the expansion and diversity of NBS-LRR genes.

NBS-LRR gene clustering

The percentage of NBS-LRR genes appearing in clusters is higher in the C genome (B.

oleracea, Table 34, 62%) compared to the A genome (B. rapa, 54%) in B. napus, which is

consistent in line with the previous comparison of B. oleracea, B. rapa and Arabidopsis. (Yu

et al. 2014). ’s study comparing B. oleracea, B. rapa and Arabidopsis. The TNL genes were

commonly found in clusters whereas most of the CNLs were not in clusters, as was also

reported in the potato genome (Jupe et al. 2012), possibly reflecting the greater abundance of

TNL genes.

19

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 20: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Single A single functional gene locusi denotes a stable gene that is not under high

selectionevolution pressure (Guo et al. 2011). Examples of single-copy genes include RPM1

and Rps2, which are both conserved in B. rapa and B. oleracea (Yu et al. 2014). Whereas,

NBS-LRR genes that form clusters tend to be more polymorphic and more prone to

evolutionary process. In this study, the majority of conserved NBS-LRR genes between B.

napus and its related species are not located in clusters but as a singletons.

Clustering of NBS-LRR genes with members having high sequence homology arising from

tandem gene duplication can confer gain of resistance function (Grant et al. 1998). This

situation is compared to heterogeneous clusters where genes involved in the cluster are

distantly related to R genes. These clusters of mixed R-gene related and unrelated genes areis

commonly found in higher plants such as M. truncatula (Zhu et al. 2002) and could have

arisen as a result of either evolutionary mechanism or random co-localization in the genome

(Richly et al. 2002; Meyers et al. 2003). Moreover, clusters contain genes from non- regular

NBS-LRR genes often located adjacent to regular NBS-LRR genes in gene clusters and this

is consistent with phylogenetic lineages. Therefore, the clustered organisation and

arrangement of these genes may be a critical attribute allowing the generation of novel

resistance specificities via intergenic rearrangement, duplication especially tandem

duplication or gene conversion (Hulbert et al. 2001).

NBS-LRR gene duplication

The level of gene duplication in the B. napus genome is thought to be significant with high

levels of genomic redundancy (Parkin et al. 2003; Rana et al. 2004; Parkin et al. 2005). Thus,

gene duplication, including tandem and whole-genome duplication (WGD) could have

resulted in a substantial increase of the gene number to compensate the gene losses during

formation of B. napus.

20

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 21: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

In this study, we found that 50% of B. napus NBS-LRRs were duplicated or exist in pairs of

paralogs. The distribution of duplicate genes was random which could be as a result of

genome rearrangements or gene loss/gain occurring after whole genome duplication events.

Intra-genomic and inter-genomic duplication events between B. napus NBS-LRR genes has

been observed suggesting these genes possibly originated via tandem duplication where

duplicated genes are contiguous with its original copy. There was considerable collinearity

between duplicated genes in homologous chromosomes with the exception of some showing

duplication between non homologous chromosomes due to possible translocation or genome

mis-assembly.

It has been suggested that duplication in combination with divergent selection acting on

duplicated genes, influences the generation of gene diversity within existing gene clusters

(Parniske and Jones 1999; Richly et al. 2002; Wei et al. 2013). We found 38% of duplicated

genes were involved in formation of gene clusters suggesting that gene clusters could be

chromosomal hot spots in which the NBS-LRR genes are duplicated with the gene clusters

also expanded by tandem duplication followed byas a result of selection pressure (Wei et al.

2013). Indeed, previous studies revealed that NBS-LRR genes are often present as tightly

linked genes with high homology and are prone to gene duplication and recombination, and

thus evolve more rapidly than the rest of genome (Grant et al. 1998). Therefore, the clustered

distribution of NBS-LRR genes provides a reservoir of genetic variation by which new

specificities to pathogens can evolve via gene duplication (Michelmore and Meyers 1998;

Yang et al. 2008) . For instance, duplicated genes could be allelic variants that show

specificity for different pathogen strains. Together, these findings suggested that tandem

duplication in combination with recombination and selection pressure played a major role in

NBS-LRR genes expansion and diversity in B. napus.

21

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 22: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

The duplicate genes were classified as resulting from polyploidization or post-speciation.

Based on their conservation and synteny, it can be difficult to distinguish among orthologs

and paralogs without extensive genome wide analysis. However, the results indicatedgave

indication that a large number of duplicated genes were originated via polyploidization from

the ancestral species: B. rapa and B. oleracea, while the tandemly duplicated genes were

more likely produced by post speciation duplications.

Comparative analysis of NBS-LRR genes on chromosome A7 and C6 in B. napus and Rlm

disease resistance association

Genetic and genomic studies have identified that a cluster of five linked genes on

chromosome A7 in B. napus are a major source of blackleg resistance genes (Delourme et al.

2004; Mayerhofer et al. 2005). An environmentally stable QTL for resistance against L.

maculans has also been identified on chromosome A7 in DH lines of B. napus (Huang et al.

2016), which confirms that this chromosome contains valuable QTLs for selection against L.

maculans.

Preliminary data suggests that there is duplication and clustering of tightly linked genes in the

QTL region containing NBS-LRR genes on chromosome A7 (Raman et al. 2012) and

suggests that map-based cloning of an Rlm gene may also identify additional blackleg

resistance genes (Raman et al. 2012; Tollenaere et al. 2012). The only resistance genes

which have been cloned in B. napus are LepR3 and Rlm2 (Larkan et al. 2014; Larkan et al.

2015), allelic variants of the same gene, located on chromosome A10. These genes belong

to the leucine-rich repeat receptor-like proteins (LRR-RLPs) family and have an important

role in facilitating defence signalling mechanisms. This family of genes was not analysed

within our study, and therefore the identification and role of RLPs should be included in

future analyses. This study identified several candidate NBS-LRR underlying the QTL

22

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Page 23: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

region of blackleg resistance in B. napus. We found that the majority of candidate NBS-LRR

genes linked to blackleg resistance shared a duplicated copy on both chromosomes A7 and

C6 suggesting chromosomal duplication or rearrangements of this region. Furthermore,

several members of an NBS-LRR cluster on A7 that could confer resistance to blackleg

disease in B. napus had conserved orthologous genes in B. rapa. Therefore, the fully

sequenced B. napus genome has enabled us to identify blackleg candidate resistance genes on

chromosome A7 and targeting these genes for validation is underway.

Acknowledgements

The authors acknowledge funding support from Australian Research Council Projects DP0985953,

FT130100604, DP1601004497, LP140100537, LP160100030

23

1

2

3

4

5

6

7

8

9

10

11

Page 24: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

References

Altschul, SF, Madden, TL, Schäffer, AA, Zhang, J, Zhang, Z, Miller, W, Lipman, DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research 25, 3389-3402.

Ameline-Torregrosa, C, Wang, B-B, O'Bleness, MS, Deshpande, S, Zhu, H, Roe, B, Young, ND, Cannon, SB (2008) Identification and Characterization characterization of Nucleotide-Binding Site-Leucine-Rich Repeat Genes genes in the Model model Plant plant Medicago truncatula. Plant Physiology 146, 5-21.

Babula, D, Kaczmarek, M, Barakat, A, Delseny, M, Quiros, CF, Sadowski, J (2003) Chromosomal mapping of Brassica oleracea based on ESTs from Arabidopsis thaliana: complexity of the comparative map. Molecular Genetics and Genomics 268, 656-665.

Bai, J, Pennill, LA, Ning, J, Lee, SW, Ramalingam, J, Webb, CA, Zhao, B, Sun, Q, Nelson, JC, Leach, JE, Hulbert, SH (2002) Diversity in Nucleotide Binding Site–Leucine-Rich Repeat Genes genes in Cerealscereals. Genome Research 12, 1871-1884.

Bailey, TL, Boden, M, Buske, FA, Frith, M, Grant, CE, Clementi, L, Ren, J, Li, WW, Noble, WS (2009) MEME Suite: tools for motif discovery and searching. Nucleic Acids Research 37, W202-W208.

Bonardi, V, Tang, S, Stallmann, A, Roberts, M, Cherkis, K, Dangl, JL (2011) Expanded functions for a family of plant intracellular immune receptors beyond specific recognition of pathogen effectors. Proceedings of the National Academy of Sciences 108, 16463-16468.

Cannon, S, Mitra, A, Baumgarten, A, Young, N, May, G (2004) The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC plant biology 4, 10.

Cannon, S, Zhu, H, Baumgarten, A, Spangler, R, May, G, Cook, D, Young, N (2002) Diversity, distribution, and ancient taxonomic relationships within the TIR and non-TIR NBS-LRR resistance gene subfamilies. J Mol Evol 54, 548 - 562.

Chalhoub, B, Denoeud, F, Liu, S, Parkin, IAP, Tang, H, Wang, X, Chiquet, J, Belcram, H, Tong, C, Samans, B, Corréa, M, Da Silva, C, Just, J, Falentin, C, Koh, CS, Le Clainche, I, Bernard, M, Bento, P, Noel, B, Labadie, K, Alberti, A, Charles, M, Arnaud, D, Guo, H, Daviaud, C, Alamery, S, Jabbari, K, Zhao, M, Edger, PP, Chelaifa, H, Tack, D, Lassalle, G, Mestiri, I, Schnel, N, Le Paslier, M-C, Fan, G, Renault, V, Bayer, PE, Golicz, AA, Manoli, S, Lee, T-H, Thi, VHD, Chalabi, S, Hu, Q, Fan, C, Tollenaere, R, Lu, Y, Battail, C, Shen, J, Sidebottom, CHD, Wang, X, Canaguier, A, Chauveau, A, Bérard, A, Deniot, G, Guan, M, Liu, Z, Sun, F, Lim, YP, Lyons, E, Town, CD, Bancroft, I, Wang, X, Meng, J, Ma, J, Pires, JC, King, GJ, Brunel, D, Delourme, R, Renard, M, Aury, J-M, Adams, KL, Batley, J, Snowdon, RJ, Tost, J, Edwards, D, Zhou, Y, Hua, W, Sharpe, AG, Paterson, AH, Guan, C, Wincker, P (2014) Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome. Science 345, 950-953.

Collier, SM, Hamel, L-P, Moffett, P (2011) Cell death mediated by the N-terminal domains of a unique and highly conserved class of NB-LRR protein. Molecular Plant-Microbe Interactions 24, 918-931.

Dangl, JL, Horvath, DM, Staskawicz, BJ (2013) Pivoting the plant immune system from dissection to deployment. Science 341, 746-751.

Delourme, R, Pilet-Nayel, ML, Archipiano, M, Horvais, R, Tanguy, X, Rouxel, T, Brun, H, Renard, M, Balesdent, MH (2004) A cluster of major specific resistance genes to Leptosphaeria maculans in Brassica napus. Phytopathology 94, 578-583.

Eitas, TK, Dangl, JL (2010) NB-LRR proteins: pairs, pieces, perception, partners, and pathways. Current opinion in plant biology 13, 472-477.

Ellis, J, Dodds, P, Pryor, T (2000) Structure, function and evolution of plant disease resistance genes. Current Opinion in Plant Biology 3, 278-284.

24

1

23456789

10111213141516171819202122232425262728293031323334353637383940414243444546474849

Page 25: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Fourmann, M, Charlot, F, Froger, N, Delourme, R, Brunel, D (2001) Expression, mapping, and genetic variability of Brassica napus disease resistance gene analogues. Genome 44, 1083-1099.

Garcia-Mas, J, Benjak, A, Sanseverino, W, Bourgeois, M, Mir, G, González, VM, Hénaff, E, Câmara, F, Cozzuto, L, Lowy, E (2012) The genome of melon (Cucumis melo L.). Proceedings of the National Academy of Sciences 109, 11872-11877.

Grant, MR, McDowell, JM, Sharpe, AG, De Torres Zabala, M, Lydiate, DJ, Dangl, JL (1998) Independent deletions of a pathogen-resistance gene in Brassica and Arabidopsis. Proceedings of the National Academy of Sciences of the United States of America 95, 15843-15848.

Guo, Y-L, Fitz, J, Schneeberger, K, Ossowski, S, Cao, J, Weigel, D (2011) Genome-Wide wide Comparison comparison of Nucleotide-Binding Site-Leucine-Rich Repeat-Encoding Genes genes in Arabidopsis. Plant Physiology 157, 757-769.

Hammond-Kosack, KE, Jones, JDG (1997) Plant disease resistance genes. Annual review Review of plant Plant biology Biology 48, 575-607.

Hofberger, JA, Zhou, B, Tang, H, Jones, JD, Schranz, ME (2014) A novel approach for multi-domain and multi-gene family identification provides insights into evolutionary dynamics of disease resistance genes in core eudicot plants. BMC genomics Genomics 15, 966.

Holub, EB (2001) The arms race is ancient history in Arabidopsis, the wildflower. Nature Reviews Genetics 2, 516-527.

Huang, YJ, Jestin, C, Welham, SJ, King, GJ, Manzanares-Dauleux, MJ, Fitt, BDL, Delourme, R (2016) Identification of environmentally stable QTL for resistance against Leptosphaeria maculans in oilseed rape (Brassica napus). Theoretical and Applied Genetics 129, 169-180.

Hulbert, S, Webb, C, Smith, S, Sun, Q (2001) Resistance gene complexes: evolution and utilization. Annual Review of Phytopathology 39, 285 - 312.

Jacob, F, Vernaldi, S, Maekawa, T (2013) Evolution and conservation of plant NLR functions. Frontiers in iImmunology 4,

Jones, P, Binns, D, Chang, H-Y, Fraser, M, Li, W, McAnulla, C, McWilliam, H, Maslen, J, Mitchell, A, Nuka, G (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236-1240.

Jupe, F, Pritchard, L, Etherington, G, MacKenzie, K, Cock, P, Wright, F, Sharma, SK, Bolser, D, Bryan, G, Jones, J, Hein, I (2012) Identification and localisation of the NB-LRR gene family within the potato genome. BMC gGenomics 13, 75.

Kale, SM, Pardeshi, VC, Barvkar, VT, Gupta, VS, Kadoo, NY (2012) Genome-wide identification and characterization of nucleotide binding site leucine-rich repeat genes in linseed reveal distinct patterns of gene structure. Genome 56, 91-99.

Kohler, A, Rinaldi, C, Duplessis, S, Baucher, M, Geelen, D, Duchaussoy, F, Meyers, B, Boerjan, W, Martin, F (2008) Genome-wide identification of NBS resistance genes in Populus trichocarpa. Plant Molecular Biology 66, 619-636.

Lagercrantz, U (1998) Comparative mapping between Arabidopsis thaliana and Brassica nigra indicates that Brassica genomes have evolved through extensive genome replication accompanied by chromosome fusions and frequent rearrangements. Genetics 150, 1217-1228.

Lagercrantz, U, Lydiate, DJ (1996) Comparative genome mapping in Brassica. Genetics 144, 1903-1910.

Larkan, NJ, Lydiate, DJ, Yu, F, Rimmer, SR, Borhan, MH (2014) Co-localisation of the blackleg resistance genes Rlm2 and LepR3 on Brassica napus chromosome A10. BMC pPlant biology Biology 14, 387.

Larkan, NJ, Ma, L, Borhan, MH (2015) The Brassica napus receptor like protein RLM2 is encoded by a‐ second allele of the LepR3/Rlm2 blackleg resistance locus. Plant Bbiotechnology journal Journal 13, 983-992.

25

123456789

1011121314151617181920212223242526272829303132333435363738394041424344454647484950

Page 26: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Leister, D (2004) Tandem and segmental gene duplication and recombination in the evolution of plant disease resistance genes. Trends in gGenetics : TIG 20, 116-122.

Liu, J, Liu, X, Dai, L, Wang, G (2007) Recent Progress progress in Elucidating elucidating the Structurestructure, Function function and Evolution evolution of Disease disease Resistance resistance Genes genes in Plantsplants. Journal of Genetics and Genomics 34, 765-776.

Liu, S, Liu, Y, Yang, X, Tong, C, Edwards, D, Parkin, IAP, Zhao, M, Ma, J, Yu, J, Huang, S, Wang, X, Wang, J, Lu, K, Fang, Z, Bancroft, I, Yang, T-J, Hu, Q, Wang, X, Yue, Z, Li, H, Yang, L, Wu, J, Zhou, Q, Wang, W, King, GJ, Pires, JC, Lu, C, Wu, Z, Sampath, P, Wang, Z, Guo, H, Pan, S, Yang, L, Min, J, Zhang, D, Jin, D, Li, W, Belcram, H, Tu, J, Guan, M, Qi, C, Du, D, Li, J, Jiang, L, Batley, J, Sharpe, AG, Park, B-S, Ruperao, P, Cheng, F, Waminal, NE, Huang, Y, Dong, C, Wang, L, Li, J, Hu, Z, Zhuang, M, Huang, Y, Huang, J, Shi, J, Mei, D, Liu, J, Lee, T-H, Wang, J, Jin, H, Li, Z, Li, X, Zhang, J, Xiao, L, Zhou, Y, Liu, Z, Liu, X, Qin, R, Tang, X, Liu, W, Wang, Y, Zhang, Y, Lee, J, Kim, HH, Denoeud, F, Xu, X, Liang, X, Hua, W, Wang, X, Wang, J, Chalhoub, B, Paterson, AH (2014) The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nature Communications 5,

Lozano, R, Hamblin, MT, Prochnik, S, Jannink, J-L (2015) Identification and distribution of the NBS-LRR gene family in the Cassava genome. BMC genomics Genomics 16, 1.

Lozano, R, Ponce, O, Ramirez, M, Mostajo, N, Orjeda, G (2012) Genome-Wide wide Identification identification and Mapping mapping of NBS-Encoding encoding Resistance resistance Genes genes in Solanum tuberosum Group Phureja. PLoS ONE 7, e34775.

Lukens, LN, Quijada, PA, Udall, J, Pires, JC, Schranz, M, Osborn, TC (2004) Genome redundancy and plasticity within ancient and recent Brassica crop species. Biological Journal of the Linnean Society 82, 665-674.

Mayerhofer, R, Wilde, K, Mayerhofer, M, Lydiate, D, Bansal, VK, Good, AG, Parkin, IAP (2005) Complexities of chromosome landing in a highly duplicated genome: toward map-based cloning of a gene controlling blackleg resistance in Brassica napus. Genetics 171, 1977-1988.

McHale, L, Tan, X, Koehl, P, Michelmore, RW (2006) Plant NBS-LRR proteins: adaptable guards. Genome biology Biology 7, 1.

Meyers, B, Dickerman, A, Michelmore, R, Sivaramakrishnan, S, Sobral, B, Young, N (1999) Plant disease resistance genes encode members of an ancient and diverse protein family within the nucleotide-binding superfamily. The Plant Journal 20, 317 - 332.

Meyers, B, Kozik, A, Griego, A, Kuang, H, Michelmore, R (2003) Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis. The Plant Cell 15, 809 - 834.

Meyers, BC, Morgante, M, Michelmore, RW (2002) TIR-X and TIR-NBS proteins: two new families related to disease resistance TIR-NBS-LRR proteins encoded in Arabidopsis and other plant genomes. The Plant Journal 32, 77-92.

Michelmore, R, Meyers, B (1998) Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process. Genome Research 8, 1113 - 1130.

Monosi, B, Wisser, RJ, Pennill, L, Hulbert, SH (2004) Full-genome analysis of resistance gene homologues in rice. Theoretical and Applied Genetics 109, 1434-1447.

Mun, J, Yu, H, Park, S, Park, B (2009) Genome-wide identification of NBS-encoding resistance genes in Brassica rapa. Molecular Genetics and Genomics 282, 617 - 631.

Pan, Q, Wendel, J, Fluhr, R (2000) Divergent evolution of plant NBS-LRR resistance gene homologues in dicot and cereal genomes. Journal of molecular Molecular evolution Evolution 50, 203-213.

Parkin, IAP, Gulden, SM, Sharpe, AG, Lukens, L, Trick, M, Osborn, TC, Lydiate, DJ (2005) Segmental structure of the Brassica napus genome based on comparative analysis with Arabidopsis thaliana. Genetics 171, 765-781.

Parkin, IAP, Koh, C, Tang, H, Robinson, SJ, Kagale, S, Clarke, WE, Town, CD, Nixon, J, Krishnakumar, V, Bidwell, SL (2014) Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea. Genome biology 15, 1.

26

123456789

101112131415161718192021222324252627282930313233343536373839404142434445464748495051

Page 27: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Parkin, IAP, Sharpe, AG, Lydiate, DJ (2003) Patterns of genome duplication within the Brassica napus genome. Genome 46, 291-303.

Parniske, M, Jones, JDG (1999) Recombination between diverged clusters of the tomato Cf-9 plant disease resistance gene family. Proceedings of the National Academy of Sciences 96, 5850-5855.

Paterson, AH, Bowers, JE, Bruggmann, R, Dubchak, I, Grimwood, J, Gundlach, H, Haberer, G, Hellsten, U, Mitros, T, Poliakov, A, Schmutz, J, Spannagl, M, Tang, H, Wang, X, Wicker, T, Bharti, AK, Chapman, J, Feltus, FA, Gowik, U, Grigoriev, IV, Lyons, E, Maher, CA, Martis, M, Narechania, A, Otillar, RP, Penning, BW, Salamov, AA, Wang, Y, Zhang, L, Carpita, NC, Freeling, M, Gingle, AR, Hash, CT, Keller, B, Klein, P, Kresovich, S, McCann, MC, Ming, R, Peterson, DG, Mehboob Ur, R, Ware, D, Westhoff, P, Mayer, KFX, Messing, J, Rokhsar, DS (2009) The Sorghum bicolor genome and the diversification of grasses. Nature 457, 551-556.

Porter, BW, Paidi, M, Ming, R, Alam, M, Nishijima, WT, Zhu, YJ (2009) Genome-wide analysis of Carica papaya reveals a small NBS resistance gene family. Molecular Genetics and Genomics 281, 609-626.

Raman, R, Taylor, B, Marcroft, S, Stiller, J, Eckermann, P, Coombes, N, Rehman, A, Lindbeck, K, Luckett, D, Wratten, N, Batley, J, Edwards, D, Wang, X, Raman, H (2012) Molecular mapping of qualitative and quantitative loci for resistance to Leptosphaeria maculans causing blackleg disease in canola (Brassica napus L.). Theoretical and Applied Genetics 125, 405-418.

Rana, D, van den Boogaart, T, O'Neill, CM, Hynes, L, Bent, E, Macpherson, L, Park, JY, Lim, YP, Bancroft, I (2004) Conservation of the microstructure of genome segments in Brassica napus and its diploid relatives. The Plant Journal 40, 725-733.

Richly, E, Kurth, J, Leister, D (2002) Mode of amplification and reorganization of resistance genes during recent Arabidopsis thaliana evolution. Mol Biol Evol 19, 76 - 84.

Saitou, N, Nei, M (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular Biology and Evolution 4, 406-425.

Shao, Z-Q, Wang, B, Chen, J-Q (2016a) Tracking ancestral lineages and recent expansions of NBS-LRR genes in angiosperms. Plant Signaling & Behavior 170, 00-00.

Shao, Z-Q, Xue, J-Y, Wu, P, Zhang, Y-M, Wu, Y, Hang, Y-Y, Wang, B, Chen, J-Q (2016b) Large-scale analyses of angiosperm nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes reveal three anciently diverged classes with distinct evolutionary patterns. Plant physiology Physiology pp. 01487.2015.

Shao, Z-Q, Zhang, Y-M, Hang, Y-Y, Xue, J-Y, Zhou, G-C, Wu, P, Wu, X-Y, Wu, X-Z, Wang, Q, Wang, B (2014) Long-term evolution of nucleotide-binding site-leucine-rich repeat genes: understanding gained from and beyond the legume family. Plant physiology Physiology 166, 217-234.

Song, Q, Chen, ZJ (2015) Epigenetic and developmental regulation in plant polyploids. Current opinion in plant biology 24, 101-109.

Spoel, SH, Dong, X (2012) How do plants achieve immunity? Defence without specialized immune cells. Nature Reviews Immunology 12, 89-100.

Staskawicz, BJ, Ausubel, FM, Baker, BJ, Ellis, JG, Jones, JDG (1995) Molecular genetics of plant disease resistance. Science 268, 661-667.

Tamura, K, Stecher, G, Peterson, D, Filipski, A, Kumar, S (2013) MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0. Molecular Biology and Evolution 30, 2725-2729.

Tan, X, Meyers, BC, Kozik, A, West, MA, Morgante, M, St Clair, DA, Bent, AF, Michelmore, RW (2007) Global expression analysis of nucleotide binding site-leucine rich repeat-encoding and related genes in Arabidopsis. BMC plant Plant biology Biology 7,

Tarr, D, Alexander, H (2009) TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders. BMC Res Notes 2, 197.

27

123456789

10111213141516171819202122232425262728293031323334353637383940414243444546474849

Page 28: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Thompson, JD, Higgins, DG, Gibson, TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22, 4673-4680.

Tollenaere, R, Hayward, A, Dalton-Morgan, J, Campbell, E, Lee, JRM, Lorenc, MT, Manoli, S, Stiller, J, Raman, R, Raman, H, Edwards, D, Batley, J (2012) Identification and characterization of candidate Rlm4 blackleg resistance genes in Brassica napus using next-generation sequencing. Plant bBiotechnology jJournal 10, 709-715.

Town, CD, Cheung, F, Maiti, R, Crabtree, J, Haas, BJ, Wortman, JR, Hine, EE, Althoff, R, Arbogast, TS, Tallon, LJ (2006) Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy. The Plant Cell 18, 1348-1359.

Verde, I, Abbott, AG, Scalabrin, S, Jung, S, Shu, S, Marroni, F, Zhebentyayeva, T, Dettori, MT, Grimwood, J, Cattonaro, F (2013) The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nature gGenetics 45, 487-494.

Vicente, JG, King, GJ (2001) Characterisation of disease resistance gene-like sequences in Brassica oleracea L. Theoretical and Applied Genetics 102, 555-563.

Wan, H, Yuan, W, Bo, K, Shen, J, Pang, X, Chen, J (2013) Genome-wide analysis of NBS-encoding disease resistance genes in Cucumis sativus and phylogenetic study of NBS-encoding genes in Cucurbitaceae crops. BMC gGenomics 14, 109.

Wan, H, Yuan, W, Ye, Q, Wang, R, Ruan, M, Li, Z, Zhou, G, Yao, Z, Zhao, J, Liu, S, Yang, Y (2012) Analysis of TIR- and non-TIR-NBS-LRR disease resistance gene analogous in pepper: characterization, genetic variation, functional divergence and expression patterns. BMC gGenomics 13, 502.

Wang, X, Wang, H, Wang, J, Sun, R, Wu, J, Liu, S, Bai, Y, Mun, J-H, Bancroft, I, Cheng, F, Huang, S, Li, X, Hua, W, Wang, J, Wang, X, Freeling, M, Pires, JC, Paterson, AH, Chalhoub, B, Wang, B, Hayward, A, Sharpe, AG, Park, B-S, Weisshaar, B, Liu, B, Li, B, Liu, B, Tong, C, Song, C, Duran, C, Peng, C, Geng, C, Koh, C, Lin, C, Edwards, D, Mu, D, Shen, D, Soumpourou, E, Li, F, Fraser, F, Conant, G, Lassalle, G, King, GJ, Bonnema, G, Tang, H, Wang, H, Belcram, H, Zhou, H, Hirakawa, H, Abe, H, Guo, H, Wang, H, Jin, H, Parkin, IAP, Batley, J, Kim, J-S, Just, J, Li, J, Xu, J, Deng, J, Kim, JA, Li, J, Yu, J, Meng, J, Wang, J, Min, J, Poulain, J, Wang, J, Hatakeyama, K, Wu, K, Wang, L, Fang, L, Trick, M, Links, MG, Zhao, M, Jin, M, Ramchiary, N, Drou, N, Berkman, PJ, Cai, Q, Huang, Q, Li, R, Tabata, S, Cheng, S, Zhang, S, Zhang, S, Huang, S, Sato, S, Sun, S, Kwon, S-J, Choi, S-R, Lee, T-H, Fan, W, Zhao, X, Tan, X, Xu, X, Wang, Y, Qiu, Y, Yin, Y, Li, Yet al. (2011) The genome of the mesopolyploid crop species Brassica rapa. Nature Genetics 43, 1035-1039.

Wei, H, Li, W, Sun, X, Zhu, S, Zhu, J (2013) Systematic Analysis analysis and Comparison comparison of Nucleotide-Binding Site Disease disease Resistance resistance Genes genes in a Diploid diploid Cotton cotton Gossypium raimondii. PLoS ONE 8, e68435.

Xiao, S, Emerson, B, Ratanasut, K, Patrick, E, O'Neill, C, Bancroft, I, Turner, JG (2004) Origin and maintenance of a broad-spectrum disease resistance locus in Arabidopsis. Molecular bBiology and eEvolution 21, 1661-1672.

Yang, S, Zhang, X, Yue, J-X, Tian, D, Chen, J-Q (2008) Recent duplications dominate NBS-encoding gene expansion in two woody species. Molecular Genetics and Genomics 280, 187-198.

Yu, J, Tehrim, S, Zhang, F, Tong, C, Huang, J, Cheng, X, Dong, C, Zhou, Y, Qin, R, Hua, W, Liu, S (2014) Genome-wide comparative analysis of NBS-encoding genes between Brassica species and Arabidopsis thaliana. BMC gGenomics 15, 3.

Zhang, YM, Shao, ZQ, Wang, Q, Hang, YY, Xue, JY, Wang, B, Chen, JQ (2016) Uncovering the dynamic evolution of nucleotide binding site leucine rich repeat (NBS LRR) genes in Brassicaceae.‐ ‐ ‐ ‐ Journal of iIntegrative pPlant bBiology 58, 165-177.

28

123456789

10111213141516171819202122232425262728293031323334353637383940414243444546474849

Page 29: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Zheng, F, Wu, H, Zhang, R, Li, S, He, W, Wong, F-L, Li, G, Zhao, S, Lam, H-M (2016) Molecular phylogeny and dynamic evolution of disease resistance genes in the legume family. BMC Genomics 17, 1.

Zhou, F, Guo, Y, Qiu, L-J (2016) Genome-wide identification and evolutionary analysis of leucine-rich repeat receptor-like protein kinase genes in soybean. BMC pPlant bBiology 16, 1.

Zhou, T, Wang, Y, Chen, JQ, Araki, H, Jing, Z, Jiang, K, Shen, J, Tian, D (2004) Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes. Molecular Genetics and Genomics 271, 402-415.

Zhu, H, Cannon, SB, Young, ND, Cook, DR (2002) Phylogeny and genomic organization of the TIR and Non-TIR NBS-LRR resistance gene family in Medicago truncatula. Molecular Plant-Microbe Interactions 15, 529-539.

29

123456789

1011

12

Page 30: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Tables

Table 1: The total number of predicted NBS-LRR genes identified in the three genomes

of Brassica species. The comparison between methods used; MEME / MAST and CNL

and TNL BLAST.

Species B. napus B. oleracea B. rapa

Genome size (Mbp) 1284 650 552

Gene content 111,479 59,225 41,019

Total candidate NBS-LRR genes 641 443 249

Proportion of NBS-LRR genes 0.58% 0.74% 0.60%

Identified in MAST output 615 437 248

Common to MAST and CNL/TNL

analysis509 379 208

Agreement % 79% 85% 83%

30

1

2

3

4

5

6

7

8

Page 31: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Table 2: NBS-LRR specific motifs sequence identified with psp-gen MEME (Jupe et al., 2012).

Motif No. Length (bp) Motif domain Group or

class Motif sequence

1 21 P-loop kinase 1 NB-ARC CNL/TNL PIWGMGGVGKTTLARAVYNDP

2 21 GLPL NB-ARC CNL/TNL CGGLPLAIKVWGGMLAGKQKT

3 15 Kinase 2 NB-ARC CNL/TNL YLVVLDDVWDTDQWD

4 15 RNBS-B NB-ARC CNL/TNL GSRIIITTRNKHVAN

5 15 RNBS-D NB-ARC CNL LKPCFLYCAIFPEDY

6 21 MHDV NB-ARC CNL/TNL CRMHDMMHDMCWYKAREQNFV

7 15 RNBS-A NB-ARC CNL HFDCRAWVCVSQQYD

8 21 -- Linker CNL/TNL MEDVGEYYFNELINRSMFQPI

9 21 LRR LRR CNL/TNL LIHLRYLNLSGTNIKHLPASI

10 19 RNBA-C NB-ARC CNL/TNL YHMQFLSHEESWQLFHKHA

11 19 LRR LRR CNL/TNL MPNLETLDIRNCPNLEEIP

12 21 -- Pre-NB CNL IDRNKLIWLWMAEGFVPHENG

13 15 -- NB-ARC CNL/TNL NEIMPILRLSYHHLP

14 25 TIR 3 TIR TNL QIVIPIFYDVDPSDVRHQTGSFGEA

15 17 -- Pre-NB CNL DAAYDAEDVIDSFKYHA

16 50 -- Monocot -- AIKDIQEQLQKVADRRDRNKVFVPHPTRPIAIDPCLRALYAEATELVGIY

17 21` TIR 2 TIR TNL KNYATSRWCLNELVKIMECKE

18 50 EDVID Monocot CNL ETSSFELMDLLGERWVPPVHLREFKSFMPSQLSALRGWIQRDPSHLSNLS

19 27 LRR LRR -- FLDIACFFRGRKKDYVMQILESCDFGA

20 21 TIR 1 TIR TNL KYDVFLSFRGPDTRKTFTSHL

31

1

Page 32: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Table 32: The number of NBS-LRR genes classified into different classes and their

subfamilies and domain compositions

CNL 180 TNL 461 Total 641

CNL-NBS-LRR 124 TIR-NBS-LRR 242Regular NBS-LRR

gene366(57%)

NBS-LRR 23 NBS-LRR 43

Non-regular NBS-

LRR gene188(29%)

NBS 8 NBS 9

CC-NBS 24 TIR-NBS 74

LINKER-LRR 1 LINKER-LRR 3

TIR-LRR 3

TIR 87 87(14%)

32

1

2

3

4

Page 33: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Table 43: The distribution and cComparative analysis of candidate NBS-LRR genes

in B. napusCh

rom

osom

e

Tota

l

TNL

CNL

No.

of g

ene

clus

ters

No.

of N

BS-

LRR

gene

s in

clus

ters

No.

of

dupl

icat

ed

gene

s (T

NL+

CNL)

No.

of

dupl

icat

ed

gene

s in

Cons

erve

d

Synt

enic

ge

nes

Synt

enic

ge

nes i

n cl

uste

r

A ge

nom

e B.

nap

us D

arm

or

A1 28 17 11 2 7 18(10+8) 4 17 13 5A2 44 33 11 5 31 22(20+2) 15 26 20 16A3 24 20 4 5 18 16(13+3) 6 15 13 10A4 13 12 1 1 7 5(5+0) 3 6 3 2A5 10 4 6 1 4 9(3+6) 5 5 4 1A6 21 7 14 2 6 16(5+11) 5 11 7 2A7 19 16 3 2 10 14(14+0) 8 12 10 4A8 24 17 7 3 12 13(9+4) 3 12 10 5A9 58 38 20 10 43 38(23+5) 24 40 35 30

A10 11 7 4 0 0 6(4+2) 0 5 4 1

Total 252 171 81 31 138(54%)

157 (106+51)(62%)

73 (28%) 149 119 76

C ge

nom

e B.

nap

us D

arm

or

C1 37 24 13 3 21 16(9+7) 10 25 21 12C2 51 47 4 5 34 21(18+3) 13 32 15 12C3 56 46 10 7 35 20(14+6) 12 33 26 20C4 18 12 6 1 4 5(3+2) 2 11 8 5C5 19 11 8 4 13 13(8+5) 7 14 14 10C6 44 33 11 4 30 28(20+8) 17 33 29 23C7 38 30 8 5 25 14(10+4) 8 32 31 25C8 39 21 18 4 23 19(9+10) 11 24 19 18C9 79 62 17 8 55 26(19+7) 18 54 42 33

Total 381 286 95 41 240(62%)

162 (110+52)(42%)

98 (25%) 258 205 158

Unassigned 8 4 4 0 0 0 0 0 0 0

Total 641 461 180 72 378(59%)

319(50%)

171(53%)

409(63.8%)

324 (50%)

234(72%)

33

1

2

3

4

Page 34: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Table 54: Comparative analysis of the number of NBS-LRR genes in the three

Brassica species and the reciprocal best BLAST hits

Genome B. oleracea

C genome

B. napus

Darmor

C genome

B. rapa

A genome

B. napus

Darmor

A genome

Total NBS LRR 443 381 249 252

Total TNL and CNL 319 , 124 286 , 95 168 , 81 171 , 81

Reciprocal BLAST hits conserved

genes “retained”260 (60%) 260 (75%) 149 (60%) 149 (60%)

Conserved TNL and CNL 187 , 73 192 , 68 107 , 42 108 , 41

% Conserved TNL and CNL 58% , 58% 67% , 71% 63% , 52% 63% , 52%

No. of conserved genes in cluster 160 (61%) 75 (50%)

Not conserved 183 121 100 103

missing 183 0 100 0

Gained 0 121 0 103

34

1

2

3

4

Page 35: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Supplementary Table 1: A list of NBS-LRR genes identified in the study, including

genome position, gene type and number of exons

35

1

2

3

Page 36: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure captions 

Figure 1: Example of representation of motif patterns in NBS-LRR genes in B. napus. The

different coloured boxes indicated distinct motifs as shown in boxes below identified by

MEME/MAST program. Refer to Table 2 for the type of each motif.

Figure 2: A phylogenetic tree of NBS domains in 176 CNL- encoding genes of B. napus

for NBS domains based on the neighbour-joining method using MEGA 6.0 software. The

numbers on the branches indicate the percentage of 1000 bootstrap replicates. The tree

only shows values > 50%. Gene names are intended to represent chromosome and domain

configurations. The blue and red coloured branches are to separate the clades. The closed

green triangle represent a homogenous cluster on chromosome C6 where all genes

sequence are related and located within a clade as well as their relationship with genes on

homologous chromosome A7 as represented by unfilled green triangles.

Figure 3: A phylogenetic tree of NBS domains in 366 TNL- encoding genes of B. napus

for NBS domains based on the neighbour-joining method using MEGA 6.0 software. The

numbers on the branches indicate the percentage of 1000 bootstrap replicates. The tree

only shows values > 50%. Gene names are intended to represent chromosome and domain

configurations. The blue and red coloured branches are to separate the clades. The closed

green triangles represent a homogenous cluster on chromosome C6 where all genes

sequence are related and located within a clade as well as their relationship with genes on

homologous chromosome A7, as represented by unfilled green triangles.

36

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

Page 37: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure 4: A phylogenetic tree of TNL and CNL genes located on chromosomes A7 and C6

in B. napus and the diploid species B. rapa and B. oleracea based on a neighbour-joining

method using MEGA 6.0 software. The numbers on branches indicate the percentage of

1000 bootstrap replicates. The coloured squares indicate the gene origin: red from B.

napus, blue from B. rapa and green from B. oleracea.

Figure 5: Inter-genomic and intra-genomic duplication relationship of NBS-LRR -genes

between the two sub-genomes of B. napus: the A and C genomes. Coloured lines represent

the relationship of paralogous gene pairs between homeologous chromosomes. Red lines

represent the gene duplication between chromosomes within a sub-genome. Black lines

represent the gene duplication between homologous chromosomes with collinearity. Blue

lines represent the gene duplication between non homeologous chromosomes. The green

block represents a duplicated block where there is more than one gene involved in

duplication. Chromosomes are scaled based on the number of NBS-LRR genes

Figure 6: Comparative chromosomal map of NBS-LRR genes on chromosomes A7 and

C6 between B. napus (Bn), B. rapa (Br) and B. oleracea (Bo). The squares and triangles

represent NBS-LRR type: TNL and CNL, respectively. The Rlm QTL refers to Huang et

al. (2016). Genes in syntenic relationships are connected by lines. Red lines show inter-

genomic duplications between B. napus A7 and C6. Closed bars are orthologous genes.

Genes within green squares are in a gene cluster. The numbers represent the amino acid

alignments with alignment coverage in bracket.

37

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

Page 38: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure 1

38

1

2

Page 39: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure 2

39

1

2

3

4

5

6

Page 40: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure 3

40

1

2

3

4

Page 41: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure 4

41

1

2

Page 42: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure 5

42

1

2

3

Page 43: research-repository.uwa.edu.au · Web viewGenome wide identification and comparative analysis of NBS-LRR resistance genes in Brassica napus Salman AlameryA B, Soodeh TirnazD, Philipp

Figure 6

43

1

2

3

4