a gdna microarray for genotyping salvia species

14
RESEARCH A gDNA Microarray for Genotyping Salvia Species Alexandra Olarte Nitin Mantri Gregory Nugent Hans Wohlmuth Chun Guang Li Charlie Xue Edwin Pang Published online: 27 December 2012 Ó Springer Science+Business Media New York 2012 Abstract Salvia is an important genus from the Lamia- ceae with approximately 1,000 species. This genus is dis- tributed globally and cultivated for ornamental, culinary, and medicinal uses. We report the construction of the first fingerprinting array for Salvia species enriched with poly- morphic and divergent DNA sequences and demonstrate the potential of this array for fingerprinting several eco- nomically important members of this genus. In order to generate the Salvia subtracted diversity array (SDA) a suppression subtractive hybridization (SSH) was per- formed between a pool of Salvia species and a pool of angiosperms and non-angiosperms to selectively isolate Salvia-specific sequences. A total of 285-subtracted geno- mic DNA (gDNA) fragments were amplified and arrayed. DNA fingerprints were obtained for fifteen Salvia geno- types including three that were not part of the original subtraction pool. Hierarchical cluster analysis indicated that the Salvia-specific SDA was capable of differentiating S. officinalis and S. miltiorrhiza from their closely related species and was also able to reveal genetic relationships consistent with geographical origins. In addition, this approach was capable of isolating highly polymorphic sequences from chloroplast and nuclear DNA without preliminary sequence information. Therefore, SDA is a powerful technique for fingerprinting non-model plants and for identifying new polymorphic loci that may be devel- oped as potential molecular markers. Keywords Microarray Genotyping Salvia Diversity Introduction Salvia (Lamiaceae) is an important genus with approxi- mately 1,000 species distributed widely in many regions of the world including the Mediterranean area, southern Africa, Central and South America, and Asia [1]. This genus is largely cultivated for ornamental, culinary, and medicinal uses [2, 3]. For example, S. officinalis L. is used to preserve foods and employed as a spice for flavoring [3]. Its essential oil is used in perfumery and cosmetics [4] and the plant and its derivates are known to have a wide-range of biological activities, such as antibacterial, antioxidative, anti-inflammatory, hypoglycemic activity [3, 5]. Another example is the roots and rhizome of Salvia miltiorrhiza Bunge, known as danshen, which is officially listed in the Chinese Pharmacopeia to be used for the treatment of menstrual disorders, menorrhalgia, insomnia, menostasis, A. Olarte N. Mantri (&) G. Nugent E. Pang Health Innovations Research Institute, School of Applied Sciences, RMIT University, Melbourne, VIC 3000, Australia e-mail: [email protected] A. Olarte e-mail: [email protected] G. Nugent e-mail: [email protected] E. Pang e-mail: [email protected] H. Wohlmuth Southern Cross Plant Science and Medicinal Plant Herbarium, Southern Cross University, Lismore, NSW 2480, Australia e-mail: [email protected] C. G. Li C. Xue Health Innovations Research Institute, School of Health Sciences, RMIT University, Melbourne, VIC 3000, Australia e-mail: [email protected] C. Xue e-mail: [email protected] 123 Mol Biotechnol (2013) 54:770–783 DOI 10.1007/s12033-012-9625-5

Upload: scu-au

Post on 26-Apr-2023

0 views

Category:

Documents


0 download

TRANSCRIPT

RESEARCH

A gDNA Microarray for Genotyping Salvia Species

Alexandra Olarte • Nitin Mantri • Gregory Nugent •

Hans Wohlmuth • Chun Guang Li •

Charlie Xue • Edwin Pang

Published online: 27 December 2012

� Springer Science+Business Media New York 2012

Abstract Salvia is an important genus from the Lamia-

ceae with approximately 1,000 species. This genus is dis-

tributed globally and cultivated for ornamental, culinary,

and medicinal uses. We report the construction of the first

fingerprinting array for Salvia species enriched with poly-

morphic and divergent DNA sequences and demonstrate

the potential of this array for fingerprinting several eco-

nomically important members of this genus. In order to

generate the Salvia subtracted diversity array (SDA) a

suppression subtractive hybridization (SSH) was per-

formed between a pool of Salvia species and a pool of

angiosperms and non-angiosperms to selectively isolate

Salvia-specific sequences. A total of 285-subtracted geno-

mic DNA (gDNA) fragments were amplified and arrayed.

DNA fingerprints were obtained for fifteen Salvia geno-

types including three that were not part of the original

subtraction pool. Hierarchical cluster analysis indicated

that the Salvia-specific SDA was capable of differentiating

S. officinalis and S. miltiorrhiza from their closely related

species and was also able to reveal genetic relationships

consistent with geographical origins. In addition, this

approach was capable of isolating highly polymorphic

sequences from chloroplast and nuclear DNA without

preliminary sequence information. Therefore, SDA is a

powerful technique for fingerprinting non-model plants and

for identifying new polymorphic loci that may be devel-

oped as potential molecular markers.

Keywords Microarray � Genotyping � Salvia � Diversity

Introduction

Salvia (Lamiaceae) is an important genus with approxi-

mately 1,000 species distributed widely in many regions of

the world including the Mediterranean area, southern

Africa, Central and South America, and Asia [1]. This

genus is largely cultivated for ornamental, culinary, and

medicinal uses [2, 3]. For example, S. officinalis L. is used

to preserve foods and employed as a spice for flavoring [3].

Its essential oil is used in perfumery and cosmetics [4] and

the plant and its derivates are known to have a wide-range

of biological activities, such as antibacterial, antioxidative,

anti-inflammatory, hypoglycemic activity [3, 5]. Another

example is the roots and rhizome of Salvia miltiorrhiza

Bunge, known as danshen, which is officially listed in the

Chinese Pharmacopeia to be used for the treatment of

menstrual disorders, menorrhalgia, insomnia, menostasis,

A. Olarte � N. Mantri (&) � G. Nugent � E. Pang

Health Innovations Research Institute, School of Applied

Sciences, RMIT University, Melbourne, VIC 3000, Australia

e-mail: [email protected]

A. Olarte

e-mail: [email protected]

G. Nugent

e-mail: [email protected]

E. Pang

e-mail: [email protected]

H. Wohlmuth

Southern Cross Plant Science and Medicinal Plant Herbarium,

Southern Cross University, Lismore, NSW 2480, Australia

e-mail: [email protected]

C. G. Li � C. Xue

Health Innovations Research Institute, School of Health

Sciences, RMIT University, Melbourne, VIC 3000, Australia

e-mail: [email protected]

C. Xue

e-mail: [email protected]

123

Mol Biotechnol (2013) 54:770–783

DOI 10.1007/s12033-012-9625-5

blood circulation diseases, and other cardiovascular dis-

eases [6].

One of the fundamental requirements for the commer-

cialization of Salvia is the accurate identification of com-

mercially important species from its close relatives.

Traditionally, the correct species identification was per-

formed by morphological or chemical techniques [6–8].

Nowadays, molecular biology techniques have given rise to

new possibilities for identification based on the genetic

composition of the plant which is unaffected by environ-

mental, developmental, and biological factors [9, 10].

However, DNA-based analyses of important medicinal

herbs such as Salvia has several challenges since they are

not model plants, their genome sizes are usually unknown

and there is a lack of molecular-marker-based approaches

for plant improvement [11]. PCR-based methods such as

random amplified polymorphic DNA (RAPD) and ampli-

fied fragment length polymorphism (AFLP) have been

commonly used to fingerprint different species of Salvia

[2, 4, 12–14]. However, the PCR-based methods are based

on gel electrophoresis, which is time consuming, and cor-

relating bands on gels for the allelic variants is difficult and

can lead to inaccurate interpretations [15]. Sequencing-

based methods have also been used for species identi-

fication of these genera. For instance, chloroplast and

mitochondrial DNA regions have been amplified and

restriction digested (polymerase chain reaction–restriction

fragment length polymorphism; PCR–RFLP) to genotype

Salvia species endemic to the Mediterranean region [16].

Furthermore, sequences of the internal transcribed spacer

(ITS) region have been amplified, sequenced and aligned in

order to genotype different Salvia [17, 18]. However,

chloroplast and ITS regions have not always been found to

be polymorphic among closely related Salvia species. For

example, high sequence similarities were found among

these DNA regions of some Japanese Salvia, which made it

difficult to determine evolutionary relationships among

them [19]. In addition, it is very unlikely that any chloro-

plast loci could be linked to a gene responsible for an

important agronomic trait, since these genes are usually

found in nuclear genome. Therefore, there is a need for

genetic markers in Salvia that could be useful not only for

the accurate identification of closely related species but that

could also be linked to desirable agricultural traits.

DNA microarrays can overcome many of the limitations

of the PCR-based fingerprinting methods. For instance,

they provide fixed data features, removing the positional

variation inherent in gel fingerprints. In addition, this

technique is based on hybridization, ensuring that common

elements are identical and genetically informative [20]. A

technology called diversity array technology (DArT) uses

microarrays to detect DNA polymorphism at several hun-

dred genomic loci with the advantage that no prior

knowledge of sequence information is needed [15]. The

main disadvantage of DArT is the method of genome

complexity reduction used, since a large number of

homologous/monomorphic sequences remain present in the

array, which will affect the level of polymorphism detected

[21].

In 2007, we reported a novel DNA-fingerprinting tech-

nique called subtracted diversity array (SDA) [22]. This

technique uses an alternative subtraction suppression

hybridization (SSH) that removes the highly conserved

genomic DNA from the DNA representations that are used

to develop the array. Previously, an SDA was constructed

using gDNA representations from angiosperms and non-

angiosperms to isolate sequences specific to flowering

plants. The array successfully genotyped different flower-

ing plants to the clade, order and family level [23]. In order

to increase the discriminatory power of the angiosperm

SDA, an Asterid-specific SDA was also constructed. This

SDA was capable of successfully fingerprinting 25 Asterid

species representing 20 families and 9 orders [24]. The

broad subtraction approach (between clades) conducted to

produce the Asterid-specific SDA was able to isolate highly

polymorphic regions which were found to be novel species-

specific loci and chloroplast regions that have been

extensively used in phylogenetic analyses. However, its

discriminatory power was not enough to differentiate

among species of the same genera (data not shown). In this

article, we report the construction of the first genera-

specific SDA for Salvia species employing a narrow sub-

traction approach and demonstrate its fingerprinting

potential. In addition, we report the identification of highly

discriminatory and species-specific sequences that may

possibly be developed as molecular markers.

Materials and Methods

Plant Material

In order to obtain a genomic DNA (gDNA) representation

for the subtraction, a total of 151 species including

angiosperms and non-angiosperms were sourced (Table 1).

Non-angiosperms were collected from Toolangi State Park,

Victoria (Australia) and identified [25]. Angiosperms were

obtained only from verified nursery species; a total of 126

species were sourced to represent all angiosperm clades. In

addition, a total of ten Salvia species (42 plants) were

sourced to represent the different centres of diversity

around the world. Salvia miltiorrhiza and S. sinica Migo

plants were obtained from seeds which were used in a

previous study [26]. The other Salvia species were obtained

from verified specimens from various plant nurseries

(Table 1).

Mol Biotechnol (2013) 54:770–783 771

123

Table 1 Description of the angiosperm and nonangiosperm species used for DNA extraction and development of genome representations for

preparing the Salvia-specific SDA

Representations Species

Nonangiosperms (25 species) Adiantum raddianum Dicksonia antarctica Riccardia eriocaula

Azolla sp. Equisetum hyemale Selaginella sp.

Blechnum chambersii Ginkgo biloba Sphagnum australe

Blechnum fluviatile Grammitis billardieri Sticherus tener

Bryum billardieri Hymenophyton flabellatum Thuidium sp.

Catagonium nitens Marchantia sp. Weymouthia cochlearifolia

Cyathea cooperi Microsorum pustulatum Wollemia nobilis

Cyathophorum sp. Polystichum proliferum

Dawsonia superba Racopilum cuspidigerumvar.convolutaceum

Magnoliids (6 species) Cinnamomum verum Illicium anisatum Nymphaea gigantea

Houttuynia cordata Magnolia denudate Peumus boldus

Monocots (23 species) Acorus calamus Dioscorea polystacha Polygonatum multiflorum

Acorus gramineus Fritillaria thunbergii Ruscus aculeatus

Aloe vera Iris domestica (syn. Belamcanda chinensis) Serenoa repens

Bambusa beecheyana Iris versicolor Trachycarpus fortunei

Bletilla striata Lilium longiflorum Zea mays

Coix lacryma-jobi Lomandra longifolia Zephyranthes sp.

Colocasia esculenta Ophiopogon japonicus Zingiber officinale

Curcuma longa Pinellia cordata

Eudicots not placed in either the Rosids

or Asterids subclades (19 species)

Aconitum carmichaelii Chelidonium majus Eschscholzia californica

Aquilegia sp. Clematis hexapetala Grevillea robusta

Berberis fortunei Clematis montana Gypsophila oldhamiana

(syn. Mahonia fortunei) Clematis serratifolia Hamamelis virginiana

Berberis japonica Clematis songarica Phytolacca acinosa

(syn. Mahonia japonica) Dianthus caryophyllus Ranunculus sp.

Buxus sempervirens Dianthus superbus Rumex crispus

Rosids (33 species) Abutilon theophrasti Citrus aurantium Isatis tinctoria

Agrimonia eupatoria Citrus reticulata Oenothera biennis

Agrimonia pilosa Crataegus monogyna Oenothera odorata

Albizia julibrissin Dichroa febrifuga Oxalis pes-caprae

Alchemilla xanthochlora Filipendula ulmaria Passiflora edulis

Althaea officinalis Firmiana simplex Pelargonium sp.

Armoracia rusticana Glycyrrhiza glabra Poncirus trifoliata

Astragalus membranaceus Glycyrrhiza uralensis Rosa rugosa

Baptisia tinctoria Gynostemma pentaphyllum Ruta graveolens

Allocasuarina sp. Humulus lupulus Sophora flavescens

Catha edulis Hypericum perforatum Urtica dioica

Asterids (45 species) excluding

Lamiaceae

Achillea millefolium Codonopsis thalictrifolia Sambucus nigra

Adenophora potaninii Codonopsis pilosula Scrophularia ningpoensis

Angelica archangelica Coffea arabica Scrophularia nodosa

Angelica dahurica Cynara scolymus Solidago canadensis

Aralia chinensis Digitalis purpurea Symphytum officinale

Artemisia abrotanum Eupatorium perfoliatum Syringa vulgaris

Artemisia absinthium Ilex paraguariensis Tanacetum cinerariifolium

Artemisia lactiflora Impatiens sp. Tanacetum parthenium

Artemisia pontica Inula helenium Taraxacum officinale

Atropa belladonna Lycium barbarum Tetrapanax papyrifer

772 Mol Biotechnol (2013) 54:770–783

123

DNA Extraction and Development of Pools

Total DNA was extracted from fresh leaves using a mod-

ification of the standard CTAB procedure [27]. Approxi-

mately 0.5 g of leaves was ground with liquid nitrogen to a

fine powder. The powder was dissolved in 5 ml of CTAB

Buffer (3 % CTAB, 100 mM Tris–HCL pH 8.0, 20 mM

EDTA pH 8.0, 1.4 M NaCl), 1 ml of 10 % PVP and 1.2 ml

of 10 % CTAB. Subsequently, the mixture was incubated

at 60 8C for 1 h and centrifuged at 13,000 rpm for 10 min.

The supernatant obtained was further purified with a dou-

ble chloroform extraction followed by precipitation with

7.5 M ammonium acetate and 100 % ethanol. The pre-

cipitated DNA was resuspended in sterile water and sub-

sequent cleanup was performed by using the DNeasy�

column of the DNeasy� Plant Mini Kit (Qiagen) following

the protocol in the user manual.

All DNA samples were pooled based on the Angiosperm

Phylogeny Group (2009) classification [28] in order to obtain

representations of the following seven groups: Salvia (Tester

pool), non-angiosperms, Monocots, Magnoliids, Rosids,

Asterids (excluding Lamiaceae), and Eudicots not placed

in either Rosids or Asterids subclades (Eudicots and Core

Eudicots) (Table 1). About 10 lg of DNA was bulked for

each representation, with each pool having equal amounts of

gDNA per species. Subsequently, each pool was separately

concentrated using the DNeasy� column of the DNeasy�

Plant Mini Kit (Qiagen). The concentration and purity of the

DNA pools were evaluated spectrophotometrically.

Subtraction, Library, and SDA Construction

The method used for subtraction, library and microarray

construction was prepared using our previously described

method [22].

Subtraction was performed using the PCR-SelectTM

cDNA Subtraction Kit (Clontech). The Salvia pool (tester)

was represented by equal amounts of DNA extracted from

42 different plants of which 25 were Salvia miltiorrhiza

(five plants from five different lines). The driver pool was

obtained by bulking 700 ng of each representation with the

exception of the Salvia pool (Table 1).

The Salvia-specific DNA obtained after the subtraction

was purified using the QIAquick PCR Purification Kit

(Qiagen). Then, approximately 100 ng of the purified PCR

products were ligated into the pGEM�-T Easy vector

(Promega) and transformed into heat-shock competent

Escherichia coli JM109 (Promega) according to the user

manual. Positive transformation was determined by PCR

amplification of the cloned insert using nested primers 1

and 2R (Clontech). Plasmids containing cloned inserts

Table 1 continued

Representations Species

Asterids Bacopa monnieri Petroselinum crispum Trachelospermum jasminoides

Camellia sinensis Physalis peruviana Tussilago farfara

Centella asiatica Plantago major Valeriana officinalis

Chamaemelum nobile Platycodon grandiflorus ‘Apoyama’ Verbascum thapsus

Clerodendrum trichotomum Platycodon grandiflorus Withania somnifera

Salvia (13 species) Subtraction pool (total of 42

plants)

S. sinicag(five plants from Zhejiang

province)

Not included in the SDA

development

S. przewalskiig S. lanceolataf

S. officinalisa (five plants) S. microphyllac

S. lyratab S. miltiorrhizag S. fruticosad,a,e

S. elegansc (5 plants from 5 different populations)

S. sclaread,a,e Shandong province

S. mexicanac Shanxi province

S. runcinataf Henan province

S. lavandulifoliaa Hebei province

S. miltiorrhiza f. alba (Shandong)

a Native to the Mediterraneanb Native to North Americac Native to Central Americad Native to North Africae Native to Central and western Asiaf Native to South Africag Native to China

Mol Biotechnol (2013) 54:770–783 773

123

which showed a single band were isolated from subcultured

transformed cells using the DirectPrep 96 Miniprep Kit

(Qiagen).

The cloned inserts were subsequently PCR amplified

from the corresponding plasmid using the nested primers.

After that, PCR products were precipitated in 96 % ethanol

and 3 M sodium acetate (pH 5.2). The precipitation was

carried out at -20 8C overnight. The pellets obtained were

washed with 70 % ethanol, air dried and resuspended in

10 ll of 50 % DMSO. Finally, a total of 285 clones were

transferred individually into a 384-well plate (Genetix,

Hampshire, UK).

The 285 clones together with the controls were printed on

aminosilane-coated slides using a BioRobotics� MicroGrid

II Compact arrayer (Genomic Solutions) at RMIT Univer-

sity, Australia. Eight subarrays were gridded on a Corning�

GAPS II coated slide (Corning Incorporated Life Sciences,

Acton, MA). Each subarray was composed of 285 samples

and 15 controls. Among the positive controls were three

housekeeping genes (ribulose-1,5-bisphosphate carboxyl-

ase/oxygenase, ribosomal RNA and chlorophyll a/b-binding

protein) sourced from Cicer arietinum [29]. A single printed

slide was used to perform two hybridization experiments,

where each hybridization reaction was tested with 4

subarrays.

Labeling of Target DNA and SDA Hybridization

Target synthesis and hybridization of the microarray slides

were performed by modifying our previously described

method [22]. Only 30 ng of biotin-labeled sample was

mixed with 17.5 ll of 2 9 hybridization buffer (5 9 SSC,

0.2 % SDS, 50 % formamide); 0.5 ll of 1 mg/ml Human

Cot1 DNA (Invitrogen); 0.5 ll of 5 mg/ml PolyA (Sigma-

Aldrich) and 0.5 ll of 10 mg/ml salmon sperm DNA

(Sigma-Aldrich). After that, the mixture (made up to 35 ll

with sterile water) was denatured at 100 8C for 2 min and

applied to the array under a 22 9 22 mm lifter slip (Grale

Scientific, Australia). Hybridization was performed over-

night in a water bath at 42 8C. All hybridizations were

performed with four technical replicates (subarrays) and

two biological replicates, for a total of eight data points per

array feature.

After hybridization, the coverslips were removed and

the slides were washed twice in 19 SSC, 0.1 % SDS at

40 8C for 8 min, once in 0.19 SSC, 0.1 % SDS at 40 8Cfor 8 min and once in 0.19 SSC at room temperature for

5 min. Then, detection of the biotinylated DNA targets

bound on the array was performed by a protocol modified

from Mirus Label IT� lArray� Biotin Labeling Kit [24].

In brief, the slides were washed once in 69 SSPE-T buffer

(0.9 M NaCl, 0.06 M NaH2PO4�H2O, 0.006 M EDTA,

0.005 % Triton X-100, pH 7.4) at room temperature for

5 min. Subsequently, the detection solution (0.8 ll of

25 lg/ul of BSA, 0.5 ll of 0.8 lg/ll streptavidin-labeled

CyTM3 dye (Amersham Pharmacia, UK), made to 200 ll

with 69 SSPE-T) was applied to the wet surface of the

slide and covered by a 25 9 60 mm lifter coverslip (Grale

Scientific) to evenly distribute the solution. The slides were

incubated at 37 8C for 40 min in a waterproof hybridiza-

tion chamber in the dark. Finally, the slides were washed

three times in 69 SSPE-T buffer for 5 min, rinsed with

deionized water and dried with an air gun.

Data and Statistical Analysis

Slides were scanned at 10 lm resolution and gain of 50

PMT using a Perkin Elmer array scanner. Images were

captured and quantified with the ScanArray Express�

Microarray Analysis System. The program quantified the

signal intensity of each spot using adaptive circle method

and LOWESS function. Local background was subtracted

during quantitation. The signal-to-noise ratio obtained for

each spot by ScanArray Express� was defined as:

Signal-to-noise ratio

¼ Mean foreground�Mean background

Standard deviation of background

This signal value was considered to have the most accurate

background correction since it also accounted for varia-

tions in background intensity over the array. Quantified

data was exported to Microsoft Excel (Microsoft) and

abnormal spots that were not automatically flagged by the

software were flagged manually.

The following series of data transformations and

subsequent statistical analyses that were performed with

the raw signal intensity (signal-to-noise ratio) are sum-

marized in Fig. 1. In brief, a mean of the signal intensity

was obtained for each feature between the four technical

replicates. The data was then normalized across the

slides using the mean signal intensity for all 285 features

in all hybridizations performed. After normalization, the

data of the biological replicates was combined to pro-

duce a single value per feature. This data was transferred

to PASW Statistics 18 to perform a hierarchical cluster

analysis of the 15 Salvia genotypes. The dissimilarity

dendrogram was generated using the average distance

linkage between-groups and Square Euclidian metrics.

In addition, the normalized mean data was used for

principal component analyses (PCA) and correlate bivar-

iate analysis using MINITAB� Release 14.1 and PASW

Statistics 18, respectively. These analyses were able to

distinguish the most discriminatory features useful for

fingerprinting.

774 Mol Biotechnol (2013) 54:770–783

123

Sequencing of Selected most Discriminatory

and Species-Specific Features

The cloned inserts were re-amplified from the corre-

sponding isolated plasmid using SP6/T7 primers. Ampli-

fication products were bi-directionally sequenced by

Macrogen Inc. (Korea). Vector and primer sequences were

removed and nucleic acid and protein homology searches

were performed using blastN and blastX programs through

the National Center of Biotechnology Information [30].

All sequences have been deposited in EMBL Nucleo-

tide Sequence Database (Accession number HE96289 to

HE962398).

Results and Discussion

The first step to generate the Salvia-SDA was to use a

suppression subtractive hybridization (SSH) between a

gDNA pool of ten Salvia species (tester) and a pool of 151

non-Lamiaceae angiosperm and non-angiosperm species

(driver) to eliminate DNA fragments common to these two

pools (Fig. 2). The species belonging to the Lamiaceae

were not included in the driver pool since this would have

produced an over-subtracted library resulting in fewer

polymorphic sequences between the Salvia. Therefore, a

very conservative approach was employed for the sub-

traction. During SSH, the tester (Salvia) DNA was divided

in two parts and different adaptors were ligated to each part

followed by two hybridization steps between the tester and

the driver. In the first hybridization, the homologous

sequences between the driver and the tester were hybrid-

ized, leaving almost all of the Salvia-specific DNA single

stranded. Subsequently, in the second hybridization,

Salvia-specific DNA should have been the only double

stranded DNA with a different adaptor sequence allowing

exponential amplification of these sequences by PCR.

These potential Salvia-specific sequences were then iso-

lated by cloning, amplified and used to construct the

Salvia-specific SDA. Once constructed, the Salvia-specific

SDA was validated and tested for its ability to successfully

differentiate (fingerprint) different Salvia species. Each

fingerprint was obtained by hybridizing a biotin-labeled

genomic DNA of each Salvia species to the SDA (Fig. 3).

Finally, the hybridization patterns (fingerprints) were sub-

sequently compared to identify the most discriminatory

features and species-specific features that may be devel-

oped as potential molecular markers.

Subtraction Efficiency and Validation of the Microarray

The Salvia-specific microarray was first validated by

determining the subtraction efficiency, i.e., if the con-

structed array contained only Salvia-specific sequences. In

order to determine this efficiency, the tester and driver pool

were used as targets and separately hybridized onto the

array. Theoretically, a complete subtraction should result in

the absence of any hybridization with the driver pool as all

driver sequences are supposed to be eliminated. However,

thirty-four (12 %) positive features were observed after

hybridizing the driver target, indicating 88 % efficiency.

This subtraction efficiency was lower than that obtained for

the angiosperm [22] and the Asterids [24] SDAs where 97

and 99 % of efficiency was achieved, respectively.

During this subtraction, we used 30-fold excess of driver

(1:30 tester:driver ratio) as recommended by the kit man-

ufacturer (Clontech, CA). This same ratio was used for the

rather broad subtractions performed to obtain the angio-

sperm and Asterids SDAs which effectively eliminated the

common sequences between tester and driver. However,

the Salvia subtraction was more stringent than that of

previous work since the subtraction needed to be per-

formed to the genus level. It may be possible that a higher

concentration of driver DNA may be needed to remove

highly and partially homologous sequences from the tester

Fig. 1 Flow-chart of data analyses. The input of these analyses was

the signal-to-noise ratio obtained after scanning and quantitation of

spot intensities

Mol Biotechnol (2013) 54:770–783 775

123

pools. Mathematical models for suppression subtractive

hybridization have shown that by increasing the excess

ratios, the likelihood for enriching for highly specific tester

sequences is increased [31]. Therefore, it is possible that

higher subtraction efficiencies are achieved using higher

concentration of driver. However, the results of 88 %

efficiency are satisfactory for a subtraction with this high

level of stringency where genera-specific sequences are

isolated.

Microarray Scoring

A new scoring method was developed using the raw data of

the signal intensity since the binomial data scoring repre-

sented clear limitations in the analysis. Initially, the scoring

of the array was performed by converting the signal-

to-noise ratio of each feature to binomial using the same

analysis as our previously described method [22]. However,

we discovered that several of the features were giving signal

for all Salvia species though with different intensities. This

made us realize that since the species we assayed belonged

to the same genera; they shared highly similar gDNA

sequences. These sequences from closely related species

would potentially differ from each other by less than 10 %

thus generating variable signal. Therefore, due to the limi-

tations represented and possible misleading effects of the

binomial scoring, all the subsequent analyses were per-

formed with the raw signal intensity.

Previous array-based studies have also found that

binomial scoring may present difficulties in the analysis of

out-breeding species. For instance, a study based on DArT

found that the bi-allelic assessment limited the range of

possible analysis in 17 Eucalyptus individuals [32]. Fur-

thermore, another DArT study on Arabidopsis thaliana

successfully identified segregating markers in a F2 popu-

lation based on hybridization signal intensity differences

Fig. 2 The process of

construction of the Salvia SDA.

The genomic representations of

tester and driver were used to

perform a suppression

subtractive hybridization (SSH)

in order to eliminate the DNA

fragments common to these two

pools. Salvia-specific sequences

were then isolated by cloning,

amplified and used to construct

the Salvia-specific SDA

776 Mol Biotechnol (2013) 54:770–783

123

[33]. One common feature between the current study and

the previous Eucalyptus DArT study is that the species

analyzed are mainly cross-pollinated [34, 35]. Therefore,

the cross-pollination in these species could have increased

the level of heterozygosity, which will generate interme-

diate signal intensities that could have complicated the

analysis of binomial data. Consequently, the direct com-

parison between signal intensities is the preferred option

for a cross-pollinated genus such as Salvia, since the

assessment based on the presence and absence (dominant)

could have potentially misleading effects due to the higher

heterozygosity expected in this genus.

Evaluation of the Fingerprinting Potential of the SDA

Fingerprints for fifteen Salvia genotypes were obtained,

which included fingerprints for thirteen species and two

accessions of Salvia officinalis. Out of these thirteen, three

fingerprints were of S. lanceolata Lam., S. microphylla

Kunth, and S. fruticosa Mill., which were species not used

in the construction of the SDA (Table 1). The entire data

set from these experiments has been deposited in Gene

Expression Omnibus (GSE39403).

The relationship between the Salvia genotypes was

examined by a hierarchical cluster analysis constructed

based on the 285 features (Fig. 4). This dendrogram

grouped the 15 genotypes into three distinctive clusters.

Cluster A contained species native to the Mediterranean

region. Cluster B grouped all species native to Africa or the

Americas together with S. sclarea L. and S. fruticosa,

which are native to Europe, North Africa and Asia, and

Cluster C grouped all Salvia species from China.

The hierarchical cluster analysis clearly differentiated

important commercial Salvia species and accessions within

species. For instance, it was possible to differentiate Salvia

officinalis from its closely-related species S. fruticosa. These

Fig. 3 Process of biotin

labeling and hybridization. Each

fingerprint was obtained by

hybridizing a biotin-labeled

genome representation of each

Salvia species to the SDA.

Then, the biotinylated targets

bound to the array were labeled

with streptavidin-CyTM3 dye

Mol Biotechnol (2013) 54:770–783 777

123

two species have a high morphological similarity [7], which

has led to adulteration in commercial products. For example,

most of the commercial dried sage imported into North

America not only consists of S. officinalis but it is often

mixed with S. fruticosa since S. officinalis has a slow growth

rate in winter months [36]. In the cluster analysis, the three

accessions of S. officinalis were found to be closely related to

S. lavandulifolia Vahl, in particular to S. officinalis var.

purple, while S. fruticosa was located in a separate cluster

(Fig. 4). These results are in agreement with a recent taxo-

nomical classification [7], where S. lavandulifolia was found

to be a subspecies of S. officinalis and S. fruticosa was clearly

differentiated from these two taxa. Furthermore, the SDA

could also effectively differentiate S. miltiorrhiza from its

close relatives, S. przewalskii Maxim. and S. sinica. Roots of

S. przewalskii and S. sinica are often sold as Danshen (roots

of S. miltiorrhiza), even though chemical fingerprinting by

high performance liquid chromatography (HPLC) have

found that the levels of total phenolic acids and diterpenoids

in the roots of S. miltiorrhiza are generally higher [37, 38].

The above results indicate that this Salvia-specific SDA

could be a useful tool for fingerprinting closely related spe-

cies and accessions that have been used in the construction of

the array.

The results also demonstrate the ability of the SDA to

fingerprint species that were not used in its construction.

For instance, it was possible to fingerprint S. lanceolata,

S. microphylla, and S. fruticosa, which grouped with other

Salvia species of similar geographical origin in the hier-

archical cluster analysis (Fig. 4), even though they were

not part of the original subtraction pool. Similar results

were found for the angiosperm and Asterid-specific SDAs

which were able to fingerprint species outside the initial

subtraction pool [23, 24]. Therefore, this SDA may have a

wider applicability in fingerprinting other species of Salvia

apart from the ones used to construct the array, with the

advantage of not requiring prior sequence information.

Further, the hierarchical analysis revealed genetic rela-

tionships consistent with geographical origins (Fig. 4). The

dendrogram shows three major clusters. The first cluster

included the European Salvia, the second cluster had the

native American and African species together with S. sclarea

and S. fruticosa, and the third clustered the Chinese Salvia.

However, these results differ from previous genotyping

studies. For instance, recent analyses of 51 worldwide

collected accessions of Salvia with AFLP have grouped the

genotypes into two main clusters [14]. The first one

included all species form Central and South America and

the second one contained species form Europe, Africa and

Asia. This clear differentiation between African and

American Salvia was not found in the present study

(Fig. 4). Interestingly, the AFLP and the SDA-based clas-

sification also presented differences with previous phylo-

genetic studies performed in the tribe Mentheae and in the

genus Salvia using the sequences of the amplified nuclear

rDNA ITS and chloroplast DNA regions [1, 39]. It is

important to note that the main aim of this study was to

develop a microarray for fingerprinting Salvia species and

not to perform a phylogenetic analysis of this genus.

During the construction of the SDA, the Salvia pool was

enriched with S. miltiorrhiza, S. sinica, and S. officinalis.

Therefore, the SDA could be overrepresented with

Fig. 4 Dissimilarity

dendrogram for the SDA

hybridization patterns of fifteen

Salvia genotypes using the 285

features (Squared Euclidian

distance, between groups

linkage). The steps of the

dendrogram show the combined

clusters and the values of the

distance coefficients at each

step; the values have been

rescaled to numbers between 0and 25, preserving the ratio of

the distances between the steps

778 Mol Biotechnol (2013) 54:770–783

123

sequences from these three species, and as a result the

phylogenetic analyses obtained from this array could be

biased on the distances given across the species and major

clusters. Previous SDA studies have shown the utility of

the SDA for inferring genetic relationships consistent with

the Angiosperm Phylogeny Group (2009) classification

[28]. For instance, the angiosperms-specific SDA has

shown to be useful in classifying different families with the

respective clade and species within their correspondent

families [23]. However, in order to apply this technique for

phylogenetic analyses in Salvia, a more comprehensive

array would have to be constructed with equal genomic

representations from all Salvia subgenera. In addition, a

wider range of species would have to be genotyped in order

to obtain a more detailed phylogenetic and evolutionary

analysis of the genus.

Identification of most Discriminatory

and Species-Specific Features

Identification of the most discriminatory and species-

specific features was performed by a series of statistical

analyses (Fig. 1). Firstly, principal components analysis

(PCA) was performed in order to identify the most dis-

criminatory DNA fragments capable of generating a fully

resolved phylogeny of the genotypes analyzed. As it can be

seen in Fig. 5, a high percentage of variation (80.5 %) may

be explained by the first two components. Based on this

analysis, only the four most distant features from zero on

the X axis were chosen as the first component explained

most of the variation (71.1 %). Upon examination of these

four features (A16, I7, N6, P4) it was observed that they

were the only features that had a high variance and a high

mean across the fingerprints; however, none of them

showed any species specificity.

In order to find species-specific features, a second

analysis based on the magnitude of the variance for the

signal intensity of each feature across the 15 genotypes was

performed. Ten species-specific features (E13, F5, G4,

G13, H17, J9, N7, N12, N13, O1) were identified which

were not previously detected by PCA. However, more

than one feature showed specificity to the same species,

implying there were features with the same patterns of

variation. Pearson bivariate correlation performed among

the 14 features (four chosen by PCA and ten species-

specific), indicated that there were positive significant

correlations between H17 and J9 (r = 0.98, P \ 0.01),

H17 and G4 (r = 0.99, P \ 0.01), G13 and N7 (r = 0.99,

P \ 0.01) and between N6 and I7 (r = 0.83, P \ 0.01).

Therefore, J9, G4, N7, and I7 were eliminated and only ten

features (Table 2) were selected as they were the most

discriminatory and each had unique patterns of variation

among the 15 genotypes.

Finally, a second hierarchical cluster was performed using

only the ten features selected above (three chosen by PCA and

seven species-specific) (Fig. 6). It was found that the clus-

tering of the species was consistent with the major clusters

obtained using the full set (Fig. 4). However, five of the spe-

cies in Cluster B were displaced relative to the original den-

drogram. In the second dendrogram, S. microphylla clustered

with S. lanceolata and S. lyrata L., whilst S. mexicana L. and

S. runcinata L.f. appeared to be more distantly related. In

contrast, in the original dendrogram, S. microphylla and

S. mexicana clustered together, whilst S. runcinata,

S. lanceolata and S. lyrata were closely related (Fig. 4). Based

on the above results, it was inferred that these ten features were

among the most useful features for fingerprinting these fifteen

Salvia genotypes. Among this subset, species-specific features

were found for S. officinalis, S. sclarea, S. przewalskii,

S. elegans Vahl, and S. runcinata (Table 2; Fig. 6). Also

specific features which discriminated between species were

found, for instance feature N12 was specific for S. sinica,

S. miltiorrhiza, and S. przewalskii and H17 can differentiate

S. officinalis and S. lavandulifolia from the other species.

Sequence Identity of the most Discriminatory

and Species-Specific Features

The species-specific features together with three features

chosen by PCA were sequenced and analyzed using blastN

and blastX [30] (Table 2). Four out of the ten features

matched with homologous sequences in GenBank. Features

E13 [EMBL: HE962398] and P4 [EMBL: HE962389]

corresponded to known chloroplast loci. Feature N12

[EMBL: HE962396] matched to an uncharacterized geno-

mic DNA, F5 [EMBL: HE962397] corresponded to an

uncharacterized gene and the other six features were not

Fig. 5 Principal component analysis plot for the 285 features. The

first principal component accounts for 71.1 % of variation and the

second component explained only 9.4 % of variation. The squaresrepresent features that account for most of the variability found across

the genotypes

Mol Biotechnol (2013) 54:770–783 779

123

Table 2 Predicted locus/function of the ten sequenced SDA features using blastN program [30]

Feature

ID

Length

(bp)

Specific to target Matching

database

entry

Putative identity E value

A16a 556 – No hits NA

E13b 372 Differentiate

accessions of S.officinalis

DQ673256.1 Forsythia europaea psaA-psbB fragment, chloroplast. 7e-151

GQ996975.1 Antirrhinum majus cpl protease proteolytic subunit protein (cplP) gene,

complete cds, chloroplast

8e-150

F5b 218 S. przewalskii DY322087.1 Ocimum basilicum uncharacterized cDNA sequence 6e-14

G13b 145 S. elegans – No hits NA

H17b 612 S. officinalis – No hits NA

S. lavandulifolia

N12b 423 S. miltiorrhiza FJ148301.1 Daucus carota subsp. sativus uncharacterized sequence 2e-06

S. sinica

S. przewalskii

N13b 556 S. runcinata – No hits NA

N6a 250 – No hits NA

O1b 118 S. sclarea – No hits NA

P4a 526 DQ673256.1 Forsythia europaea psaA-psbB fragment, chloroplast. 2e-77

DQ983917.1 Jacobaea uniflora isolate SGO10 tRNA-Met (trnM) gene, partial

sequence; ATP synthase epsilon subunit (atpE) and ATP synthase beta

subunit (atpB) genes, complete sequence; and ribulose 1,5-bisphosphate

carboxylase/oxygenase large subunit (rbcL) gene, partial sequence;

chloroplast

2e-65

The best match is shown as the putative identity for each sequence. E value regarded as significant if \1e–10. NA indicates the absence of

significant dataa Features that were chosen by PCAb Features that are part of the seven species-specific features

Fig. 6 Dissimilarity

dendrogram for the 15

genotypes using only the ten

most discriminatory and

species-specific features

(Squared Euclidian distance,

between groups linkage). The

steps of the dendrogram show

the combined clusters and the

values of the distance

coefficients at each step; the

values have been rescaled to

numbers between 0 and 25,

preserving the ratio of the

distances between the steps. The

black shaded circles in the

dendrogram represent species-

specific features which

discriminated across the species

780 Mol Biotechnol (2013) 54:770–783

123

recognized as known DNA sequences or proteins [EMBL:

HE962390, HE962391, HE962392, HE962393, HE962394,

HE962395].

Features E13 [EMBL: HE962398] and P4 [EMBL:

HE962389], had a significant alignment with the same

psaA-psbB chloroplast region [GeneBank:DQ673256.1], as

shown in Table 2. However, these two polymorphic fea-

tures did not overlap each other since they aligned to dif-

ferent parts of the psaA-psbB region which is 32,412 bp

long. Interestingly, this same region has been found to be

polymorphic among members of the Oleaceae family

(Jasminum and Menodora), where a series of inversions

have been identified [40]. This high level of polymorphism

found in the psaA-psbB region among different members of

the Lamiales (Oleaceae and Lamiaceae family), could

make it a source of potential markers for the Lamiales

species. It is important to note that in this study this

sequence was isolated by the subtraction process, indicat-

ing that this technique is capable of identifying highly

polymorphic sequences that are highly specific for the taxa

under study.

The six unknown and the two uncharacterized sequences

are almost certainly part of nuclear DNA, since if they

were part of chloroplast or mitochondrial DNA they would

have matched to GenBank accessions (which has an

extensive amount of chloroplast and mitochondrial data-

base entries). The presence of these polymorphic nuclear

DNA features suggests that the SDA may be a more reli-

able approach for fingerprinting closely related species and

hybrids. The reason for this may be that the detection of

hybridization/introgression is not reliably accomplished by

examination of chloroplast DNA since it is uniparentally

inherited [41]. In addition, dependence on a single nuclear

locus could have misleading results since the hybrid could

be homozygous at many loci [42]. Selection of nuclear loci

have also been performed using sequence databases as a

framework; however, this approach is dependent on the

phylogenetic proximity of the taxa under study to a number

of DNA sequences being available in the databases [43].

Therefore, the narrow subtraction approach conducted to

produce the SDA is a good alternative to identify new

nuclear polymorphic loci for taxa that do not have closely

related species with available genomic libraries.

Furthermore, the potential nuclear sequences identified

in the array could be associated or linked to a gene

responsible for an important agronomic trait, since these

genes are usually found in the nuclear genome. Previous

studies have already found relationships between essential

oil content and genotyping profiles in S. officinalis [12],

S. fruticosa [13], and other Mediterranean Salvia [16].

However, as of date no significant relationship has been

found between important bioactive components and geno-

typing profiles in several other species. For instance,

previous studies on S. miltiorrhiza have been unable to

correlate molecular profiles obtained with ITS sequences

with HPLC profiles of important bioactive components

[17, 18]. Current genotyping studies on ten S. miltiorrhiza

populations using the SDA have found significant corre-

lation between the signal intensity of two SDA features and

previous chemical analyses on the same geographical

populations (data not shown). Preliminary results of this

work can be found in a published progress report available

online [44]. Therefore, it is expected that upcoming reports

would corroborate the importance of this technique to

identify potential molecular markers which could assist in

future breeding programs.

Advantages and Limitations of the SDA

The SDA technique has shown potential advantages over

other fingerprinting techniques.

(i) The narrow subtraction approach allowed us to

selectively enrich the SDA with Salvia-specific DNA

sequences, which at the same time allowed the

fingerprint of Salvia species that were not used to

construct the array. This indicates that a SDA can be

constructed using a representative pool of individuals

instead of all members of the group under study. The

Salvia-specific SDA was constructed with only ten

species and it has the potential to fingerprint other

Salvia species. This is especially advantageous for

fingerprinting a broad genus like Salvia, which has

approximately 1,000 species with wide morphological

and ecological variation.

(ii) The ability of the SDA to fingerprint based on

polymorphic regions from chloroplast and nuclear

DNA increases the possibility of differentiating closely

related species and hybrids. However, sequences from

ribosomal and chloroplast regions have been success-

fully used to genotype Salvia species [16, 45, 46]. In

addition, intergenic spacers between trnH-psbA and

trnL-trnF together with rbcL and the nuclear ITS

region have been used to conduct molecular phyloge-

netic analyses in the tribe Mentheae and in the genus

Salvia [1, 19, 39]. Although, it could be argued that

SDA does not offer a significant advantage over

sequencing-based methods to genotype Salvia, it is

important to note that it is very unlikely that any

chloroplast or ITS region could be linked to a gene

responsible for an important agronomic trait. Conse-

quently, SDA offers a significative advantage since the

polymorphic nuclear sequences identified in this study

could potentially be employed not only for genotyping

but also they could be associated with genes respon-

sible for important agronomical traits.

Mol Biotechnol (2013) 54:770–783 781

123

(iii) The microarray scoring based on the direct compar-

ison between signal intensity is more appropriate

than a binomial scoring to analyze a cross-pollinated

genus. Therefore, SDA offers and advantage over

dominant markers such as RAPD and AFLP analysis

which have been commonly used to genotype Salvia

species. Co-dominant microsatellites could also offer

a good option for genotyping out-breeding species.

For instance, SSR derived from S. miltiorrhiza EST

sequences were able to detect genetic diversity

among test samples of S. miltiorrhiza and distinguish

it from other Salvia plants [47]. However, it is

important to note that microsatellites need previous

sequence knowledge and are low-throughput.

(iv) The SDA has proven to be a powerful tool to identify

species-specific sequences that may be employed for

unequivocal identification of commercial species

such as S. officinalis and S. miltiorrhiza. However,

the application of this technique for authentication of

commercial species would involve an investment in

the equipment necessary to print, hybridize, and scan

the SDAs. Furthermore, it will require good quality

scans of several species that can serve as references,

followed by standardization of data and statistical

analysis. Therefore, it would not be advisable to use

the entire array to identify only one or a couple of

commercial species since it would be more time and

consuming expensive than PCR. Instead, specific

primers could be developed for the amplification of

the species-specific features previously identified.

Then, variations in the amplification products or

sequence polymorphisms identified between the

related species could be employed for the unequiv-

ocal identification of the species of interest. Once

polymorphic PCR-based markers are developed from

these sequences, a fast and easy protocol for routine

quality control could be develop in order to prevent

substitution and misidentification of commercial

species of Salvia.

Conclusions

To the best of authors’ knowledge, this is the first finger-

printing array constructed for Salvia species and the first

genus-specific SDA. Using this array it was possible to fin-

gerprint 15 Salvia genotypes and to construct a hierarchical

cluster which was found to be consistent with the geo-

graphical origin and was able to differentiate S. officinalis

and S. miltiorrhiza from their closely related species.

Based on the above results it is possible to conclude that

the narrow subtraction approach conducted to produce the

SDA is a good alternative to isolate highly variable DNA

sequences specific to a genus as large as Salvia without

preliminary sequence information. Therefore, SDA is a

powerful technique to fingerprint non-model plants where

little genomic sequence information and few markers are

available.

Acknowledgments The authors gratefully acknowledge the support

from the Rural Industries Research and Development Corporation,

RMIT University and the Australian Postgraduate Scholarship awar-

ded to Alexandra Olarte. We acknowledge the technical support from

A/Prof. Reg Lehmann from MediHerb Australia and Claudia Salazar

for their assistance with the graphics in this study.

References

1. Walker, J. B., Sytsma, K. J., Treutlein, J., & Wink, M. (2004).

Salvia (Lamiaceae) is not monophyletic: Implications for the

systematics, radiation, and ecological specializations of Salviaand tribe Mentheae. American Journal of Botany, 91, 1115–1125.

2. Bruna, S., Giovannini, A., Benedetti, L. D., Principato, M. C., &

Ruffoni, B. (2006). Molecular analysis of Salvia Spp. through

RAPD markers. ISHS Acta Horticulturae, 723, 157–160.

3. Topcu, G. (2006). Bioactive triterpenoids from Salvia species.

Journal of Natural Products, 69, 482–487.

4. Echeverrigaray, S., & Agostini, G. (2006). Genetic relationships

between commercial cultivars and Brazilian accessions of Salviaofficinalis L. based on RAPD markers. Revista Brasileira dePlantas Medicinais, 8, 13–17.

5. Longaray-Delamare, A. P., Moschen-Pistorello, I. T., Artico, L.,

Atti-Serafini, L., & Echeverrigaray, S. (2007). Antibacterial

activity of the essential oils of Salvia officinalis L. and Salviatriloba L, cultivated in south Brazil. Food Chemistry, 100,

603–608.

6. Cai, Z., Lee, F. S. C., Wang, X. R., & Yu, W. J. (2002). A capsule

review of recent studies on the application of mass spectrometry

in the analysis of Chinese medicinal herbs. Journal of MassSpectrometry, 37, 1013–1024.

7. Reales, A., Rivera, D., Palazon, J. A., & Obon, C. (2004).

Numerical taxonomy study of Salvia sect. Salvia (Labiatae).

Botanical Journal of the Linnean Society, 145, 353–371.

8. Zhong, G. X., Li, P., Zeng, L. J., Guan, J., Li, D. Q., & Li, S. P.

(2009). Chemical characteristics of Salvia miltiorrhiza (Danshen)

collected from different locations in China. Journal of Agricul-ture and Food Chemistry, 57, 6879–6887.

9. Kiran, U., Khan, S., Mirza, K. J., Ram, M., & Abdin, M. Z.

(2010). SCAR markers: A potential tool for authentication of

herbal drugs. Fitoterapia, 81, 969–976.

10. Joshi, K., Chavan, P., Warude, D., & Patwardhan, B. (2004).

Molecular markers in herbal drug technology. Current ScienceIndia, 87, 159–165.

11. Canter, P. H., Thomas, H., & Ernst, E. (2005). Bringing medic-

inal plants into cultivation: Opportunities and challenges for

biotechnology. Trends in Biotechnology, 23, 180–185.

12. Boszormenyi, A., Hethelyi, E., Farkas, A., Horvath, G., Papp, N.,

Lemberkovics, E., et al. (2009). Chemical and genetic relation-

ships among sage (Salvia officinalis L.) cultivars and Judean sage

(Salvia judaica Boiss.). Journal of Agriculture and Food Chem-istry, 57, 4663–4667.

13. Skoula, M., El Hilali, I., & Makris, A. M. (1999). Evaluation of

the genetic diversity of Salvia fruticosa Mill. clones using RAPD

markers and comparison with the essential oil profiles. Bio-chemical Systematics and Ecology, 27, 559–568.

782 Mol Biotechnol (2013) 54:770–783

123

14. Braglia, L., Casabianca, V., De Benedetti, L., Pecchioni, N.,

Mercuri, A., Cervelli, C., et al. (2011). Amplified fragment length

polymorphism markers for DNA fingerprinting in the genus

Salvia. Plant Biosystems, 145, 274–277.

15. Jaccoud, D., Peng, K., Feinstein, D., & Kilian, A. (2001). Diversity

arrays: A solid state technology for sequence information inde-

pendent genotyping. Nucleic Acids Research, 29, E25.

16. Karaca, M., Ince, A. G., Ay, S. T., Turgut, K., & Onus, A. N.

(2008). PCR-RFLP and DAMD-PCR genotyping for Salviaspecies. Journal of the Science of Food and Agriculture, 88,

2508–2516.

17. Xu, H., Wang, Z. T., Cheng, K. T., Wu, T., Gu, L. H., & Hu, Z. B.

(2009). Comparison of rDNA ITS sequences and tanshinones

between Salvia miltiorrhiza populations and Salvia species.

Botanical Studies, 50, 127–135.

18. Han, J. P., Liu, C., Li, M. H., Shi, L. C., Song, J. Y., Yao, H.,

et al. (2010). Relationship between DNA barcoding and chemical

classification of Salvia medicinal herbs. Chinese Herbal Medi-cines, 2, 16–29.

19. Takano, A., & Okada, H. (2010). Phylogenetic relationships

among subgenera, species, and varieties of Japanese Salvia L.

(Lamiaceae). Journal of Plant Research, 124, 245–252.

20. Kingsley, M. T., Straub, T. M., Call, D. R., Daly, D. S., Wun-

schel, S. C., & Chandler, D. P. (2002). Fingerprinting closely

related Xanthomonas pathovars with random nonamer oligonu-

cleotide microarrays. Applied and Environment Microbiology, 68,

6361–6370.

21. Li, T. -X., Wang, J. -K., Bai, Y. -F., & Lu, Z. -H. (2006).

Diversity suppression-subtractive hybridization array for profiling

genomic DNA polymorphisms. Journal of Integrative PlantBiology, 48, 460–467.

22. Jayasinghe, R., Kong, S., Coram, T. E., Kaganovitch, J., Xue, C.

C., Li, C. G., et al. (2007). Construction and validation of a

prototype microarray for efficient and high-throughput genotyp-

ing of angiosperms. Plant Biotechnology Journal, 5, 282–289.

23. Jayasinghe, R., Hai Niu, L., Coram, T. E., Kong, S., Kaganovitch,

J., Xue, C. C. L., et al. (2009). Effectiveness of an innovative

prototype subtracted diversity array (SDA) for fingerprinting plant

species of medicinal importance. Planta Medica, 75, 1180–1185.

24. Mantri, N., Olarte, A., Li, C. G., Xue, C., & Pang, E. C. K.

(2012). Fingerprinting the Asterid species using subtracted

diversity array reveals novel species-specific sequences. PLoSONE, 7, e34873.

25. Duncan, B. D., & Isaac, G. (1994). Ferns and allied plants of

Victoria, Tasmania and South Australia. Melbourne University

Press in association with Monash University, Carlton.

26. Li, C. G., Sheng, S. J., Pang, E. C., Marriot, P., May, B., Zhou, S.

F., et al. (2009). Cultivar variations of Australian-grown Danshen

(Salvia miltiorrhiza): Bioactive markers and root yields. Chem-istry & Biodiversity, 6, 170–181.

27. Doyle, J. J., & Doyle, J. L. (1987). A rapid DNA isolation pro-

cedure for small quantities of fresh leaf tissue. PhytochemicalBulletin, 19, 1–15.

28. Bremer, B., Bremer, Kr., Chase, M., Fay, M., Reveal, J., Soltis,

D., et al. (2009). An update of the Angiosperm Phylogeny Group

classification for the orders and families of flowering plants: APG

III. Botanical Journal of the Linnean Society, 161, 105–121.

29. Coram, T. E., & Pang, E. C. K. (2005). Isolation and analysis of

candidate ascochyta blight defence genes in chickpea. Part I. Gen-

eration and analysis of an expressed sequence tag (EST) library.

Physiological and Molecular Plant Pathology, 66, 192–200.

30. National Center of Biotechnology Information (www.ncbi.nlm.

nih.gov).

31. Gadgil, C., Rink, A., Beattie, C., & Hu, W. S. (2002). A math-

ematical model for suppression subtractive hybridization. Com-parative and Functional Genomics, 3, 405–422.

32. Lezar, S., Myburg, A. A., Berger, D. K., Wingfield, M. J., &

Wingfield, B. D. (2004). Development and assessment of

microarray-based DNA fingerprinting in Eucalyptus grandis.

Theoretical and Applied Genetics, 109, 1329–1336.

33. Wittenberg, A. H. J., van der Lee, T., Cayla, C., Kilian, A.,

Visser, R. G. F., & Schouten, H. J. (2005). Validation of the high-

throughput marker technology DArT using the model plant

Arabidopsis thaliana. Molecular Genetics and Genomics, 274,

30–39.

34. Claßen-Bockhoff, R. (2007). Floral construction and pollination

biology in the Lamiaceae. Annals of Botany, 100, 359–360.

35. Pound, L. M., Wallwork, M. A. B., Potts, B. M., & Sedgley, M.

(2002). Self-incompatibility in Eucalyptus globulus ssp. globulus(Myrtaceae). Australian Journal of Botany, 50, 365–372.

36. Dudai, N., Lewinsohn, E., Larkov, O., Katzir, I., Ravid, U.,

Chaimovitsh, D., et al. (1999). Dynamics of yield components

and essential oil production in a commercial hybrid sage (Salviaofficinalis x Salvia fruticosa cv. Newe Ya’ar no. 4). Journal ofAgriculture and Food Chemistry, 47, 4341–4345.

37. Cao, J., Wei, Y. J., Qi, L. W., Li, P., Qian, Z. M., Luo, H. W.,

et al. (2008). Determination of fifteen bioactive components in

Radix et Rhizoma Salviae Miltiorrhizae by high-performance

liquid chromatography with ultraviolet and mass spectrometric

detection. Biomedical Chromatography, 22, 164–172.

38. Li, M. H. (2008). Investigation of Danshen and related medicinal

plants in China. Journal of Ethnopharmacology, 120, 419–426.

39. Walker, J. B., & Sytsma, K. J. (2007). Staminal evolution in the

genus Salvia (Lamiaceae): Molecular phylogenetic evidence for

multiple origins of the staminal lever. Annals of Botany, 100,

375–391.

40. Lee, H. L., Jansen, R. K., Chumley, T. W., & Kim, K. J. (2007).

Gene relocations within chloroplast genomes of Jasminum and

Menodora (Oleaceae) are due to multiple, overlapping inversions.

Molecular Biology and Evolution, 24, 1161–1180.

41. Fazekas, A. J., Kesanakurti, P. R., Burgess, K. S., Percy, D. M.,

Graham, S. W., Barrett, S. C. H., et al. (2009). Are plant species

inherently harder to discriminate than animal species using DNA

barcoding markers? Molecular Ecology Resources, 9, 130–139.

42. Chase, M. W., Salamin, N., Wilkinson, M., Dunwell, J. M.,

Kesanakurthi, R. P., Haidar, N., et al. (2005). Land plants and

DNA barcodes: Short-term and long-term goals. PhilosophicalTransactions of the Royal Society of London, Series B: BiologicalSciences, 360, 1889–1895.

43. Alvarez, I., Costa, A., & Feliner, G. N. (2008). Selecting single-

copy nuclear genes for plant phylogenetics: A preliminary anal-

ysis for the Senecioneae (Asteraceae). Journal of MolecularEvolution, 66, 276–291.

44. Pang, E. (2011). Using frontier technologies for the qualityassurance of medicinal herbs (pp. 1–21). RIRDC publications.

RMIT University, Melbourne.

45. Bertea, C. M., Luciano, P., Bossi, S., Leoni, F., Baiocchi, C.,

Medana, C., et al. (2006). PCR and PCR-RFLP of the 5S-rRNA-

NTS region and salvinorin A analyses for the rapid and

unequivocal determination of Salvia divinorum. Phytochemistry,67, 371–378.

46. De Mattia, F., Bruni, I., Galimberti, A., Cattaneo, F., Casiraghi,

M., & Labra, M. (2011). A comparative study of different DNA

barcoding markers for the identification of some members of

Lamiaceae. Food Research International, 44, 693–702.

47. Deng, K. J., Zhang, Y., Xiong, B. Q., Peng, J. H., Zhang, T.,Zhao, X. N., et al. (2009). Identification, characterization and

utilization of simple sequence repeat markers derived from Salviamiltiorrhiza expressed sequence tags. Yao Xue Xue Bao [ActaPharmaceutica Sinica], 44, 1165–1172.

Mol Biotechnol (2013) 54:770–783 783

123