development and application of microsatellite markers for diversity analysis...

DEVELOPMENT AND APPLICATION OF

MICROSATELLITE MARKERS FOR DIVERSITY

ANALYSIS IN JATROPHA CURCAS L.

THESIS SUBMITTED

TO THE

UNIVERSITY OF LUCKNOW

FOR THE DEGREE

OF

DOCTOR OF PHILOSOPHY

IN

BOTANY

BY

RAMANUJ MAURYA M.Sc. (Botany)

CSIR-NATIONAL BOTANICAL RESEARCH INSTITUTE

RANA PRATAP MARG, LUCKNOW-226001 (U.P.), INDIA

AND

DEPARTMENT OF BOTANY

UNIVERSITY OF LUCKNOW

LUCKNOW- 226007 (U.P.), INDIA

2014

Dedicated To My Beloved Parents

& Late Grandparents

CERTIFICATE

This is to certify that the work embodied in this thesis entitled

“DEVELOPMENT AND APPLICATION OF MICROSATELLITE MARKERS FOR

DIVERSITY ANALYSIS IN JATROPHA CURCAS L.” has been carried out by

Mr. RAMANUJ MAURYA, M.Sc. (Botany) under our supervision. He has

fulfilled all the requirements for the degree of DOCTOR OF PHILOSOPHY IN

BOTANY of the University of Lucknow, Lucknow regarding the nature and

prescribed period of work. The work included in this thesis is all original,

unless otherwise stated.

(Dr. Hemant Kumar Yadav) Co-Supervisor

Senior Scientist

Genetics and Plant Breeding

CSIR-National Botanical Research Institute

Lucknow-226 001 (U. P.), India

(Dr. Rantna Katiyar) Supervisor

Associate Professor Department of Botany

University of Lucknow

Lucknow-226 007 (U. P.), India

Acknowledgements

ACKNOWLEDGEMENTS

I am grateful to almighty and my parents Mrs. Bhanumati Maurya and Late Mr. Jagadeesh

Prasad Maurya for blessing me with strong will power, patience and confidence, which helped me in

completing the present work.

I am very much grateful to Dr. Ratna Katiyar, Botany Department, University of Lucknow

for accepting me as a Ph.D. student. Her valuable guidance from time to time helped immensely in

boosting my morale. I deeply appreciate her untiring help and sweet behavior to me.

I express my heartfelt gratitude and thanks to Dr. Hemant Kumar Yadav, Senior Scientist,

Genetics and Plant Breeding Division, C.S.I.R.-N.B.R.I. for giving me an opportunity to join him as a

Ph.D. student and also for excellent guidance, critical suggestions and long scientific discussions.

This work would not have been possible without the guidance, encouragement and appreciation I

received from him. His devotion to the work and untiring efforts has left deep impression in my heart.

I gratefully acknowledge Prof. Y.K. Sharma (HOD, Botany Dept., University of Lucknow) for

allowing me to use the facilities of the university.

I offer my special thanks to the former Director Dr. Rakesh Tuli and the present Director Dr.

Chandra Shekhar Nautiyal, CSIR-N.B.R.I, for providing me facilities and allowing to use the

infrastructure at the institute.

I feel a special gratitude to Dr. Bajarang Singh, Ex-Scientist CSIR- N.B.R.I and Dr. S. A.

Ranade, Chief Scientist CSIR-N.B.R.I. for their constant guidance in planning and completion of this

work and useful advices.

I acknowledge with thanks for fellowship and other financial support provided by C.S.I.R. and

D.B.T in accomplishment of this work.

My sincere thanks and gratitude to my maternal uncle Mr. Krishna Nand Maurya (Technical

Officer, CSIR- N.B.R.I. Lucknow) and Dr. P.P. Gothawal (Chief Scientist CSIR-C.F.T.R.I. Lucknow)

for their useful advice, encouragement, constant support and graciously sharing their vast

knowledge to me.

I would like to convey sincere and heartfelt thanks to Dr. Samir V. Sawant, Dr. P.K. Singh, Dr.

Sribas Roy, Dr. Pravin Chandra Verma, Dr. A.K. Saxena, Dr. S.N. Jena, Dr. C.S. Mohanti, Dr.

Sumit Kumar Bag and Dr. S.K. Raj.

I am especially thankful for the help and support given by Dr. Raju Madanala, and Dr. Anil

Kumar for their cooperation in central Facilities and very friendly behavior.

I am elated with delight to avail of this wonderful opportunity to express my sincere thanks to

my immediate seniors Dr. Neeraj Dubey, Dr. Rahul Singh, Dr. Harsh Singh, Mr. Anukool

Srivastava, Dr. Saurabh Verma, Mr. Sunil Kumar Singh, Mr. K.M. Rai and Dr. Sunil Kumar Snehi,

for their moral support and pleasant company all along.

Very special thanks to my friends Mr. Sidharth , Mr. Omesh Bajpai, Mr. Kasim, Mr. Komal, Mr.

Sachin, Mr. Vrijesh, Mr. Vikas, Mr. Rajiv, Mr. Vikram Rajapure, Mr. Ravi, Mr. Surendra, Miss.

Rinky, Miss. Subhi, Miss Namrata Singh, Mr. Sunil Gupta, Miss Vimlesh Rawat, Miss Pratibha

Maurya, Dr. Karamveer Gautam, Mr. Rameshwar and Mr. Nitesh for their unconditional support in

various aspects of my work.

I would like to thank my dear colleagues like Miss Astha Gupta, Mrs. Chandrawati, Mr.

Umesh Kumar and trainee Miss Parul, Mrs. Shipra and Mr. Suresh Yadav for giving support in my

Ph.D. work.

Special thanks to Mr. Alok Jain, Mr. Jitesh and Mr. Chaitu Ram for their cooperation and

assistance whenever it was needed. I acknowledge the assistance provided by Mr. Krishnakant, Mr.

Ramu, Mr. Hansraj, Mr. Maheshpal and Mr. Sunil Kumar Yadav.

Finally, to be very special and different I find no words in owing my sincere gratitude to my

grandparents Late Mr. Bihari Lal Maurya and Mrs. Chandra Kala and Mrs. Prabhawati and Late

Mr. Babu Ram Maurya whose affection, encouragement and blessings have always been a source of

light to me. I wish to express my cordial appreciation and special thanks to my sister (Mrs. Kavita

Maurya and Mrs. Ranjana Maurya) and maternal aunt Mrs. Shradha Maurya for their constant

support to concentrate on completing this work.

Last but not least acknowledgement to dear Master Apoorv and Ms. Pragya Priyadrshini for

their cheerful smiling faces, which gave me strength during most odd days of my tenure.

Not everyone is mentioned, but none is forgotten, I humbly acknowledge to all.

Dated:

Place: (Ramanuj Maurya)

Contents

1. INTRODUCTION……………………………………………………………..... 1-4

2. REVIEW OF LITERATURE………………………………………………….

2.1 Genetic Markers…………………………………………………………………

2.2 Classification of DNA Markers…………………………………………………

2.3 Simple Sequence Repeats (SSRs) or Microsatellite Markers…………………...

2.4 Discovery and development of SSR markers…………………………………...

2.5 Cross-species amplification of SSRs……………………………………………

2.6 Advantage of SSR analysis…………………………………………………......

2.7 Application of SSR markers…………………………………………………….

2.8 Application of molecular markers in J. curcas………………………………….

5-48

5

7

12

19

25

26

28

33

3.0 MATERIALS AND METHODS……………………………………………...

3.1 Materials……………………………………………………………………......

3.2 Methods……………………………………………………………...

49-67

49

54

4.0 RESULS ……………………………………………………………………….

4.1 Phenotypic characterization of indigenous accessions of J. curcas.....................

4.2 Development of large scale genomic derived SSRs from four microsatellite

enriched genomic libraries (two di-nucleotide and two tri-nucleotide)…………….

4.3 PCR optimization, polymorphism detection and characterization of developed

SSRs for various attributes…………………………………………………………..

4.4 Study of molecular genetic diversity among indigenous and exotic accessions

of J. curcas. …………………………………………………………………………

4.5 Molecular characterization of interspecific hybrid of J. curcas x J. integerrima

4.6 Heterozygosity assessment of J. curcas ……………………………………….

68-132

69

81

88

113

123

131

5.0 DISCUSSION………………………………………………………………….. 133-155

5.1 Phenotypic characterization of indigenous accessions of J. curcas…………….. 134

5.2 Development of large scale genomic derived SSRs from di- and tri-nucleotide

enriched genomic libraries………………………………………………………….

143

5.3 PCR optimization, polymorphism detection and characterization of developed

SSRs for various attributes…………………………………………………………

147

5.4 Study of molecular genetic diversity among indigenous and exotic accessions

of J. curcas L……………………………………………………………………….

149

5.6 Characterization of interspecific hybrid population of J. curcas x J.

integerrima ……………………………………………………………………….....

154

5.7Assessment of heterozygosity in J. curcas L…………………………………..... 155

6.0 SUMMARY…………………………………………………………………… 156-160

7.0 REFERENCES……………………………………………………………….. 161-180

APPENDIX………………………………………………………………………....

LIST OF PUBLICATIONS……………………………………………………….

Abbreviations

Abbreviations

CTAB - Cetyl trimethyl ammonium bromide

EDTA - Ethylene diamine tetra acetic acid

NaCl - Sodium chloride

NaOH - Sodium hydroxide

TAE - Tris-acetate-EDTA

TBE - Tris- borate- EDTA

TE - Tris-EDTA

Tris - Tris (hydroxymethyl) aminomethane

βME - 2Betamercaptoetahnol

dNTP - Deoxy nucleoside triphosphate

LA - Luria agar medium

LB - Luria broth medium

LBA - Luria broth agar

IPTG - Isopropyl D thiogalactopyranoside

Xgal - 5bromo4chloro3indolylDgalactopyranoside

PVP - Polyvinylpyrolidone

P: C: I - Phenol:Chloroform:Isoamyl

RNaseA - RibonucleaseA

rpm - Rotation per minute

gDNA - Genomic DNA

RT - Room temperature

hrs - Hours

M - Molar

mg - Milligram

MgCl2 - Magnesium chloride

DNA - Deoxyribonucleic acid

RNA - Ribonucleic acid

PCR - Polymerase chain reaction

min - Minute

ng - Nanogram

nm - Nanometer

UV - Ultraviolet

µ - Micro

mg - miligram

Kg - Kilogram

SSRs - Simple Sequence Repeats

AFLP - Amplified Fragment Length Polymorphism

RAPD - Random Amplified Polymorphic DNA

ISSR - Inter Simple Sequence Repeats

SNP - Single Nucleotide Polymorphism

EST - Expressed sequence tag

QTL - Quantitative Trait Loci

MAS - Marker Assistant Selection

mm - millimeter

NMR - Nuclear Magnetic Resonance

SSR - Simple sequence repeat

STR - Short tandem repeat

STMS - Sequence tagged microsatellite site

SSLP - Simple sequence length polymorphism

MP-PCR - Microsatellite-primed PCR

SPAR - Single primer amplification reaction

AMP-PCR - Anchored microsatellite primed PCR

ISSR - Inter-simple sequence repeats

ASSR - Anchored simple sequence repeat

RAMP - Random amplified microsatellite polymorphism

RAMPO - Random amplified microsatellite polymorphism

RAHM - Random amplified hybridization microsatellites

RAMS - Randomly amplified microsatellites

SAMPLE - Selective amplification of microsatellite polymorphic loci

REMAP - Retrotransposon-microsatellite amplified polymorphism

List of figures

List of Figures

Figure No. Title Page No.

Figure 2.1 Putative functions/effects of SSRs (Reproduced from Li et al.

2002). 13

Figure 2.2 Slippage during DNA replication. Assume that in the original

DNA molecule there were 5 repeats of the motif, symbolized

by a box. Slippage leads to the formation of new alleles with 6

and 4 repeats, depending on the strand containing the

polymerase error (reproduced from Goldstein and Schlottrer

1999).

15

Figure 2.3 Unequal crossing-over between homologous chromosomes.

Black and blue regions correspond to microsatellite repeat

sequences (Reproduce from Oliveira et al. 2006). 16

Figure 2.4 A general protocol for developing SSR markers with a SSR-

enrichment step (reproduced from Park et al. 2009). 22

Figure 2.5 Microsatellite- a summary of development, distribution,

functions and applications (reproduced from Kalia et al. 2011).

28

Figure 3.1 Map of India showing collection sites of Jatropha curcas from

different states of India. 53

Figure 3.2 A graphical representation of SSR identification and primer

designing.

58

Figure 4.1 Dendrogram of 80 Jatropha curcas accessions derived from

the Wards minimum variance cluster analysis using

Mahalanobis distances.

75

Figure 4.2 Histogram showing frequency of different types of SSRs

repeat motif recovered.

82

Figure 4.3 Histogram showing frequency of different repeat units

recovered for Lib A and Lib B.

83

Figure 4.4 Histogram showing frequency of different repeat motifs

recovered from Lib A and Lib B.

84

Figure 4.5 The average read length and unique sequence length. 85

Figure 4.6 Histogram showing distribution of SSRs based on (a) Repeat

types and (b) Repeat motifs.

87

Figure 4.7 Histogram showing repeat units of different types of SSRs

recovered.

88

Figure 4.8 A representative 1.5% Agarose gel image showing PCR

amplification and non-amplification. Sample from first 3

accessions (out of 8) were loaded for each SSR primers.

Primer 1 from well No. (1-3), Primer 2 (4-6), Primer 3 (7-9),

Primer 4 (10-12), Primer 5 (13-15), Primer 6 (16-18), Primer 7

(19-21), Primer 8 (22-24), Primer 9 (25-27), Primer 10 (28-

30), Primer 11 (31-33) and Primer 12 (34-36). All primers

amplified except primer number 10th

.

89

Figure 4.9 A representative 6% non-denaturing polyacrylamide gel image

showing polymorphism. 90

Figure 4.10 A representative snapshot from GeneMapper showing

polymorphism. Arrow showing polymorphic peak. 90

Figure 4.11 PIC distribution of 248 polymorphic SSR loci (106 from Lib A

and 142 from Lib B) calculated from 7 accessions of J. curcas

including non-toxic accession. 98

Figure 4.12 PIC distribution of polymorphic SSRs (447) based on 7

accessions of J. curcas. 110

Figure 4.13 Annotation of genomic SSRs developed from Lib A and B of

J. curcas. Each bar indicates the percent sequence similarity

with various plant genomes based on BLASTIX. 111

Figure 4.14 Gene Ontology (GO) classification of the SSR containing

genomic sequences derived from microsatellite enriched

libraries of J. curcas. The relative frequencies of GO hits to

functional categories of cellular components, biological

process and molecular functions. 111

Figure 4.15 Annotation of genomic SSRs developed from enriched

libraries of J. curcas. Each pie indicates the percent sequence 112

similarity with various plant genomes based on BLASTX.

Figure 4.16 PIC distributions of 41 polymorphic SSR loci calculated across

96 J. curcas accessions. 115

Figure 4.17 Genetic relationship among 96 accessions of J. curcas based

on NJ tree constructed using genotyping data of 41

polymorphic genomic SSRs. The numbers on branches

indicates bootstrap values based on 1000 replications. 120

Figure 4.18 17 Population structure analysis showing; a) Delta K showing

highest probability of four (K=4) subpopulation and b)

Structural plot of 96 accessions of J. curcas. 122

Figure 4.19 Photograph showing (A) J. curcas (Female parent) (B) J.

integerrima (Male parent) selected for interspecific

hybridization (C) Developing successful fruits after crossing

(D) Seedling of F1 interspecific hybrids in glasshouse, (E)

Transplanted F1 hybrid plants growing in field, (F) Close up

view of F1 hybrid plant (G) Close up view of inflorescence of

hybrid plant and (H) Hybrid plant showing fruits. 124

Figure 4.20 (A) Dorsal side and (B) ventral side showing morphological

variation in leaf shape and pigmentation of parents (J. curcas

and J. integerrima) and hybrid plants. 125

Figure 4.21 A snapshot of GeneMapper showing the allelic pattern of J.

curcas, J. integerrimma and their hybrid. 126

Figure 4.22 Dendrogram showing clustering of 94 interspecific hybrids

alongwith their parental lines J. curcas (JC) and J. integerrima

(JI). 129

Figure 4.23 Two- Dimensional plot by PCA showing clustering of 94

interspecific hybrids alongwith their parental lines J. curcas

(JC) and J. integerrima (JI). 130

List of tables

List of tables

Table No. Title Page No.

2.1 Classification of molecular markers system (adopted from Jones et al. 2009) 9

2.2 Comparison of various aspects of frequently used molecular markers

technique (Modified from Agarwal et al. 2008) 11

2.3 Classification of SSR markers (reproduced from Kalia et al. 2011) 17

3.1 Details of accessions of Jatropha curcas used in microsatellite

characterization and diversity analysis 50

4.1 Range, mean, estimates of variance components, broad sense heritability

and genetic advance in Jatropha curcas 70

4.2 Distribution of 80 accessions of Jatropha curcas in 4 clusters based on their

10 quantitative traits 73

4.3 Intra- (diagonal bold) and inter-cluster Mahalanobis distances for 80

accessions in Jatropha curcas 74

4.4 Cluster means and standard errors of the means of different traits in

Jatropha curcas L. 74

4.5 Loadings of the first four principal components of genetic divergence in 80

accessions of Jatropha curcas

76

4.6 Estimates of genotypic (rG) and Phenotypic (rP) correlation coefficients

among various traits determined in 80 accessions of J. curcas 78

4.7 Path coefficient analyses for seed yield/plant in J. curcas germplasm 80

4.8 Summary of genomic SSRs developed from enriched libraries of J. curcas 81

4.9 Statistical details of 454 sequencing of SSR enriched libraries of J. curcas 85

4.10 Details of SSR search using MISA 86

4.11 Polymorphism screening details of SSRs with different accessions of J.

curcas and mapping populations 91

4.12 Polymorphism details of 1122 SSRs developed from Lib C and D among

two taxa of Jatropha 91

4.13 Polymorphism features of 106 newly developed SSRs (Lib A) in J. curcas 92

4.14 Polymorphism features of 142 newly developed SSRs (Lib B) in J. curcas 95

4.15 Polymorphism features of 247 polymorphic SSRs developed from Lib C and

D in J. curcas 99

4.16 Polymorphism features of 41 SSR markers surveyed over 96 accessions of

J. curcas 113

4.17 Analysis of molecular variance (AMOVA) for 96 J. curcas accessions 116

4.18 Different genetic diversity estimates for two populations (indigenous and

exotic) of J. curcas based on 41 SSR loci 116

4.19 Minimum, maximum and mean of the genetic dissimilarity coefficient of 96

accessions of J. curcas 117

4.20 Crossability success in crosses between J. curcas and J. integerrima 123

4.21 Genotyping details of 15 polymorphic primers with 94 interspecific hybrids

and its parents (J. curcas and J. integerrima) 127

4.22 Polymorphism feature of 18 polymorphic SSRs among 48 progenies of

single plants used for heterozygosity assessment 132

Preface

Preface

Jatropha curcas L. is a non-edible, oil-rich crop which has attracted global attention as a

promising renewable resource of biodiesel production. Before its use as a bio-energy crop,

Jatropha was used for medicinal products and as a live fence around arable land. Considering the

importance of this biofuel plant various genetic studies has been carried out and which are still

going on towards the genetic improvement of J. curcas including traditional and modern

approaches.

Wide genetic variation is required in breeding for major agronomically important traits

like seed yield, oil yield and composition, flowering behavior, tree morphology, disease

resistance and the absence of anti-nutritional factors that currently inhibit the use of Jatropha

seed meal in animal feeding. Plant breeding programs need such genetic variation to be able to

combine positive traits from different parents to provide the required profitable and sustainable

Jatropha varieties of the future.

Traditional approaches of genetic improvement of polygenic traits have mainly relied on

phenotypic and pedigree information which are time and labour intensive especially in

heterozygous perennial crops such as J. curcas. Various molecular tools has been devised and

integrated to assist the traditional breeding strategies for faster and precise improvement

including perennial plants.

The molecular marker techniques is one of the most widely exploited tools for various

molecular breeding related studies such as assessing genetic diversity, phyolgenetics,

linkage/QTL mapping etc. In case of J. curcas several studies have also been conducted using

different class of molecular markers to evaluate the genetic diversity in different sets of

germplasm collections such as RAPD, ISSR, AFLP, SSRs and SNPs markers.

The molecular marker based studies showed contrasting results about level of genetic

diversity varying from low to high and explained accordingly based on number of accessions and

techniques used. Majority of these studies were carried out with a limited number of markers and

accessions. This warrants the enrichment of the gene pool along with development and validation

of a large number of polymorphic markers to better understand the available genetic variability

in J. curcas.

Preface

Till date, most of the markers based studies in J. curcas have been performed with

limited numbers of markers. For better understanding of polygenic traits, construction of dense

map and fine QTL mapping a large number of workable and validated markers are required.

Thus, in view of the previous report of low level of genetic diversity and limited number of

validated markers, there is a need to enrich the genetic pool, develop and validate large number

of polymorphic markers. Therefore, the present investigation is devoted to topic “Development

and Application of Microsatellite Markers for the genetic diversity analysis in the Jatropha

curcas L.”

This thesis incorporates the objectives of work undertaken, techniques used, results obtained,

discussion, summary and the bibliography under the following seven heads:

1. INTRODUCTION: This chapter commences with a brief introduction to the background

and objectives of the problem.

2. REVIEW OF LITERATURE: This provides the in depth studies of genetic markers

development and world-wide genetic diversity analysis using different types of molecular

markers.

3. MATERIALS AND METHODS: The chapter describes the experimental procedure

and techniques that were employed in order to accomplished the objectives of the

dissertation.

4. RESULTS: Details of various observations made and results obtained from the

experiments and that were performed are described in the chapter.

5. DISCUSSIONS: Detail analysis of results obtained and conclusions drawn are described

in this chapter.

6. SUMMARY: This chapter summarizes the work that has been presented in this

dissertation and conclusions drawn from it.

7. REFERENCES: This chapter lists the publications that have been referred to the

dissertation.

Chapter 1 Introduction

1

The Jatropha curcas L. was first described by Swedish botanist Carl Linnaeus in 1753 in

“Species Plantarum”. The name of Jatropha derived from the Greek word ‘Jatros’ means

‘Doctor’ and ‘trophe’ means ‘Nutrition’. J. curcas L. is belonging to the family

Euphorbiaceae having chromosome number 2n=22 (Dehgan 1984) with relatively smaller

genome size of ~416 Mb (Carvalho et al. 2008; Sato et al. 2011). It is a multipurpose small

tree or large shrub of 5-7 m tall with an average life span of upto 50 years and is found

throughout the tropical region. Common vernacular names of J. curcas L. in India are

Ratanjyot, Safed arand, Physic nut, Purging nut, Chandrajyot, Jamalghota etc.

J. curcas L. is a tropical species native to Mexico and Central America, but widely

distributed in other tropical and sub-tropical areas of the world, especially in Africa, India

and South-East Asia (Heller 1996; Sujatha and Prabhakaran 1997; Openshaw 2000; Rao et al.

2008). In India, the Portuguese settlers introduced it in the 16th

century. It occurs in almost all

parts of India including Andman islands and generally grown as live fence. It is well adapted

to arid and semi- arid conditions. There are approximately 170 species of Jatropha across the

world (Brittaine and Lutaladio 2010), of which 9 species are being reported in India viz. J.

curcas, J. gossypifolia, J. glandulifera, J. integerrima, J. nana, J. podagrica, J. multifida, J.

maheshwari and J. vilosa.

The plant has inflorescence which is axillary paniculate polychasial cymes formed

terminally on the branches and are complex, possessing main and co-florescences with

paracladia. Flowers are unisexual, monoecious, greenish yellow colored in terminal long,

peduncle paniculate cymes (Divakara et al. 2010). The male flowers consists of 5 calyx,

nearly equal, elliptic or obviate, corolla is campanulate, 5 lobes, connate, hairy inside,

exceeding the calyx, each lobe bears inside a gland at the base, 10 stamens in two series,

outer 5 filaments free, inner 5 filaments connate, dithecous erect anther with opening by

longitudinal slit. Female flowers consist of sepals up to 18 mm long, persistent, calyx as in

male, 4 corolla scarcely exceeding the calyx lobes united, villous inside, ovary 3-locular,

ellipsoid, 1.5-2 mm in diameter, style bifid, ovules solitary in each locule. Flowering time,

number and male/female flower ratio all varied substantially depending on soil fertility, soil

moisture, precipitation and temperature. Flowering occurs during the wet season often with

two flowering peaks, i.e. during summer and autumn (Raju and Ezradanam 2002).

Numerically, 1-5 female flowers and 25-29 male flowers are produced per inflorescences

with an average male to female ratio of 29:1 (Solomon and Ezradanam 2002).

J. curcas can be both protandrous and protogynous and able to produce seeds through

both self- and cross-pollination (Negussie et al. 2013). The unisexual flowers of Jatropha

depend on pollination by insects, including bees, flies, ant and thrips (Brittaine and Lutaladio

2

2010). Each inflorescence yields a bunch of ovoid fruits. Seeds resemble castor in shape,

ovoid, oblong and black in colour. The seeds mature when the capsule changes from green to

yellow, after two months of fruit setting. Each fruit bears three seeds and seeds contain 25-

35% oil.

It can grow well in different kinds of soils, tolerate drought and other environmental

stress conditions and animals do not browse its leaves (Patil 2004; Gmunder et al. 2012). It

can grow almost everywhere- even or gravely, sandy, acidic and alkaline soils having pH

ranging from 5.5 to 8.5. It can thrive in poorest stony soils. It grows even in the cracks and

crevices of rocks on all types of soil except one subjected to water lodging. Jatropha can

grow between 15 and 40 0C temperature and under a broad spectrum of rainfall regimes from

250 to over 1200 mm per annum and is more altered by lower temperatures than by altitude

or day length (Foidl et al. 1996; Katwal and Soni 2003).

J. curcas is usually propagated on mass scale both by seed as well as stem cutting. J.

curcas respond very well to vegetative propagation through stem cutting. Stem cuttings are

typically prepared with one-year-old terminal branches having 2-3 cm diameter. The

branches are cut into 15-20 cm long pieces and put into polybags filled with rooting media of

soil and sand in equal ratio. The polybags are kept in closed and humid condition to promote

rooting. Cuttings are generally exercised during February – March for better rooting and

survival. The rooting begins after 45 days and generation rate from cuttings ranged from 50-

80% (Li 2005). The roots of the cuttings are not as robust as those of the seedlings (Ye et al.

2009). The benefit of cutting propagation is that it offers the possibility to grow privileged

accessions. The cuttings and seedlings of Jatropha are grown in nurseries for 2-6 months and

thereafter transplanted on the field at the beginning of the wet season. The plant propagated

through cuttings show a lower longevity and possesses a lower drought and disease resistance

than those propagated through seeds (Heller 1996).

The production of J. curcas seed yield ranges from 0.1 to 15 t/ha/year in different

countries and regions (Ong et al. 2011) and the oil yield is reported to be 1590 kg/ha/year

(Ong et al. 2011; Silitonga et al. 2011; Mofijur et al. 2012). Depending on the variety, the oil

content of decorticated seed ranges from 30% to 50% by weight and the kernel ranges from

45% to 60% (Ong et al. 2011; Silitonga et al. 2011; Atabani et al. 2013; Mofijur et al. 2012).

J. curcas seed oil is proven to be toxic to many microorganisms, insects and animals.

Despite its toxicity, Jatropha is not pest and disease resistant. J. curcas is susceptible to many

insects, pests and viral diseases such as root rot, stem borer, fruit damage by Webber and

plant damaged by root rot and fruit Webber. The fruit sap suckers (Sahai et al. 2011; Pandey

et al. 2012), virus infestation and bark eater rodents are quite common problems at different

3

locations. The disease is endemic to Jatropha not transmitted to any other plant (Tewari et al.

2007; Narayana et al. 2006). Gao et al. (2010) identified a new strain of Indian cassava

mosaic virus causing a mosaic disease in J. curcas. Snehi et al. (2011) identified new

begomovirus associated with yellow mosaic disease of J. gossypifolia in India.

J. curcas L. has received much attention because of its immense role in biodiesel

production as-an eco-friendly fuel, biodegradable, renewable and non-toxic in nature as

compared to petro-diesel (Pandey et al. 2012). The oil from Jatropha plant is considered as

the best source of biofuel production among the various plants based fuel resources world

over (Belewu et al. 2010). J. curcas L., a non-domesticated shrub has been popularized as an

unique candidate among renewable energy sources due to its peculiar feature like drought

tolerance (Openshaw 2000), rapid growth and easy propagation, higher oil content than other

oil crops (Achten et al. 2008), small gestation period, wide range of environmental adaptation

(King et al. 2009; Johnson et al. 2011) and the optimum plant size and architecture make it as

a sole candidate for further consideration (Sujatha et al. 2008). J. curcas is identified by the

Indian government as one of most suitable biodiesel feedstock, since it is able to grow on

marginal land and yields high-quality oil suitable for energetic use (Gmunder et al. 2012).

Beside biodiesel, J. curcas has number of uses such as: potential phytoremediator

(Moursy et al. 2014), Soil carbon sequestration (Wani et al. (2012), reduction of

environmental pollutants (Bender 2011), as a live fence (Reubens et al. 2011), Jatropha agro-

forestry (Agbogidi et al. 2013b), Jatropha for enrichment of soil (Wani et al. 2006), Biogas

production (Subramanian et al. 2005), Human consumption and animal feed (Segura et al.

2014), In Industry (Atabani et al. 2013), Medicinal value (Dahake et al. 2012, Costa et al.

2014).

Genetic diversity can be assessed using either morphological traits (Kaushik et al.

2007) or molecular markers. The study of genetic diversity based on morphological traits is

not much reliable as they are highly influenced by environment. However, molecular markers

base studies are independent of environmental factors. There is a surge of interest in

identifying a large number of molecular markers for rapid application in the assessment of

genetic diversity and the selection of desired genotypes. Molecular markers have been

considered to have great potential for plant breeding in enhancing the efficiency of selection

of desirable traits via markers-assisted breeding and understanding the genetic relationships,

evolutionary trends and fingerprinting of varieties. Among the various molecular markers,

PCR-based markers such as RAPD, AFLP, ISSR and SSRs were most preferred.

J. curcas is still considered as an undomesticated or semi- domesticated plant and its

response to yield and oil content is found to be erratic with different agro-climatic zones.

4

There are various drawbacks associated with J. curcas such as (i) Non availability of high

yielding varieties, (ii) Asynchronous maturity of fruits (iii) Lesser female flowers (iv) Seed

toxicity (v) Susceptibility for water lodging, frost and diseases (vi) low diversity and

productivity.

Various studies have been carried out in J. curcas to investigate the genetic variability

in different sets of genetic materials. Most of these studies carried out at morphological and

molecular level reported low level of genetic variability and diversity in J. curcas. However,

limited numbers of molecular markers were used for genetic diversity analysis and

identification of polymorphic markers in J. curcas. The low level of genetic variation

warrants the enrichment of the genetic pool, develop and validate large number of

polymorphic markers for the development of linkage map, diversity assessment, quantitative

trait loci (QTL) map and marker assisted breeding for selection of high oil yielding

accessions of J. curcas.

Keeping all these aspects in view, present investigation was undertaken with the following

objectives:

(i) Phenotypic characterization of indigenous accessions of J. curcas.

(ii) Development of large scale genomic derived SSRs from four microsatellite enriched

genomic libraries.

(iii) PCR optimization, polymorphism detection and characterization of developed SSRs for

various attributes.

(iv) Study of molecular genetic diversity among indigenous and exotic accessions of J.

curcas L.

Chapter 2 Review of Literature

5

2.1 Genetic Markers

In recent years, the molecular markers especially DNA-based markers, have been extensively

used in many areas of research such as gene mapping and tagging (Kliebenstein et al. 2001;

Karp and Edwards 1997), characterization of sex (Flachowsky et al. 2001; Martinez et al.

1999), analysis of genetic diversity (Erschadi et al. 2000; Palacios et al. 1999; Lerceteau and

Szmidt 1999; Godt and Hamrick 1999), genetic relatedness (Mace et al. 1999; Rao et al.

1997; Brookfield 1992), linkage map construction and marker assisted breeding (Kalia et al.

2011). According to Stansfield (1986), the term MARKER is usually used for “LOCUS

MARKER”. Each gene has a particular place along the chromosome called LOCUS. Due to

mutations, gene can be modified in several forms mutually exclusives called ALLELES (or

allelic forms). All the allelic forms of gene occur at the same locus on homologous

chromosomes. When allelic forms of one locus are identical, the genotype is called

HOMOZYGOTE (at this locus), whereas different allelic forms constitute a

HETEROZYGOTE. Thus, a molecular marker is defined as a particular segment of DNA that

is representative of the differences at the genomic level. Molecular markers may or may not

correlate with phenotypic expression of a trait. Molecular markers offer numerous advantages

over conventional phenotype based alternatives as they are stable and detectable in all tissues

regardless of growth, differentiation, development or defense status of the cell and are not

confounded by the environment, pleiotropic and epistatic effects. According to Agarwal et al.

(2008), an ideal molecular marker technique should have the following criteria: (1) be

polymorphic and evenly distributed throughout the genome (2) provide adequate resolution

of genetic differences (3) generate multiple, independent and reliable markers (4) simple,

quick and inexpensive (5) need small amounts of tissue and DNA samples (6) have linkage to

distinct phenotypes and (7) require no prior information about the genome of an organism.

In recent years, the progress made in the development of DNA based marker have

great potential for plant breeding in enhancing the efficiency of selection of desirable traits

via marker-assisted breeding and understanding the genetic relationships, evolutionary trends

and fingerprinting of varieties (Johnson et al. 2011).There are three major types of genetic

markers: (1) morphological markers, which themselves are phenotypic characters of a trait

(2) biochemical markers, which include allelic variants of enzymes called isozymes (3) DNA

(or molecular) markers, which reveal sites of variation in DNA (Winter and Kahl 1995, Jones

et al. 1997). Brief description of different types of genetic markers is given below.

6

2.1.1 Morphological markers

Morphological markers are visually characterized as phenotypic characters such as flower

color, seed shape, growth habits or pigmentation. In conventional plant breeding programs

breeder generally used to select desired plant types based on morphological data and also

tried to correlate them to specific morphological markers. However, the morphological

markers are available in limited numbers and have influence of environmental conditions.

Therefore, these markers are not of much utility in plant breeding programs.

2.1.2 Biochemical marker- allozymes (Isozyme)

In population genetics, protein-based markers (allozymes) were the first markers developed

and widely used (Hamrick and Godt 1990). Allozyme was the first true molecular marker

established to distinguish protein variants in enzymes (Schlotterer 2004). Isozyme analysis

has been used for over 60 years for various purposes in biology, viz. to delineate

phylogenetic relationships, to estimate genetic variability and taxonomy, to study population

genetics and developmental biology, to characterize plant genetic resources and plant

breeding (Bretting and Widrlechner 1995). Isozymes were defined as structurally different

molecular forms of an enzyme with, qualitatively, the catalytic function. Isozymes originate

through amino acid alterations, which cause changes in net charge or the spatial structure

(conformation) of the enzyme molecules and also, therefore, their electrophoretic mobility.

Like morphological marker, the biochemicals markers are are also limited in number and are

influenced by environmental factors or the developmental stage of the plant (Winter and Kahl

1995).

2.1.3 Molecular markers or DNA markers

Molecular markers are the DNA sequence variations that can be readily detected and whose

inheritance can be monitored easily. The uses of molecular markers are based on the naturally

occurring DNA polymorphism, which forms basis for designing strategies to exploit for

applied purposes. A marker must be polymorphic i.e. it must exit in different forms so that

chromosome carrying the mutant genes can be distinguished from it. Genetic polymorphism

is defined as the simultaneous occurrence of a trait in the same population of two variants or

genotypes. DNA markers seem to be the best candidates for efficient evaluation and selection

of plant genetic material. Unlike protein markers, DNA markers segregate as single gene and

are not influenced by the environment. DNA is easily extracted from plant materials of any

7

developmental stage and its analysis can be cost and labour effective (Kumar et al. 2009).

The development and use of molecular markers for the detection and exploitation of DNA

polymorphism is one of the most significant developments in the field of molecular genetics

(Semagn et al. 2006). It has also been proved to be helpful in understanding the genetic

relationships, evolutionary trends and fingerprinting of varieties. Genetic diversity can be

assessed either through morphological traits or by using molecular markers. The assessment

of genetic diversity using molecular marker assumes greater significance as it is a pre-

requisite for any sound breeding program (Surwenshi et al. 2011). DNA marker systems,

which were introduced to genetic analysis in the 1980s, have many advantages over the

traditional morphological and protein markers that are used in genetic and ecological analyses

of plant population: firstly, an unlimited number of DNA markers can be generated;

secondly, DNA marker profiles are not affected by the environment and thirdly DNA

markers, unlike isozyme markers are not constrained by tissue or developmental stage

specificity (Park et al. 2009).

2.2 Classification of DNA markers

The various types of available DNA markers can be classified broadly into following three

groups:

2.2.1 First generation DNA markers (Hybridization based)

The first generation DNA marker system employed southern blot based markers such as

RFLPs (Restriction Fragment Length Polymorphism). The RFLP technique employs

molecular hybridization of cDNA or genomic DNA probes with genomic DNA digested with

restriction enzymes. RFLP is the most widely used hybridization-based molecular marker.

RFLP markers were first used in 1975 to identify DNA sequence polymorphism for genetic

mapping of a temperature- sensitive mutation of adeno-virus serotypes (Grodzicker et al.

1975). It was then used for human genome mapping (Botstein et al. 1980) and later adopted

for plant genomes (Helentjaris et al. 1986; Weber and Helentjaris 1989).

2.2.2 Second generation (PCR- based markers)

The second generation DNA-based molecular markers were driven by the invention of

polymerase chain reaction (PCR) (Mullis et al. 1986). PCR is a molecular biology technique

for enzymatically replicating (amplifying) small quantities of DNA and analyzed in many

individuals without the requirement for cloning or isolating large amounts of ultra-pure

8

genomic DNA. It is used to amplify a short (usually up to 10 kb), well-defined part of a DNA

strand from single gene or just a part of a gene. PCR revolutionized the genetic and

ecological analysis of populations in several ways because it had two major advantages over

southern blot based markers. First, it requires only small amount of DNA to allow analysis at

very early stages, thus reducing the need for plant nurseries. Second, it is inexpensive and

simple enough so that large scale experiments can be carried out rapidly on a large scale

basis. The various types of PCR based markers such as RAPD, AFLP, ISSR, IRAPs and SSR

relies on the use of PCR primers, which binds to multiple or specific sites in the genome.

This can be achieved by using either short PCR primers (Randomly Amplified

Polymorphism, RAPD) (Williams et al. 1990), PCR primers that are complementary to

repetitive elements such as microsatellites (Inter-Simple-Sequence-Repeats, ISSR)

(Zietkiewicz et al. 1994) or retrotransposans (Inter- Retrotransposon amplified

Polymorphism, IRAPs) (Kalendar et al. 1999). Alternatively, restriction fragments could be

amplified by adding linkers and subsequent selective amplification as in case of AFLP

(Zabeau and Vos 1993; Vos et al. 1995).

2.2.3 Third generation DNA markers (DNA sequence based)

In recent years, there has been an emphasis on the development of newer and more efficient

high throughput molecular marker systems involving inexpensive non gel-based assays with

high throughput detection systems i.e. SNPs (Single Nucleotide Polymorphism) (Gupta et al.

2001) and Microarrays (Linman et al. 2009). The polymorphism of single base differences

can be assayed by high-throughput analysis, by hybridization with allele-specific

oligonucleotides (ASO), primer extension, oligonucleotide ligation assays (OLA) and

invasive cleavage. The main advantage of SNPs is their high potential for an automated high-

throughput analysis at moderate cost (Chen and Sullivan 2003)

“….the arrival of DNA manipulation techniques promoted a shift from

enzyme-based to DNA-based markers (Schlotterer 2004).”

9

Tab

le 2.1

Classificatio

n o

f molecu

lar mark

er system

s (adopted

from

Jones et al. 2

009)

Mark

er Syste

m

Ad

van

tages

Disa

dvan

tages

(A) F

irst-gen

eratio

n m

ark

ers based

on

restrictio

n fra

gm

ent d

etection

Restrictio

n frag

men

t length

poly

mo

rphism

(RF

LP

) C

o-d

om

inan

t, hig

hly

repro

ducib

le L

ow

multip

lex ratio

*; h

igh o

n tim

e/labour

(B) S

econ

d g

enera

tion

ma

rkers b

ased

on

PC

R

Cleav

age am

plificatio

n p

oly

morp

hism

(CA

P)

Insen

sitive to

DN

A m

ethylatio

n, n

o

requirem

ent fo

r radio

activity

Pro

duces in

form

ative P

CR

pro

ducts

Ran

dom

amplified

poly

morp

hic D

NA

(RA

PD

) L

ow

on tim

e/labour; m

ediu

m m

ultip

lex

ratio*

Dom

inan

t; low

repro

ducib

ility

Am

plified

Frag

men

t Len

gth

Poly

morp

hism

(AF

LP

) H

igh rep

roducib

ility; h

igh m

ultip

lex

ratio*

Dom

inan

t; moderate tim

e/labour

Seq

uen

ce- specific am

plificatio

n p

oly

morp

hism

(S-

SA

P)

Applicab

le for targ

eting an

y g

ene,

transp

oso

n o

r sequen

ce of in

terest

Seq

uen

ce must b

e know

n to

enab

le desig

n o

f

elemen

t- specific P

CR

prim

ers

Sim

ple seq

uen

ce repeat (m

icrosatellite) (S

SR

) C

o-d

om

inan

t, hig

hly

repro

ducib

le; low

on tim

e and lab

our

Hig

h co

st of d

evelo

pm

ent; lo

w m

ultip

lex ratio

*

Inter-S

imple seq

uen

ce repeat (IS

SR

) T

echnically

simple; n

o p

rior g

enom

ic

info

rmatio

n n

eeded

to rev

eal both

inter-

and in

traspecific v

ariation

Dom

inan

t mark

ers; ban

d stain

ing can

be w

eak

Variab

le num

ber tan

dem

repeat (m

inisatellite)

(VN

TR

)

Num

erous m

ultiallelic lo

ci L

ow

-resolu

tion fin

gerp

rints in

plan

ts

Seq

uen

ce tagged

sites (ST

S)

Co-d

om

inat, u

seful fo

r map

pin

g

Rep

roducib

ility; b

ased o

n so

me d

egree o

f sequ

ence

know

ledge.

Seq

uen

ce characterized

amplificatio

n reg

ion (S

CA

R)

May

be d

om

inan

t or co

-codom

inat;

better rep

roducib

ility th

an R

AP

Ds

More d

ifficult to

repro

du

ce than

RA

PD

s

Seq

uen

ce amplificatio

n o

f micro

satellite poly

morp

hic

loci (S

AM

PL

)

Hig

h m

ultip

lexin

g*; co

-dom

inat

mark

ers; exten

sive p

oly

morp

hism

Som

e blu

rred b

andin

g; stu

tter ban

ds

10

(C) T

hird

- gen

eratio

n m

ark

ers based

on

DN

A seq

uen

cing

Sin

gle n

ucleo

tide p

oly

morp

hism

(SN

P)

Com

mon; ev

enly

distrib

uted

; detectio

n

easily au

tom

ated; h

igh th

roughput; lo

w

assay co

st; usefu

l for asso

ciation

studies; p

oten

tially h

igh m

ultip

lex

ratio*

Usu

ally o

nly

two alleles p

resent

Ex

pressed

sequen

ce tag (E

ST

) E

asy to

collect an

d seq

uen

ce; reveals

novel tran

scripts; g

ood rep

resentatio

n

of tran

scripts

Erro

r-pro

ne co

-dom

inan

t and d

om

inan

t mark

ers,

which

can lead

to co

mplex

ity; n

ull alleles d

etected

directly

Targ

et recognitio

n am

plificatio

n p

roto

col (T

RA

P)

Sim

ple to

use; h

ighly

info

rmativ

e;

pro

duces n

um

erous m

arkers b

y u

sing

existin

g p

ublic E

ST

datab

ase; uses

mark

ers targeted

to sp

ecific gen

e.

Req

uires cD

NA

or E

ST

sequen

ce info

rmatio

n fo

r

prim

er dev

elopm

ent

Micro

arrays (arran

gem

ent o

f small sp

ots o

f DN

A

fixed

to g

lass slides)

Whole g

enom

e scannin

g; h

igh

-

thro

ughput tech

nolo

gy; g

enoty

pe-

phen

oty

pe relatio

nsh

ip; ex

pressio

n

analy

sis of larg

e num

bers o

f gen

es

Ex

pen

sive; n

eeds g

ene seq

uen

ce data; tech

nically

dem

andin

g

Div

ersity arra

y tech

nolo

gy (D

ArT

) N

o seq

uen

ce data req

uired

; hig

h-

thro

ughput; d

etects single b

ase chan

ges

and in

dels; rap

id g

ermplasm

characterizatio

n

Dom

inan

t mark

ers; techn

ically d

eman

din

g

Sin

gle-stran

d co

nfo

rmatio

nal p

oly

morp

hism

(SS

CP

) D

etects DN

A p

oly

morp

hism

and

mutatio

ns at m

ultip

le sites in D

NA

fragm

ents

Tem

peratu

re-dep

enden

t; sensitiv

ity affected

by p

H

Den

aturin

g g

radien

t gel electro

phoresis (D

GG

E)

Sep

arates indiv

idual seq

uen

ce from

a

com

plex

mix

ture o

f micro

bes b

ased o

n

sequen

ce differen

ces

PC

R frag

men

t size limited

to ab

out 5

00 b

p;

difficu

lt to reso

lve frag

men

ts that d

iffer by o

nly

one

or tw

o b

ases

Tem

peratu

re grad

ient g

el electrophoresis (T

GG

E)

Alm

ost id

entical to

DG

GE

; more

reliable; u

ses temperatu

re grad

ient

Tech

nically

dem

andin

g; little u

sed in

plan

ts

*T

he m

ultip

lex ratio

is the n

um

ber o

f indep

enden

t loci d

etected in

the assa

y

11

Tab

le 2.2

Com

pariso

n o

f vario

us asp

ects of freq

uen

tly u

sed m

olecu

lar mark

ers techniq

ue (M

odified

from

Agarw

al et al. 2008)

A

bu

nd

an

ce R

epro

du

cibility

D

egre

e of

poly

morp

hism

Locu

s

specificity

Tech

nica

l

requ

irem

ent

Qu

an

tity o

f

DN

A req

uired

Majo

r ap

plica

tion

RF

LP

H

igh

H

igh

M

ediu

m

Yes

Hig

h

Hig

h

Ph

ysical m

appin

g

RA

PD

H

igh

L

ow

M

ediu

m

No

Low

L

ow

G

ene tag

gin

g

SS

R

Med

ium

M

ediu

m

Med

ium

N

o

Med

ium

L

ow

G

enetic d

iversity

SS

CP

L

ow

M

ediu

m

Low

Y

es M

ediu

m

Low

S

NP

map

pin

g

CA

PS

L

ow

H

igh

L

ow

Y

es H

igh

L

ow

A

llelic div

ersity

SC

AR

L

ow

H

igh

M

ediu

m

Yes

Med

ium

L

ow

G

ene tag

gin

g an

d

ph

ysical m

appin

g

AF

LP

H

igh

H

igh

M

ediu

m

No

Med

ium

M

ediu

m

Gen

e taggin

g

IRA

P/R

EM

AP

H

igh

H

igh

M

ediu

m

Yes

Hig

h

Low

G

enetic d

iversity

RA

MP

O

Med

ium

M

ediu

m

Med

ium

Y

es H

igh

L

ow

G

enetic d

iversity

RF

LP

restriction frag

men

t length

poly

morp

hism

, RA

PD

random

amplifi

ed p

oly

mo

rphic D

NA

, SS

R sim

ple seq

uen

ce repeats, S

SC

P sin

gle stran

d

confo

rmatio

nal

poly

morp

hism

, C

AP

S

cleaved

am

plifi

ed

poly

mo

rphic

sequen

ce, S

CA

R

sequen

ce ch

aracterized

amplifi

ed

regio

n,

AF

LP

Am

plifi

ed

fragm

ent

length

poly

morp

hism

, IR

AP

/RE

MA

P

inter-retro

transp

oso

n

amplifi

ed

poly

morp

hism

/retrotran

sposo

n-m

icrosatellite

amplifi

ed p

oly

mo

rphism

12

2.3 Simple sequence repeats (SSRs) or microsatellite markers

The first widespread markers to avail full advantage of PCR technology was microsatellites

(Litt and Luty 1989; Tautz 1989; Weber and May 1989). The genomes of higher organisms

contain three types of multiple copies of simple repetitive DNA sequences (satellite DNAs,

minisatellites and microsatellites) arranged in arrays of vastly differing size (Armour et al.

1999; Hancock 1999). Microsatellites, variously known as short tandem repeats (STR),

Simple Sequence Repeats (SSRs) or Simple Sequence Length Polymorphism (SSLPs) are

tandem repeats occurs in the form of iterations of repeat units of almost anything from a

single base pair to thousands of base pairs (Litt and Luty 1989). Some researchers (e.g.

Armour et al. 1999) define microsatellites as 2-8 bp repeats, others (e.g., Goldstein and

Pollock 1997 ) as 1-6 or even 1-5 bp repeats (Schlotterer 1998). Mono-, di-, tri- and

tetranucleotide repeats are the main types of microsatellite, but repeats of five (penta-) or six

(hexa-) nucleotides are usually classified as microsatellites as well. The term satellite DNA

originate from the observation in the 1960s of a fraction of shared DNA that showed a

distinct buoyant density, detectable as a ‘satellite peak’ in density gradient centrifugation and

that was subsequently identified as large centromeric tandem repeat.

Microsatellites were first identified in humans in 1981 by sequence analysis of alleles

at the β globin locus (Miesfeld et al. 1981; Spritz 1981) and subsequently found to be

naturally occurring and ubiquitous in prokaryotic and eukaryotic genomes (Tautz and Renz

1984; Jeffreys et al.1985; Tautz 1989; Thoren et al.1995; Toth et al. 2000). The repeats of

longer units form minisatellites or in the extreme case, satellite DNA (Ellegren 2004). The

isolation and sequencing of satellite DNAs revealed repeat motifs of variable length from just

a single base to thousands of base, a typical satellite DNA is a centromeric sequence with a

100bp repeat (Pardue and Gall 1970). Subsequently, satellite of 10-30 bp repeat motifs,

termed minisatellites was isolated in mammals (Jeffreys et al. 1985). Finally, satellites with

even shorter repeat motifs, called microsatellites, were isolated. In 1982, Hamada and his

colleagues showed the existence of dinucleotide repeats of poly (CA) and poly (GT) in

diverse eukaryotic genomes. Furthermore, Weber and May (1989) demonstrated that SSR

polymorphisms (SSRPs) could be easily detected by PCR, using two flanking primers, which

prompted the development of SSRs in various mammalian species and their subsequent

assignment to specific chromosomes. In plants, the presence of SSRs was first demonstrated

by the hybridization of oligonucleotide probes of poly (GT) and poly (AG) on the phage

libraries of tropical tree genomes.

13

A search of published DNA sequences reveals that SSRs are also highly abundant in

diverse plant genomes (Morgante and Olivieri 1993). It has been shown that SSR in exons are

less abundant than in non-coding regions (Hancock 1995) and that different taxa exhibit

different preferences for SSR types (Beckmann and Weber 1992, Lagercrantz et al. 1993;

Tautz and Schlotterer 1994). Microsatellites are highly polymorphic, abundant and fairly

evenly distributed throughout the euchromatic part of the genomes. These properties have

made microsatellites one of the most popular genetic markers for mapping, paternity testing

and population genetics (Goldstein and Schlotterer 1999). SSRs are non-randomly distributed

within expressed sequence tags (ESTs), UTR regions, introns and coding regions. SSR

variations within these regions can cause frame-shift, alteration in gene expression,

inactivation of gene, change of function and eventually phenotypic changes and can cause

neuronal disease, cancers in human and animals (Li et al. 2004). The significant part of SSR

structure are functionally important for gene transcription, translation, chromatin

organization, recombination, DNA replication, DNA MMR system, cell cycle, etc. as shown

in Figure 2.1.

Figure 2.1 Putative functions/effects of SSRs (Reproduced from Li et al. 2002)

SSR functions/effects

Chromatin

organization

Regulation of DNA

metabolic processes

Regulation of gene

activity

Chromosoma

l organization

DNA

structure

Centromere

and

telomere

Transcription Binding

protein

Translation

DNA

replication

Recombination MMR

system Cell cycle

14

2.3.1 Origin/genesis of microsatellites

The genesis/origin of SSRs is an evolutionarily dynamic process and has proven to be

exceedingly complex phenomenon (Ellegren 2004; Pearson et al. 2005). The most accepted

mechanism of microsatellite genesis is based on (i) proto-microsatellite (ii) insertion/deletion

of 2-4 nucleotide and (iii) retro-transposon

(i) Proto-microsatellite

The origin of microsatellites concluded that a minimum number of repeats called as proto-

microsatellite are required before DNA polymerase slippage can extend the number of

repeats (Rose and Falush 1998). It has been shown that in the species that have primates as

their common ancestor (e.g. gorillas, chimpanzees and humans) GA mutations at the ƞ-globin

locus change the sequence ATGTGTGT to ATGTATGT, thus creating a microsatellite

(ATGT)2 which evolved into (ATGT)4 in African monkeys and (ATGT)5 in humans (Messier

et al. 1996).

(ii) Insertion/deletion of 2-4 nucleotides

Zhu et al. (2000) conducted an elegant study on mutated human genes and demonstrated that

more than 70% of all 2 to 4 nucleotide insertions resulted in 2 to 5 new repeats, most of

which are not extensions of pre-existing repeats but new microsatellites originating from

random sequences e.g.

AC

ACGGACG ACGACGACG (ACG) 3

ACGAATCGACG ACGACGACG (ACG) 3

AT

(iii) Retro-transposon

Retro-transposons are repetitive DNA fragments, which are inserted into chromosomes after

they had been reverse-transcribed from any RNA molecule. SSR generation was found to be

accompanied by retro-transposition events by analysis of a portion-sequenced of human and

rice genome DNA (Nadir et al. 1996; Temnykh et al. 2001). According to Arcot et al. (1995),

the Alu SINES (Short Interspersed Nuclear Elements) family is largely dispersed in the

primate genome and is likely to contribute to the genesis of microsatellites due to the

presence of A-rich regions at the 3’ terminal and within the sequence.

15

2.3.2 Mechanism of SSR length variation

The most accepted mechanism of microsatellite length variation is mutational mechanism

including errors during recombination (Levinson and Gutman 1987), unequal crossing-over

(Harding et al. 1992) and polymerase slippage (Wolff et al. 1991; Stephan and Kim 1998)

during DNA replication or repair. When unequal crossing-over occurs, there can be drastic

changes such as the loss or gain of a large number of repeats.

(i) Replication slippage

Length changes in microsatellite DNA generally occur during DNA replication or repair. The

DNA polymerase slippage can occur, in which one DNA strand temporarily dissociates from

the other and rapidly rebinds in a different position, leading to base-pairing errors and

continued lengthening of the new strand and an increase in the number of repeats (i.e.

additions). If the error occurs on the complementary strand or a decreased number of repeats

(i.e. deletions) if the error occurs on the parent strand (Goldstein and Schlottrer 1999,

Ellegren 2004). High rate of slippage have been demonstrated but these appear to lead to only

small changes in the number of repeats. Slippage can destabilize microsatellites either

because there is no effective repair system for DNA loops or because of alteration in DNA

polymerase or its cofactors that result in increased slippage rates.

Figure 2.2 Slippage during DNA replication. Assume that in the original DNA molecule

there were 5 repeats of the motif, symbolized by a box. Slippage leads to the formation of

new alleles with 6 and 4 repeats, depending on the strand containing the polymerase error

(reproduced from Goldstein and Schlottrer 1999)

3' 5'

’ 5' 3'

3' 5'

3’

’'’ 5'

3'

’

5'

’

3'

’

5'

’ 3' 5’

Replication

Slippage

3'

’ 5'

’

3'

’

5

’'

’

3' 5'

New replication cycle

+1 repeat -1 repeat

3' 5'

3' 5'

3' 5'

3' 5'

16

(ii) Recombination

Recombination could potentially change the SSR length by unequal crossing over or by gene

conversion (Brohele and Ellegren 1999; Jakupciak and Wells 2000). Unequal exchanges in

combination with random genetic drift and selection can have a strong effect on the

accumulation of tandem-repetitive sequences in the genome (Charlesworth et al. 1994).

Nonreciprocal recombination (gene conversion) play significant role in destabilization of

tandem repeats for both micro- and minisatellite (Jakupciak and Wells 2000; Richard and

Paques 2000).

Figure 2.3 Unequal crossing-over between homologous chromosomes. Black and blue

regions correspond to microsatellite repeat sequences (Reproduced from Oliveira et al. 2006).

(iii) Interaction of replication slippage and recombination

A very strong interaction was found between mean repeat length and SSR locus distance

from the centromere on the number of alleles and variation in repeat size at SSR loci and

microsatellite diversity study in wild emmer wheat (Li et al. 2003). The interaction of

slippage and recombination, which may happen in heteroduplex DNA tracts, could also affect

SSR stability (Li et al. 2002).

17

2.3.3 Classification of SSRs

Microsatellites are classified according to the type of repeat sequence as perfect, imperfect,

interrupted or compound. In a perfect microsatellite, the repeat sequence is not interrupted by

any base not belonging to the motif (e.g. TATATATATATATATA). While in an imperfect

microsatellite there is a pair of bases between the repeated motifs that does not match the

motif sequence (e.g. TATATATACTATATA). In case of an interrupted microsatellite there

is a small sequence that does not match the motif sequence (e.g.

TATATACGTATATATATA). While in a compound microsatellite the sequence contains

two adjacent distinctive sequence repeats (e.g. TATATATATAGTGTGTGTGT). Depending

upon the arrangement of nucleotides within the repeat motifs, Weber (1990) used the terms

perfect, imperfect and compound to classify microsatellites, whereas Wang et al. (2009a)

coined the terms simple perfect, simple imperfect, compound perfect and compound

imperfect.

Table 2.3 Classification of SSR markers (reproduced from Kalia et al. 2011)

Types Example

(a) Based on the number of nucleotides per repeat

Mononucleotide (A)n

Dinucleotide (CA)n

Trinucleotide (CGT)

Tetranucleotide (CAGA)n

Pentanucleotide (AAATT)n

Hexanucleotide (CTTTAA)n

(b) Based on the arrangement of nucleotides in the repeat motifs

perfect or simple perfect (CA)n

Simple imperfect (AAC)n ACT (AAC)n+1

Compound or simple compound (CA)n (GA)n

Interrupted or imperfect or compound imperfect (CCA)n TT (CGA)n+1

( C) Based on location of SSRs in genome

Nuclear nuSSR

Chloroplastic cpSSRs

Mitochondrial mtSSRs

18

2.3.4 Frequency and distribution of SSRs

Various studies have demonstrated that the SSRs constitute a large fraction of non-coding

DNA. However, recently several reports have shown that a large number of SSRs are also

located in transcribed regions of genomes, including protein coding genes and expressed

sequence tags (ESTs), although, repeat number of SSRs in these regions are comparatively

low (Morgante et al. 2002; Li et al. 2004). For instance, in cereals (maize, wheat, barley,

Sorghum and rice), only 1.5-7.5% SSRs are available in ESTs (Kantety et al. 2002; Thiel et

al. 2003). The dinucleotide repeats are most common in many species, but are much less

frequent in coding region than in non-coding regions (Li et al. 2002; Wang et al. 1994). In

many species, exons have more triplet SSRs than other repeats (Morgante et al. 2002; Li et al.

2004). In plants, the most frequent triplet is AAG (Li et al. 2004), although in cereals, most

common triplet is CCG (Cordeiro et al. 2001; Varshney et al. 2002; Thiel et al. 2003). In

general, frequency of microsatellites is inversely related to the genome size in plants, but the

percentage of repetitive DNA appeared to remain constant in coding regions (Morgante et al.

2002). The genomic distribution, evolutionary dynamics, biological function and practical

utility have been the objective of many researchers, as summarized in several review articles

(Tautz and Schlotterer 1994; Jarne and Lagoda 1996; Schlotterer 1998; Chambers and

MacAvoy 2000; Li et al.2002; Dieringer and Schlotterer 2003; Ellegren 2004; Oliveira et al.

2006; Subirana and Messeguer 2008; Sun et al.2009).

2.3.5 Identification of polymorphism

To identify DNA markers that reveal differences between parents (i.e. polymorphic markers)

it is critical that sufficient polymorphism exists between parents. Generally, cross pollinating

species possess higher levels of DNA polymorphism compared to inbreeding species,

mapping in inbreeding species generally requires the selection of parents that are distantly

related. In many cases, parents that provide adequate polymorphism are selected on the basis

of the level of genetic diversity between parents (Anderson et al. 1993). The choice of DNA

markers used for mapping may depend on the availability of characterized markers or the

appropriateness of particular markers for a particular species. Once polymorphic markers

have been identified, they must be screened across the entire mapping population, including

the parents (and F1 hybrid, if possible). This is known as marker ‘genotyping’ of the

population.

19

2.4 Discovery and development of SSR markers

Highly polymorphic and dispersed molecular markers can be used to study the biological

relatedness of organism, facilitate the mapping of valuable traits and ultimately the cloning of

their genes. In recent years, a number of different molecular marker systems have been

developed, with microsatellite markers proving to be most powerful. However, despite their

usefulness for many applications, the difficulty, expenses and time in obtaining microsatellite

markers are a major hindrance to their use. The traditional method for isolating microsatellite

clones is to create a small-insert, partial genomic library in a plasmid or phage vector and

then screen clones by repeated rounds of filter hybridization using an oligonucleotide repeat

probe. Microsatellite enrichment has also been developed to increase the proportion of clones

in a given library containing the microsatellite motif of interest. Several strategies for

microsatellite enrichment have been reported (Edwards et al. 1996; Fisher and Bachman

1998; Hamilton et al. 1999; Kijas et al. 1994; Koblizkova et al. 1998; Paetkau 1999; Phan et

al. 2000; Zane 2002; Nunome et al. 2006 and 2009). Conventional genomic libraries

construction and subsequent screening is cumbersome, tedious and cost intensive process

which requires high level of expertise. However, once developed, the running cost of these

markers is low enough. AT di-nucleotides, which are the most abundant type of SSRs in

plants are difficult to isolate from libraries because they are palindromic (Powell et al. 1996).

Therefore, several alternative strategies have been devised in order to reduce the time

invested in SSR isolation and to significantly increase yield of SSRs. These methods involve

identification of SSR sequence in RAPD amplicons, screening of available sequenced EST

databases and transferability of markers from related species.

2.4.1 Development of SSRs through enriched small insert genomic library construction

An efficient way to discover new SSR is to construct libraries enriched for specific SSR

motifs (Ostrander et al. 1992; Kijas et al. 1994; Kandpal et al. 1994; Edwards et al.1996;

Fisher and Bachmann 1998; Hamilton et al. 1999; Jakse and Javornik 2001). The SSR

markers have been developed for many plant species (Edwards et al.1996; Connell et al.1998;

Fisher and Bachmann 1998; Jones et al. 2001; Kolliker et al. 2001; Zane et al. 2002; Cai et al.

2003; Wang et al. 2004) including several trees (Rossetto et al. 1999; Liebhard et al. 2002;

Marinoni et al. 2003; Merdinoglu et al. 2005). The construction of SSR-enriched libraries can

be tedious and expensive work, especially without some type of enhancement process to

eliminate non-SSR-containing clones. Several enrichment methodologies have been

developed (Edwards et al. 1996; Connell et al. 1998; Fisher and Bachmann 1998; Hamilton et

20

al. 1999; Wang et al. 2004), but regardless of the protocols, SSR detection and discovery

invariably relies on sequencing all the insert of selected clones to confirm the presence of the

desired motif. Several polymerase chain reaction (PCR)- based methods have been applied to

both non-enriched and enriched libraries to identify inserts containing SSRs before

sequencing and these strategies have increased the efficiency of detecting colonies with

desired SSR inserts (Lench et al. 1996; Lunt et al. 1999; Chen et al. 2005). Microsatellite

DNA loci have become important source of genetic information for a variety of purposes

(Goldstein and Schlotterer 1999). To amplify microsatellite loci by PCR, primers must be

developed from the DNA that flanks specific microsatellite repeats. These regions of DNA

are among the most variable in the genome, thus primer-binding sites are not well conserved

among distantly related species (Pepin et al. 1995; Primmer et al. 1996; Zhu et al. 2000).

Among various strategies for obtaining microsatellite loci, the cloning small genomic

fragments and using radio labeled oligonucleotide probes of microsatellite repeats to identify

clones with microsatellite was the first described and works well in organisms with abundant

microsatellite loci (Tautz1989; Weber and May 1989; Weissenbach et al. 1992).

Unfortunately, this approach does not work well when microsatellite repeats are less

abundant. Thus, two classes of enrichment strategies have been developed: 1) uracil-DNA

selection (Ostrander et al. 1992) and 2) selective hybridization capture (Armour et al. 1994;

Kandpal et al.1994; Kijas et al. 1994). The selective hybridization strategy based SSR

enrichment technique is a relatively simple, robust, reproducible and cost effective approach

for isolating large number of SSRs from diverse plant species with higher efficiency. In the

first step of selective hybridization approach, fragments generated by sonication or

endonuclease digestion of genomic DNA are ligated to known sequence, a vector or an

adaptor. Following the fragmentation-ligation step, DNA is denatured and hybridized with

the repeat containing probes (Figure 2.4). The probes can be bound to a nylon membrane

(Karagyozov et al. 1993, Armour et al. 1994) or biotynylated and captured on streptavidin

coated beads (Kandpal et al.1994; Kijas et al. 1994). After the hybridization step and several

washes with buffer to remove nonspecific binding, the probe bound DNA is eluted and

recovered by southern blotting, PCR or direct sequencing (Zane et al. 2002).

Two protocols were proposed to produce genomic libraries that were highly enriched

for specific SSRs using a primer extension reaction (Ostrander et al. 1992; Paetkau 1999).

Both methods rely on the construction of a primary genomic library, in which fragmented

genomic DNA is inserted into a phagemid or a phage vector in order to obtain a single strand

DNA (ssDNA) library. ssDNA is then used as a template for a primer extension reaction,

primed with repeat-specific oligonucleotides, which generates a double stranded product only

21

from vectors containing the desired repeat. During the primary library production step, for

practical reasons, only a limited portion of the investigated.

2.4.2 Development of SSRs through screening of RAPD amplicons

To avoid library construction and screening, some authors proposed modifications of the

RAPD approach for the amplification of unknown SSRs, by either using repeat-anchored

random primers (Wu et al. 1994) or using RAPD primers and subsequent southern

hybridization of polymerase chain reaction (PCR) bands with SSR probes (Cifarelli et al.

1995; Richardson et al. 1995). Although not useful for single-locus analyses as no

information on SSR flanking regions is obtained, these methods inspired alternative strategies

for the identification of single SSRs. Based on the observed abundance of repeat regions in

RAPD amplicons, isolation of SSR regions was achieved simply by means of southern

hybridization of RAPD profiles with SSR-repeat containing probes, followed by the selective

cloning of positive bands (Ender et al. 1996), or through the cloning of all the RAPD

products and screening of arrayed clones (Lunt et al. 1999). Other non-library PCR based

strategies rely on the use of repeat-anchored primers to isolate and then sequence one (Fisher

et al. 1996) or both regions (Lench et al. 1996; Cooper et al. 1997) flanking SSR repeats. All

these methods provide, if successful, a quick alternative to laborious and time- consuming

library screening, but their use has not been that much frequent (Zane et al. 2002).

22

Figure 2.4 A general protocol for developing SSR markers with a SSR-enrichment step

(reproduced from Park et al. 2009)

2.4.3 Development of SSRs from EST sequences (genic or EST-SSRs)

A wealth of sequence data of ESTs has been generated as a result of sequencing projects for

gene discovery from several plant species, giving scientists the flexibility to access many full-

length cDNA clones and characterized genes. These sequences are usually available in online

database in public domain and can be downloaded and scanned for identification of SSRs.

The expressed sequence tag-simple sequence repeats (EST-SSRs), genic SSRs or gene-

derived SSRs, found in complementary DNA (cDNA) or ESTs sequence, between 1.1% and

4.8% have EST-SSR tandems (Saha et al. 2004). These informative markers are becoming

Extracted genomic DNA from young leaves

Digested Genomic DNA with restriction enzymes

Size fractionated from 300 to 1500bp

Adaptor ligation

One way PCR amplification using adaptor primers

Hybridization with SSR motif biotin probe and Streptavidin

coated magnetic beads in amplicons

Substract SSR motif elements using magnet

Washing

Denaturation

Reamplification

Cloning and Sequencing with SSR motif

fragments

23

the marker of choice as a large number of cDNA and EST sequences are being uploaded over

the public databases. These markers are more attractive than genome-based SSR markers

because they are obtained in a fast, efficient and low-cost method and are present in coding

regions of the genome, which makes them absolute markers of functional genes (Haimei et al.

2005; Varshney et al. 2005). Thus, the development of EST-SSRs is relatively easy and

inexpensive because they are produced form ESTs that are publicly available. However, the

generation of EST-SSRs is largely limited to those species or close relatives for which the

ESTs sequences are available. The EST-SSRs have some intrinsic advantages over genomic

SSRs as they have higher transferability rate because the primers are designed from the more

conserved coding regions of the genome. Several search modules or programmes have been

extensively used to identify SSRs such as:

MISA(MIcroSAtellite,http://pgrc.ipk-gatersleben.de/misa),

SSRprimer (http://hornbill.cspp.latrobe.edu.au.http://acpfg.imb.uq.edu.au),

Sputnik (http://abajian.nert/sputnik/index.html),

SSRIT (SSR Identification Tool, http://www.gramene.org/db/searches/ssrtool),

SSRSEARCH (ftp://ftp.gramene.org/pub/gramene/software/scripts/ssr.pl),

TRF (Tandem Repeat Finder, http://tandem.bu.edu/trf/trf.html) etc. Recently, Yadav et al.

(2011) developed and deployed 50 EST-SSRs over 25 accessions of J. curcas collected from

different geographical regions of India for genetic diversity analysis. Cubry et al. (2014)

developed 226 EST-SSR markers and tested over 19 Hevea accessions to assess the

polymorphism and diversity analysis. Wang et al. (2014) developed 1129 EST-SSRs from

two 454 sequencing cDNA libraries of Gossypium barbadense and 311 polymorphic loci

were integrated into interspecific BC1 genetic linkage map. Jain et al. (2014) downloaded

13,513 ESTs from NCBI and obtained 7552 unigenes from these ESTs and developed 377

ESTs-SSR in Jatropha curcas. SSRs have been isolated for a number of plant species using

this strategy. Greater DNA sequence conservation in transcribed regions, however, leads to

lower polymorphism in EST-SSRs making them less efficient compared to genomic SSRs for

distinguishing the closely related genotypes. Therefore, genomic SSRs are superior over

EST-SSRs for fingerprinting or varietal identification studies.

A major drawback of EST-SSRs is the sequence redundancy that yields multiple sets

of markers at the same locus (Parida et al. 2006). To circumvent the problem of redundancy

in EST data base, a non-redundant unigene EST data set (random EST sequences assembled

into unique gene sequences called unigenes) should be used. However, EST sequence

analyses revealed approximately 1.5-7.5% of sequences containing microsatellite motifs in

cereals (Kantety et al. 2002; Thiel et al. 2003). Among dicotyledonous species the frequency

24

of ESTs containing SSRs was found to range from 2.65 to 16.82% (Kumpatla and

Mukhopadhyay 2005). Therefore, regardless of the plant considered, an ever-increasing

number of EST sequences provide a complementary source for microsatellite marker

identification. Although the conserved nature of coding sequences may limit their

polymorphism, it should facilitate cross-amplification of loci among phylogenetically related

species (Scott et al. 2000) and even genera.

2.4.4 Development of SSRs through search of genome sequences

Another approach for the isolation of SSRs involves a computational search of the genome

databases. Weber and May (1989) reported abundance of the (CA)n SSRs in the human

genome through search of human genome sequence database. In plants, the first report of

(CA)n repeats in soybean sequence was the one from a computer search of the Gene Bank

databases (Akkaya et al. 1992). Morgante and Olivieri (1993) showed an abundance of (AT)n

repeats in 30 different plant genomes and demonstrated that the analysis of repeat number

variations by PCR was highly informative in genome analyses. Wang et al. (1994) surveyed

mono-, di-, tri- and tetra-nucleotide repeats which were all present in non-coding regions, but

57% of the tri-nucleotide repeats, containing G-C base pairs, reside in the coding region. The

abundance of tri-nucleotide repeats in coding region was attributed to the other types being

eliminated from the coding sequences because of their ability to cause frame-shift mutations.

However, in species lacking genomic sequences, a computer search would not be

useful in developing large scale SSR markers. Construction of SSR-enriched libraries leading

to development of large-scale sequences would be a plausible way to develop SSR markers.

2.4.5 Development of SSR through next generation sequencing

Traditional methods for the identification of microsatellite markers usually demand the

construction of small-insert genomic libraries, colony selection by microsatellite- containing

probe hybridization, sequencing of selected clones, primer design for suitable flanking

regions and assessments on the marker polymorphism by PCR analysis on a germplasm

sample. Later on, methods employing microsatellite- enriched genomic libraries diminished

costs, time and workload necessary for marker development (Billotte et al. 1999; Ostrander et

al. 1992; Paetkau 1999). More recently, researchers are being applying next-generation

sequencing technologies to generate sequence data for the genome identification of

microsatellite regions and primer design (Abdelkrim et al. 2009; Castoe et al. 2010;

Csencsics et al. 2010; Zhu et al. 2012). Next-generation sequencing (NGS) techniques

25

became commercially available around 2005 and since then, several different sequencing

methods have been developed, all of which are continuously being improved.

2.5 Cross-species amplification of SSRs

Sometimes, sequence information for a particular genome is not available or insufficient to

develop large scale SSRs. In such cases, it may be advantageous to utilize primer sequences

identified for one species in the analysis of other closely related species. In general, the EST-

SSRs showed higher rate of cross-species transferability as compared to genomic SSRs. In

this way SSRs developed for one species can be successfully utilized for other related

species. A large numbers of reports available cross-species transferability of SSRs was used

and few are listed below:

Wen et al. (2010) selected 419 Expressed sequence tag (EST)-SSR that had been

developed for Manihot esculenta (Cassava), and investigated whether they could be

transferred to J.curcas and found that the transferability of EST-SSR markers was high across

the species. Pamidimarri et al. (2011) isolated 49 SSR markers from J. curcas and checked

the ability of cross species amplification to deduce the generic relationship with its 6 sister

taxa (J. glandulifera, J. gossypifolia, J. integerrima, J. multifida, J. podagroca and J.

tanjorensis). Yadav et al. (2011) designed 406 EST-SSR markers and showed 57% to 95.6%

transferability among 5 species of Jatropha and 47.0% transferability across genera in

Ricinus communis. Laosatit et al. (2013) developed 163 EST-SSRs to evaluate the

transferability and genetic relatedness among 4 accessions of J. curcas from China, Mexico,

Thailand and Vietnam, 5 accessions of congeneric species, viz. J. gossypifolia, dwarf J.

integerrima, J. multifida, J. podagrica and Ricinus communis. The polymorphic markers

showed 75.56-85.19% transferability among four species of Jatropha and 26.67%

transferability across genera in Ricinus communis. The cross species SSR transferability has

also been used in various crop plant as few are listed here. Peakall et al. (1998) demonstrated

the successful cross-species amplification of 31 Soybean (Glycine max) SSR loci to within

and among legume genera, for the study of variation within population of single species. The

cross- species amplification within soybean (Glycine max) was up to 65% whereas, outside

the genus was much lower about 3% to 13%. Gupta et al. (2003) used 78 SSRs primer

successfully for the study of their transferability to 18 related wild species and 5 cereal

species (barley, oat, rye, rice and maize). Kuleung et al. (2004) examined the transferability

of SSR markers among wheat (Triticum aestivum L.), rye (Secale cereaale L.) and triticale

(T.duram L. or T. aestivum L. X Secale sp.). One hundred forty-eight wheat and 28 rye SSR

26

markers were used to amplify genomic DNA from five lines each of wheat, rye and triticale.

Tang et al. (2006) analyzed the utility of EST-SSR markers among monocots in silico and by

experiment for homologous analysis of SSR-ESTs and transferability of wheat SSR-EST

markers across barley, rice and maize. Chen et al. (2010) assessed the transferability 120 rice

SSR markers to 21 different bamboo species and reported 68.3% transferability. Sathya and

Jayamani (2013) used 35 microsatellite primers pairs derived from the adzukibean (Vigna

angularis (wild.) Ohwi & Ohashi) for the assessment of transferability over 36 genotype of

greengram and related Vigna species.

2.6 Advantage of SSR analysis

SSR markers have many advantages over the other marker systems. A few advantages are

listed as 1) High reproducibility: The high reproducibility would be the most important in

genetic analysis. While reproducibility of the SSR profile is as robust as it is with RFLPs,

experimental procedures for SSR analysis are much simpler and require only a small amount

of the template DNA. Since SSR analysis does not require restriction with enzymes, it can

reproduce the same profiles regardless of the state of the template DNA. It also does not

require template DNA to be ultra pure, which is a requirement in AFLP analysis since

contaminated or impure DNA is often recalcitrant in restriction enzyme digestion to produce

nonspecific spurious bands. This is a real benefit when one is dealing with specimens that are

dry, contaminated, mummified or even in fossilized form in the wild (Manen et al. 2003;

Boder et al. 2006). 2) Hyper-variability: The hyper-variable nature of SSRs produces very

high allelic variations even among very closely related varieties. A literature survey showed

that number of alleles varied from 1 to 37 with diversity indices of 0.29-0.95 in major crop

species (Powell et al. 1996). The level of genetic variation detected by SSRs analysis was

almost two times higher than that detected by RFLPs, with 61 soybean lines (Morgante et al.

1994). In a comparative study of the utility of RFLP, RAPD, AFLP and SSR marker systems

for germplasm analysis, SSRs showed the highest expected heterozygosity, while AFLP had

the highest effective multiplex ratio (Powell et al. 1996). 3) Co-dominancy: The third

advantage has to do with the co-dominat nature of SSR polymorphisms. Although

homoplasious bands can be misleading in scoring SSR profiles, the SSR bands produced

from the same set of primers are intuitively orthologous. The multiple bands generated from

RAPD and AFLP analysis do not permit their designation as allelic or orthologous bands

until they are converted into STS markers after sequencing. The co-dominant nature of SSRs

is suitable for genetical analysis in segregating F2 population or parentage analysis in hybrids

27

(Scott et al. 2000; Slavov et al. 2005). 4) Abundance and distribution: The SSRs are shown to

be highly abundant and distributed throughout the genomes (Wang et al. 1994; Toth et al.

2000, Varshney et al. 2005). Genetic analysis often becomes more complex by the fact that

large numbers of anonymous on RAPD or AFLP markers are clustered in specific locations

of chromosomes or linkage maps (Vuylsteke et al. 1999; Kwon et al. 2006). In silico

investigation by Varshney et al. (2002) for frequency and distribution of microsatellites in

ESTs of some cereal species like barley, maize, oats, rice, rye and wheat showed that the

frequency of SSRs was 1/7.5 Kb in barley, 1/7.5 Kb in maize, 1/6.2 Kb in wheat, 1/5.5 kb in

rye and sorghum and 1/3.9 Kb in rice. Gupta et al. (2003) recorded the average density of

SSRs per 9.2 kb of EST sequence of wheat. Lawson and Zhang (2006) studied on Arbidopsis

thaliana (2n=10) and Rice (2n=24) genome to find out the distinct pattern of SSR distribution

in whole genome. Comparative analysis of the Arabidopsis and rice genomes have yielded a

number of insights about the two plants because Arabidopsis has been traditionally used as a

model plant species and rice gathered much attention due to its significance in being one of

the major food resources in the world. The Arabidopsis genome contains a total of 104,102

SSRs and the average of SSR density in genome is approximately 875 per mega-base (MB).

In comparison, the rice genome contains a total of 298,819 SSRs and the average of SSR

density in genome is approximately 807/MB. The whole genome analysis of castor bean

identified 5, 80,986 SSRs with a frequency of 1 per 680 bp. The genomic distribution of

SSRs revealed that 27% were present in the non-genic regions whereas 73% were also

present in the putative genic regions with 26% in 5’UTRs, 25% in introns, 16% in 3’ UTRs

and 6% in exons (Sharma and Chauhan 2011). Qu and Liu (2013) analyzed genome-wide

simple sequence repeats in maize (Zea mays ssp. Mays L.). A total of 179,681 SSRs were

identified on the whole, 10 chromosomes and the average distance between repeat unit varied

from 11.12 Kb (chromosome 6) to 11.89 Kb (chromosome 4), with an average of 11.46 Kb.

The other advantage of the SSR marker system is that SSR are preferentially associated with

non-repetitive DNA (Varshney et al. 2005; Morgante et al. 2002; Andersen et al. 2003).

Genomic sites of SSR markers, derived from genomic libraries, fall into either the transcribed

region (genic SSRs) or the non- transcribed region (genomic SSRs). The SSRs, derived from

ESTs or cDNAs are mostly genic SSRs, which have the potential for application in such areas

as gene function characterization (Ronning et al. 2003), association analysis for gene tagging

(Szalma et al. 2005; Shin et al. 2006; Crossa et al. 2007) and QTL analysis (Buerstmayr et al.

2002; Breseghello et al. 2006a; Zeng et al. 2009).

28

2.7 Application of SSR markers

Microsatellites have become a marker of choice for an array of applications in plants due to

hypervariable nature and extensive genome coverage. There are many applications of

microsatellites in plants, a few can be categorized as: (1) genome mapping (2) cultivar

identification and markers-assisted selection (3) genetic diversity analysis and phylogenetic

relationship and (4) population and evolutionary studies (Figure 2.5).

Figure 2.5 Microsatellite- a summary of development, distribution, functions and

applications (reproduced from Kalia et al. 2011).

Gender

Identification

Hybridization

& breeding Transgenics Population genetics

Taxonomic and

phylogenetic

studies

Functional

Genomics

Genome

mapping

Diversity &

cultivar

analysis

Gene tagging

& QTL

analysis

Da

tab

ase

structu

re Applications

Techniques

based on SSR Development

Distribution and

functions

RA

HM

,

RA

MP

o S

AM

PL

Cro

ss species

am

plifica

tion

Scree

nin

g

gen

om

ic

libra

ries

Scree

nin

g o

f

RA

PD

an

aly

sis R

EM

AP

F

IAS

CO

S

TM

S

RA

MP

dR

AM

P

rR

AM

P

MP

-PC

R o

r

SS

R

Microsatellite

3’UTR Intro

n

Exon 5’UTR

Gene

Transcription

/Translation

Inactive genes or

truncated

proteins

Gene silencing mRNA splicing Transcription

slipage

29

(1) Genome mapping

Genome mapping consists of (a) genetic mapping (b) comparative mapping and (c) physical

mapping.

(a) Genetic mapping

Genetic mapping with microsatellite markers in plants was first reported in tropical trees

(Condit and Hubbell 1991) and then reported in soybean (Akkaya et al. 1992) and rice (Wu

and Tanksley 1993; Zhao and Kochert 1993).

(b) Comparative mapping

Comparative mapping is the alignment of chromosome or chromosomal fragment of related

species based on genetic mapping of common DNA markers and can trace the history of

chromosome rearrangements during the evolution of plants, animals and insects.

Microsatellite markers can facilitate comparative mapping which definitely helps to identify

‘linkage blocks’, major gene syntenies, chromosome rearrangement and micro-syntenies

among species. Major or micro-synthesis will further help to develop DNA markers for

specific chromosomal regions for marker-assisted selection and even for cross-species

homologous cloning (Wang et al. 2009b). Comparative genomics of Arabidopsis relatives has

great potential to improve our understanding of molecular function and evolutionary

processes. Recent studies of phylogenetic relationships within Family Brassicaceae provide

an important framework for comparative genomics research. Comparative linkage mapping

and chromosome painting in the close relatives of Arabidopsis inferred an ancestral

karyotype of these species. In addition, comparative mapping to Brassica identified genomic

blocks that have been maintained since the divergence of the Arabidopsis and Brassica

lineages (Schranz et al. 2007). Microsatellite markers have been used for comparative

mapping between Quercus robur (L.) and Castanea sativa (Mill.) (Barreneche et al. 2004).

EST-SSRs were used in comparative mapping in wheat, barley, rye and rice. The

conservative chromosome regions between wheat and rice and the presence of orthologues of

barley EST-SSRs in different species have been confirmed and identified (Yu et al. 2013,

Varshney et al. 2005; Stein et al. 2007). SSR markers have also been used to construct whole

genome physical maps of model crop species, for anchoring and comparing the frames of

soybean genetic and physical maps (Shultz et al. 2007; Shoemaker et al. 2008).

30

(c) Physical mapping

SSR markers can be used as anchor markers for joining large pieces of overlapped DNA

fragments such as bacterial artificial clones (BACs). Physical maps give us a real physical

distance between markers or genes in bp (base pair) or kbp. SSR markers have been used to

construct a whole genome physical map of model crop species.

(2) Cultivar identification

SSR marker-assisted selection (MAS) can also greatly enhance the efficiency of plant

breeding programs. SSR markers used for selection can be classified into flanking SSR

markers (closely linked to the locus for a trait) and targeted gene SSR markers (developed

within the targeted gene itself). In tomato, a set of 65 SSR markers has been selected for

distinguishing 19 diverse tomato cultivars (He et al. 2003).

(3) Genetic diversity and phylogenetic relationships

Genetic diversity refers to any variation in nucleotides, genes, chromosomes or whole

genomes of organism. Genetic diversity can be assessed at different levels within a species or

among species. Phylogenetic relationships reflect the relatedness of a group of species based

on a calculated genetic distance (sequence conservation or diversification in their

evolutionary history). SSR markers often are a powerful system for revealing interspecific or

intraspecific phylogenetic relationship. For example, the genetic diversity and phylogenetic

relationship from germplasm collection such as a temperate bamboo collection (Barkley et al.

2005), a citrus variety collection (Barkley et al. 2006) and a cultivated and wild peanut

collection (Barkley et al. 2007; Cuc et al. 2008) have been assessed by SSR markers.

(4) Population and evolutionary studies

Studies of plant evolution were traditionally based on taxonomic and phenotypic data (such

as morphological and karyotype). Microsatellite markers can be used to determine the

population structure within and among the natural population and /or identify the potential

progenitors. The development of organelle specific SSR markers (i.e. cpSSR and mtSSR) had

a great impact on the determination of structure and variation within a natural population as

well as phylogenetic relationships. The uni-parental mode of inheritance, conserved gene

order and lack of heteroplasmy and recombination of organelle genomes make them an

attractive tool for evolutionary studies, mainly patterns of migration, population histories and

31

levels of differentiation (Provan et al. 2001). However, ESTs are also being used for such

analysis because in such studies, one actually looks at the evolution of functional genes (Joshi

et al. 1999). Evolution of genetic diversity and phylogenetic relationships has resulted in

identification of some misclassified accessions that were reclassified. Genetic diversity

assessment and phylogenetic relationship construction will provide important information for

choosing parental lines for breeding programs, classification of plant germplasm accessions

and further curation and acquisition of new plant germplasm accessions (Wang et al. 2009a).

(5) QTL mapping and marker assisted breeding (MAB)

Microsatellite markers have been efficiently deployed in determination of specific genomic

regions that are responsible for the expression of important physiological and agronomic

traits. It has been used in analyzing quantitative trait loci (QTLs) which can lead to the

identification of candidate genes for the trait of interest that are particularly vital for the

breeding program like yield, disease resistance, stress tolerance, seed and fruit quality etc.

(Neeraja et al. 2007; Romero et al. 2009). Countless number of studies are available on QTL

mapping and MAB in plant using SSR markers. A few important and latest studies are

mentioned here.

(A) QTL mapping

Ding et al. (2011) developed genetic map consisting of 180 polymorphic SSR markers with

an average linkage distance of 11.0 cM and identified 5 QTL for maize test weight. Wang et

al. (2012) explored the genetic basis for stay-green traits in maize using 112 polymorphic

SSR markers and identified 14 quantitative trait loci (QTLs) for three stay-green and related

traits. Guo et al. (2013) utilized 245 RILs and 236 markers (211 SSR, 6 CAPS, 5 STS, 2 SNP

and 12 IDP) to identify QTLs for oil, starch and protein content traits in maize. Liu et al.

(2014) identified a total of 83 QTL for maize kernel-size traits and kernel weight in multi-

environments. Uga et al. (2013) identified five quantitative trait loci (QTLs) for rice using

SSR and SNP for the ratio of deep rooting (RDR). Lim et al. (2014) identified QTLs for 6

traits (plant height, tiller number, panicle diameter, panicle length, flag leaf length and flag

leaf width) for japonica rice. They identified 11 main- and 16 minor-effect QTLs for 6 traits

using 131 molecular markers (68 SSR, 35 STS and 28 insertion/deletion). Feng et al. (2013)

constructed genetic map for common wheat (Triticum aestivum L.) using 195 SSR and STS

markers and identified 3 QTL for A-type starch granule content. Rustgi et al. (2013)

developed high-density linkage map of wheat using 81 molecular markers (31 SSR, 1 STM,

32

25 RFLP, and 24 DArT) on chromosome 3A to increase the precision of previously identified

yield component QTLs and to map QTLs for biomass- related trait and identified four novel

QTLs for shoot biomass, total biomass, kernels/spike (KPS) and Pseudocercosporella

induced lodging (PsIL). Cui et al. (2014) constructed a genetic map consisting of 159 loci

distributed across 21 wheat chromosomes using 589 molecular markers (g-SSR, e-SSR,

DArT, STS, SRAP and ISSR). They identified 22 QTL for yield per plant (YD) and 12 QTLs

for yield difference between the value (YDDV) under higher nitrogen (HN) and the value

under lower nitrogen (LN). Rajkumar et al. (2013) constructed a linkage map with 104

markers loci comprising 50 EST-SSRs, 34 non-genic nuclear SSR and 20 SNPs in Sorghum

(Sorghum bicolor L. Moench) and identified a total of 28 QTLs for root and yield related

traits.

(B) Marker assisted breeding (MAB)

Furthermore, a large number of monogenic and polygenic loci for various traits could be

identified and exploited for marker-assisted selection (Joshi et al. 1999). Marker assisted

selection or marker assisted breeding can help breeders bypass the traditional phenotype

based selections in the field and save time and labour to develop new varieties. MAB can not

be affected by environmental factors, thus allowing the selection to be performed under any

environmental conditions. MAB is highly suitable for gene pyramiding. The rice variety

Swarna has been efficiently converted to a submergence tolerant variety in three backcross

generations using markers assisted backcrossing (Neeraja et al. 2007). Singh et al. (2013a)

utilized marker-assisted simultaneous and stepwise backcross breeding for pyramiding blast

resistance genes, Piz5 and Pi54, from non-Basmati donors, C101A51 and Tetep, respectively,

into PRR78, an elite Basmati restorer line of rice hybrid, Pusa RH10. Cao et al. (2014)

demonstrated the first practical use of chromosome segment introgression lines (CSILs) for

the transfer of fiber quality QTLs into upland cotton cultivars using SSR markers without

detrimentally affecting desirable agronomic characteristics. Tyagi et al. (2014) developed

high-yielding wheat variety with better nutritional quality and resistance to all major diseases.

For this purpose a popular elite wheat cultivar PBW343 were pyramided eight QTLs /gene

for four grain quality traits (high grain weight, high grain protein content, pre-harvest

sprouting tolerance and desirable high-molecular-weight glutenin subunits) and resistance

against the three rusts. Hao et al. (2014) transferred a major QTL for oil content using

markers-assisted backcrossing into an elite hybrid to increase the oil content in maize.

33

(C) Association mapping

Association mapping which refers to significant association of a molecular marker with a

phenotypic trait is especially useful for implementing marker-assisted selection for

quantitative traits in plant breeding programs (Breseghello and Sorrells 2006b). QTL

mapping usually uses a population from a bi-parental cross, while association mapping uses a

collection of individuals often with varying ancestry. In recent years, genetic maps have been

prepared in several plant species including rice, wheat, barley, cotton, ryegrass, white clover,

raspberry, potato, sorghum, etc. Once mapped, microsatellite markers could be employed in

tagging several individual traits that are particularly important for a breeding program.

Association mapping using SSR markers has been successfully conducted in many important

crop species as: Zhao et al. (2014) studied on genetic structure, linkage disequilibrium and

association mapping of Verticillium wilt resistance in elite cotton (Gossypium hirsutum L.)

germplasm population. Xiao et al. (2013) studied the associate markers with drought

tolerance at vegetative stage and examined the pattern of LD in a diversity and stress

adaptation rice panel containing 184 rice germplasm accessions with 141 polymorphic SSR

markers that were nearly evenly distributed at 3 mb bin on the 12 rice chromosomes. Hu et al.

(2014) identified yield-enhancing QTL and conducted association mapping with 85 SSR

markers in wild soybeans accessions (Glycine soja). Laido et al. (2014) studied on linkage

disequilibrium and genome-wide association mapping in tetraploid wheat (Triticum turgidum

ssp) using 26 SSR and 970 DArT markers.

2.8 Application of molecular markers in J. curcas

Different types of molecular markers such as RAPD, ISSR, AFLP, SSR and SNPs have been

used for various molecular and genetic studies in J. curcas. The detailed survey of related

literatures on molecular marker applications in J. curcas is presented here under different

sub-heads.

2.8.1 Molecular marker based genetic diversity analysis in J. curcas

Conventional methods have shown that morphological characteristics are useful to establish

phylogenetic relationship at the generic level, but are insufficient to define genetic diversity

and relationships among accessions of J. curcas due to the strong influence of the

environment on traits like seed weight, seed protein and oil content (Medina et al. 2011). It is

therefore clear that evaluation of genetic variation is more feasible and reliable using neutral

34

molecular markers (Basha and Sujatha 2007). A number of molecular markers have been

used for the genetic diversity analysis in J. curcas as follows:

2.8.1.1 RAPD based genetic diversity analysis

The earliest report of application of RAPD markers to investigate the genetic similarity

between toxic Indian accessions and non-toxic Mexican accessions was made by Sujatha et

al. (2005). They had shown similarity index of 96.3% and developed unique fingerprints for

identifying the Indian toxic accessions and the Mexican non-toxic accessions.

Basha and Sujatha (2007) evaluated the genetic diversity among 42 J. curcas

germplasm collected from various geographic locations of India along with a non-toxic

genotype from Mexico using 400 RAPD and 100 ISSR markers and reported 64%

polymorphism.

Ranade et al. (2008) studied genetic diversity among 18 accessions of J. curcas using

RAPD and DAMD markers alongwith two other species J. gossypifolia L. and Phyllanthus

emblica L. (Family Euphorbiaceae). The UPGMA tree clearly shows the accessions collected

from NBRI, Bhubaneswar, North-East, Lalkuan and outgroup accessions are all separated

from each other. The North East accessions are most dissimilar relative to the NBRI and

Bhubaneswar accessions.

Ganesh et al. (2008) assessed genetic diversity using 26 RAPD primers with 5 J.

curcas accession from Tamil Nadu and seven Jatropha species native to India. They found

high genetic variability among the 8 species (80.2% polymorphic). The Jaccard’s coefficient

of similarity ranged from 0.00 to 0.85 which showed a high level of genetic variation among

the genotypes. The UPGMA cluster analysis indicated three distinct clusters, one comprising

all accessions of J.curcas while second included 6 species viz., J. ramanadensis, J.

gossypifolia, J. podagrica, J. tanjorensis and J. integerrima. However, J. glandulifera

remained distinct and formed third cluster indicating its higher genetic distinctness from other

species.

Kumar et al. (2009) differentiated 26 accessions of J. curcas using 55 RAPD markers.

Out of 55 primers, 26 primers produced good amplification products. The 5 primers

exhibited the highest level of variability and the percentage of polymorphic bands was 42.94

to 62.89. The polymorphic primers were selected to generate dendrogram UPGMA based for

genetic relationship among the accessions of J. curcas. Finally, it was concluded that RAPD

markers could be used for estimation of genetic relationship, which will be helpful in the

characterization of J. curcas germplasm.

35

Jubera et al. (2009) deployed 5 RAPD markers to assess the genetic diversity among 7

J. curcas (L.) genotypes. Four RAPD primers revealed polymorphism among the genotypes.

The genetic distance (%) based on Jaccard’s similarity co-efficient ranged from 81.8-100,

indicating narrow genetic variability among the genotypes based on RAPD markers.

Pamidimarri et al. (2009) characterized the toxic and non-toxic varieties at molecular

level and to develop PCR based markers for distinguishing non-toxic from toxic or vice

versa. In total 371 RAPD, 1,442 AFLP markers were analyzed and 56 (15%) RAPD, 238

(16.49%) AFLP markers were found specific to either of the varieties. Genetic similarity

between non-toxic and toxic varieties was found to be 0.92% by RAPD and 0.90% by AFLP

fingerprinting.

Ambrosi et al. (2010) analyzed 27 accessions of J. curcas from different geographic

locations in the world using flow cytometric seed screening (FCSS) and 10 RAPD markers to

study the genetic diversity and reproductive strategy. The study reiterated the fact that the

germplasm from different geographical regions had limited genetic variation with the

exception of accessions from Mexico.

Ikbal et al. (2010) assessed the genetic diversity among 40 accessions of J. curcas L.

collected from different eco-geographical regions of India using 50 RAPD primers. Cluster

analysis based on Jaccard’s similarity coefficient using UPGMA grouped all the 40

genotypes into two major groups at a similarity coefficient of 0.54. The similarity indices

ranged from 0.44 to 0.92 with an average of 0.73, indicating a moderate to high genetic

variability among the genotypes.

Subramanyam et al. (2010) analyzed the genetic diversity among 10 accessions of J.

curcas collected from different eco-climatic zones of India using 43 RAPD markers. Out of

43 primers, 10 primers were found to be polymorphic and showed reproducibility. The

average number of polymorphic bands per primers was 7.6 and the percentage of

polymorphism ranged from 41.66 to 92.30 which indicated the variable potentiality of the

primers in resolving the variation in genotypes. The visualization of genetic relatedness

among the J. curcas genotypes revealed a wider genetic base.

Rosado et al. (2010) studied diversity among 192 germplasm of J. curcas collected

from Brazil. A total of 96 RAPD primers, 48 of which were selected on previous studies that

reported primers amplifying a large number of polymorphic markers for J. curcas and 6 SSR

markers used for diversity analysis. Only 23 of the RAPD and one microsatellite were

polymorphic. Surprisingly, all the accessions were homozygous at all but one microsatellite,

in contrast with the outcrossing mating system reported for the species, suggesting that J.

curcas not only supports selfing but possibly breeds by geitonogamy.

36

Machua et al. (2011) analyzed diversity among 160 individuals of J. curcas randomly

selected from eight populations including three populations from the Coastal region (Likoni,

Kilifi and Kwale), three in South Rift (Ngurumani, Kajiado and Namanga) and two in Eastern

Kenya (Kibwezi and Kitui) using 40 RAPD primers. Only 10 RAPD produced unambiguous

polymorphic and reproducible fragments. Analysis of molecular variance (AMOVA) showed

that, more variation (53%; P=0.01) was partitioned among population while 47% (P=0.01)

variation was partitioned within the population.

Rafii et al. (2012) used 8 RAPD primers for genetic diversity analysis of 48

accessions of J. curcas collected from different locations of the Selangor, Terengganu and

Kelantan states of Malaysia. The polymorphism percentages of J. curcas accessions for

Selangor, Kelantan and Terengganu states were 80.4, 50.0, and 58.7% respectively, with an

average of 63.04%.

Khurana-Kaul et al. (2012) assessed the genetic relationship of 29 J. curcas accession

using 52 RAPD and 26 ISSR markers. Out of 52 RAPD primers, 47 showed polymorphic

banding patterns which produced 552 bands that could be scored and of which 334 were

polymorphic with an average of 7.1 polymorphic fragments per primer. The results indicated

the modest level of genetic diversity in the J. curcas accessions.

Gopale and Zunjarrao (2013) used 10 RAPD primers to evaluate the genetic diversity

in 20 accessions of J. curcas L. collected from four agro-climatic regions of Maharashtra,

India. Ten selected markers produced 125 bands, of which 94 (75.2%) were polymorphic.

Cluster analysis based on Jaccard’s similarity coefficient using UPGMA grouped all the 20

accessions in two major groups at similarity coefficient of 0.54. The similarity indices from

the 20 accessions ranged from 0.14 to 0.98 with an average of 0.63, indicating moderate to

high genetic variability among the accessions.

Kumar et al. (2013b) assessed the genetic diversity in 20 J. curcas genotypes using 10

RAPD primers. Out of 10 RAPD primers, only 6 primers showed amplification in all the 20

genotyps. A total of 47 sharp and reproducible bands were obtained, out of which 44 bands

were polymorphic resulting in 93.61% polymorphism among the genotypes. Genetic

similarity matrices of the genotypes ranged from 0.88 to 0.93, indicating a moderate genetic

variability among the genotypes.

Alves et al. (2013) assessed the genetic variability of the Brazilian physic nut

germplasm (117 accessions) using a combination of phenotypic and molecular markers

(RAPD and SSR). They reported that molecular markers did not adequately sample the

genomic regions that were relevant for phenotypic differentiation of the accessions. The

diversity varied from 0 to 1.29 among the 117 accessions, with an average dissimilarity

37

among genotypes of 0.51. Joint analysis of phenotypic and molecular diversity indicated that

the genetic diversity of the physic nut germplasm was 156% and 64% higher than the

diversity estimated from phenotypic and molecular data, respectively.

Murty et al. (2013) assessed genetic diversity using RAPD (100 primers), ISSR (11

primers) and DAMD (4 primers) markers over 15 Jatropha accession and 4 different species.

Highest polymorphism (96.67%) was recorded by RAPD followed by DAMD (91.02%) and

ISSR (90%). All the markers proved J. gossypiifolia as one of the parents of J. tanjorensis.

Dhillon et al. (2014) assessed the genetic variability in J. curcas induced by different

doses of gamma- rays using 47 RAPD primers. Molecular characterization of induced

mutants (M1 generation) with 47 RAPD primers showed 65.27% polymorphism. The

variability created by gamma rays ranged from 9 to 28%. The gamma- rays exposure changed

the patterns of RAPD in comparison with control and hence can be adopted in mutation

breeding of J. curcas.

2.8.1.2 AFLP based genetic diversity analysis

Tatikonda et al. (2009) studied genetic diversity among 48 accessions collected from

6 different states of India using 7 AFLP primer combinations. A total of 770 fragments were

produced with an average of 110 fragments per primer combination. A total of 680 (88%)

fragments showed polymorphism in the germplasm analyzed, of which 59 (8.7%) fragments

were unique (accession specific) and 108 (15.9%) fragments were rare (present in less than

10% accessions). They reported a moderate to high level of genetic variability among the

accessions surveyed and also the clustering was in accordance with the geographical

distribution.

Shen et al. (2010) studied 38 populations of J. curcas L. collected from China using 9

AFLP primer combinations. These AFLP primers generated a total of 246 fragments and of

which 72 (26.99%) were polymorphic. The Jaccard’s similarity coefficient showed a high

similarity range from 0.866 to 0.977, suggesting a low genetic diversity among the 38

populations.

Santos et al. (2010) studied genetic relationship between 12 accessions of J. curcas

using 17 AFLP primer combinations which generated 164 bands. An UPGMA dendrogram

constructed, based on genetic distances estimated by Jaccard’s similarity coefficient showed a

cophenetic value of 0.91. Groups of plants were observed in 6 of the 12 accessions studied

with similarity of over 30%, indicating high genetic variability. The variation among

accessions was estimated to be 0.275, also indicating high variability.

38

Pecina-Quintero et al. (2011) used 6 AFLP primer pairs for genetic diversity analysis

in 88 accessions of J. curcas collected from the state of Chiapas, Mexico and reported 79% of

the genetic variation within the populations, 16% among populations within regions and 5%

among regions. The AFLP analysis detected high levels of polymorphism (90%) among 88

accessions of J. curcas from the state of Chiapas, Mexico and allowed for discrimination

between accessions.

Ovando-Medina et al. (2011) assessed genetic diversity using AFLP primers in 134

genotypes of J. curcas collected from Chiapas, Mexico. Out of 209 AFLP markers, 152

useful markers were obtained for diversity analysis. The polymorphism rates were found to

range between 71.7% and 92.1%, while the average polymorphism was 81.1%, the effective

number of alleles (Ne) was between 1.181 and 1.398 with an average of 1.303, the Shannon

diversity index (I) ranged from 0.202 to 0.378, the average was 0.306, the genetic diversity of

Nei (He) ranged from 0.121 to 0.245 with an average of 0.192. These results revealed high

genetic diversity in the population studied.

Shen et al. (2012) characterized the genetic variation among 63 populations of J.

curcas from 10 countries in Asia, Africa and Mexico grown in the provenance trials in China

and Vietnam using AFLP markers. Four primer combinations were used to generate a total of

89 bands of which 87 were polymorphic. The total genetic diversity (Ht) was low (0.15) and

genetic diversity within populations (Hs) was 0.07. AMOVA indicated that 69% of the total

variation corresponded to variation within populations and 31% to variation among

populations. The polymorphic loci (Np=16), the percentage of polymorphic loci (Pp=18%),

Nei’s diversity index (H=0.07) and Shannon’s information index (I=0.10) all indicated low

genetic diversity among populations.

Mastan et al. (2012) assessed genetic diversity in 15 selected germplasm of J. curcas

using 180 RAPD, 21 AFLP primer combinations and 19 SSR markers. The percentage

polymorphism among the selected germplasm using RAPD, AFLP and SSR was found to be

56.43, 57.9 and 36.84, respectively. The variation of diversity among the germplasm using 3

markers in the present study may be due to the codominant nature of SSR and dominant

nature of RAPD and AFLP markers. As the AFLP provides more amplified fragments than

RAPD followed by SSRs, it shows highest polymorphism when compared with other.

Sinha and Tripathi (2013) used 4 AFLP primer combinations for genetic diversity

study in 6 Jatropha species namely J. curcas, J. integerrima, J. gossypifolia, J. glandulifera,

J. Podagrica and J. dioca. Four AFLP primer combinations produced a total of 178

fragments with an average of 44.5 fragments per primer combination. A total of 167

(93.80%) fragments showed polymorphism in the germplasm analyzed. The UPGMA cluster

39

analysis revealed the level of similarity among 6 Jatropha species and maximum similarity

was observed between J. curcas and J. dioca which was 59%, J. integerrima (53%) was

found next to it in similarity matrix generated. J. glandulifera showed highest dissimilarity

with other Jatropha species.

Osorio et al. (2014) assessed the genetic variation of 182 J. curcas accessions

collected from Central America (47 accessions), South America (9 accessions), Africa (35

accessions) and Asia (91 accessions) with 29 SSR, 13 TRAP and 20 AFLP primers. Out of 29

SSR, 13 TRAP, 20 AFLP, 14, 13 and 2 markers yielding polymorphic patterns respectively.

The Jaccard’s similarity, cluster analysis by UPGMA and PCA (principle component

analysis) indicated higher variability in Central America accessions compared to Asian,

African and South America accessions.

Pamidimarri and Reddy (2014) assessed the genetic diversity among 63 germplasm

collected from India, Madagascar Spain, Mexico, Cape Verde using RAPD, AFLP and

nrDNA-ITS. Using 18 combinations of AFLP selective primers, 911 markers were generated,

out of which 667 markers were resulted to be polymorphic. A total of 180 RAPD primers

were screened and 52 primers responded with more than 6 bands were considered to study

the genetic polymorphism. The germplasm of J. curcas taken for RAPD and AFLP analysis

were also used for nrDNA-ITS sequence analysis. The study was focused to understand the

molecular diversity at reported probable center of origin (Mexico) and to reveal the dispersal

route to other regions. They found that the overall genetic diversity of J. curcas was narrow

and the highest genetic diversity was observed in the germplasm collected from Mexico.

Least genetic diversity found in the Indian germplasm and clustering results revealed that the

species was introduced simultaneously by two distinct germplasm and subsequently

distributed in different parts of India.

2.8.1.3 ISSR based genetic diversity analysis

He et al. (2007) assessed genetic diversity of 9 populations from 5 provinces of China

using 10 ISSR markers. All the primers generated highly reproducible and stable DNA

fragments. The polymorphic loci (PPB=97.04%) and POPGENE analysis result (H=0.2357,

I=0.3760) suggested a high level of genetic variation of J. curcas among the different

populations.

Vijayanand et al. (2009) assessed genetic diversity of among 12 Jatropha accessions

collected from different geographical areas of India using 19 morphological traits and 21

ISSR primers. Five accessions of J. curcas were collected from Coimbatore (Tamil Nadu), 6

40

Jatropha species i.e. J. ramanadensis, J. villosa, J. glandulifera, J. gossypifolia, J. podagrica,

J. tanjorensis were collected from Tamil Nadu and one species J. integerrima was collected

from Andhra Pradesh. The twenty-one ISSR primers generated 156 polymorphic alleles with

an average 7.47 per primer. Maximum diversity was noticed between J. villosa and J.

integerrima.

Cai et al. (2010) collected a set of 224 accessions including 219 from all the

adaptation areas in China and 5 from Myanmar to assess genetic diversity using 100 ISSR

primers. Out of 100 ISSR primers, 15, that had good reproducibility were selected to

genotype the population. Among the 169 amplified bands, 127 (75.15%) were polymorphic

which showed that Chinese Jatropha had high genetic diversity.

Grativol et al. (2011) assessed genetic diversity in 332 accessions of J. curcas

collected from eight states of Brazil using 7 ISSR primers. Seven ISSR primers amplified 104

loci with a total of 21,253 bands in all accessions, of which 19,472 bands (91%) showed

polymorphism. Polymorphic information content (PIC), marker index (MI) and resolving

power (RP) averaged 0.26, 17.86 and 19.87 per primers, respectively, showing the high

efficiency and reliability of the markers used. The UPGMA-phenogram and

multidimensional scaling MDS showed that Brazilian accessions are closely related but have

a higher level of genetic diversity than accessions from other countries.

Tanya et al. (2011) studied 39 Jatropha plants that included 30 accessions of J. curcas

from China, Mexico, Thailand and Vietnam, two J. gossypifolia (Bellyache bush), two J.

integerrima (Spicy Jatropha), J. podagrica (Bottle plant shrub) and 3 R. communis (castor

bean) using 86 ISSR markers. Genetic relationships were evaluated using 27 of 86 ISSR

markers, yielding 307 polymorphic bands with polymorphism contents ranging from 0.76 to

0.95. Dice’s genetic similarity coefficient ranged from 0.39 to 0.99, which clearly separated

the plant sample into seven groups at the coefficient of 0.48. The first group comprised

J.curcas from Mexico, the second group comprised J. curcas from China and Vietnam, the

third group comprised J. curcas from Thailand, the fourth group was J. integerrima, the fifth

group was J. gossypifolia, the sixth group was J. podagrica and the last and most distinct

group was Ricinus communis.

Sunil et al. (2011) assessed the genetic diversity in 34 accessions of J. curcas

collected from the diverse geographical regions of India using 56 RAPD and 40 ISSR

markers. All the tested RAPD primers amplified across the 34 genotypes. The total number of

polymorphic markers was 543 with a mean of 9.69 per primer and the percentage of

polymorphism was 87.3. The 40 ISSR primers produced a total of 494 scorable markers

among the genotypes. The total number of polymorphic markers and the percentage of

41

polymorphism were 325 and 68.5, respectively. The dendrogram based on both marker

systems separated the genotypes into 7 clusters, with similarity coefficient ranging from 0.50

to 0.92. The molecular markers (RAPD and ISSR) yielded 868 polymorphic markers that

discriminate the 34 genotypes into 7 and 8 clusters, respectively.

Arolu et al. (2012) characterized 48 accessions of J. curcas L. collected from 3 states

(Kelantan, Selangor and Terengganu) in Peninsular Malaysia using 10 ISSR markers. The 48

J. curcas accessions were grouped into 3 different populations based on state from where

they were collected. The percentage of polymorphism in these 3 populations ranged from

90.75% (Terengganu) to 100% (Kelantan). Analysis of molecular variance (AMOVA)

showed that 94% of the total variation was observed within the populations while variation

among the populations accounted for the remaining 6%.

Camellia et al. (2012) assessed the genetic diversity in the 16 accessions of J. curcas

and Ricinus sp. of Malaysia using 8 ISSR primers. From the 8 ISSR primers used, the number

of amplicons per primers varied from 2 to 14 and amplicons size from 151 bp to 2779 bp.

From the total of 63 bands, 25 (40%) were polymorphic with an average of 4.16 polymorphic

bands per primer. Jaccard’s coefficient of similarity varied from 0.72 to 1, indicating low

level of genetic variation among the genotypes.

Biabani et al. (2013) studied genetic diversity on 114 accessions of Jatropha collected

from 6 populations from 4 countries (Malaysia, Philippines, India and Indonesia) using 35

ISSR primers. Ten ISSR primer combinations were generated a total of 143 polymorphic

fragments varied from 4 to 27. The percentage of polymorphic bands for the Indonesia1,

Indonesia2, Malaysia1, Malaysia2, Philippines and India, Jatropha populations were 54.6,

59.4, 46.2, 53.2, 60.8 and 56.4%, respectively with an average of 55.1%. An UPGMA

dendrogram was constructed and the Jatropha populations were grouped into four major

clusters at a coefficient level of 0.28. The genetic similarities between the populations ranged

from 0.31% to 0.25%.

Alkimim et al. (2013) assessed the genetic diversity among the 46 accessions of

Jatropha from Brazil using 69 RAPD and 37 ISSR markers. The genetic distance between

accessions ranged from 0.13 to 0.76, with an average genetic distance of 0.21. A

dendrogram generated by UPGMA presented that only two phylogenetic groups, one of

which contained only 3 individuals, the remaining group included 95.6% of the analyzed

genotypes which showed low genetic diversity.

42

2.8.1.4 SSR based genetic diversity analysis

Wen et al. (2010) selected 419 EST-SSR and 182 genomic SSRs that had been

developed for the Manihot esculenta (Cassava) to investigate the transferability to J. curcas.

In total, 187, out of 419 EST-SSR and 54 out of 182 genomic SSR from Cassava were found

polymorphic among the 5 J. curcas accessions. Thirty-six EST-SSRs and 20 genomic SSRs

were chosen to analyze the genetic diversity of 45 accessions of J. curcas and reported a

total of 216 alleles and 183 (84.72%) of them were polymorphic. On the basis of the

distribution of these polymorphic alleles, the 45 accessions were classified into 6 groups.

The estimated mean genetic diversity index was 0.55 showing that J. curcas germplasm

collection has high level of genetic diversity.

Na-ek et al. (2011) assessed genetic diversity of 32 J. curcas accessions, including 6

from Thailand, 9 non-toxic accessions from Mexico, 12 gamma radiation-treated seeds from

Mukdahan province and 1 accession each from Myanmar, Cambodia, India, Laos and China

using 10 SSR markers. Low level of average genetic diversity were observed (He=0.160).

The analysis of molecular variance (AMOVA) showed higher variability (63.753%) among

groups than within groups (36.247%).

Yadav et al. (2011) deployed 50 EST-SSRs over 25 J.curcas accessions collected

from different geographical regions of India for genetic diversity analysis. Out of 50 EST-

SSR markers 21 SSR markers were polymorphic and used to determine genetic relationships

among 25 J. curcas accessions collected from different geographical regions of India.

Twenty-one SSR markers were polymorphic and with allele variation from two to four. The

Polymorphic information content (PIC) value ranged between 0.04 to 0.61 with an average

of 0.25±0.16, indicating low to moderate level of informativeness within these EST-SSRs.

Sato et al. (2011) carried out genome sequencing of J. curcas and identified large

number of SSRs. They selected 100 genome-derived microsatellites and assessed genetic

diversity analysis with 12 Jatropha accessions collected from the various parts of the world.

A total of 88 markers generated specific amplicons, whereas the other 8 and 4 markers

showed no amplification and non-specific amplification, respectively. The number of alleles

per locus ranged from 1 to 4 with a mean value of 1.31. The large number of markers

detecting no polymorphism and the low mean value of PIC (0 to 0.45) indicated that genetic

diversity in Jatropha lines is generally narrow.

Bressan et al. (2012) developed 40 SSR and validated in 41 accessions of J. curcas

collected from 6 populations from Brazil, Mexico and Colombia. Nine loci were polymorphic

revealing from 2 to 8 alleles per locus and 6 primers were able to amplify alleles in the

43

congeners J. podagrica, J. pohliana and J. gossypifolia. The primers developed that revealed

polymorphic loci are suitable for genetic diversity and structure, mating system and gene

flow studies in J. curcas and some congeners.

Ricci et al. (2012) collected 64 Jatropha germplasm from 7 geographical locations on

2 continents and analyzed with 32 SSRs and two candidate gene-specific primers (ISPJ-1

gene and Curcin-P2 gene promoter). In general, markers were found to be highly conserved

and many (40%) were monomorphic. The polymorphic information content of the

polymorphic markers ranged from 0.03 to 0.47. The genetic similarity analysis identified 2

distinct groups at 0.73 DICE similarity coefficient. Group I included germplasm collected

from the islands of Cuba and Cape Verde and group II consisted of Brazil, Mozambique and

Senegal populations. They developed several population-specific microsatellite markers

(JC03, JC05 and JC09) and a single base substitution at Jcps9 locus that clearly separated the

island and mainland populations.

Vischi et al. (2013) studied genetic diversity between 29 toxic and non-toxic

accessions of Jatropha collected from America (North, central, South) and Mexico using 40

SSR markers. The genetic study pointed out a high degree of similarity both within and

among the non- Mexican accessions. The Mexican accessions proved to be non-toxic and

genetically differentiated forming a well separated cluster from out of Mexico accessions.

Yue et al. (2013) assessed genetic variation using 29 microsatellites located on 11

linkage groups in 276 accessions of J. curcas collected from 9 locations in 5 countries in

South- America, Asia and Africa. They did not detect any genetic diversity at all 29

microsatellites loci. All the 276 accessions were homozygous at all loci and shared the same

genotype at each locus, suggesting no microsatellite variation in the genome of Jatropha

curcas.

Kumari et al. (2013) used 32 EST-SSR markers to analyze the genetic diversity

among 42 accessions collected from different parts of India. Out of 32 EST-SSR primers, 24

primer pairs exhibited polymorphism among the genotypes with varying amplicons from 1 to

8 with an average of 2.33 alleles per polymorphic marker. The polymorphic information

content value ranged from 0.02 to 0.5 with an average of 0.402 indicating moderate level of

informativeness within these EST-SSR markers.

Jain et al. (2014) developed in silico microsatellites (SSR) using J. curcas ESTs from

various tissues viz. embryo, root, leaf and seed available in the public domain of NCBI to

assess genetic diversity and identification of markers for the selection of high oil yielding

clones in Jatropha. A total of 13,513 ESTs were downloaded and from these ESTs, 7552

44

unigenes were obtained and 395 SSRs were generated from 377 SSR-ESTs. In this study they

reported low level of polymorphism in Jatropha.

Ouattara et al. (2014) assessed genetic diversity of 103 accessions including 82

accessions from different agro- ecological zones in Senegal and 21 exotic accessions through

33 microsatellite markers. They found low level of genetic variation because introduction of

J. curcas in Senegal seems to have been done from or few origins and the species have not

retained genetic diversity since then, due to vegetative propagation.

2.8.1.5 SNP based genetic diversity analysis

Gupta et al. (2012a) discovered 2,482 informative single nucleotide polymorphisms

(SNPs) and did genotyping of selected SNPs among 148 accessions of J. curcas for diversity

analysis. The 148 genotypes of J. curcas were collected from different crop-growing regions

of India, North America, South America and Africa. The diversity analysis revealed that a

narrow level of genetic diversity existed among the indigenous genotypes as compared to the

exotic genotypes of J. curcas.

Montes et al. (2014) assessed the genetic structure and diversity in Jatropha

germplasm with 54 SSR and 120 SNP markers in a diverse, worldwide, germplasm panel of

70 accessions. They reported a high level of homozygosis in the germplasm that does not

correspond to the purely out-crossing mating system assumed to be present in Jatropha. They

hypothesize that the prevalent mating system of Jatropha comprise a high level of self-

fertilization and that the outcrossing rate is low. Genetic diversity in accessions from Central

America and Mexico was higher than in accession from Africa, Asia, and South America.

They also identified markers associated with the presence of phorbol esters.

2.8.2 Linkage and QTL mapping in J. curcas

Wang et al. (2011) have constructed first generation linkage map of J. curcas with

216 SSRs and 290 SNPs using backcross mapping population derived from J. curcas and J.

integerrima. They established first generation linkage map using a mapping panel containing

two backcross populations with 93 progeny and mapped 506 markers (216 SSR and 290 SNP

from ESTs) onto 11 linkage groups. The total length of the map was 1440.9 cM with an

average markers space of 2.8 cM.

Liu et al. (2011) identified 18 QTL underlying the oil traits and 3 eQTLs of the

oleosin genes using backcrossing population (286) derived from crosses with J. curcas and J.

integerrima through SNP markers. The QTLs and eQTLs, especially qC18:1-1, qOil-4 and

45

qOlelll-5 with contribution rates (R2) higher than 10%, controlling oleic acid, total oil content

and oleosin gene expression respectively.

Sun et al. (2012) developed linkage map and reported QTLs association with growth

and seed in J. curcas using 105 SSR markers on 11 linkage groups. They identified a total pf

28 QTL for 11 growth and seed traits using a population of 296 back crossing Jatropha trees.

Two QTLs qTSW-5 and qTSW-7 controlling seed yield were mapped on LGs 5 and 7

respectively. These two QTL clusters were critical with pleotropic roles in regulating plant

growth and seed yield.

King et al. (2013) developed linkage map from 4 F2 mapping population of J. curcas

that reveals a locus controlling the biosynthesis of phorbol esters which cause seed toxicity

and restricted for animal feed. The consensus linkage map contains 502 co-dominant

markers, distributed over 11 linkage groups, with a mean markers density of 1.8 cM per

unique locus. They observed segregation ratio of 3:1 within seeds collected from F2 plants

and QTL analysis revealed that a locus on linkage group 8 was responsible for phorbol ester

biosynthesis. The identification of the locus responsible for PE biosynthesis will be useful to

develop new non-toxic varieties.

2.8.3 Morphological traits based variability and genetic diversity analysis

Characterization of genetic material based on morphometric traits is an important step

towards the genetic improvement of J. curcas. Such studies help in assessment of the genetic

variability, identification of diverse parental line, interrogation of desirable trait etc. Some

preliminary genetic diversity studies based on quantitative traits are summarized as under:

Ginwal et al. (2005) reported preliminary quantitative genetic variations in seed

morphology, germination and seedling growth among 10 accessions collected from Madhya

Pradesh, India. The phenotypic and genotypic variance, their coefficient of variability and

broad sense heritability also showed a sizeable variability. The high percentage of heritability

coupled with moderate intensity of genetic gain was reported for seed germination traits,

which signifies that germination is under strong genetic control and good amount of heritable

additive genetic component can be exploited for improvement of Jatropha.

Kaushik et al. (2007) evaluated 24 accessions of J. curcas collected from different

agro-climatic zones of Haryana, India for seed oil content variations and divergence. They

found that the phenotypic coefficient of variation was higher than the genotypic coefficient of

variation indicating the predominant role of environment. High heritability and genetic gain

46

were recorded for oil content (99% and 18.90%) and seed weight (96% and 18%),

respectively, indicating the additive gene action for these traits.

Rao et al. (2008) collected 32 wild accessions of J. curcas from Andhra Pradesh,

India and evaluated for genetic association, variability and diversity in seed traits, growth,

reproductive and yield traits. Broad sense heritability was high in general and exceeded 80%

for all the seed traits studied.

Sunil et al. (2009) studied on J. curcas for its distribution and diversity in South-East

coastal zone of India using DIVA-GIS. The analysis for richness using rarefaction method of

DIVA-GIS showed that Ranga Reddy district of Andhra Pradesh is the potential area for

germplasm with high oil content. The study revealed that diverse germplasm accessions of J.

curcas are distributed all over the South-East coastal zone and enabled them to find out gaps

in collection and diversity richness.

Das et al. (2010) collected 16 Jatropha genotypes from four different regions of India

and evaluated for 12 morphological characters. The genotypes showed significant differences

in most of the component traits and seed yield, primary branches/plant, fruits/branch and

seed/fruit. The phenotypic coefficient of variation (PCV) and genotypic coefficient of

variation (GCV) estimates were high for seed yield/plant followed by flowering

bunches/plant, fruits/plant and secondary branches/plant. Heritability was high (>80%) for

plant height, fruits/plant and 100 seed weight. Genetic advance (GA) was high (>50%) for

seed yield/plant, fruits/plant and flowering bunches/plant. Moderate to high heritability

accompanied with high genetic advance for seed yield/plant, fruits/plant and flowering

bunches/plant indicated additive gene action and selection for these characters would be

effective.

Mohapatra and Panda (2010) studied variability on growth, phenology and seed

characteristics among 20 randomly selected J. curcas collected from different agro-climatic

zones of India in a progeny trial under tropical monsoon climatic conditions of Bhubaneswar,

India. The correlation studies revealed that length and number of branches were positively

correlated with number of inflorescence and number of fruits per plant. A positive correlation

were observed between fruit diameter and oil content and also, between seed length and test

weight. On the basis of non-hierarchial Euclidian cluster analysis, 20 accessions were

grouped into 6 clusters. The maximum inter-cluster distance (7.195) between cluster III and

VI followed by cluster III and IV (7.074) indicates wider genetic diversity between the trees

in these groups.

Ghosh and Singh (2011) assessed the variation in seed and seedling characters of J.

curcas collected from 6 zones (geographical regions) within India and 4-6 provenances

47

within each zone. They found significant variation among zones and among different

provenances within zones, for all traits of seed and seedlings of J. curcas. This study has

implications for identifying potential seed sources of J. curcas for exploiting for higher oil

content.

Zapico et al. (2011) assessed intraspecific variation/interrelationships and to

determine the association of geographical distribution and phenotypic diversity in the J.

curcas accessions. Ex situ morphological characterization of 13 J. curcas genotype from

various sources was undertaken using 21 quantitative traits. The PCA analysis reduced the

collected data to 5 principal components that cumulatively explained 88.81% of total

variance. The 2 clustering mechanism, UPGMA and UPGMC, divided the provenances into

two major groups and revealed the divergence of Tubao- Philbio from the provenances.

Gairola et al. (2011) assessed the variability in seed characteristics of J. curcas L.

from hill region of Uttarakhand, India. They found significant differences among the seed

sources for all the parameters studied. The fruit weight ranged from 2.21 to 3.41g, seed

weight from 0.58 to 0.81g, seed length from 1.49 to 1.81 cm, seed circumference from 2.97

to 3.51 cm, seed moisture content from 6.20 to 10.44% and seed germination from 53.33 to

79.83%. In case of genetic component analysis, heritability value was found high for seed

germination percentage (85.34%) coupled with moderate genetic gain which signifies that

seed germination is under genetic control and can be used for improvement of this species.

Shabanimofrad et al. (2011) collected 59 accessions of J. curcas from different

locations of Selangor, Kelantan and Terengganu states of Malaysia to assess the genetic

diversity using multivariate analysis and DIVA-geographical information system (GIS). The

6 quantitative characters- seed length, seed width, fruit length, fruit width, 100 seed weight

and oil content were recorded. Based on 6 quantitative characters, 59 accessions were

grouped into 3 clusters at a coefficient level of 3.7.

Biabani et al. (2012) assessed phenotypic and genotypic variation of Jatropha

population from Malaysia, India, Indonesia and the Philippines to select superior plants with

high seed and oil yields production for commercial planting and to study inter-populations

variation in morphological, seed and oil yield characteristics. Highly significant genotypic

differences were obtained among the Jatropha populations for various traits measured.

Wani et al. (2012b) evaluated genetic variability for morphological and qualitative

attributes among 7 J. curcas L. accessions grown under subtropical conditions of North India.

The seed yield/plant had a positive and significant correlation with number of branches/plant,

oil yield, plant spread (r=0.806, 0.802, 0.782), plant spread had a highest correlation with

48

plant height (r= 0.840). The oil content varied from 24.5% to 37.9%. The evaluation will be

helpful to identify cultivar with specific yield and vegetative growth features.

Ouattara et al. (2013) assessed distribution and variability in seed traits of J. curcas in

Senegal collected from different agro-ecological zones of the country. Among the seed traits

studied, 100 seed weight ranged from 63.68 to 77.83 g and seed length from 17.89 to 19.15

mm. Variability in seed traits was not linked to the geographical location.

Singh et al. (2013b) assessed genetic association, divergence and variability in 24

accessions of J. curcas for seed and oil yield and its contributing traits. Based on genetic

divergence D2 statistics, 24 accessions were grouped into 14 clusters. Phenotypic variances

were higher than genotypic variances for all characters. The results showing significant and

positive correlation with seed and oil content can be used for genetic improvement of seed

and oil yield.

Guan et al. (2013) studied the characterization of the seed morphology and genetic

diversity of J. curcas L. from 8 different provenances for providing support for the breeding

and allocation of seed. For the morphological characterization 5 traits were investigated,

including 100 seed weight, seed length, width, lateral diameter, seed length and width ratio.

The genetic diversity of 8 populations from different provenances was assessed using five

DALP primers. The 5 DALP primers generated a reproducible DNA fragment that is 219 of

244 loci and were polymorphic, i.e. PPB was 89.75%. The result showed that seed

morphology had significant variation among location and the 8 populations had high level of

genetic diversity and show apparent genetic differentiation.

Brasileiro et al. (2013) estimated genetic parameters, selection gain and genetic

diversity in physic nut of 20 half-sib progeny of 17 families obtained from the experimental

station of Empresa Baiana de desenvolvimento Agricol (EBDA), Brazil. In the analysis of

genetic diversity, genotypes were divided into 4 groups. The genotypes 18, 18, 20 and 8

clustered together and presented the highest means for the vegetative characters and

production and the lower means were observed in 17, 12, 13 and 9 genotypes from the same

group.

Chapter 3 Materials and Methods

49

3.1 Materials

3.1.1 Plant materials for SSR enriched libraries preparation and polymorphism

detection

The J. curcas accession NBJC132 was used to extract genomic DNA for the preparation of

SSR enriched libraries. The plant materials used to identify polymorphic SSRs include 7

accessions of J. curcas i.e. NBJC132, NBJC139, NBJC147, NBJC148, NBJC161, NBJC183,

NBJC195 (Table 3.1). In addition, one accession of J. integerimma was also included in the

screening panel. The first 5 accessions of J. curcas i.e. NBJC132, NBJC139, NBJC147,

NBJC148, NBJC161 were collected from different eco-geographical and agro-climatic zones

of India. These accessions were reported to be elite and diverse accessions of J. curcas from

India. The rest two accessions i.e. NBJC183 and NBJC195 were exotic collection and

procured from South Africa and Mexico respectively. The Mexican accession (NBJC195)

was reported to be comparatively more diverse and non toxic as it contains low level of

phorbol esters as compared to other accessions of J. curcas (Makkar et al. 1998, Basha et al.

2009). The accession of J. integerrima was collected from germplasm repository of Botanical

garden of CSIR-NBRI.

3.1.2 Plant material for SSR based genetic diversity analysis

The plant materials used to assess the magnitude of genetic diversity in the present

investigation include 96 accessions of J. curcas which comprise of 70 indigenous collections

from different states of India and 26 exotic collections. The details of these accessions are

presented in Table 3.1. The exotic collections include 3 non-toxic accessions from Mexico

(NBJC194, NBJC195 and NBJC196). These non-toxic accessions were reported to have low

level of phorbol esters (Makkar et al. 1998, Basha et al. 2009).

50

Table 3.1 Details of accessions of J. curcas used in microsatellite characterization and

diversity analysis

Sl.

No. Code* Collection site States/country

Toxicity Latitude Longitude

1 NBJC101 Udaipur Rajasthan, India Toxic 27° 42' N 75° 33' E

2 NBJC102 Chittorgarh Rajasthan, India Toxic 24° 54' N 74° 42' E

3 NBJC103 Dungarpur Rajasthan, India Toxic 23° 50' N 73° 50' E

4 NBJC104 Sirohi Rajasthan, India Toxic 24° 88' N 72° 87’ E

5 NBJC105 Nainital Uttaranchal, India Toxic 29º 23' N 79º 30' E

6 NBJC106 Pithoragarh Uttaranchal, India Toxic 29° 34' N 80° 13' E

7 NBJC107 Tehri Uttaranchal, India Toxic 30º 20' N 78º 53' E

8 NBJC108 Dehradun Uttaranchal, India Toxic 30º 19' N 78º 04' E

9 NBJC109 Ranchi Jharkhand, India Toxic 23º 23' N 85º 23' E

10 NBJC110 Latehar Jharkhand, India Toxic 23º 76' N 84º 6' E

11 NBJC111 Plamau Jharkhand, India Toxic 23º 52' N 84º 17' E

12 NBJC112 Garhwa Jharkhand, India Toxic 24º 10' N 83º 52' E

13 NBJC113 Begusarai Bihar, India Toxic 23º 75' N 84º 5' E

14 NBJC114 Patna Bihar, India Toxic 25º 37' N 85º 13'E

15 NBJC115 Bihar Sharif Bihar, India Toxic 25 º 11' N 85 º 31' E

16 NBJC116 Nawada Bihar, India Toxic 24º 53' N 85º 35' E

17 NBJC117 Raigarh Chhatisgarh, India Toxic 21º 9' N 83º 4' E

18 NBJC118 Bilashpur Chhatisgarh, India Toxic 22º 05' N 82º 13' E

19 NBJC119 Korba Chhatisgarh, India Toxic 22 º 20' N 82 º 42' E

20 NBJC120 Ambikapur Chhatisgarh, India Toxic 23º 10' N 83º 15' E

21 NBJC121 Kangra Himachal Pradesh, India Toxic 32° 05' E 76° 18' E

22 NBJC122 Hamirpur Himachal Pradesh, India Toxic 31° 68' E 76° 52' E

23 NBJC123 Mandi Himachal Pradesh, India Toxic 31° 43' N 76° 58' E

24 NBJC124 Shimla Himachal Pradesh, India Toxic 31° 06' N 77° 13' E

25 NBJC125 Yamunanagar Haryana, India Toxic 30° 6' N 77° 17' E

26 NBJC126 Hisar Haryana, India Toxic 29° 10' N 75° 46' E

27 NBJC127 Gurgaon Haryana, India Toxic 28° 37' N 77° 04' E

28 NBJC128 Yamunanagar Haryana, India Toxic 30° 6' N 77° 16' E

29 NBJC129 Allahabad Uttar Pradesh, India Toxic 25° 28' N 81° 54' E

30 NBJC130 Gorakhpur Uttar Pradesh, India Toxic 26° 45' N 83° 24' E

31 NBJC131 Mahoba Uttar Pradesh, India Toxic 25° 18' N 79° 55' E

32 NBJC132* Lucknow Uttar Pradesh, India Toxic 26° 55' N 80° 59' E

33 NBJC133 Dhar Madhya Pradesh, India Toxic 22° 35' N 75° 20' E

51

34 NBJC134 Indore Madhya Pradesh, India Toxic 22° 44' N 75° 50' E

35 NBJC135 Ujjain Madhya Pradesh, India Toxic 23° 09' N 75° 43' E

36 NBJC136 Satna Madhya Pradesh, India Toxic 24° 34' N 80° 55' E

37 NBJC137 Rangareddy Andhra Pradesh, India Toxic 17°23' N 77°50' E

38 NBJC138 Vishakhpatnam Andhra Pradesh, India Toxic 17 °42' N 83° 15' E

39 NBJC139* Rangareddy Andhra Pradesh, India Toxic 17°23' N 77°50' E

40 NBJC140 Mahboobnagar Andhra Pradesh, India Toxic 16°46' N 77° 56' E

41 NBJC141 Hoshiarpur Punjab, India Toxic 31° 32' N 75° 57' E

42 NBJC142 Gurdaspur Punjab, India Toxic 32° 03' N 75° 27' E

43 NBJC143 Gurdaspur Punjab, India Toxic 32° 03' N 75° 27' E

44 NBJC144 Bankura West Bengal, India Toxic 23° 24' N 87°07' E

45 NBJC145 Midnapur West Bengal, India Toxic 22° 33' N 87° 15' E

46 NBJC146 Purulia West Bengal, India Toxic 23° 34' N 86° 36' E

47 NBJC147* Bhavnagar Gujarat, India Toxic 21° 46' N 72° 11' E

48 NBJC148* Banaskantha Gujarat, India Toxic 24° 17' N 72°43'E

49 NBJC149 Panchmahal Gujarat, India Toxic 23°23'N 74°75'E

50 NBJC150 Junagarh Gujarat, India Toxic 21° 31' N 70° 36' E

51 NBJC151 Pulbhani Orissa, India Toxic 19° 08' N 76° 5' E

52 NBJC152 Ganjam Orissa, India Toxic 19° 22' N 85° 06' E

53 NBJC153 Balasur Orissa, India Toxic 23° 51' N 90° 21' E

54 NBJC154 Bhubaneswar Orissa, India Toxic 20° 15' N 85° 52' E

55 NBJC155 Golaghat Assam, India Toxic 26° 31'N 93° 58'E

56 NBJC156 Sonitpur Assam, India Toxic 26° 69'N 92° 33'E

57 NBJC157 Karbi angling Assam, India Toxic 26°11'N 93°34'E

58 NBJC158 Lakhimpur Assam, India Toxic 27 °14'N 94°15'E

59 NBJC159 Papumpare Arunachal Pradesh, India Toxic 27°10'N 93°42'E

60 NBJC160 Dibong Valley Arunachal Pradesh, India Toxic 28°25'N 95°52'E

61 NBJC161* Mon Nagaland, India Toxic 26 °75' N 95°1' E

62 NBJC162 Mokokchung Nagaland, India Toxic 26 °33' N 94°51' E

63 NBJC163 - Karnataka, India Toxic - -

64 NBJC164 - Karnataka, India Toxic - -

65 NBJC165 West Tripura Tripura, India Toxic 23° 49'N. 91° 27'E.

66 NBJC166 West Garo hills Meghalaya, India Toxic

67 NBJC167 Coimbatore Tamil Nadu, India Toxic 11° 00' N 77° 00' E

68 NBJC168 Coimbatore Tamil Nadu, India Toxic 11° 00' N 77° 00' E

69 NBJC169 Imphal Manipur, India Toxic 24° 44' N 93° 58' E

70 NBJC170 Palakkad Kerala, India Toxic 10° 78' N 76° 65' E

52

71 NBJC171 - Eastern China Toxic - -

72 NBJC172 - Southern Asia Toxic - -

73 NBJC173 Ghana Western Africa Toxic 8.00°N 1.07°W

74 NBJC174 Mozambique Eastern Africa Toxic 18.69°S 35.55°E

75 NBJC175 Brazil South America Toxic 8.46°S 51.33°W

76 NBJC176 Lola South America Toxic - -





81 NBJC181 Suriname South America Toxic 3.89°N 56.01°W

82 NBJC182 Mexico Central America Toxic 23.68°N 102.01°W

83 NBJC183* Senegal Western Africa Toxic 14.47°N 14.52°W

84 NBJC184 Guinea Eq Middle Africa Toxic 9.94°N 11.33°W

85 NBJC185 Togo Western Africa Toxic 8.57°N 0.82°E

86 NBJC186 Togo Western Africa Toxic 8.57°N 0.82°E

87 NBJC187 Ghana Western Africa Toxic 8.00°N 1.07°W

88 NBJC188 Madagascar Eastern Africa Toxic 18.92°S 46.44°E

89 NBJC189 Tanzania Eastern Africa Toxic 6.41°S 34.92°E

90 NBJC190 Zimbabwe Eastern Africa Toxic 18.92°S 29.15°E

91 NBJC191 Cameroon Middle Africa Toxic 8.70°N 11.25°E

92 NBJC192 Cambodia Southe-Estern Asia Toxic 13.7°N 104.94°E

93 NBJC193 Indonesia South-Eastern Asia Toxic 2.77°S 118.16°E

94 NBJC194 Mexico Central America Non-toxic 23.68°N 102.01°W

95 NBJC195* Mexico Central America Non-toxic 23.68°N 102.01°W

96 NBJC196 Mexico Central America Non-toxic 23.68°N 102.01°W

*Accessions of 7 DNA panel used for polymorphism detection

53

Figure 3.1 Map of India showing collection sites of J. curcas from different states of India

RJ

PB

HP

UK HR

UP

BH

JH WB

OR

CHH

AP

TN KL

MP GJ

ASM NL

ARP

KA

MNP

TP

MG

Abbreviations HP=Himachal Pradesh PB=Punjab UK=Uttarakhand HR=Haryana UP= Uttar

Pradesh BH=Bihar JH=Jharkhand WB=West Bengal OR=Orissa CHH=Chhatisgarh MP=Madhya Pradesh RJ=Rajasthan GJ=Gujarat AP=Andhra

Pradesh TN=Tamil Nadu KR=Kerala

KA=Karnataka TP=Tripura MNP=Manipur MG=Meghalay NL= Nagaland ASM=Assam ARP= Arunachal

Pradesh

N

54

3.1.3 Materials for heterozygosity assessment

For the assessment of heterozygosity level in J. curcas, one accessions i.e. NBJC147 was

selfed to check the cross pollination. The seeds thus obtained were used to raise seedlings in

the experimental field of CSIR- National Botanical Research Institute, Lucknow, India for

heterozygosity assessment. The seedlings were raised in polybags and DNA from fresh

leaves was extracted.

3.1.4 Interspecific hybrid plant material

In order to develop interspecific mapping population in J.curcas an interspecific hybrid

between J. curcas and J. integerrima was developed and maintained at CSIR-NBRI

experimental field. The J. curcas was used as female parent and J. integerrima used as male

parent to affect the crosses. About 200 hybrid seeds were first sown in small plastic pots with

a mixture of soil and manure in glass house. The seedlings were transferred into field after 1

month. Ninety four seedlings were established in the field which were used for detail

molecular characterization alongwith their parental lines.

3.1.5 Materials for morphological characterization

For morphometric evaluation, a total of 80 accessions of J. curcas representing different eco-

geographical and agro-climatic zones of India were selected from germplasm bank

maintained at Banthra Research Center (BRC) of National Botanical Research Institute,

Lucknow India.

3.2 Methods

3.2.1 Genomic DNA extraction

The genomic DNA was isolated from young fresh leaves following CTAB (Cetyle tri-methyl

ammonium bromide) method of Saghai-Maroof et al. (1984) with minor modifications. In

brief, approximately 5 g fresh and young leaves were subjected to grinding in liquid nitrogen

alongwith 2% PVP (Polyvinylpyrrolidone) with the help of mortar and pestle to obtain a fine

powder. The fine powder was quickly transferred to 50ml polypropylene centrifuge tube

containing 15 ml of pre-warmed (at 600C) extraction buffer and homogenized the mixture by

gentle shaking. This was then incubated for 45min at 60 0c in water-bath and mixed by gentle

55

swirling after every 10 min. After incubation, equal volume of chloroform:Isoamyl alcohol

(24:1) was added and the tubes were gently shaken for 10 min. and centrifuged for 20 min. at

10000 rpm at room temperature and the upper aqueous phase was transferred to a fresh sterile

50 ml centrifuge tubes. To each of the above tubes, 2 volume ice-cold isopropanol was added

and the tube was shaken gently and kept at – 20 0C for 1 hr. With the help of a sterile glass

hook, DNA was spooled out and transferred to 2 ml sterile tube for washing with 70%

ethanol for 10 min. The 2ml tube having DNA with 70% ethanol was centrifuged for 2 min.

at 12000 rpm and ethanol was discarded. The pellet was air-dried and dissolved in 500µl of

autoclaved sterile water. For purification of extracted genomic DNA, the RNase A was added

to the sample 20µg/ml (100 µl DNA: 1µl RNase A) and the mixture was incubated at 37 0c

for 1 hour. DNA was extracted with equal volume of phenol:chloroform:isoamyl alcohol

(25:24:1) by centrifuging the tube at 10,000 rpm for 5 minutes at room temperature. The

upper aqueous supernatant was transferred to new sterile 2 ml tube and two volume of ice-

cold absolute alcohol and 1/10th

volume of 3M sodium acetate was added and kept at -20 0c

for 1 hour. After 1 hour, the tube was centrifuged at 10000 rpm for 5 min. and DNA was

pelleted. Finally, the pellet was washed with 70% ethanol two times, air dried and dissolved

in 500 µl autoclaved sterile water.

3.2.2 Quantification and quality check of genomic DNA

The concentration of genomic DNA was determined using a Nanodrop spectrophotometer

ND1000 (Nanodrop Technologies, DE, USA) as per standard manufacturer’s instructions.

The concentration was recorded in g/µl. The ratio of absorbance at 260 nm and 280 nm was

used to assess the purity of DNA and RNA. A ratio of ~1.8 is generally accepted as ‘pure’

for DNA and a ratio of ~2.0 is generally accepted ‘pure’ for RNA. If the ratio is appreciably

lower in either case, it may indicate the presence of protein, phenol or other contaminants that

absorbs strongly at or near 280 nm. Further, the quality of DNA was also checked by running

on 0.8% agarose gel.

3.2.3 Development of SSR markers

3.2.3.1 Construction of SSR enriched genomic libraries of J. curcas

The genomic DNA of J. curcas (accession NBJC 132) was utilized for the construction of

SSR- enriched genomic DNA libraries. For this purpose, 4 SSRs repeat motif i.e. (AC)n,

56

(AG)n, (AAC)n and (AAT)n were selected for the construction of 4 independent SSR-

enriched genomic libraries. The work for constructing 4 SSR-enriched genomic libraries was

outsourced to Genetic Identification Services (GIS, Chatsworth, CA, USA). The standard

protocol used to prepare SSR enriched libraries is as follows: the genomic DNA was partially

digested with a mixture of endonucleases. Size- separated DNA fragments, ranging from 300

to 750 bp, were ligated with adapters and separately enriched for each specific SSR motif

using biotinylated capture molecules (CPG, Lincoln Park, NJ). The captured fragments were

amplified and digested with HindIII to remove the adaptors and the fragments were cloned in

pUC19 vector. GIS supplied ligation mixtures of all the 4 SSR-enriched libraries. Using each

of the above 4 ligation mixtures separately, transformation of Escherichia coli strain Dh5α

(Invitrogen) was done by electroporation using MicroPulser (BIO-RAD, Gladesville, New

South Wales, Australia). The recombinant clones were used for plasmid isolation and

sequencing of cloned inserts. A graphical representation of identification of SSR and primer

designing with classical and new method were shown in figure 3.2.

3.2.3.2 Screening of SSRs containing sequences through classical method (Sanger

sequencing)

The two genomic libraries enriched for CA and GA repeat motifs of recombinant plasmid

were transformed into ElectroMaxTM

DH5-ETM

electro-competent Escherichia coli cells

(Invitrogen) ), which were then plated onto LB agar plates containing 200µg/ml ampicillin,

100µg/ml X-galactosidase and 0.1M Isopropyl –β- D- thiogalactopyranoside (IPTG) for blue

white selection. The plates were incubated overnight at 37 0c. The white colonies were picked

up and grown in 5 ml LB overnight for the isolation of plasmid DNA.

Small- scale isolation of plasmid DNA was carried out by the alkaline lysis method of

Birnboim and Doly (1979), as described by Sambrook et al. (1989). In brief, a single isolated

colony was inoculated in 3 ml luria broth (LB) medium containing appropriate antibiotic and

grown overnight at 37 0c with continuous shaking at 250 rpm in an incubator shaker (New

Brunswick, New Jersey, USA). About 1.5 ml of the overnight grown culture was transferred

to micro-centrifuge tube and spun at 12,000 rpm for 1 minute at 4 0c to pellet down the cells.

The remaining 1.5 ml culture was transferred to the same tube after discarding the

supernatant and centrifuged to collect cells. The supernatant was carefully discarded and the

pellet was thoroughly suspended in 100 l of solution I (0.05 M Glucose, 0.025 M TrisCl, pH

8.0 and 0.01 M EDTA, pH 8.0, stored at 4 0c) by vortexing. To this mixture, 200 l of freshly

57

prepared solution II (Alkaline SDS1% and 0.2 N NaOH) was added and the tube was gently

inverted to get a clear suspension. To this suspension, 150 l of solution III was added and

the content of the tube were gently mixed by inverting for a few seconds and the lysate was

kept on ice for 10-15 minutes. The tube containing lysate was centrifuged at 12,000 rpm for

15 minutes at 4oc and the supernatant was transferred to a fresh 2 ml eppendorf tube and mix-

up 2µl RNase (10mg/ml) and put on 37 0c for 30 min. Equal volume of chloroform: iso-amyl

alcohol (24:1 v/v) was added and centrifuged at 12,000 rpm for 5 min at room temperature.

The supernatant was transferred to a fresh 2 ml eppendorf tube and added equal volume of

isopropanal and centrifuged at 12,000 rpm for 15 min at 40c. The supernatant was discarded

and the pellet was then washed with 70% ethanol (v/v) by centrifuging at 12,000 rpm for 10

minute at room temperature. The DNA pellet was air-dried until the ethanol evaporated. The

dried pellet was dissolved in 50 l sterile water (MQ) and stored at –20oc. The quality of

plasmid DNA was checked on 0.8% agarose and quantified using Nanodrop. The good

quality plasmid DNA was used for Sanger sequencing using ABI 3730xl DNA analyzer.

Sequencing of the plasmid DNA was done by Sanger sequencing method with M13

primer (5΄ACGACGTTGTAAAACGACGG-3΄) on ABI 3730xl DNA Analyzer (Applied

Biosystems, Foster City, CA, USA). The BigDye®

Terminator v3.1 Cycle Sequencing Kit

(Applied Biosystems) was used for sequencing reaction. The reaction was carried out

according to the manufacturer’s instruction. The composition of the reaction mixture was as

follows-

Template - 300 ng

Ready Reaction mix - 0.5µl

Dilution Buffer - 1.75 µl

Primer (M13) - 2.0 pmol

Volume of the reaction mixture was adjusted to 10 µl with nuclease free water. The reaction

mixture was subjected to 25 cycles of PCR with following program:

Denaturation - 96oc - 30 sec

Annealing - 50oc - 15 sec

Extension - 60oc - 4 min

On completion of above PCR reaction, sequencing product was purified by Ethanol/EDTA

precipitation methods and dissolved in 10µl Hi-Di formamide which was then denatured at

950c for 5 minutes. The denatured samples were taken up in the instrument DNA Analyzer

3730xl for sequencing.

58

The sequence data obtained were checked manually and vector sequences removed.

The redundant sequences were identified by comparison using stand-alone BLAST (2.2.1.2).

After removing redundancy, unique sequences were subjected to SSR search by web based

program SSRIT (http://www.gramene.org/db/markers/ssrtool). The basic search criteria for

SSRs were a minimum of 5 repeats and maximum motif length was 6. The primer pairs

flanking the SSRs were designed using PRIMER3 software (http://frodo.wi.mit.edu/primer3)

with major primer design parameters as follows: product length 100-300bp, primer size 18-

25bp, and melting temperature between 57-630c (optimum 60

0c). In some cases, where

primer could not be designed, the criteria were relaxed. The primers were synthesized

(Eurofins, Germany) with an additional 18 base (5΄-TGTAAAACGACGGCCAGT-3΄) tag at

5΄ end to all forward primers as M13 tail.

Figure 3.2 A graphical representation of SSR identification and primer designing.

Library preparation

Emulsion PCR

Signal processing to identify

sequences

454 GS FLX sequencing

Enriched DNA

fragments

Ligated enriched DNA into pUC19 vector

Ligation mix of SSR enriched g-DNA

Electroporation of E. coli DH5α with ligation

Isolation of plasmid DNA and sequencing

Selection of transformed cells

New Method

Primer designing using PRIMER3/WebSat softwares

Identification of SSRs in the sequence

(MISA/SSRIT/WebSat)

Classical Method

59

3.2.3.3 Screening of SSR containing sequences using next generation sequencing

(Roche 454 GS FLX sequencer)

The rest two genomic libraries enriched for trinucleotide repeat motif microsatellite (AAT

and AAG) were used to develop genomic derived SSRs utilizing high throughput next

generation sequencing technique of Roche 454 whole genome sequencer. The ligated SSR

enriched fragment was diluted 1:50 and PCR amplified with M13 universal primers. The

agarose gel having smear of 300-800bp was cut and eluted using PCR gel elution kit

(Genetix, India). This eluted DNA fragment were sequenced on Roche 454 GS-FLX

sequencer (Titanium, GS sequencer v2.5, 454 Life Sciences, Branford, CT) on 1/8th

area of

the PicoTiter Plate device using GS emPCR kit1 and GS LR 70 sequencing kit, according to

the manufacturer’s protocol.

The adaptor sequences of the raw reads were removed from the DNA sequence data

obtained from the GS FLX sequencer and assembled using CAP3 assembler (Huang and

Madan 1999). The criteria for assembling the reads were minimum overlap length of 40 bases

with 90% identity. The redundant sequences were identified by comparison using stand-alone

BLAST (2.2.1.2) and removed. The resulting contigs and singlets were then used for

identification and localization of microsatellites by a microsatellite search module MISA

(MIcroSAtellite, http://pgrc.ipk-gatersleben.de/misa). The criteria for SSR search by MISA

were repeat stretches having a minimum of: 5 repeat unit in case of di-, tri-, and 4 repeat units

for tetra-, penta- and hexanucleotide SSRs. The flanking sequences of the repeat motifs were

used to design specific primer for the SSR containing sequences using PRIMER3

(http://frodo.wi.mit.edu/primer3) software with major criteria as: length, 20-26 bp; melting

temperature, 55-650c and length of PCR product 100-400bp. The primers were synthesized

(Sigma -Aldrich) with an additional 18 base (5΄-TGTAAAACGACGGCCAGT-3΄) tag at 5΄

end to all forward primers as M13 tail.

3.2.3.4 Similarity search and functional annotation

Functional annotations of the sequences were determined against NCBI non-redundant (NR)

protein database (NCBI nr, release: 20th

Dec, 2011) using BLASTX with a criteria of

minimum e-value of 1e-5 and minimum alignment length 50% of the query sequence and

classified on the basis of their plant (genus) specific associations.

60

3.2.4 SSR Genotyping

3.2.4.1 PCR amplification and confirmation on agarose gel

The SSR flanking primers were commercially synthesized with an additional 18 base (5'-

TGTAAAACGACGGCCAGT-3') at 5' end to all the forward primers as “M13 tail”

following Schuelke (2000). In addition, 4 “M13 tag” with the same sequence of 18 base of

M13 tail was also synthesized with different fluorescent dyes namely FAM, VIC, NED and

PET (Applied Biosystems, USA). The PCR amplification was carried out in 10µl reaction

mixtures that contained 10 g of genomic DNA, 1X PCR master mix (AmpliTaq Gold®,

Applied Biosystems, USA), 0.1 µl (5pmol/µl ) of forward primer (tailed with M13 tag), 0.3µl

(5pmol/µl) each of both, reverse primer and M13 tag (labeled with either 6- FAM, VIC, NED

and PET). PCR was performed on Verti Thermal Cycler (Applied Biosystems, USA) and

DNA Engine Tetrad (Bio-Rad) using the following cycling condition: initial denaturation at

950c for 5 min. followed by 36 cycle of 94

0c for 30 s, 50-55

0c (primer specific) for 45

s and

72 0c for 1 min. Subsequently, 10 cycles of denaturation for 30 s at 94,

0c annealing for 45 s at

530c, extension for 45 s at 72

0c followed by final extension for 15 min at 72

0C was

performed.

The genomic DNA amplified with SSR primers were first checked for their

amplification on 1.5% agarose gel. For agarose gel electrophoresis, gels were prepared using

agarose (Pronadisa) in 1X TBE buffer and 0.5X TBE buffer using horizontal agarose gel slab

apparatus. The TBE buffer was used to suspend the agarose and dissolved by heating in a

microwave oven. After melting the agarose properly, it was cooled down at 600c followed by

addition of 10 μg/ml ethidium bromide and pouring in tray. Slots were made by fixing comb

over the tray, prior to the pouring of molten agarose. The comb was removed after the

agarose gelled and the gel was transferred to electrophoresis tank containing 0.5X TBE

buffer. The DNA samples and molecular weight markers were mixed with tracking dye and

loaded into the slots of gel. Electrophoresis was carried out at 5V/cm. DNA bands in the gel

were examined under transmitted UV light and documented using Bio Rad Gel

Documentation system. After confirmation of PCR amplification on agarose gel the amplified

PCR amplicon were separated on PAGE or ABI 3730xl capillary electrophoresis.

61

3.2.4.2 Polyacrylamide gel electrophoresis (PAGE) and data scoring

Some PCR amplified SSR were separated on PAGE and some on ABI DNA Analyzer. The

amplified PCR products were resolved on 8% polyacrylamide gel (29:1 acrylamide: N, N’-

methylene bis acrylamide, 1X TBE buffer) using a Bio-Rad Sequi Gen gel apparatus (Bio-

Rad, USA). Twenty base pair ladder marker was used as DNA marker in each gel. For

resolving the SSR PCR products, casting of PAGE gels and electrophoresis involved the

following steps. (i) The two glass plates, one notched IPC (Integral Plate Chamber) and one

outer plate were washed with liquid labolene detergent using warm distilled water and

allowed to air dry and plates were wiped. (ii) On the outer glass plate binding silane were

spread with 0.5% acetic acid and 95% ethanol and at notched plate repel silane were spread

(iii) The glass plates with a pair of 0.4 mm thick spacers were assembled together in the

precision caster base with gasket with the help of GT lever clamps. (iv) The required amount

of TEMED was added to the 8% polyacrylamide gel solution, which was immediately

injected/ poured between the plates through an orifice of precision caster base injection port

with the help of a 180 cc loading syringe. Precaution was taken to avoid the introduction of

air bubbles. When the solution reached the neck of the plates, a 49- well vinyl sharks tooth

comb (0.4 mm thick) facing upward was placed between the plates. The gel was allowed to

polymerize for about 1 h (v) After polymerization, the comb was removed carefully and the

walls of the gel were carefully cleaned with lint free paper, so that the dust/dirt or any foreign

object does not interfere with the resolution of amplified SSR fragments. The plates were

mounted onto the sequencing system with the assembly and upper and lower buffer chambers

were filled to the required volume with 1X fresh TBE buffer (vi) The comb with shark teeth

facing downwards was placed at the appropriate position without damaging the gel. (vii)

After completing the assembly, gel was pre-run for 30 min at constant voltage of 1500V. The

voltage and the time of the pre-run were set with the help of Bio-Rad PowerPac 3000. (viii)

After pre-run, the 1µl PCR products with an equal volume of gel loading dye (xylene cyanol

and bromophenol blue) were properly mixed and loaded with a loading tip in the wells. After

loading, gel was run at 600 V and 60 0c for 2 h.

The PAGE was silver stained to visualize the DNA bands. Following steps were

involved in silver staining of gels: (i) After electrophoresis, the gel was carefully removed

from the glass plates and transferred to a tray containing double distilled water and kept for

5min. with gentle shaking (ii) The distilled water in the above tray was then replaced with

fixing solution containing 10% ethanol/methanol and 0.50% glacial acetic acid and kept for

62

another 5 min with gentle shaking (iii) The above fixing solution was removed from the tray

and retained for further use. Silver solution (0.3g AgNO3 powder in 150 ml 10%

ethanol/methanol solution with 750 µl glacial acetic acid) was then poured in the tray for

staining the gels. The gel was kept in the silver solution for 5 min with gentle shaking (iv)

The silver solution was removed from the tray and gel was rinsed for a while in distilled

water (v) The gel was transferred to the developing solution (prepared by dissolving 4.0 g

NaOH pellets in 150 ml distilled water with 450 µl of 40% formaldehyde). The solution in

the tray was shaken gently for 5-10 min. allowing the DNA bands to appear (vi) The staining

was stopped by rinsing the gel for 5 min in the fixing solution retained after step three (vii)

The gel was placed on a light box and photographed using a digital camera and the picture

was transferred to the computer for recording of data. A 20 bp DNA marker (0.05 µg/ µl, 25

µg) was used as a size standard.

To determine the DNA band size of visualize in PAGE the migration distance of

bands of known size of 20 bp DNA marker were measured with scale. The migration distance

of the unknown fragment was also measured. The migration distances recorded were used for

the calculation of the size of the unknown fragments by using web based software ‘Fragment

Size Calculator’ (www.basic.northwestern.edu/biotools/SizeCalc.html). Thus, the genotypic

data on all SSR bands for each genotype were recorded in terms of fragment size (bp).

3.2.4.3 Automated capillary electrophoresis (ABI 3730xl) and data scoring

The SSR amplified PCR product was also separated on ABI 3730xl DNA analyzer. The ABI

DNA analyzer not only has the ability to perform sequencing by capillary electrophoresis but

also can perform a variety of additional DNA analysis applications based on the sizing and

intensity of fluorescently labelled DNA fragments. Collectively, these applications are

referred to as “fragment analysis”. For the fragment analyses following steps were involved:

(i) Preparation of sample

After PCR amplification confirmation on 1.5% agarose gel, post PCR multiplex sets were

prepared based on fluorescence labelled primers. For post PCR multiplexing, 1µl of 6-FAM

and 2 µl of each VIC, NED and PET labelled PCR products representing different SSRs were

combined with 13 µl of water. 1 µl of this mixed product was added to 10 µl HI-Di

formamide containing 0.25 µl GeneScanTM

600 LIZ(R)

as internal size standard, denatured for

5 min at 95 0c, quickly chilled on ice for 5 min. and loaded on ABI 3730xl DNA Analyzer.

63

(ii) Capillary electrophoresis

Both the dye-labelled sample and size standard fragments were co-injected and separated

based on size and charge as they move through the capillary filled with polymer. As each of

the fluorescently labelled sample and size standard fragments moves across the laser window

and fluoresces, the signal produced is detected by an optical detection device on the

instrument. The data collection software then reports the DNA fragments.

(iii) Data analysis using software

Basic data collection occurs during a capillary electrophoresis run. After data collection,

secondary analysis software is used to analyse the data. Applied Biosystem®

Gene-Mapper®

V.4.0 software is flexible, high-performance software package which was used to calculate

SSR allelic data.

3.2.5 Statistical analysis of SSR data

The allelic SSR data obtained from PAGE and ABI DNA analyzer were subjected to various

statistical analysis using different softwares. The allelic data subjected to Power Marker

software (Liu and Muse 2005) to calculate polymorphic SSRs, observed heterozygosity (Ho),

gene diversity or expected heterozygosity (He), major allele frequency and polymorphic

information content (PIC) value. The PIC value was calculated following Botstein et al.

(1980) as:

Where Pi and Pj are the frequencies of ith

and jth

allele. GenAlex 6.5 software was

used to calculate number of observed alleles (Na), number of effective alleles (Ne), Shannon’

information index (I) and molecular variance (AMOVA) (Excoffier et al. 1992). The allelic

data were converted into 0-1 matrix which was used to calculate pair-wise genetic

dissimilarities among the accessions using Jaccard’s coefficient with bootstrap value of 1000.

The dissimilarity matrix thus generated was used to construct a neighbor-joining (NJ) tree

using DARwin 5.0.157 software (Perrier et al. 2003). Further, in order to determine the

genetic structure and define the number of clusters (gene pools), model-based cluster analysis

was also performed using software STRUCTURE version 2.3.3 (Pritchard et al. 2000). The

number of presumed population (K) was set from 2 to 10 with admixture model, without prior

64

information on their origin. Five independent runs were assessed for each fixed K and each

run consisted of 30,000 burn-in period and 1,00,000 iterations. The optimal value of K was

determined by examination of the ΔK statistic and L (K) (Evanno et al. 2005) using Structure

Harvester (Earl and vonHoldt 2012).

3.2.6 Morphometric analyses

3.2.6.1 Field experiment and morphological data measurement

A total of 80 accessions of J. curcas representing different eco-geographical and agro-

climatic zone of India were selected from germplasm bank maintained at Banthra Research

Center (BRC) of CSIR-National Botanical Research Institute, Lucknow, India. Fifteen

cuttings of each accession were raised in polybags filled with soil, cowdung manure and sand

in equal proportion during March 2008. The 6 rooted cuttings of each accession were then

transplanted in experimental plot in Randomized Block Design (RBD) with 3 replications and

2 plants/replication during July 2008. The experimental plot is situated between 260

40’N

latitude and 800

45’ E longitude and at an altitude of 129 m above sea level. The distances

between rows and plants were kept 2 meter. The field was irrigated as and when required.

Pruning of plants was practiced 2 feet above the ground in the first week of March 2009 and

2010.

Data on different morphometric traits were recorded during November 2010- January

2011. Following traits were considered for data measurement:

Female flower/plant: Number of female flowers counted during flowering period (November

– January)

Male flower/plant: Number of male flowers counted per plant.

Male/female ratio: ratio between female and male flowers per plant.

Number of fruits/plant: total number of fruits counted per plant at harvesting time.

Number of seeds/plant: Total number of seeds counted per plant.

Fruit weight/plant: total fruit weight measured in gram per plant.

Seed weight/plant: total seed weight measured in gram per plant.

Seed length and width: twenty seeds per plant randomly selected and length and width

measured in middle with vernier caliper (mm) and

Oil content: twenty five to thirty seeds randomly selected per plant and used to measure oil

content in percent through Nuclear Magnetic Resonance (NMR) spectrometer.

65

2 p

X

2g

X

x 100

x 100

3.2.6.2 Statistical analysis of morphological data

The mean values for each trait were used for statistical analysis and subjected to analysis of

variance and covariance using WINDOSTAT software (www.windostat.org). Variance

components were estimated from mean square of ANOVA (Singh and Chaudhary 1985). For

divergence studies, the variability among populations was tested by Wilk’s lambda criterion

for pooled effect of all the characters. Hierarchical clustering was carried to find out the

pattern of similarity/dissimilarity among accessions using ward’s minimum variance method

(Ward 1963). The relationships among the clusters were assessed by estimating the

intercluster distances using Mahalanobis distance (D2) statistics (Rao 1952).

Variance components

The variance components were estimated from mean square of ANOVA (Singh and

Chaudhary 1985). The phenotypic, genotypic and error variance of the mean for accessions

were calculated as

Where 2g,

2p and

2e are the variance components of genotypes, phenotypes and

environments respectively.

Coefficient of Variability

Phenotypic (PCV) and genotypic (GCV) coefficient of variation were calculated as

Heritability in broad sense

Heritability in broad sense (h2

B) was estimated on genotypic mean basis as described by Hill

et al. (1998) and Allard (1999) as

h2

B = 2g/

2p

2p =

2g +

2e

MSv – MSe

r

2e= E (MSe)

2g =

PCV =

GCV =

66

Genetic Advance

Expected genetic advance (%) of mean was estimated according to Johnson et al. (1955).

GA= k x (p) x h2

B

GA % = (GA/X) x 100

Where,

k- Standardized selection differential (2.06),

p - Phenotypic standard deviation,

h2

B – broad sense heritability and

X - Mean of the trait.

Correlation and path coefficient analysis

The correlation coefficient between different characters at genotypic and phenotypic level

was worked out according to Johnson et al. (1955).

1. Phenotypic correlation coefficient (rp)

Phenotypic correlation (rp) =

Phenotypic covariance = genotypic covariance + error covariance

Error covariance = M.S.P. error

Phenotypic variance X or Y = genotypic variance + error variance

2. Genotypic correlation coefficient (rg)

Genotypic correlation (rg) =

Genotypic covariance =

Genotypic variance X or Y=

Phenotypic covariance X Y

√Phenotypic variance X. Phenotypic variance Y

Genotypic covariance X Y

√Genotypic variance X. Genotypic variance Y

M.S.S. treatment – M.S.S. error

replication

M.S.P. treatment – M.S.P. error

replication

67

Direct and indirect effects of various traits to understand the relationship among variables

based on a priori model were calculated as described by Lynch and Walsh (1998).

Where ryi is the simple correlation coefficient between the ith

causal variable (Xi ) and effect

variable (y), rii is the simple correlation coefficient between the ith

and i’th

causal variables,

Pyi is the path coefficient (direct effect) of the ith

causal variable (Xi ), rii’ Pyi’ is the indirect

effect of the ith

causal variable via the i’th

causal variable. To determine Pyi values, square

matrices of the correlation coefficient between independent traits in all possible pairs were

inverted and then multiplied by the correlation coefficients between the independent and

dependent traits.

k

ryi = Pyi rii’ Pyi’ For ii’ and i’1

i’=1

Chapter 4 Results

68

J. curcas L. is a non-edible, oil-rich plant which has attracted global attention as a promising

renewable resource of biodiesel production. Limited efforts have been made towards its

genetic improvement and it is still considered as undomesticated/semi-domesticated plant

with various negative features. A few preliminary studies have been conducted in the recent

past using classical methods for genetic improvement of polygenic traits such as yield and oil

trait. The genetic improvement of such polygenic traits mainly rely on phenotypic and

pedigree information that are labor and time intensive with traditional breeding approaches

(Falconer et al. 1996). Integration of molecular markers technology with conventional plant

breeding approaches help in unraveling the genetic architecture of these complex quantitative

traits through identification of markers associated with quantitative trait loci (QTLs)

controlling these traits and marker assisted breeding (MAB). However, very limited efforts

have been made towards the development of molecular markers and their use in

genetic/genomic studies in J. curcas. Therefore, the present investigation was undertaken to

develop a set of large number of SSRs from genomic libraries and thereafter to characterize,

validate, identify polymorphic SSR and apply selected markers for molecular diversity

analysis in global collection of J. curcas. In addition, phenotypic characterization of large

number of J. curcas accessions collected from different part of India was carried out. Efforts

were also made to develop interspecific hybrid between J. curcas and J. integerrima and

characterize the hybrids using newly developed SSR markers. Heterozygosity level of J.

curcas was also assessed using SSR markers. The results related to the above aspects are

summarized under following heads:

Phenotypic characterization of indigenous accessions of J. curcas.

Development of large scale genomic derived SSRs from four microsatellite enriched

genomic libraries

PCR optimization, polymorphism detection and characterization of developed SSRs

for various attributes.

Study of molecular genetic diversity among indigenous and exotic accessions of

J. curcas.

Characterization of interspecific hybrid population of J. curcas x J. integerrima.

Assessment of heterozygosity in J. curcas.

69

4.1 Phenotypic characterization of indigenous accessions of J. curcas

4.1.1 Genetic variability

For studying the level of genetic variability for various agronomic traits a subset of 80

accessions of J. curcas collected from different eco-geographical and agro-climatic zones of

India were used for phenotypic evaluation. A total of 10 morphometric traits related to floral

characters and yield were measured and data were analyzed to assess the level of genetic

variability and diversity in J. curcas. The range and mean value of different traits along with

various statistical parameters is presented in Table 4.1. The analysis of variance revealed

significant differences for all the 10 traits as indicated by F value (Table 4.1). The number of

female flowers/plant varied from 72.6 to 118.0 with an average of 94.6±1.39 and the number

of male flowers/plant varied from 1627.2 to 2960.0 with an average of 2240.8±42.92.

Male/female flower ratio was found to be varied between 17.2 and 32.1 with an average of

24.0±0.37. The number of fruits/plant varied from 63.4 to 112.9 with an average of

80.5±1.31. The number of seeds/plant varied from 169.6 to 297.0 with an average of

218.3±3.89. The seed weight/plant varied from 102.7 to 273.8 g with an average of

180.2±4.20 g. The range of seed length (mm) and seed width (mm) were 13.3 - 18.5 and 7.8 -

11.8 with an average of 16.4±0.13 and 10.7±0.09 respectively. The oil content varied

between 20.8 to 36.1% with an average of 26.2±0.38. Out of 80 accessions, 37 accessions

had seed weight/plant above average value (i.e.180.2g) and of which only 4 accessions had

seed weight/plant above 250g with maximum in accession NBJC1078 (273.08g). Likewise,

26 accessions had oil content above average value of 26.2% and only 3 accessions namely

NBJC1055, NBJC1051 and NBJC1048 had oil content above 35%.

In order to assess the heritable portion of total variability, the phenotypic variance

(2p) was partitioned into genotypic (

2g) and error variance (

2e). The values of error

variance were found to be higher than those of genotypic variance (2g) for number of female

flowers/plant (163.5), male/female ratio (18.89), number of fruits/plant (143.13), number of

seeds/plant (1047.72) and seed weight/plant (1075.79) indicating the influence of

environmental factors on these traits. The other traits namely male flowers/plant, fruit

weight/plant, seed length, seed width and oil content were not much influenced by

environmental factors as these traits showed lower error variance than the genotypic variance.

70

Tab

le 4.1

Ran

ge, m

ean, estim

ates of v

ariance co

mponen

ts, bro

ad sen

se heritab

ility an

d g

enetic ad

van

ce in J. cu

rcas

R

ange

Mean

±S

D

F v

alue

σ2g

σ

2p

σ2e

GC

V

PC

V

Hb

GA

G

A%

Fem

ale flow

ers/plan

t 72.6

-118.0

94.6

±1.3

9

2.8

2**

99.5

5

263.1

0

163.5

10.5

4

17.1

3

38.0

12.6

4

13.3

5

Male flo

wers/p

lant

1627.2

-2960.0

2240.8

±42.9

2

4.0

6**

109448.1

8

216787.5

1

107339.3

14.7

6

20.7

8

50.0

484.2

4

21.6

1

Male/fem

ale ratio

17.2

-32.1

24±

0.3

7

1.7

6**

4.7

9

23.6

9

18.8

9

9.1

2

20.2

7

20.0

2.0

3

8.4

5

No. o

f fruits/p

lant

63.4

-112.9

80.5

±1.3

1

2.8

6**

88.9

5

232.0

8

143.1

3

11.7

2

18.9

3

38.0

12.0

3

14.9

5

No. o

f seeds/p

lant

169.6

-297.0

218.3

±3.8

9

3.4

3**

850.4

2

1898.1

4

1047.7

2

13.3

5

19.9

4

45.0

40.2

1

18.4

0

Fru

it weig

ht/p

lant (g

) 301.1

-575.2

397±

8.1

8

4.5

**

4128.0

7

7665.6

3

3537.5

6

16.1

5

22.0

1

54.0

97.1

3

24.4

2

Seed

weig

ht/p

lant (g

) 102.7

-273.8

180.2

±4.2

0

3.9

0**

1040.4

3

2116.2

2

1075.7

9

17.8

7

25.4

8

49.0

46.5

9

25.8

1

Seed

length

(mm

) 13.3

-18.5

16.4

±0.1

3

23.3

2**

1.3

4

1.5

2

0.1

8

7.0

5

7.5

1

88.0

2.2

4

13.6

4

seed w

idth

(mm

) 7.8

-11.8

10.7

±0.0

9

20.6

9**

0.6

1

0.7

0

0.0

9

7.2

8

7.8

2

87.0

1.5

0

13.9

7

Oil co

nten

t (%)

20.8

-36.1

26.2

±0.3

8

27.1

0**

10.8

4

12.0

9

1.2

5

12.6

0

13.3

0

90.0

6.4

2

24.5

8

*, *

* sig

nifican

t at 5%

and 1

% p

robab

ility resp

ectively

σ2g

- gen

oty

pic v

ariance, σ

2p- p

hen

oty

pic v

ariance, σ

2e- error v

ariance

GC

V- g

enoty

pic co

efficient o

f variab

ility, P

CV

- phen

oty

pic co

efficient o

f variab

ility

Hb-B

road

sense h

eritability

(%), G

A- g

enetic ad

van

ce, GA

%- g

enetic ad

van

ce percen

t

71

The phenotypic coefficient of variation (PCV) and genotypic coefficient of variation

(GCV) varied from 7.51 to 25.48 and 7.05 to 17.87% respectively. Maximum PCV and GCV

were noticed for seed weight/plant (25.48; 17.87) followed by fruit weight/plant (22.01;

16.15), number of male flowers/plant (20.78; 14.76), number of seeds/plant (19.94; 13.35).

The PCV was found to higher than that of GCV for all the traits with remarkable differences

in their values. However, the traits as seed length, seed width and oil content had very

insignificant differences in PCV and GCV values.

Broad sense heritability varied from 20% to 90% and maximum was observed for oil

content (90%) followed by seed length (88%), seed width (87%), fruit weight/plant (54%)

and male flowers/plant (49%). The lowest heritability (20%) was noticed for male/female

ratio. The female flowers/plant and number of fruits/plant had 38.0% heritability. The

number of seeds/plant had 45.0% heritability. The high heritability noticed for oil content,

seed length and seed width indicates that these characters are under genotypic control.

The genetic advance as percent of mean varied from 8.45 for male/female ratio to

25.81 for seed weight/plant. High heritability (90.0%) coupled with high GA (24.58) and

GCV (12.60) was found for oil content. Low heritability (49%) with high GA (25.81) was

noticed for seed weight/plant, fruit weight/plant (45.0; 24.42) and number of seeds/plant

(45.0; 18.40). The high heritability with low GA was found for seed length (88.0; 13.64) and

seed width (87.0; 13.97). The trait male/female ratio had lowest heritability (20%), lowest

GA (8.45%) and lowest GCV (9.12).

4.1.2 Diversity analysis based on morphological traits

The morphological data of all the 10 traits were subjected to multivariate analysis for diversity

assessment among the 80 accessions of J. curcas. The testing of significance based on Wilk’s

lambda criterion for pooled effect of all the characters showed significant differences among

the accessions (χ2=790 df=3079.06**). A hierarchical cluster analysis based on Wards

minimum variance grouped all the 80 accessions into 4 clusters (Table 4.2 Figure 4.1).

4.1.2.1 Cluster composition

The number of accessions per clusters varied from 6 (cluster III) to 29 (cluster II). The cluster

II was largest with 29 accessions collected from 11 states India i.e. Andhra Pradesh (6

accessions), Rajasthan (5), Jharkhand (4), Uttar Pradesh (2), Chhattisgarh (2), Bihar (2),

Punjab (2), Gujarat (1), Haryana (1), Kerala (1) and Tamil Nadu (1). The cluster I was second

72

largest comprising 27 accessions including 5 accessions collected from Uttar Pradesh, 4 from

Gujarat, 3 from West Bengal, 2 each from Rajasthan, Tamil Nadu, Himachal Pradesh, Bihar,

Chhattisgarh, Haryana, and 1 each from Punjab, Madhya Pradesh, Uttaranchal, Jharkhand

and Kerala. The cluster III was the smallest with 6 accessions collected from Uttaranchal (4),

Jharkhand (1) and Rajasthan (1). The cluster IV had 18 accessions collected from

Chhattisgarh (4), Uttar Pradesh (4), Tamil Nadu (3), Himachal Pradesh (2), Haryana (2),

Orissa (1), Andhra Pradesh (1), and West Bengal (1).

4.1.2.2 Cluster distance and mean value of traits

The maximum intra-cluster distance was noticed in cluster IV (30.15) followed by cluster I

(28.61), cluster II (25.89) and cluster III (24.77). The inter-cluster distance varied from 47.59

(between cluster I and cluster II) to 211.27 (between cluster III and cluster I). Based on

cluster distance, the cluster III showed maximum genetic distance with cluster I, followed by

cluster IV and cluster II suggesting comparatively wider genetic diversity among them.

Considering the cluster means for all the 10 traits, the cluster I which was second largest

cluster showed highest mean value for all the traits (Table 4.4) i.e. female flowers/plant

(103.12±2.18), male flowers/plant (2471.53±68.74), male/female ratio (24.30±0.70) number

of fruits/plant (89.31±2.37), fruit weight/plant (432.88±14.52), number of seeds/plant

(245.39±7.10), seed weight/plant (198.61±7.75), seed length (17.58±0.10) and seed width

(11.12±0.08) except oil content (24.97±0.36). On contrary, cluster III which was the smallest

cluster had lowest mean value for number of male flowers/plant (2018.00±114.06),

male/female ratio (22.29±1.07), fruit weight/plant (347.50±11.85), seed weight/plant

(151.44±6.98), seed length (13.92±0.24) and seed width (8.29±0.02). However, it had

moderate mean value for female flowers/plant (91.44±4.54), fruits/plant (75.50±3.55),

seeds/plant (205.28±9.10) and oil content (25.56±1.02). The cluster II had second highest

mean value for the traits female flowers/plant, fruits/plant, fruit weight/plant, seed

weight/plant, seed length, seed width and lowest mean value for oil content. The cluster IV

which had 18 accessions showed highest mean value for oil content, lowest mean value for

female flowers/plant, fruits/plant, and seeds/plants.

73

Tab

le 4.2

Distrib

utio

n o

f 80 accessio

ns o

f Jatro

ph

a cu

rcas in

4 clu

sters based

on th

eir 10 q

uan

titative traits

Clu

ster

Nu

mb

er o

f

accessio

ns

Accessio

ns n

am

e

Clu

ster I 27

NB

JC1001,

NB

JC1034,

NB

JC1044,

NB

JC1005,

NB

JC1039,

NB

JC1054,

NB

JC1004,

NB

JC1045,N

BJC

1060,

NB

JC1121,

NB

JC1007,

NB

JC1031,N

BJC

1137,

NB

JC1107,

NB

JC1127,N

BJC

1023, N

BJC

1097, N

BJC

1008, N

BJC

1033,N

BJC

1129, N

BJC

1064, N

BJC

1022,

NB

JC1058,N

BJC

1124,N

BJC

1078, N

BJC

1122, N

BJC

1130.

Clu

ster II 29

NB

JC1006, N

BJC

1093, N

BJC

1083, N

BJC

1057, N

BJC

1133, N

BJC

1082, N

BJC

1087, N

BJC

1135,

NB

JC1003, N

BJC

1035, N

BJC

1112, N

BJC

1036, N

BJC

1050, N

BJC

1020, N

BJC

1021, N

BJC

1085,

NB

JC1080, N

BJC

1131, N

BJC

1063, N

BJC

1067, N

BJC

1065, N

BJC

1073, N

BJC

1052, N

BJC

1071,

NB

JC1084, N

BJC

1101, N

BJC

1138, N

BJC

1094,N

BJC

1072.

Clu

ster III 6

NB

JC1009, N

BJC

1014, N

BJC

1013, N

BJC

1019, N

BJC

1017, N

BJC

1025.

Clu

ster IV

18

NB

JC1048, N

BJC

1051, N

BJC

1055, N

BJC

1049, N

BJC

1081, N

BJC

1089, N

BJC

1092, N

BJC

1079,

NB

JC1141, N

BJC

1053, N

BJC

1123, N

BJC

1069, N

BJC

1088, N

BJC

1075, N

BJC

1076, N

BJC

1070,

NB

JC1090, N

BJC

1077.

74

Tab

le 4.3

Intra- (d

iagonal b

old

) and in

ter-cluster M

ahalan

obis d

istances fo

r 80 accessio

ns in

Jatro

pha cu

rcas

C

luster I

Clu

ster II C

luster III

Clu

ster IV

Clu

ster I 28.6

1

47.5

9

211.2

7

91.6

2

Clu

ster II

25.8

9

129.4

7

66.5

1

Clu

ster III

24.7

7

153.2

4

Clu

ster IV

30.1

5

Tab

le 4.4

Clu

ster mean

s and stan

dard

errors o

f the m

eans o

f differen

t traits in Ja

trop

ha cu

rcas

F

emale

flow

er/

plan

t

Male flo

wer/

plan

t

male/fem

ale

ratio

No.

of

fruits/

plan

t

Fru

it weig

ht/

plan

t (g)

No. o

f seeds

/plan

t

Seed

weig

ht/

plan

t (g)

Seed

length

(mm

)

Seed

wid

th

(mm

)

Oil

conten

t

(%)

Clu

ster I 103.1

2±

2.1

8

2471.5

3±

68.7

4

24.3

0±

0.7

0

89.3

1±

2.3

7

432.8

8±

14.5

2

245.3

9±

7.1

0

198.6

1±

7.7

5

17.5

8±

0.1

0

11.1

2±

0.0

8

24.9

7±

0.3

6

Clu

ster II 91.7

1±

1.8

2

2200.5

3±

67.2

6

24.0

6±

0.6

5

77.7

7±

1.7

5

390.9

5±

14.1

8

209.2

9±

4.9

0

179.3

2±

6.2

4

16.1

7±

0.1

4

10.8

8±

0.0

6

24.3

4±

0.2

7

Clu

ster III 91.4

4±

4.5

4

2018.0

0±

114.0

6

22.2

9±

1.0

7

75.5

0±

3.5

5

347.5

0±

11.8

5

205.2

8±

9.1

0

151.4

4±

6.9

8

13.9

2±

0.2

4

8.2

9±

0.0

2

25.5

6±

1.0

2

Clu

ster IV

85.9

1±

2.2

6

2033.9

4±

66.7

0

24.0

4±

0.6

8

73.2

0±

1.4

2

372.9

4±

12.1

9

196.4

1±

4.1

1

163.6

3±

7.3

7

15.9

8±

0.1

6

10.7

2±

0.0

7

31.0

4±

0.6

9

75

Figure 4.1 Dendrogram of 80 J. curcas accessions derived from the Wards minimum

variance cluster analysis using Mahalanobis distances

76

In order to assess the patterns of variation, principal component analysis (PCA) was

done by considering all the 10 variables simultaneously and the results of PCA are presented

in Table 4.5. A total of 4 principal components were detected having root of more than 1. The

first principal components accounted for 42.5% of the total variation majorly due to seed

length (3.07), seed width (1.95), seed weight/plant (0.61) and number of seeds/plant (0.51)

which had maximum and positive weight on this component. Oil content had negative weight

(-0.32) on PC1. The other traits as female flowers/plant, male flowers/plant, fruits/plant, fruit

weight/plant had comparatively lower effect of the first PC. The PC2 concentrated 32% of

total variation and was positively associated with seed weight/plant (0.52), number of

seeds/plant (0.55) and female flowers/plant (0.42), male flowers/plant (0.32), fruits/plant

(0.21), fruit weight/plant (0.14). The oil content (-3.05) had highest negative weight on PC2

which was followed by the trait - seed length (-0.18), seed width (-0.74) and male female

ratio (-0.05). The third PC accounted for 12% of total variation and was mainly due to seed

weight/plant (0.88), number of seeds/plant (0.89), oil content (0.72), female flowers/plant

(0.56), number of fruits/plants (0.48), male flowers/plant (0.25) fruit weight/plant (0.29), seed

length (0.26). The seed width had negative weight (-1.10) on the PC3. The fourth principal

component (PC) accounted for 7% of the total variation and was positively associated with

seed width (0.95), seed weight/plant (0.59), and number of seed/plant (0.55). The seed length

had negative weight (-0.87). The fruit weight had lowest and positive weight (0.06) followed

by oil content (0.07) and male/female ratio (0.07). The seed weight/plant invariably had

almost equal and positive weight on all the four components.

Table 4.5 Loadings of the first four principal components of genetic divergence in 80

accessions of Jatropha curcas

Characters PC1 PC2 PC3 PC4

Female flowers/plant 0.40 0.42 0.56 0.09

Male flowers/plant 0.37 0.32 0.25 0.21

male/female ratio 0.01 -0.05 -0.00 0.07

No. of fruits/plant 0.37 0.21 0.48 0.19

No. of seeds /plant 0.51 0.55 0.89 0.55

Fruit weight/plant (g) 0.20 0.14 0.29 0.06

Seed weight/plant (g) 0.61 0.52 0.88 0.59

Seed length (mm) 3.07 -0.18 0.26 -0.87

seed width (mm) 1.95 -0.74 -1.10 0.95

Oil content (%) -0.32 -3.05 0.72 0.07

Components

Root 14.49 10.86 4.09 2.44

% Variance explained 42.54 31.87 12.02 7.18

Cum. variance explained 42.54 74.41 86.43 93.62

77

4.1.3 Correlation coefficient analysis

The morphological data was subjected to correlation coefficient analysis to understand the

genetic relationship among the various traits. The genotypic and phenotypic correlations had

been calculated in 80 diverse accessions of J. curcas and are presented in Table 4.6. The

values of phenotypic and genotypic correlations were of the same sign except in female

flowers/plant vs. male flowers/plant, male female ratio vs fruit weight/plant. In general,

genotypic correlations were slightly higher than the corresponding phenotypic correlation.

The seed weight/plant (SWP) was positively and significantly associated with female

flowers/plant (0.95; 0.36), male flowers/plant (0.64; 0.36), Number of flowers/plant (0.99;

0.39), Number of seed/plant (0.99; 0.52), fruit weight/plant (0.98; 0.40), seed width (0.23;

0.16) and negatively associated with oil content (-0.30; -0.18). Non-significant and positive

association of seed weight/plant was noticed with seed length (0.19; 0.11). The oil content

was found to be negatively and significantly correlated with female flowers/plant, male

flowers/plant, number of seeds/plant, fruit weight/plant and seed weight/plant. The negative

but non-significant association of oil content was also noticed with male female ratio, number

of female flowers/plant, seed length and seed width. Among component traits, female

flowers/plants was significantly and positively correlated with male flowers/plant (0.77;

0.44), number of fruits/plants (0.99; 0.57), number of seeds/plants (0.98; 0.41), fruit

weight/plants (0.99; 0.63) and seed length (0.32; 0.14) and had negative association with oil

content (-.039; -0.22). Male flowers/plant had significant and positive correlation with male

female ratio (0.71; 0.65), number of fruits/plant (0.73; 0.39) and fruit weight/plant (0.72;

0.39). Male female ratio had significant and negative correlation with fruit weight/plant

(0.03; -0.13). Number of fruits/plant had significant and positive correlation with number of

seeds/plant (0.99; 0.42), fruit weight/plant (0.99; 0.81), seed length (0.37; 0.17) and seed

width (0.24; 0.14). Number of seeds/plant had significant and positive correlation with fruit

weight/plant (0.99; 0.47) and negatively correlated with oil content (-0.31; -0.20). Fruit

weight/plant had significant and positive correlation with seed length (0.35; 0.14) and seed

width (0.17; 0.14) while negatively correlated with oil content (-0.30; -0.20). Seed length had

significant and positive correlation with seed width (0.74; 0.65).

78

Table 4.6 Estimates of genotypic (rg) and phenotypic (rp) correlation coefficients among

various traits determined in 80 accessions of J. curcas

MFP MFR NFP NSP FWP SWP SL SW OC

FFP rg

rp

0.77

0.44**

0.10

-0.38**

0.99

0.57**

0.98

0.41**

0.99

0.63**

0.95

0.36**

0.32

0.14*

0.10

0.09

-0.39

-0.22**

MFP rg

rp

0.71

0.65**

0.73

0.39**

0.66

0.37**

0.72

0.39**

0.64

0.36**

0.22

0.11

0.20

0.11

-0.32

-0.20**

MFR rg

rp

-0.04

-0.09

-0.05

0.04

0.03

-0.13*

-0.04

0.05

0.00

-0.00

0.24

0.05

-0.07

-0.03

NFP rg

rp

0.99

0.42**

0.99

0.81**

0.99

0.39**

0.37

0.17**

0.24

0.14*

-0.29

-0.19

NSP rg

rp

0.99

0.47**

0.99

0.52**

0.11

0.06

0.14

0.09

-0.31

-0.20**

FWP rg

rp

0.98

0.40**

0.35

0.14*

0.17

0.14*

-0.30

-0.20**

SWP rg

rp

0.19

0.11

0.23**

0.16

-0.30**

-0.18

SL rg

rp

0.74

0.65**

-0.05

-0.05

SW rg

rp

-0.03

-0.04

OC

*, ** = significant at 5% and 1% respectively.

FFP: female flowers/plant; MFP: male flowers/plant; MFR: male female ratio; NFP: number

of flowers/plant; NSP: number of seeds/plant; FWP: fruit weight/plant; SWP: seed

weight/plant; SL: seed length; SW: seed width; OC: oil content; rg: genotypic correlation

coefficient; rp: phenotypic correlation coefficient

79

4.1.4 Path coefficient analysis

Correlation analysis reveals over all relationships among different traits which may be

negative or positive in nature and it is the net result of direct effect of a particular trait and

indirect effects via other traits. In order to predict the direct and indirect effect of traits on

correlation among various traits the path analysis studies are being carried out. The path

coefficient analysis for seed yield in J. curcas germplasm was performed and results are

presented in table 4.7. The male flowers/plant had the maximum direct effect on seed yield

(4.87), followed by number of seeds/plant (0.71), seed width (0.57), number of fruits/plant

(0.28) and oil content (0.15). On the other hand, the female flowers/plant (-2.19), male female

ratio (-3.32), fruit weight/plant (-1.14), seed length (-0.4) exhibited negative and direct effect

on seed yield but showed positive and significant correlation on seed yield except male

female ratio. The negative direct effect of female flowers/plant, fruit weight/plant, seed

length on seed yield/plant was counterbalanced by indirect positive effect via male

flowers/plant, number of fruits/plant, number of seeds/plant, and seed width. Male

flowers/plant (3.78), number of fruits/plant (0.31), number of seeds/plant (0.70), seed width

(0.05) had indirect positive effect and influenced female flower which indirectly affect yield.

Male female ratio showed negative indirect effect and negative correlation with seed yield.

Oil content had positive direct effect on seed yield/plant though it had negative association

which was due to negative indirect effect via male flower/plant, number of seeds/plant,

number of fruit/plant and seed width.

80

Tab

le 4.7

Path

coefficien

t analy

ses for seed

yield

/plan

t in J. cu

rcas g

ermplasm

Traits

In

direct effect v

ia

Direct E

ffect F

FP

M

FP

M

FR

N

FP

N

SP

F

WP

S

L

SW

O

C

r2

Total In

direct effect

FF

P

-2.1

9

_

3.7

8

-0.3

3

0.3

1

0.7

-1

.18

-0.1

3

0.0

5

-0.0

6

0.9

5

3.1

4

MF

P

4.8

7

-1.6

7

_

-2.3

6

0.2

0.4

6

-0.8

3

-0.0

9

0.1

1

-0.0

5

0.6

4

-4.2

3

MF

R

-3.3

2

-0.2

2

3.4

6

_

-0.0

1

-0.0

4

-0.0

4

0

0.1

4

-0.0

1

-0.0

4

3.2

8

NF

P

0.2

8

-2.4

2

3.5

0.1

4

_

0.8

-1

.25

-0.1

5

0.1

4

-0.0

5

0.9

9

0.7

1

NS

P

0.7

1

-2.1

7

3.2

0.1

6

0.3

2

_

-1.2

-0

.05

0.0

8

-0.0

6

0.9

9

0.2

8

FW

P

-1.1

4

-2.2

6

3.5

4

-0.1

2

0.3

0.7

5

_

-0.1

4

0.0

9

-0.0

4

0.9

8

2.1

2

SL

-0

.4

-0.7

1.1

0

0.1

0.0

8

-0.4

_

0.4

1

0

0.1

9

0.5

9

SW

0.5

7

-0.2

0.9

9

-0.8

0.0

7

0.1

-0

.2

-0.3

_

0

0.2

3

-0.3

4

OC

0.1

5

0.8

5

-1.5

7

0.2

3

-0.0

8

-0.2

2

0.3

4

0.0

2

-0.0

2

_

-0.3

0

-0.4

5

FF

P: fem

ale flow

ers/plan

t; MF

P: m

ale flow

ers/plan

t; MF

R: m

ale female ratio

; NF

P: n

um

ber o

f flow

ers/plan

t; NS

P: n

um

ber o

f seeds/p

lant;

FW

P: fru

it weig

ht/p

lant; S

WP

: seed w

eight/p

lant; S

L: seed

length

; SW

: seed w

idth

; OC

: oil co

nten

t

81

4.2 Development of large scale genomic derived SSRs from four

microsatellite enriched genomic libraries (two di-nucleotide and two tri-

nucleotide)

4.2.1 Development of SSRs from di-nucleotide enriched genomic libraries

The genomic SSRs were developed from genomic libraries enriched with CA (designated as

Lib A) and GA (designated as Lib B) repeat units. These libraries were custom made from

Genetic Identification Service (GIS), USA. The SSR enriched genomic library mixture were

electroporated into E. coli. The transformed clones were selected through blue-white

screening and plasmids were isolated for sequencing. More than 3500 clones were sequenced

and a total of 1740 and 1530 good quality sequences were selected from Lib A and B

respectively. The details are presented in Table 4.8.

Table 4.8 Summary of genomic SSRs developed from enriched libraries of J. curcas L.

Parameters Lib A Lib B Total

Clones sequenced 1740 1530 3270

Sequences containing SSR motifs 1385 (79.6%) 1345 (87.9%) 2730 (83.5%)

SSR containing unique sequences 639 (46.1%) 676(50.3%) 1315 (48.2%)

Primers designed 574(41.4%) 633(47.1%) 1207(44.2%)

Total number of SSRs identified 759 857 1616

Sequences with >1 SSRs 152 (26.5%) 191 (30.2%) 343 (28.4%)

Compound SSRs 115(20.0%) 154(24.0%) 269 (22.0%)

Perfect SSRs 455 (79.0%) 473 (75.0%) 928 (77.0%)

Interrupted SSRs 4 (1.0%) 6 (1.0%) 10 (1.0%)

A total of 1385 (79.6%) sequences from Lib A and 1530 (87.9%) sequences from Lib

B were found to have SSR motifs. After removing the redundant sequences, a total of 639

(46.1%) and 676 (50.3%) SSR containing sequences were found to be unique from Lib A and

B respectively. Now these unique sequences were subjected to primer designing and primers

were successfully designed for 574 (out of 639 of Lib A) SSR containing sequences. The

remaining 65 SSR containing sequences were not able to design primers due to either

marginal SSRs or flanking sequences were not suitable for primer designing criteria.

Likewise, out of 674 SSR containing sequences of Lib B, primers were designed for 633

sequences. Thus, from both the libraries a total of 1207 (44.2%) SSR primer pairs flanking

1616 SSRs were successfully designed and synthesized. From Lib A, 152 (26.5%) sequences

82

and from Lib B, 191 (30.2%) sequences were found to have >1 SSRs. In total, 343 (28.4%)

sequences were found to have >1 SSRs from the both libraries. Among the total SSRs

identified, the perfect SSRs were found to be 455 (79.0%) from Lib A and 473 (75.0%) from

Lib B accounting for a total of 928 (77.0%) from both the libraries. The interrupted SSRs

were found in very less amount. A total of only 10 SSRs (1%) were found to be interrupted

type including 4 (1%) from Lib A and 6 (1%) from Lib B (Table 4.8). These 1207 SSR

containing sequences were submitted to NCBI under GSS (Acc. No. JM427845 - JM429048).

The enrichment percentage was found to be higher for GA repeat motif (87.9%) than CA

repeat motif (79.6%). The SSR motif analysis of all the SSRs revealed that the di-nucleotide

repeats (DNR) were recovered in higher proportion from both the libraries (Figure 4.2) which

was as per expectation, since these libraries were enriched for dinucleotide repeat motif. The

Lib A produced 720 (45.55%) and Lib B produced 806 (49.87%) di-nucleotide repeat motif

SSRs. Apart from recovery of di-nucleotide repeat SSRs, some other SSR repeat types, such

as tri-nucleotide (TNR), tetra-nucleotide (TtNR), penta-nucleotide (PNR) and hexa-

nucleotide repeat (HNR) SSR also recovered though in very less numbers. From Lib A, 21

TNR, 13 TtNR, 2 PNR and 3HNR SSRs were identified. Likewise, from Lib B, 33 TNR,

10TtNR, 4 PNR and 4 HNR identified (Figure 4.2).

Figure 4.2 Histogram showing frequency of different types of SSRs repeat motif recovered.

720

21 13 2 3

806

33 10 4 4

0

100

200

300

400

500

600

700

800

900

DNR TNR TtNR PNR HNR

No. of

SS

Rs

Repeat motif types

Lib.A Lib.B

83

The SSR repeat length analysis revealed that the majority of identified SSRs i.e.

557(34.46%) from Lib A and 500 (30.94%) from Lib B were of short length varied from 4 to

10 repeat units (Figure 4.3). The 151 SSRs from Lib A and 245 from Lib B were found to be

in the range of 11-15 repeat units. The repeat unit ranged from 16 to 20 had only 42 and 84

SSRs from Lib A and Lib B respectively. There were only 37 SSRs (9 from Lib A and 28

from Lib B) were recovered having repeat length more than 20.

Figure 4.3 Histogram showing frequency of different repeat units recovered for Lib A and

Lib B.

The SSR motif types showed that the AC/GT and AG/CT motif, the targeted motif,

were recovered in higher frequency of 506 (32.5%) and 703 (45.3%) of Lib A and Lib B

respectively (Figure 4.4). Next to these motifs, the AT/AT SSR motifs were also recovered in

significant numbers (164 in Lib A and 68 in Lib B) among DNRs as compared to other non-

targeted motifs. Among TNR, the AAG/GTT and AAT/ATT SSR motif accounted for 23 and

21 respectively from both the libraries.

557

151

42

9

500

245

84

28

0

100

200

300

400

500

600

4-10 11-15 16-20 >20

No

. o

f S

SR

s

Repeat units

Lib.A

Lib.B

84

Figure 4.4 Histogram showing frequency of different repeat motifs recovered from

Lib A and Lib B.

4.2.2 Development of SSR markers from tri-nucleotide SSR enriched genomic libraries

The isolation of SSRs from tri-nucleotide repeat libraries was done with a different and new

approach using next generation sequencing technology. For the development of tri-nucleotide

SSR markers, the SSR enriched genomic libraries AAC (designated as Lib. C) and AAT

(designated as Lib. D) were sequenced using Roche 454 GS-FLX whole genome sequencer

on 1/8th

sequencing run. The 454 pyrosequencing resulted in 124,279 reads covering 29.3 Mb

size (Table 4.9) with more than 95% Q40 plus bases which represents about 7% of J. curcas

genome. The average read length was found 235.16 bases with longest one of 601 bases

(Figure 4.5).

506

47

164

3 0 6 10 1 1 1 2 1 4 0 2 1 3 0 33

703

68

2 1 17 11 1 1 1 1 2 2 1 1 2 0 2

0

100

200

300

400

500

600

700

No. of

SS

Rs

Repeat motifs

Lib.A

Lib.B

85

Figure 4.5 The average read length and unique sequence length

The assembly of raw reads generated 25,495 sequences (6.79 Mb) including 2,845

contigs and 22,650 singletons. A total of 72.62% reads and 73.32% bases were aligned. The

average contig size after assembly was found to be 584.54 bases. A total 933 contigs were

recovered with length of > 500 bp.

Table 4.9 Statistical details of 454 sequencing of SSR enriched libraries of J. curcas L.

Raw read statistics Numbers

Total reads generated 124279

Total bases generated (bases) 29226427

Avg. Read length (bases) 235.16

Longest Read length (bases) 601

Trashed Reads ( < 50 bp and low quality reads) 7940

Total reads used for assembly 116339

Details of assembly statistics

All Contigs (≥ 100 bp) 2845

Large contigs (≥ 500 bp) 933

Singletons 22650

Average contig size (bp) 584.54

% Reads Aligned 72.62

% Bases Aligned 73.32

%Q40 Plus basesb 94.56

a N50 corresponds to the length of the smallest contig in the set of largest contigs whose combined length

represents 50% of the total assembly size. b Percentage of bases called that have a quality score of 40 or above

All the assembled contigs and singletons were used for search of SSR motifs using the MISA

software. A total of 5,844 putative microsatellites were identified in 5142 sequences with

86

23% SSR recovery (Table 4.10). Out of 2,845 contigs, 155 had 190 SSRs (6.6%) and 4986

singleton had 5654 SSRs (24.9%). Six hundred nine sequences have more than one SSR loci

and 485 SSRs were found to be present in compound form. Primer pairs were successfully

designed for 1122 SSR containing sequences while the remaining sequences failed to primer

design either due to marginal SSRs or flanking sequences not suitable for primer designing

criteria. Finally, 1122 primer pairs were synthesized and used for validation and

characterization.

Table 4.10 Details of SSR search using MISA

Parameters Numbers

Total number of sequence examined 25495

Total number of identified SSRs 5844

Number of SSR containing sequences 5141

Number of SSR containing more than 1 SSRs 609

Number of SSRs present in compound form 485

Total Primers designed 1122

With respect to recovery of different types of SSR types, it was noticed that trinucleotide

repeat SSRs was found in maximum numbers (59%) (Figure 4.6a) which was as per

expectation as the libraries were enriched for TNR. The tetra-nucleotide was found to be

second highest (25%). The other repeat motifs i.e. DNR, PNR and HNR were recovered with

less than 6%. Among the TNR, the repeat motif AAT/ATT and AAG/CTT were found in

major frequency (73.2%) followed by ATC/ATG (7.0%) and ACC/GGT (6.0%) (Figure 4.6

b). Although, we have targeted TNR SSRs but significant amount of TtNR SSRs were also

recovered and among them AAAT/ATTT motif was found in maximum frequency (51.0%).

The AG/CT and AT/AT was also recovered among DNR SSRs. The repeat motif ACC/GGT,

ATC/ATG among TNR, AAAG/CTTT, AATT/AATT among TtNR were also recovered in

good number.

87

Figure 4.6 Histogram showing distribution of SSRs based on (a) Repeat types and (b) Repeat

motifs

Considering the repeat length of SSR motif, it was noticed that most of the SSRs

(5559; 59%) had short repeat length which varied in the range of 4-10 (Figure 4.7). Two

hundred twenty eight SSRs were found in the range of 11-15 and 41 were in the range of 16-

20. There were only 16 SSRs which had repeat length of more than 20.

88

Figure 4.7 Histogram showing repeat units of different types of SSRs recovered

4.3 PCR optimization, polymorphism detection and characterization of

developed SSRs for various attributes

4.3.1 PCR amplification optimization and polymorphism detection

The newly developed SSR primers were initially used for optimization PCR amplification

conditions to select the SSRs giving good quality amplification with expected amplicon size.

The PCR optimization and polymorphism detection was carried out with a set of 7

accessions of J. curcas and one accessions of J. integerrima. The 3 µl PCR products

were first checked on 1.5% Agarose gel for identification of amplified and non- amplified of

SSRs (Figure 4.8). The non-amplified SSR primers were selected and repeated the PCR

reaction with increasing or decreasing the annealing temperature and changing the master

mix or Taq polymerase concentrations. In this way, finally a total of 1089, out of 1207 SSRs

(Lib A and Lib B) and 1017 SSRs, out of 1122 SSRs (Lib C and Lib D) showed good

amplification with expected amplicon size were identified.

0

1000

2000

3000

4000

5000

6000

4-10 11-15 16-20 >20

5559

228 41 16

Nu

mb

er o

f S

SR

s

Repeat units

89

Figure 4.8 A representative 1.5% Agarose gel image showing PCR amplification and non-

amplification. Sample from first 3 accessions (out of 8) were loaded for each SSR primers.

Primer 1 from well No. (1-3), Primer 2 (4-6), Primer 3 (7-9), Primer 4 (10-12), Primer 5 (13-

15), Primer 6 (16-18), Primer 7 (19-21), Primer 8 (22-24), Primer 9 (25-27), Primer 10 (28-

30), Primer 11 (31-33) and Primer 12 (34-36). All primers amplified except primer number

10th

.

The SSR primers which showed amplification of the expected band size (checked on

agarose gel) were selected for the polymorphism detection. The polymorphism

identification was carried out using polyacrylamide gel electrophoresis (PAGE)

(Figure 4.9) and automated capillary electrophoresis DNA analyzer ABI 3730xl

(Figure 4.10). Out of 574 Lib A SSRs, 106 (18.5%) were found to be polymorphic

among 7 accessions of J. curcas. The 420 (73.2%) SSRs showed monomorphic band

and the rest 48 (8.3%) failed to amplify (Table 4.11). When data was analyzed

excluding the non-toxic accession (NBJC195), the polymorphism percentage was

further reduced to 9.6% (Table 4.11). The polymorphism screening panel included

parental lines for one intraspecific and one interspecific mapping population. The

parental polymorphism screening for intraspecific population (NBJC132 x NBJC195)

revealed that 71 (12.4%) from Lib A and 446 were monomorphic, while 57 were not

amplified in both the accessions. In case of the parents of interspecific population i.e.

Chhatrapati (NBJC147) x J.integerrima, 144 (25%) SSRs were polymorphic and 177

(30.8%) were monomorphic. Similarly, 251 and 2 SSRs were not amplified with J.

100 bp ladder 1 2 3 4 5 6 7 8 9 10 11 12

13 14 15 16 17 18 19 20 21 22 23 24

25 26 27 28 29 30 31 32 33 34 35 36

90

integerrima and Chhatrapati respectively. Furthermore, the results obtained with Lib

B SSRs showed that 70 SSRs did not amplified with any of the accessions of J. curcas

and out of the rest 563, 421(75.0%) were found to be monomorphic. The polymorphic

SSRs of Lib B were found to be 142 (25.0%) with 7 accessions of J. curcas. The

number of polymorphic SSRs reduced from 142 to 74 when accession NBJC195 (non-

toxic) was excluded from the analysis (Table 4.11). A total of 108 and 187 SSRs were

found to be polymorphic among intra-specific and interspecific mapping population

respectively.

Figure 4.9 A representative 6% non-denaturing polyacrylamide gel image showing

polymorphism

Figure 4.10 A representative snapshots from GeneMapper showing polymorphism. Arrow

showing polymorphic peak.

M

700 bp

500bp

400bp

300bp

200bp

150bp

100bp

1 2 3 4 5 6 7 8 9 10 11 12 13 14

91

Table 4.11 Polymorphism screening details of SSRs with different accessions of J. curcas

and mapping populations

Lib A SSRs Lib B SSRs

Polymorphic Monomorphic Fail Polymorphic Monomorphic Fail

Seven acc. of J. curcas 106 420 48 142 421 70

Six acc. J. curcas

(excluding NBJC195)

55 471 48 74 489 70

NBRI-J05 (NBJC132) x

EC643912 (NBJC195)*

71 446 57 108 450 75

Chhatrapati (NBJC147 x

J.integerrima#

144 177 253 187 228 218

* Parental lines of intraspecific mapping population

# Parental lines of inter-specific mapping population

Likewise, the polymorphism analysis of SSRs developed from Lib C and D showed

that out of 1122 SSRs, 447 (39.83%) were found to be polymorphic among 7 accessions of J.

curcas and 570 (50.80%) were monomorphic. The rest 105 (9.35%) SSRs either failed to

amplify or gave non-specific amplification (Table 4.12). The polymorphism analysis among

J. curcas and J. integerrima revealed that 273 (24%) SSR primers failed to amplify with J.

integerrima and 226 (20%) were found to be monomorphic. However, the polymorphic SSRs

increased from 447 to 623 (73%) with respect to J. integerrima and J.curcas (Table 4.12).

Table 4.12 Polymorphism details of 1122 SSRs developed from Lib C and D among two taxa

of Jatropha

In J.curcas In J.integerrima*

Failed/nonspecific 105 (9%) 273 (24%)

Monomorphic 570 (51%) 226 (20%)

Polymorphic 447 (40%) 623 (56%)

* Monomorphism and polymorphism with reference to J. curcas accessions

92

4.3.2 Statistical analysis and characterization of SSRs for various attributes

In addition to the polymorphism detection, the genotypic data of polymorphic SSRs with 7

accessions of J. curcas was also subjected to various statistical analyses to assess the

potential of newly developed SSRs for further genetic studies. The 106 polymorphic Lib A

SSRs showed different degree of variability at each locus as the number of alleles varied

from 2 to 5 (JGM_A281) with an average of 2.24±0.55 alleles/SSR (Table 4.13). The major

allele frequency of polymorphic SSRs varied from 0.43 to 0.93 with an average of 0.83±0.11.

The PIC values of these polymorphic SSRs ranged between 0.12-0.63 with an average of

0.23±0.11. The maximum PIC was noticed for JGM_A281 and JGM_A326 (0.63) followed

by JGM_A244 (0.55) and JGM_A107 (0.52). Most of the SSRs (85; 53.1%) showed lower

PIC value less than 0.3 (Figure 4.11). The rest 11 SSR PIC values were found to be in the

range of 0.30-0.40 and 6 SSRs were found between 0.51-0.60. There were only four SSRs

which had PIC value in the range of 0.51-0.70 (Figure 4.11). The observed heterozygosity

(Ho) varied between 0.00 and 1.00 with an average of 0.3±0.23 and expected heterozygosity

(He) or gene diversity varied from 0.12 to 0.65 with an average of 0.24±0.11 (Table.4.13).

The maximum heterozygosity (He) was noticed for the JGM_A244, JGM_A326 and

JGM_A577. The maximum gene diversity (0.65) was found for the JGM_A236

Table 4.13 Polymorphism features of 106 newly developed SSRs (Lib A) in J. curcas

S.N.

Marker

major allele

frequency Allele No.

Gene

Diversity Heterozygosity PIC

1. JGM_A102 0.86 2 0.21 0.00 0.21

2. JGM_A105 0.86 2 0.21 0.00 0.21

3. JGM_A107 0.57 4 0.51 0.14 0.52

4. JGM_A112 0.86 2 0.21 0.00 0.21

5. JGM_A113 0.79 3 0.32 0.14 0.33

6. JGM_A115 0.93 2 0.12 0.14 0.12

7. JGM_A116 0.86 2 0.21 0.00 0.21

8. JGM_A120 0.86 3 0.23 0.14 0.24

9. JGM_A121 0.86 2 0.21 0.00 0.21

10. JGM_A124 0.80 2 0.26 0.00 0.27

11. JGM_A125 0.93 2 0.12 0.14 0.12

12. JGM_A132 0.86 2 0.21 0.00 0.21

13. JGM_A141 0.86 2 0.21 0.00 0.21

14. JGM_A143 0.92 2 0.14 0.17 0.14

15. JGM_A146 0.79 3 0.32 0.14 0.33

16. JGM_A150 0.86 2 0.21 0.00 0.21

17. JGM_A151 0.71 3 0.38 0.00 0.41

18. JGM_A159 0.86 2 0.21 0.00 0.21

93

19. JGM_A160 0.86 2 0.21 0.00 0.21

20. JGM_A162 0.86 2 0.21 0.00 0.21

21. JGM_A163 0.86 2 0.21 0.00 0.21

22. JGM_A164 0.86 2 0.21 0.00 0.21

23. JGM_A172 0.79 3 0.32 0.14 0.33

24. JGM_A184 0.83 2 0.23 0.00 0.24

25. JGM_A188 0.79 3 0.32 0.14 0.33

26. JGM_A197 0.86 2 0.21 0.00 0.21

27. JGM_A203 0.86 2 0.21 0.00 0.21

28. JGM_A207 0.86 2 0.21 0.00 0.21

29. JGM_A212 0.86 2 0.21 0.00 0.21

30. JGM_A215 0.83 2 0.23 0.00 0.24

31. JGM_A219 0.86 2 0.21 0.00 0.21

32. JGM_A222 0.93 2 0.12 0.14 0.12

33. JGM_A224 0.86 2 0.21 0.00 0.21

34. JGM_A243 0.93 2 0.12 0.14 0.12

35. JGM_A244 0.43 4 0.60 1.00 0.55

36. JGM_A250 0.86 2 0.21 0.00 0.21

37. JGM_A259 0.86 2 0.21 0.00 0.21

38. JGM_A264 0.86 2 0.21 0.00 0.21

39. JGM_A266 0.86 3 0.23 0.14 0.24

40. JGM_A267 0.83 2 0.23 0.00 0.24

41. JGM_A270 0.86 2 0.21 0.00 0.21

42. JGM_A271 0.93 2 0.12 0.14 0.12

43. JGM_A277 0.86 2 0.21 0.00 0.21

44. JGM_A279 0.64 2 0.44 0.71 0.35

45. JGM_A281 0.50 5 0.62 0.71 0.63

46. JGM_A288 0.71 2 0.35 0.00 0.32

47. JGM_A290 0.93 2 0.12 0.14 0.12

48. JGM_A310 0.93 2 0.12 0.14 0.12

49. JGM_A322 0.93 2 0.12 0.14 0.12

50. JGM_A323 0.83 3 0.26 0.17 0.27

51. JGM_A326 0.43 4 0.65 1.00 0.63

52. JGM_A327 0.79 3 0.32 0.14 0.33

53. JGM_A328 0.86 2 0.21 0.00 0.21

54. JGM_A329 0.64 3 0.47 0.43 0.46

55. JGM_A330 0.93 2 0.12 0.14 0.12

56. JGM_A331 0.57 2 0.48 0.86 0.37

57. JGM_A334 0.86 2 0.21 0.00 0.21

58. JGM_A342 0.93 2 0.12 0.14 0.12

59. JGM_A351 0.93 2 0.12 0.14 0.12

60. JGM_A361 0.86 2 0.21 0.00 0.21

61. JGM_A372 0.86 2 0.21 0.00 0.21

62. JGM_A380 0.86 2 0.21 0.00 0.21

63. JGM_A384 0.93 2 0.12 0.14 0.12

64. JGM_A387 0.93 2 0.12 0.14 0.12

65. JGM_A390 0.93 2 0.12 0.14 0.12

94

66. JGM_A392 0.93 2 0.12 0.14 0.12

67. JGM_A399 0.93 2 0.12 0.14 0.12

68. JGM_A401 0.93 2 0.12 0.14 0.12

69. JGM_A406 0.86 2 0.23 0.29 0.21

70. JGM_A411 0.86 2 0.21 0.00 0.21

71. JGM_A424 0.86 2 0.21 0.00 0.21

72. JGM_A427 0.86 3 0.23 0.14 0.24

73. JGM_A428 0.79 2 0.32 0.43 0.28

74. JGM_A430 0.86 2 0.21 0.00 0.21

75. JGM_A434 0.86 2 0.21 0.00 0.21

76. JGM_A439 0.86 2 0.23 0.29 0.21

77. JGM_A444 0.93 2 0.12 0.14 0.12

78. JGM_A445 0.93 2 0.12 0.14 0.12

79. JGM_A451 0.86 2 0.21 0.00 0.21

80. JGM_A464 0.71 3 0.38 0.00 0.41

81. JGM_A468 0.86 2 0.21 0.00 0.21

82. JGM_A472 0.93 2 0.12 0.14 0.12

83. JGM_A475 0.71 3 0.38 0.00 0.41

84. JGM_A476 0.86 2 0.21 0.00 0.21

85. JGM_A484 0.86 2 0.21 0.00 0.21

86. JGM_A490 0.86 2 0.21 0.00 0.21

87. JGM_A500 0.86 2 0.21 0.00 0.21

88. JGM_A506 0.86 2 0.21 0.00 0.21

89. JGM_A513 0.86 2 0.21 0.00 0.21

90. JGM_A521 0.83 2 0.23 0.00 0.24

91. JGM_A536 0.86 2 0.21 0.00 0.21

92. JGM_A540 0.93 2 0.12 0.14 0.12

93. JGM_A553 0.86 2 0.21 0.00 0.21

94. JGM_A572 0.86 2 0.21 0.00 0.21

95. JGM_A577 0.50 3 0.55 1.00 0.46

96. JGM_A604 0.86 2 0.21 0.00 0.21

97. JGM_A615 0.86 2 0.21 0.00 0.21

98. JGM_A631 0.86 2 0.21 0.00 0.21

99. JGM_A632 0.57 2 0.48 0.86 0.37

100. JGM_A639 0.93 2 0.12 0.14 0.12

101. JGM_A650 0.64 3 0.47 0.43 0.46

102. JGM_A653 0.79 3 0.32 0.14 0.33

103. JGM_A654 0.86 2 0.21 0.00 0.21

104. JGM_A656 0.79 3 0.33 0.43 0.33

105. JGM_A658 0.86 2 0.21 0.00 0.21

106. JGM_A675 0.86 2 0.21 0.00 0.21

Range 0.43-0.93 2.0-5.0 0.12-0.65 0.00-1.00 0.12-0.63

Average±SD 0.83±0.11 2.25±0.55 0.24±0.11 0.13±0.23 0.23±0.11

95

The detailed analysis of 142 polymorphic Lib B SSRs revealed considerable

variability at each locus and showed allele range from 2 to 5 with an average of 2.42±0.62

alleles/SSR. The PIC value varied between 0.12-0.62 with an average of 0.28±0.13 per SSR

(Table 4.14). Also in case of Lib B, most of the SSRs (92; 65%) had lower PIC value (< 0.30)

(Figure 4.11). The PIC values of 19 SSRs were found between the range of 0.30-0.40 and 22

SSRs were found between 0.41-0.50. There were only 9 SSRs which showed PIC value in the

range of 0.51-0.70. The maximum PIC value was observed for JGM_B300 (0.62) followed

by JGM_B361 (0.58), JGM_B595 (0.55) and JGM_B176 (0.55). Major allele frequency of

polymorphic SSRs of Lib B ranged from 0.43 to 0.93 with an average of 0.78±0.15. The

observed heterozygosity (Ho) varied between 0.00 and 1.00 with an average of 0.31±0.21 and

expected heterozygosity (He) or gene diversity varied from 0.12 to 0.65 with an average of

0.29±0.14 (Table 4.14).

Table 4.14 Polymorphism features of 142 newly developed SSRs (Lib B) in Jatropha curcas

S.N. Marker Major

allele

frequency

Allele No. Gene

Diversity

Heteroz-

ygosity

PIC

1. JGM_B008 0.71 2 0.35 0.00 0.32

2. JGM_B010 0.71 2 0.35 0.00 0.32

3. JGM_B012 0.86 2 0.21 0.00 0.21

4. JGM_B013 0.93 2 0.12 0.14 0.12

5. JGM_B016 0.86 2 0.21 0.00 0.21

6. JGM_B017 0.86 2 0.21 0.00 0.21

7. JGM_B019 0.57 3 0.49 0.00 0.50

8. JGM_B020 0.43 3 0.52 0.00 0.53

9. JGM_B021 0.93 2 0.12 0.14 0.12

10. JGM_B022 0.93 2 0.12 0.14 0.12

11. JGM_B024 0.86 2 0.21 0.00 0.21

12. JGM_B026 0.71 2 0.35 0.00 0.32

13. JGM_B028 0.86 2 0.21 0.00 0.21

14. JGM_B030 0.64 4 0.49 0.57 0.48

15. JGM_B033 0.57 3 0.49 0.00 0.50

16. JGM_B034 0.71 2 0.35 0.00 0.32

17. JGM_B035 0.86 2 0.21 0.00 0.21

18. JGM_B038 0.86 2 0.21 0.00 0.21

19. JGM_B041 0.71 2 0.35 0.00 0.32

20. JGM_B043 0.93 2 0.12 0.14 0.12

21. JGM_B044 0.93 2 0.12 0.14 0.12

22. JGM_B047 0.86 2 0.21 0.00 0.21

23. JGM_B054 0.71 2 0.35 0.00 0.32

24. JGM_B062 0.71 3 0.38 0.00 0.41

25. JGM_B063 0.86 2 0.21 0.00 0.21

96

26. JGM_B065 0.86 2 0.21 0.00 0.21

27. JGM_B076 0.86 2 0.21 0.00 0.21

28. JGM_B094 0.86 2 0.21 0.00 0.21

29. JGM_B132 0.86 2 0.21 0.00 0.21

30. JGM_B161 0.86 2 0.21 0.00 0.21

31. JGM_B167 0.79 3 0.32 0.14 0.33

32. JGM_B176 0.57 4 0.54 0.43 0.55

33. JGM_B185 0.57 3 0.51 0.29 0.50

34. JGM_B189 0.86 2 0.21 0.00 0.21

35. JGM_B190 0.93 2 0.12 0.14 0.12

36. JGM_B191 0.57 3 0.54 0.71 0.50

37. JGM_B196 0.86 2 0.21 0.00 0.21

38. JGM_B199 0.64 4 0.49 0.43 0.50

39. JGM_B201 0.71 2 0.35 0.00 0.32

40. JGM_B203 0.64 3 0.47 0.43 0.46

41. JGM_B204 0.86 2 0.21 0.00 0.21

42. JGM_B205 0.93 2 0.12 0.14 0.12

43. JGM_B207 0.86 3 0.24 0.29 0.24

44. JGM_B211 0.86 2 0.21 0.00 0.21

45. JGM_B212 0.86 2 0.21 0.00 0.21

46. JGM_B213 0.79 3 0.32 0.14 0.33

47. JGM_B215 0.71 2 0.35 0.00 0.32

48. JGM_B216 0.86 2 0.21 0.00 0.21

49. JGM_B220 0.86 2 0.21 0.00 0.21

50. JGM_B222 0.71 3 0.41 0.57 0.39

51. JGM_B223 0.93 2 0.12 0.14 0.12

52. JGM_B231 0.86 2 0.21 0.00 0.21

53. JGM_B234 0.50 3 0.55 1.00 0.46

54. JGM_B235 0.92 2 0.14 0.17 0.14

55. JGM_B236 0.64 3 0.48 0.71 0.43

56. JGM_B237 0.86 3 0.23 0.14 0.24

57. JGM_B238 0.50 3 0.51 0.43 0.46

58. JGM_B240 0.86 2 0.21 0.00 0.21

59. JGM_B242 0.93 2 0.12 0.14 0.12

60. JGM_B246 0.79 4 0.33 0.29 0.35

61. JGM_B247 0.64 2 0.44 0.71 0.35

62. JGM_B248 0.57 4 0.55 0.71 0.52

63. JGM_B251 0.79 2 0.32 0.43 0.28

64. JGM_B254 0.86 3 0.24 0.29 0.24

65. JGM_B256 0.79 3 0.32 0.14 0.33

66. JGM_B257 0.71 3 0.38 0.00 0.41

67. JGM_B268 0.93 2 0.12 0.14 0.12

68. JGM_B273 0.86 2 0.21 0.00 0.21

69. JGM_B276 0.86 2 0.21 0.00 0.21

70. JGM_B277 0.86 2 0.21 0.00 0.21

71. JGM_B280 0.86 2 0.21 0.00 0.21

72. JGM_B282 0.93 2 0.12 0.14 0.12

97

73. JGM_B284 0.86 2 0.21 0.00 0.21

74. JGM_B285 0.50 3 0.55 1.00 0.46

75. JGM_B287 0.93 2 0.12 0.14 0.12

76. JGM_B288 0.86 2 0.21 0.00 0.21

77. JGM_B290 0.75 2 0.32 0.17 0.30

78. JGM_B291 0.50 3 0.55 1.00 0.46

79. JGM_B292 0.86 3 0.24 0.29 0.24

80. JGM_B294 0.93 2 0.12 0.14 0.12

81. JGM_B297 0.86 2 0.21 0.00 0.21

82. JGM_B300 0.43 5 0.65 1.00 0.62

83. JGM_B316 0.86 2 0.21 0.00 0.21

84. JGM_B318 0.43 3 0.52 0.00 0.53

85. JGM_B323 0.86 2 0.21 0.00 0.21

86. JGM_B329 0.93 2 0.12 0.14 0.12

87. JGM_B330 0.43 3 0.58 0.86 0.53

88. JGM_B332 0.86 2 0.21 0.00 0.21

89. JGM_B334 0.86 3 0.23 0.14 0.24

90. JGM_B361 0.50 4 0.60 0.71 0.58

91. JGM_B363 0.86 2 0.21 0.00 0.21

92. JGM_B368 0.50 3 0.55 1.00 0.46

93. JGM_B373 0.50 3 0.55 1.00 0.46

94. JGM_B393 0.79 3 0.32 0.14 0.33

95. JGM_B395 0.86 2 0.21 0.00 0.21

96. JGM_B396 0.86 2 0.21 0.00 0.21

97. JGM_B431 0.86 2 0.21 0.00 0.21

98. JGM_B433 0.71 3 0.38 0.00 0.41

99. JGM_B438 0.93 2 0.12 0.14 0.12

100. JGM_B439 0.79 3 0.33 0.43 0.33

101. JGM_B440 0.79 3 0.33 0.43 0.33

102. JGM_B443 0.86 3 0.23 0.14 0.24

103. JGM_B455 0.86 2 0.21 0.00 0.21

104. JGM_B456 0.93 2 0.12 0.14 0.12

105. JGM_B461 0.93 2 0.12 0.14 0.12

106. JGM_B463 0.86 2 0.21 0.00 0.21

107. JGM_B467 0.71 2 0.35 0.00 0.32

108. JGM_B469 0.93 2 0.12 0.14 0.12

109. JGM_B472 0.79 3 0.32 0.14 0.33

110. JGM_B479 0.86 2 0.21 0.00 0.21

111. JGM_B481 0.86 2 0.21 0.00 0.21

112. JGM_B483 0.86 2 0.21 0.00 0.21

113. JGM_B486 0.86 2 0.21 0.00 0.21

114. JGM_B491 0.50 3 0.55 1.00 0.46

115. JGM_B492 0.50 3 0.55 1.00 0.46

116. JGM_B499 0.86 2 0.21 0.00 0.21

117. JGM_B501 0.86 2 0.21 0.00 0.21

118. JGM_B503 0.93 2 0.12 0.14 0.12

119. JGM_B507 0.50 3 0.55 1.00 0.46

98

120. JGM_B512 0.86 2 0.21 0.00 0.21

121. JGM_B516 0.93 2 0.12 0.14 0.12

122. JGM_B518 0.86 3 0.24 0.29 0.24

123. JGM_B524 0.86 3 0.23 0.14 0.24

124. JGM_B527 0.86 2 0.21 0.00 0.21

125. JGM_B534 0.50 3 0.55 1.00 0.46

126. JGM_B548 0.86 2 0.21 0.00 0.21

127. JGM_B558 0.93 2 0.12 0.14 0.12

128. JGM_B571 0.86 2 0.21 0.00 0.21

129. JGM_B585 0.93 2 0.12 0.14 0.12

130. JGM_B586 0.79 2 0.32 0.43 0.28

131. JGM_B589 0.86 2 0.21 0.00 0.21

132. JGM_B590 0.86 3 0.23 0.14 0.24

133. JGM_B594 0.86 3 0.23 0.14 0.24

134. JGM_B595 0.43 4 0.60 1.00 0.55

135. JGM_B603 0.86 2 0.21 0.00 0.21

136. JGM_B617 0.93 2 0.12 0.14 0.12

137. JGM_B620 0.86 3 0.23 0.14 0.24

138. JGM_B623 0.86 3 0.24 0.29 0.24

139. JGM_B625 0.86 2 0.21 0.00 0.21

140. JGM_B627 0.50 3 0.55 1.00 0.46

141. JGM_B629 0.79 2 0.30 0.14 0.28

142. JGM_B631 0.43 3 0.58 0.86 0.53

Range 0.43-0.93 2.0-5.0 0.12-0.65 0.00-1.00 0.12-0.62

Average±SD 0.78±0.15 2.43±0.62 0.29±0.14 0.21±0.31 0.28±0.13

Figure 4.11 PIC distribution of 248 polymorphic SSR loci (106 from Lib A and 142 from

Lib B) calculated from 7 accessions of J. curcas including non-toxic accession

23

62

11

6 2 2

24

68

19 22

8

1

0

10

20

30

40

50

60

70

80

0.10-0.20 0.21-0.30 0.30-0.40 0.41-0.50 0.51-0.60 0.61-0.70

No

. o

f p

oly

mo

rph

ic S

SR

s

Range of PIC value

Lib A

Lib B

99

The number of alleles of 447 polymorphic SSRs from Lib C and D varied between 2

to 9 with an average of 2.7±1.18 per markers (Table 4.15). Majority of the markers i.e. 57%

produced two alleles followed by three (23%) and four alleles (10%). There were only two

markers JGM_CD797 and JGM_CD928 which produced 8 and 9 alleles respectively. The

PIC value ranged from 0.12 to 0.85 (average 0.34±0.17). Majority of SSRs (211, 47%)

showed low PIC value (<0.30) (Figure 4.12). Approximately half of the SSRs (49.2%)

showed moderate range of PIC values. However, only 4% SSRs showed higher PIC value in

the range of 0.71-0.90. The 68 (15.21%) SSRs were having PIC value between 0.11-0.20.

The maximum number of SSRs i.e.143 (32%) were having PIC value ranging between 0.21-

0.30 followed by 98 (21.92%) which ranged between 0.31-0.40. The 56 SSRs showed PIC

value in the range of 41.0-0.50 and 38 (8.50%) in the range of 0.51-0.60. The 28 (6.26%)

SSRs had PIC value between 0.61-0.70 and 14 were found in the range between 0.71-0.80.

Only 2 (0.44%) SSR were found to have maximum PIC value ranged between 0.81-0.90. The

observed heterozygosity (Ho) and expected heterozygosity (He) or gene diversity varied in

the range of 0.00-1.0 (0.36±0.34) and 0.12 – 0.78 (0.35±0.17) respectively. The major allele

frequency of polymorphic SSR varied from 0.21 to 0.93 with an average of 0.71±0.19.

Table 4.15 Polymorphism features of 247 polymorphic SSRs developed from Lib C and D in

J. curcas

S.N. Markers Major allele

frequency

Allele No Gene

Diversity

Heteroz-

ygosity

PIC

1 JGM_CD002 0.86 3 0.23 0.14 0.24

2 JGM_CD009 0.86 2 0.21 0.00 0.21

3 JGM_CD016 0.50 2 0.50 1.00 0.38

4 JGM_CD022 0.86 2 0.21 0.00 0.21

5 JGM_CD023 0.50 2 0.50 1.00 0.38

6 JGM_CD024 0.43 3 0.58 0.86 0.53

7 JGM_CD025 0.86 3 0.23 0.14 0.24

8 JGM_CD026 0.43 3 0.58 0.86 0.53

9 JGM_CD027 0.86 2 0.21 0.00 0.21

10 JGM_CD028 0.64 6 0.51 0.43 0.54

11 JGM_CD029 0.93 2 0.12 0.14 0.12

12 JGM_CD032 0.50 2 0.50 1.00 0.38

13 JGM_CD033 0.64 2 0.40 0.14 0.35

14 JGM_CD035 0.86 3 0.24 0.29 0.24

15 JGM_CD040 0.50 2 0.50 1.00 0.38

16 JGM_CD041 0.50 3 0.51 0.43 0.46

17 JGM_CD044 0.50 2 0.50 1.00 0.38

18 JGM_CD045 0.50 2 0.50 1.00 0.38

100

19 JGM_CD046 0.50 2 0.50 1.00 0.38

20 JGM_CD062 0.86 3 0.23 0.14 0.24

21 JGM_CD064 0.86 2 0.21 0.00 0.21

22 JGM_CD066 0.50 2 0.50 1.00 0.38

23 JGM_CD071 0.86 2 0.21 0.00 0.21

24 JGM_CD073 0.50 2 0.50 1.00 0.38

25 JGM_CD076 0.71 4 0.40 0.14 0.43

26 JGM_CD077 0.64 4 0.47 0.29 0.48

27 JGM_CD079 0.86 3 0.23 0.14 0.24

28 JGM_CD083 0.50 2 0.50 1.00 0.38

29 JGM_CD091 0.71 2 0.35 0.00 0.32

30 JGM_CD092 0.86 2 0.21 0.00 0.21

31 JGM_CD102 0.50 2 0.50 1.00 0.38

32 JGM_CD104 0.86 2 0.21 0.00 0.21

33 JGM_CD105 0.86 2 0.21 0.00 0.21

34 JGM_CD109 0.86 3 0.23 0.14 0.24

35 JGM_CD116 0.71 2 0.35 0.00 0.32

36 JGM_CD128 0.86 2 0.21 0.00 0.21

37 JGM_CD134 0.86 2 0.21 0.00 0.21

38 JGM_CD146 0.93 2 0.12 0.14 0.12

39 JGM_CD147 0.86 3 0.23 0.14 0.24

40 JGM_CD150 0.86 2 0.21 0.00 0.21

41 JGM_CD151 0.86 2 0.23 0.29 0.21

42 JGM_CD152 0.86 3 0.24 0.29 0.24

43 JGM_CD154 0.86 2 0.21 0.00 0.21

44 JGM_CD155 0.64 3 0.44 0.14 0.43

45 JGM_CD172 0.86 2 0.21 0.00 0.21

46 JGM_CD182 0.86 2 0.21 0.00 0.21

47 JGM_CD191 0.86 2 0.21 0.00 0.21

48 JGM_CD193 0.86 2 0.21 0.00 0.21

49 JGM_CD197 0.86 2 0.21 0.00 0.21

50 JGM_CD200 0.50 2 0.50 1.00 0.38

51 JGM_CD201 0.50 2 0.50 1.00 0.38

52 JGM_CD202 0.86 2 0.21 0.00 0.21

53 JGM_CD203 0.86 2 0.21 0.00 0.21

54 JGM_CD209 0.86 2 0.21 0.00 0.21

55 JGM_CD214 0.86 3 0.23 0.14 0.24

56 JGM_CD215 0.86 2 0.21 0.00 0.21

57 JGM_CD216 0.86 2 0.21 0.00 0.21

58 JGM_CD217 0.36 4 0.65 0.71 0.65

59 JGM_CD224 0.86 2 0.21 0.00 0.21

60 JGM_CD229 0.86 2 0.21 0.00 0.21

61 JGM_CD231 0.50 2 0.50 1.00 0.38

62 JGM_CD244 0.50 2 0.50 1.00 0.38

63 JGM_CD249 0.50 2 0.50 1.00 0.38

101

64 JGM_CD278 0.86 2 0.21 0.00 0.21

65 JGM_CD284 0.50 3 0.55 1.00 0.46

66 JGM_CD285 0.50 2 0.50 1.00 0.38

67 JGM_CD302 0.50 2 0.50 1.00 0.38

68 JGM_CD307 0.50 2 0.50 1.00 0.38

69 JGM_CD310 0.86 3 0.23 0.14 0.24

70 JGM_CD323 0.93 2 0.12 0.14 0.12

71 JGM_CD325 0.86 2 0.21 0.00 0.21

72 JGM_CD329 0.71 3 0.38 0.00 0.41

73 JGM_CD330 0.86 2 0.21 0.00 0.21

74 JGM_CD338 0.71 3 0.38 0.00 0.41

75 JGM_CD345 0.86 2 0.21 0.00 0.21

76 JGM_CD346 0.86 2 0.21 0.00 0.21

77 JGM_CD351 0.86 2 0.21 0.00 0.21

78 JGM_CD353 0.86 2 0.21 0.00 0.21

79 JGM_CD359 0.86 2 0.21 0.00 0.21

80 JGM_CD362 0.86 2 0.21 0.00 0.21

81 JGM_CD367 0.50 2 0.50 1.00 0.38

82 JGM_CD368 0.50 2 0.50 1.00 0.38

83 JGM_CD370 0.43 5 0.65 1.00 0.62

84 JGM_CD386 0.71 3 0.40 0.43 0.39

85 JGM_CD387 0.57 3 0.52 0.86 0.45

86 JGM_CD388 0.93 2 0.12 0.14 0.12

87 JGM_CD391 0.93 2 0.12 0.14 0.12

88 JGM_CD394 0.57 2 0.48 0.86 0.37

89 JGM_CD404 0.93 2 0.12 0.14 0.12

90 JGM_CD406 0.64 3 0.48 0.57 0.46

91 JGM_CD411 0.93 2 0.12 0.14 0.12

92 JGM_CD414 0.71 3 0.40 0.29 0.41

93 JGM_CD416 0.86 2 0.21 0.00 0.21

94 JGM_CD418 0.43 4 0.60 1.00 0.55

95 JGM_CD420 0.64 4 0.49 0.43 0.50

96 JGM_CD421 0.86 2 0.21 0.00 0.21

97 JGM_CD422 0.64 5 0.51 0.57 0.52

98 JGM_CD423 0.64 2 0.44 0.71 0.35

99 JGM_CD425 0.71 2 0.35 0.00 0.32

100 JGM_CD432 0.57 2 0.48 0.86 0.37

101 JGM_CD433 0.86 2 0.21 0.00 0.21

102 JGM_CD435 0.71 3 0.41 0.57 0.39

103 JGM_CD445 0.93 2 0.12 0.14 0.12

104 JGM_CD452 0.86 2 0.21 0.00 0.21

105 JGM_CD453 0.86 2 0.21 0.00 0.21

106 JGM_CD454 0.71 3 0.38 0.00 0.41

107 JGM_CD455 0.71 4 0.41 0.29 0.43

108 JGM_CD457 0.50 3 0.56 0.71 0.52

102

109 JGM_CD461 0.50 3 0.55 1.00 0.46

110 JGM_CD463 0.57 3 0.51 0.29 0.50

111 JGM_CD464 0.86 2 0.23 0.29 0.21

112 JGM_CD467 0.71 3 0.38 0.00 0.41

113 JGM_CD469 0.36 4 0.65 0.71 0.65

114 JGM_CD470 0.93 2 0.12 0.14 0.12

115 JGM_CD476 0.71 2 0.39 0.57 0.32

116 JGM_CD477 0.93 2 0.12 0.14 0.12

117 JGM_CD480 0.86 2 0.21 0.00 0.21

118 JGM_CD481 0.86 2 0.21 0.00 0.21

119 JGM_CD486 0.50 3 0.56 0.71 0.52

120 JGM_CD494 0.93 2 0.12 0.14 0.12

121 JGM_CD505 0.86 2 0.21 0.00 0.21

122 JGM_CD510 0.64 2 0.42 0.43 0.35

123 JGM_CD511 0.79 3 0.32 0.14 0.33

124 JGM_CD513 0.57 2 0.48 0.86 0.37

125 JGM_CD515 0.93 2 0.12 0.14 0.12

126 JGM_CD516 0.79 2 0.30 0.14 0.28

127 JGM_CD517 0.93 2 0.12 0.14 0.12

128 JGM_CD518 0.86 2 0.23 0.29 0.21

129 JGM_CD519 0.86 2 0.21 0.00 0.21

130 JGM_CD521 0.86 2 0.21 0.00 0.21

131 JGM_CD522 0.93 2 0.12 0.14 0.12

132 JGM_CD523 0.93 2 0.12 0.14 0.12

133 JGM_CD525 0.93 2 0.12 0.14 0.12

134 JGM_CD538 0.86 2 0.21 0.00 0.21

135 JGM_CD541 0.50 2 0.50 1.00 0.38

136 JGM_CD543 0.93 2 0.12 0.14 0.12

137 JGM_CD544 0.43 3 0.58 0.86 0.53

138 JGM_CD546 0.86 2 0.23 0.29 0.21

139 JGM_CD547 0.50 3 0.56 0.71 0.52

140 JGM_CD559 0.93 2 0.12 0.14 0.12

141 JGM_CD560 0.50 2 0.50 1.00 0.38

142 JGM_CD564 0.93 2 0.12 0.14 0.12

143 JGM_CD566 0.86 2 0.21 0.00 0.21

144 JGM_CD567 0.86 2 0.21 0.00 0.21

145 JGM_CD575 0.93 2 0.12 0.14 0.12

146 JGM_CD580 0.86 2 0.23 0.29 0.21

147 JGM_CD582 0.79 2 0.30 0.14 0.28

148 JGM_CD584 0.93 2 0.12 0.14 0.12

149 JGM_CD587 0.50 2 0.50 1.00 0.38

150 JGM_CD593 0.86 2 0.21 0.00 0.21

151 JGM_CD598 0.86 2 0.23 0.29 0.21

152 JGM_CD600 0.64 4 0.50 0.57 0.50

153 JGM_CD601 0.64 3 0.47 0.43 0.46

103

154 JGM_CD602 0.86 2 0.21 0.00 0.21

155 JGM_CD605 0.57 2 0.46 0.57 0.37

156 JGM_CD610 0.86 2 0.21 0.00 0.21

157 JGM_CD611 0.57 5 0.55 0.29 0.59

158 JGM_CD616 0.93 2 0.12 0.14 0.12

159 JGM_CD617 0.86 2 0.21 0.00 0.21

160 JGM_CD618 0.79 3 0.33 0.43 0.33

161 JGM_CD619 0.71 4 0.41 0.29 0.43

162 JGM_CD620 0.86 3 0.23 0.14 0.24

163 JGM_CD621 0.79 3 0.32 0.14 0.33

164 JGM_CD622 0.93 2 0.12 0.14 0.12

165 JGM_CD623 0.93 2 0.12 0.14 0.12

166 JGM_CD624 0.50 3 0.55 1.00 0.46

167 JGM_CD628 0.57 3 0.51 0.71 0.45

168 JGM_CD630 0.67 2 0.30 0.00 0.35

169 JGM_CD631 0.79 3 0.32 0.29 0.33

170 JGM_CD632 0.36 5 0.67 0.86 0.67

171 JGM_CD633 0.57 3 0.47 0.14 0.45

172 JGM_CD634 0.86 2 0.21 0.00 0.21

173 JGM_CD635 0.50 3 0.54 0.43 0.52

174 JGM_CD636 0.50 2 0.50 1.00 0.38

175 JGM_CD640 0.57 2 0.48 0.86 0.37

176 JGM_CD643 0.86 2 0.21 0.00 0.21

177 JGM_CD644 0.38 4 0.57 0.50 0.63

178 JGM_CD645 0.50 4 0.60 0.71 0.58

179 JGM_CD648 0.50 4 0.60 0.43 0.62

180 JGM_CD649 0.93 2 0.12 0.14 0.12

181 JGM_CD650 0.29 6 0.73 0.71 0.77

182 JGM_CD651 0.86 2 0.23 0.29 0.21

183 JGM_CD659 0.93 2 0.12 0.14 0.12

184 JGM_CD663 0.50 2 0.50 1.00 0.38

185 JGM_CD665 0.50 3 0.54 0.86 0.46

186 JGM_CD674 0.93 2 0.12 0.14 0.12

187 JGM_CD675 0.36 4 0.66 0.86 0.65

188 JGM_CD676 0.86 2 0.21 0.00 0.21

189 JGM_CD677 0.86 2 0.23 0.29 0.21

190 JGM_CD678 0.33 4 0.63 0.33 0.67

191 JGM_CD681 0.86 2 0.21 0.00 0.21

192 JGM_CD683 0.64 2 0.44 0.71 0.35

193 JGM_CD684 0.86 2 0.23 0.29 0.21

194 JGM_CD687 0.86 2 0.21 0.00 0.21

195 JGM_CD688 0.43 3 0.58 0.86 0.53

196 JGM_CD691 0.88 2 0.19 0.25 0.19

197 JGM_CD697 0.57 2 0.42 0.00 0.37

198 JGM_CD698 0.79 3 0.32 0.14 0.33

104

199 JGM_CD700 0.50 5 0.62 0.71 0.63

200 JGM_CD706 0.86 2 0.21 0.00 0.21

201 JGM_CD712 0.64 2 0.44 0.71 0.35

202 JGM_CD714 0.86 2 0.21 0.00 0.21

203 JGM_CD716 0.75 2 0.28 0.00 0.30

204 JGM_CD719 0.86 2 0.21 0.00 0.21

205 JGM_CD724 0.93 2 0.12 0.14 0.12

206 JGM_CD725 0.93 2 0.12 0.14 0.12

207 JGM_CD727 0.71 2 0.39 0.57 0.32

208 JGM_CD734 0.86 2 0.21 0.00 0.21

209 JGM_CD740 0.93 2 0.12 0.14 0.12

210 JGM_CD741 0.71 2 0.39 0.57 0.32

211 JGM_CD745 0.50 3 0.56 1.00 0.51

212 JGM_CD746 0.93 2 0.12 0.14 0.12

213 JGM_CD747 0.50 2 0.25 0.00 0.38

214 JGM_CD748 0.93 2 0.12 0.14 0.12

215 JGM_CD749 0.64 2 0.44 0.71 0.35

216 JGM_CD750 0.71 3 0.40 0.43 0.39

217 JGM_CD758 0.86 2 0.21 0.00 0.21

218 JGM_CD760 0.93 2 0.12 0.14 0.12

219 JGM_CD762 0.36 5 0.71 1.00 0.72

220 JGM_CD766 0.86 2 0.21 0.00 0.21

221 JGM_CD769 0.86 2 0.21 0.00 0.21

222 JGM_CD770 0.43 3 0.61 0.86 0.57

223 JGM_CD771 0.86 2 0.21 0.00 0.21

224 JGM_CD772 0.57 3 0.52 0.86 0.45

225 JGM_CD774 0.92 2 0.14 0.17 0.14

226 JGM_CD775 0.64 3 0.44 0.14 0.43

227 JGM_CD777 0.50 7 0.63 0.43 0.68

228 JGM_CD778 0.93 2 0.12 0.14 0.12

229 JGM_CD779 0.93 2 0.12 0.14 0.12

230 JGM_CD780 0.86 3 0.24 0.29 0.24

231 JGM_CD782 0.64 3 0.46 0.43 0.43

232 JGM_CD783 0.50 2 0.50 1.00 0.38

233 JGM_CD784 0.86 2 0.21 0.00 0.21

234 JGM_CD786 0.79 3 0.33 0.43 0.33

235 JGM_CD787 0.93 2 0.12 0.14 0.12

236 JGM_CD788 0.71 2 0.37 0.29 0.32

237 JGM_CD790 0.71 3 0.40 0.29 0.41

238 JGM_CD792 0.71 2 0.37 0.29 0.32

239 JGM_CD793 0.50 5 0.59 0.33 0.64

240 JGM_CD794 0.86 2 0.21 0.00 0.21

241 JGM_CD795 0.71 4 0.40 0.14 0.43

242 JGM_CD796 0.64 4 0.50 0.57 0.50

243 JGM_CD797 0.36 8 0.75 0.86 0.79

105

244 JGM_CD798 0.57 5 0.58 0.71 0.59

245 JGM_CD799 0.83 3 0.26 0.17 0.27

246 JGM_CD800 0.90 2 0.16 0.20 0.16

247 JGM_CD801 0.88 2 0.19 0.25 0.19

248 JGM_CD802 0.79 2 0.30 0.14 0.28

249 JGM_CD803 0.93 2 0.12 0.14 0.12

250 JGM_CD807 0.86 2 0.21 0.00 0.21

251 JGM_CD808 0.50 2 0.25 0.00 0.38

252 JGM_CD810 0.43 3 0.57 0.29 0.57

253 JGM_CD811 0.50 5 0.61 0.43 0.65

254 JGM_CD813 0.50 3 0.50 0.50 0.51

255 JGM_CD814 0.93 2 0.12 0.14 0.12

256 JGM_CD815 0.86 2 0.21 0.00 0.21

257 JGM_CD816 0.86 2 0.21 0.00 0.21

258 JGM_CD818 0.86 2 0.21 0.00 0.21

259 JGM_CD819 0.86 2 0.23 0.29 0.21

260 JGM_CD822 0.70 3 0.40 0.40 0.41

261 JGM_CD823 0.43 5 0.66 0.71 0.67

262 JGM_CD826 0.43 4 0.60 1.00 0.55

263 JGM_CD828 0.93 2 0.12 0.14 0.12

264 JGM_CD831 0.86 2 0.21 0.00 0.21

265 JGM_CD832 0.79 4 0.33 0.29 0.35

266 JGM_CD833 0.71 3 0.39 0.14 0.39

267 JGM_CD834 0.86 2 0.21 0.00 0.21

268 JGM_CD835 0.57 4 0.51 0.14 0.52

269 JGM_CD836 0.38 4 0.57 0.50 0.63

270 JGM_CD837 0.75 2 0.28 0.00 0.30

271 JGM_CD838 0.71 2 0.37 0.29 0.32

272 JGM_CD839 0.64 2 0.44 0.71 0.35

273 JGM_CD841 0.86 2 0.21 0.00 0.21

274 JGM_CD844 0.71 4 0.43 0.57 0.43

275 JGM_CD847 0.83 2 0.26 0.33 0.24

276 JGM_CD849 0.86 2 0.23 0.29 0.21

277 JGM_CD850 0.50 2 0.50 1.00 0.38

278 JGM_CD851 0.75 2 0.35 0.50 0.30

279 JGM_CD853 0.50 5 0.61 0.43 0.65

280 JGM_CD855 0.57 3 0.47 0.14 0.45

281 JGM_CD856 0.79 3 0.32 0.14 0.33

282 JGM_CD857 0.33 6 0.70 1.00 0.71

283 JGM_CD858 0.64 3 0.48 0.71 0.43

284 JGM_CD859 0.93 2 0.12 0.14 0.12

285 JGM_CD860 0.86 2 0.21 0.00 0.21

286 JGM_CD862 0.93 2 0.12 0.14 0.12

287 JGM_CD863 0.71 2 0.39 0.57 0.32

106

288 JGM_CD864 0.71 2 0.37 0.29 0.32

289 JGM_CD866 0.71 4 0.41 0.29 0.43

290 JGM_CD867 0.64 3 0.48 0.71 0.43

291 JGM_CD868 0.93 2 0.12 0.14 0.12

292 JGM_CD869 0.50 3 0.55 1.00 0.46

293 JGM_CD870 0.86 3 0.24 0.29 0.24

294 JGM_CD871 0.67 4 0.45 0.33 0.48

295 JGM_CD872 0.86 3 0.24 0.29 0.24

296 JGM_CD873 0.33 6 0.70 1.00 0.71

297 JGM_CD874 0.57 6 0.60 0.86 0.60

298 JGM_CD875 0.36 7 0.73 0.71 0.77

299 JGM_CD876 0.64 3 0.47 0.43 0.46

300 JGM_CD877 0.50 5 0.58 0.29 0.60

301 JGM_CD878 0.71 4 0.42 0.43 0.43

302 JGM_CD879 0.36 6 0.71 1.00 0.72

303 JGM_CD880 0.86 3 0.24 0.29 0.24

304 JGM_CD881 0.79 3 0.32 0.14 0.33

305 JGM_CD882 0.43 5 0.65 1.00 0.62

306 JGM_CD886 0.86 2 0.21 0.00 0.21

307 JGM_CD887 0.33 6 0.67 0.33 0.75

308 JGM_CD888 0.86 2 0.21 0.00 0.21

309 JGM_CD889 0.36 5 0.70 1.00 0.69

310 JGM_CD890 0.50 4 0.58 0.14 0.62

311 JGM_CD892 0.86 2 0.21 0.00 0.21

312 JGM_CD893 0.57 3 0.53 0.57 0.50

313 JGM_CD894 0.57 2 0.48 0.86 0.37

314 JGM_CD895 0.93 2 0.12 0.14 0.12

315 JGM_CD896 0.93 2 0.12 0.14 0.12

316 JGM_CD897 0.86 3 0.24 0.29 0.24

317 JGM_CD899 0.93 2 0.12 0.14 0.12

318 JGM_CD902 0.93 2 0.12 0.14 0.12

319 JGM_CD903 0.50 3 0.52 0.57 0.46

320 JGM_CD904 0.79 3 0.33 0.43 0.33

321 JGM_CD905 0.86 2 0.23 0.29 0.21

322 JGM_CD906 0.50 2 0.25 0.00 0.38

323 JGM_CD907 0.42 5 0.63 0.33 0.68

324 JGM_CD908 0.86 2 0.23 0.29 0.21

325 JGM_CD909 0.83 2 0.26 0.33 0.24

326 JGM_CD910 0.83 3 0.27 0.33 0.27

327 JGM_CD911 0.29 6 0.71 0.29 0.78

328 JGM_CD912 0.83 3 0.27 0.33 0.27

329 JGM_CD913 0.64 5 0.51 0.57 0.52

330 JGM_CD914 0.29 6 0.72 0.71 0.75

331 JGM_CD915 0.57 4 0.55 0.71 0.52

107

332 JGM_CD916 0.29 5 0.68 0.29 0.74

333 JGM_CD917 0.50 6 0.64 0.86 0.65

334 JGM_CD918 0.79 3 0.33 0.43 0.33

335 JGM_CD919 0.58 3 0.48 0.33 0.46

336 JGM_CD920 0.57 5 0.55 0.29 0.59

337 JGM_CD921 0.75 3 0.37 0.50 0.36

338 JGM_CD923 0.57 7 0.60 0.71 0.62

339 JGM_CD924 0.42 4 0.61 1.00 0.57

340 JGM_CD925 0.75 4 0.39 0.50 0.39

341 JGM_CD926 0.29 7 0.75 1.00 0.77

342 JGM_CD927 0.83 2 0.23 0.00 0.24

343 JGM_CD928 0.21 9 0.78 0.57 0.85

344 JGM_CD929 0.64 2 0.44 0.71 0.35

345 JGM_CD930 0.86 2 0.23 0.29 0.21

346 JGM_CD931 0.33 4 0.63 0.50 0.65

347 JGM_CD932 0.71 4 0.42 0.43 0.43

348 JGM_CD933 0.50 4 0.59 0.83 0.56

349 JGM_CD934 0.50 4 0.56 0.00 0.62

350 JGM_CD935 0.29 7 0.74 0.43 0.80

351 JGM_CD936 0.75 4 0.39 0.50 0.39

352 JGM_CD937 0.50 3 0.55 1.00 0.46

353 JGM_CD938 0.86 3 0.24 0.29 0.24

354 JGM_CD939 0.43 4 0.65 1.00 0.63

355 JGM_CD940 0.29 5 0.71 0.57 0.75

356 JGM_CD941 0.36 4 0.67 0.71 0.67

357 JGM_CD943 0.50 5 0.59 0.43 0.60

358 JGM_CD944 0.64 5 0.49 0.29 0.52

359 JGM_CD945 0.67 3 0.43 0.33 0.42

360 JGM_CD946 0.50 3 0.39 0.50 0.55

361 JGM_CD947 0.50 2 0.25 0.00 0.38

362 JGM_CD948 0.50 2 0.50 1.00 0.38

363 JGM_CD949 0.79 3 0.32 0.14 0.33

364 JGM_CD950 0.64 4 0.48 0.43 0.48

365 JGM_CD951 0.86 3 0.24 0.29 0.24

366 JGM_CD952 0.71 3 0.38 0.00 0.41

367 JGM_CD953 0.50 3 0.55 1.00 0.46

368 JGM_CD956 0.71 3 0.40 0.29 0.41

369 JGM_CD958 0.71 2 0.35 0.00 0.32

370 JGM_CD959 0.64 4 0.47 0.14 0.50

371 JGM_CD960 0.93 2 0.12 0.14 0.12

372 JGM_CD961 0.71 4 0.42 0.43 0.43

373 JGM_CD964 0.93 2 0.12 0.14 0.12

374 JGM_CD966 0.86 2 0.21 0.00 0.21

375 JGM_CD968 0.79 3 0.33 0.43 0.33

376 JGM_CD970 0.79 4 0.33 0.29 0.35

108

377 JGM_CD972 0.50 4 0.58 0.43 0.58

378 JGM_CD973 0.57 5 0.55 0.29 0.59

379 JGM_CD976 0.86 2 0.23 0.29 0.21

380 JGM_CD977 0.93 2 0.12 0.14 0.12

381 JGM_CD978 0.50 3 0.55 1.00 0.46

382 JGM_CD979 0.71 3 0.40 0.29 0.41

383 JGM_CD980 0.79 2 0.32 0.43 0.28

384 JGM_CD981 0.86 3 0.24 0.29 0.24

385 JGM_CD983 0.71 2 0.35 0.00 0.32

386 JGM_CD984 0.86 3 0.23 0.14 0.24

387 JGM_CD985 0.93 2 0.12 0.14 0.12

388 JGM_CD986 0.79 3 0.32 0.14 0.33

389 JGM_CD987 0.43 4 0.65 0.86 0.64

390 JGM_CD988 0.71 3 0.40 0.43 0.39

391 JGM_CD991 0.86 2 0.21 0.00 0.21

392 JGM_CD993 0.64 3 0.48 0.71 0.43

393 JGM_CD995 0.57 2 0.48 0.86 0.37

394 JGM_CD996 0.86 2 0.21 0.00 0.21

395 JGM_CD997 0.50 2 0.50 1.00 0.38

396 JGM_CD998 0.64 2 0.44 0.71 0.35

397 JGM_CD1001 0.43 4 0.63 0.86 0.60

398 JGM_CD1006 0.86 2 0.21 0.00 0.21

399 JGM_CD1008 0.93 2 0.12 0.14 0.12

400 JGM_CD1009 0.86 2 0.21 0.00 0.21

401 JGM_CD1014 0.50 3 0.60 1.00 0.55

402 JGM_CD1016 0.79 3 0.32 0.29 0.33

403 JGM_CD1018 0.93 2 0.12 0.14 0.12

404 JGM_CD1020 0.79 3 0.32 0.14 0.33

405 JGM_CD1022 0.93 2 0.12 0.14 0.12

406 JGM_CD1023 0.50 3 0.39 0.50 0.55

407 JGM_CD1026 0.57 6 0.59 0.71 0.60

408 JGM_CD1027 0.64 2 0.44 0.71 0.35

409 JGM_CD1028 0.93 2 0.12 0.14 0.12

410 JGM_CD1031 0.86 2 0.21 0.00 0.21

411 JGM_CD1032 0.86 2 0.21 0.00 0.21

412 JGM_CD1033 0.64 5 0.50 0.43 0.52

413 JGM_CD1035 0.93 2 0.12 0.14 0.12

414 JGM_CD1038 0.50 2 0.50 1.00 0.38

415 JGM_CD1042 0.93 2 0.12 0.14 0.12

416 JGM_CD1043 0.86 2 0.21 0.00 0.21

417 JGM_CD1047 0.90 2 0.16 0.20 0.16

418 JGM_CD1054 0.43 5 0.65 0.57 0.67

419 JGM_CD1057 0.64 3 0.48 0.71 0.43

420 JGM_CD1059 0.57 3 0.51 0.29 0.50

421 JGM_CD1060 0.86 3 0.23 0.14 0.24

109

422 JGM_CD1061 0.86 3 0.24 0.29 0.24

423 JGM_CD1066 0.50 4 0.58 0.43 0.58

424 JGM_CD1067 0.29 6 0.71 0.29 0.78

425 JGM_CD1070 0.50 2 0.50 1.00 0.38

426 JGM_CD1073 0.93 2 0.12 0.14 0.12

427 JGM_CD1077 0.71 2 0.39 0.57 0.32

428 JGM_CD1080 0.86 2 0.23 0.29 0.21

429 JGM_CD1085 0.93 2 0.12 0.14 0.12

430 JGM_CD1086 0.79 2 0.32 0.43 0.28

431 JGM_CD1087 0.79 2 0.32 0.43 0.28

432 JGM_CD1088 0.36 4 0.67 0.71 0.67

433 JGM_CD1090 0.86 2 0.21 0.00 0.21

434 JGM_CD1091 0.86 2 0.23 0.29 0.21

435 JGM_CD1092 0.93 2 0.12 0.14 0.12

436 JGM_CD1093 0.50 2 0.50 1.00 0.38

437 JGM_CD1094 0.79 3 0.32 0.14 0.33

438 JGM_CD1099 0.93 2 0.12 0.14 0.12

439 JGM_CD1100 0.83 2 0.26 0.33 0.24

440 JGM_CD1101 0.86 2 0.21 0.00 0.21

441 JGM_CD1102 0.86 2 0.21 0.00 0.21

442 JGM_CD1104 0.93 2 0.12 0.14 0.12

443 JGM_CD1110 0.93 2 0.12 0.14 0.12

444 JGM_CD1111 0.93 2 0.12 0.14 0.12

445 JGM_CD1112 0.86 2 0.23 0.29 0.21

446 JGM_CD1114 0.71 3 0.39 0.14 0.39

447 JGM_CD1115 0.79 3 0.32 0.14 0.33

Range 0.21-0.93 2.0-9.0 0.12-0.78 0.00-1.00 0.12-0.85

Average±SD 0.71±0.19 2.78±1.18 0.35±0.18 0.36±0.34 0.34±0.17

110

Figure 4.12 PIC distribution of polymorphic SSRs (447) based on 7 accessions of

J. curcas

4.3.3 Similarity search and functional annotations

The similarity search of Lib A and B SSR containing sequences showed maximum similarity

of 50 % with Ricinus communis (Euphorbiaceae family) (Figure 4.13). Significant similarity

was also observed with Populus trichocarpa (22 %), Vitis vinifera (16 %), Arabidopsis spp.

(4.5 %) and Oryza sativa (3%). The similarity were also found 1% with the Glycine max,

Jatropha curcas, Antirrhinum majus, Helianthus annuus, Panax ginseng, Malus domestica,

Phycomitrella patens, Pisum sativum, Selaginella moellendorffii, Solanum demissum and

Theobroma cacao. Gene Ontology analysis (http://arabidopsis.org/tools/bulk/go/index.jsp/)

resulted into a total of 1539 GO terms (against 1207 sequences), which were further

categorized under biological processes, (638 terms, 41%), cellular components (366 terms,

24%) and molecular functions (535 terms, 35%) category. Besides, significant numbers of

terms for unknown/ unclassified annotations (under all the three categories), maximum

numbers of terms were assigned for protein metabolism (10%) in biological processes

category (Figure 4.14). Similarly, maximum number of terms was assigned for nucleus (9%)

and chloroplast (7%) in cellular components and for protein binding (13%) in molecular

functions category.

68

143

98

56

38 28

14

2

0

20

40

60

80

100

120

140

160

Nu

mb

er o

f p

oly

mo

rph

ic S

SR

s

PIC value range

111

Figure 4.13 Annotation of genomic SSRs developed from Lib A and B of J. curcas. Each bar

indicates the percent sequence similarity with various plant genomes based on BLASTIX.

Figure 4.14 Gene Ontology (GO) classification of the SSR containing genomic sequences derived

from microsatellite enriched libraries of J. curcas. The relative frequencies of GO hits to functional

categories of cellular components, biological process and molecular functions.

Ricinus communis ,

50%

Populus trichocarpa,

21%

Vitis vinifera, 16% Arabidopsis spp. ,

4%

Oryza sativa , 3%

Glycine max, 1%

Jatropha curcas, 1%

Antirrhinum

majus, 1% Helianthus annuus,

1%

Malus x domestica,

1%

Panax ginseng, 1%

Physcomitrella

patens , 1%

Pisum sativum, 1%

Selaginella

moellendorffii , 1%

Solanum demissum,

1%

Theobroma cacao,

1%

112

The sequence similarity search of SSR containing sequences from Lib C and D was also

carried out against the Nr database (E-value<1e-5

) using the BLASTx algorithm.

The

similarity search of SSR containing sequences showed maximum similarity of 63% with

Ricinus communis (F. Euphorbiaceae) (Fig 4.15). Significant similarity was also observed

with Populus trichocarpa (19%), Vitis vinifera (9%), Glycine max (3%), J. curcas (2%) and

Medicago truncatula (2%). The similarity was also observed 1% with Arabidopsis byrata and

Arabidopsis thaliana.

Figure 4.15 Annotation of genomic SSRs developed from enriched libraries of J. curcas.

Each pie indicates the percent sequence similarity with various plant genomes based on

BLASTX

113

4.4 Study of molecular genetic diversity among indigenous and exotic

accessions of J. curcas.

4.4.1 Allelic diversity

A subset of 41 randomly selected polymorphic SSRs was amplified with 96 accessions of J.

curcas which include 70 indigenous (collections from different states of India) and 26 exotic

collections including 3 non- toxic accessions. A total of 152 alleles were produced by these

41 SSRs among the 96 accessions. A considerable variability was noticed with respect to

allele diversity as the number of alleles varied from 2 to 9 with an average of 4.0±1.9

alleles/SSR (Table 4.16). The PIC value for these 41 polymorphic SSRs with respect to 96

accessions varied from 0.01 to 0.80 with an average of 0.22±0.19 (Table 4.16). Maximum

PIC value was noticed for JGM_CD055 (0.80) followed by JGM_CD097 (0.73) and

JGM_CD094, while the lowest PIC value (0.01) was noticed for JGM_CD158, JGM_CD165,

and JGM_CD185. Majority of the SSRs (71%) showed low PIC value (<0.30) (Figure 4.16)

and there were only three SSRs which showed PIC value in between 0.61 and 0.80. The

observed heterozygosity (Ho) varied between 0.00 and 0.99 with an average of 0.16±0.27 and

expected heterozygosity (He) or gene diversity varied from 0.01 to 0.82 with an average of

0.24±0.21.

Table 4.16 Polymorphism features of 41 SSR markers surveyed over 96 accessions of

J. curcas

Sl.

No. SSR Loci

No. of

alleles PIC MAF Ho

He

1 JGM_CD002 2 0.14 0.92 0.03 0.15

2 JGM_CD007 2 0.08 0.96 0.00 0.08

3 JGM_CD010 4 0.07 0.96 0.07 0.07

4 JGM_CD011 5 0.06 0.97 0.06 0.06

5 JGM_CD021 6 0.23 0.87 0.04 0.24

6 JGM_CD026 8 0.32 0.81 0.05 0.33

7 JGM_CD029 2 0.07 0.96 0.07 0.07

8 JGM_CD031 2 0.23 0.84 0.07 0.27

9 JGM_CD033 5 0.31 0.81 0.16 0.33

10 JGM_CD036 3 0.39 0.49 0.99 0.51

11 JGM_CD041 2 0.03 0.98 0.03 0.03

12 JGM_CD044 4 0.10 0.95 0.07 0.1

13 JGM_CD046 3 0.40 0.51 0.30 0.52

14 JGM_CD048 2 0.22 0.85 0.03 0.26

15 JGM_CD050 3 0.05 0.97 0.01 0.05

16 JGM_CD055 9 0.80 0.24 0.77 0.82

17 JGM_CD069 3 0.17 0.90 0.08 0.19

114

PIC= polymorphism information content, MAF=Major allele frequency, Ho/He = observed

heterozygosity/expected heterozygosity,

18 JGM_CD071 5 0.23 0.87 0.15 0.24

19 JGM_CD081 2 0.03 0.98 0.03 0.03

20 JGM_CD085 5 0.12 0.94 0.04 0.12

21 JGM_CD090 4 0.28 0.82 0.15 0.3

22 JGM_CD094 5 0.64 0.46 0.14 0.69

23 JGM_CD096 2 0.08 0.96 0.01 0.08

24 JGM_CD097 9 0.73 0.34 0.61 0.77

25 JGM_CD105 2 0.10 0.95 0.00 0.1

26 JGM_CD120 2 0.11 0.94 0.00 0.12

27 JGM_CD126 3 0.35 0.68 0.00 0.44

28 JGM_CD130 2 0.11 0.94 0.00 0.12

29 JGM_CD140 6 0.33 0.79 0.06 0.36

30 JGM_CD149 4 0.31 0.80 0.22 0.34

31 JGM_CD153 5 0.12 0.94 0.09 0.12

32 JGM_CD158 2 0.01 0.99 0.01 0.01

33 JGM_CD165 2 0.01 0.99 0.01 0.01

34 JGM_CD170 2 0.28 0.79 0.14 0.34

35 JGM_CD176 2 0.37 0.54 0.93 0.5

36 JGM_CD181 5 0.08 0.96 0.00 0.08

37 JGM_CD185 2 0.01 0.99 0.01 0.01

38 JGM_CD198 4 0.43 0.49 0.96 0.54

39 JGM_CD204 7 0.24 0.86 0.02 0.25

40 JGM_CD210 2 0.02 0.99 0.00 0.02

41 JGM_CD218 3 0.18 0.89 0.03 0.2

Range 2 – 9 0.01 – 0.80 0.24 - 0.99 0.00-0.99 0.01-0.82

Average 4.0±2.0 0.22±0.19 0.83±0.20 0.16±0.24 0.24±0.21

115

Figure 4.16 PIC distributions of 41 polymorphic SSR loci calculated across 96 J. curcas

accessions

4.4.2 Analysis of molecular variance (AMOVA)

To understand the pattern of differentiation of genetic variation among and between the

populations AMOVA was carried out considering indigenous and exotic accessions as two

distinct populations. The partitioning of genetic variations within and between population

(Indigenous and exotic collections) showed that 6% of the total genetic variation existed

among the population (Table 4.17) with an average pairwise PT (similar to FST) was 0.063.

The percentage variation within population was high as 94 %. The accessions used in the

present investigation showed low level of variation as indicated by the number of observed

alleles Na (2.80±0.24), effective number of alleles Ne (1.48±0.08), gene diversity He

(0.23±0.02) and Shannon’s information index I (0.43±0.04) (Table 4.18). However, as

compared to exotic collections, the indigenous accessions showed comparatively higher

values for the genetic parameters except He. The exotic accessions had comparatively higher

number of unique alleles (0.92 allele/accessions) than indigenous accessions (0.71

allele/accession) (Table 4.18).

15

7 7

8

1

0

1

2

0

2

4

6

8

10

12

14

16

Num

ber

of

Poly

mo

rphic

SS

R l

oci

PIC Value

116

Table 4.17 Analysis of molecular variance (AMOVA) for 96 J. curcas accessions

Source Degree of freedom Sum of squares Variance % variation

Among population 1 35.99 0.68 6

Within population 94 947.77 10.08 94

Total 95 983.77 10.77 100

Fst=0.066 P>0.001

Table 4.18 Different genetic diversity estimates for two populations (Indigenous and exotic)

of J. curcas based on 41 SSR loci

Population Sample size Na Ne I He Unique

allele

Indigenous 70 3.12±0.29 1.53±0.14 0.45±0.07 0.23±0.03 50

Exotic 26 2.48±0.19 1.43±0.09 0.41±0.05 0.23±0.03 24

Mean 2.80±0.24 1.48±0.08 0.43±0.04 0.23±0.02

Na– observed number of alleles; Ne– effective number of alleles; He– Nei’s (1973) gene

diversity; I– Shannon’s information index

4.4.3 Genetic distance and cluster analysis

Genetic diversity/interrelationship among 96 accessions representing diverse eco-

geographical and agro-climatic zones of the world was also assessed. The genetic

dissimilarity ranged from 0.02 to 0.80 with an average of 0.33±0.11. The maximum genetic

dissimilarity (80%) was noticed between accessions NBJC195 and NBJC143 (0.80) followed

by NBJC195 and NBJC124 (0.79) and NBJC195 and NBJC153 (0.77). The accession

NBJC131 showed minimum genetic dissimilarity with accessions NBJC150 followed by

NBJC183 and NBJC190. Considering the average genetic dissimilarity of one accession with

other accessions showed that the accessions NBJC131 and NBJC150 had minimum average

genetic dissimilarity coefficient of 0.25±0.12 which varied from 0.02 to 0.63 (Table 4.19).

Based on genetic distance the accessions NBJC195, NBJC194, NBJC124, NBJC143,

NBJC1534, NBJC115, NBJC194 and NBJC104 were found to be the most divergent among

all the accessions studied.

117

Table 4.19 Minimum, maximum and mean of the genetic dissimilarity coefficient of 96

accessions of J. curcas

SN. Accession Minimum Maximum Mean±SD

1 NBJC101 0.04 0.61 0.25±0.13

2 NBJC102 0.04 0.62 0.25±0.13

3 NBJC103 0.09 0.62 0.29±0.11

4 NBJC104 0.25 0.68 0.48±0.06

5 NBJC105 0.07 0.60 0.25±0.12

6 NBJC106 0.04 0.62 0.27±0.12

7 NBJC107 0.09 0.60 0.27±0.11

8 NBJC108 0.12 0.64 0.33±0.11

9 NBJC109 0.09 0.60 0.26±0.12

10 NBJC110 0.09 0.62 0.26±0.12

11 NBJC111 0.10 0.61 0.27±0.12

12 NBJC112 0.06 0.62 0.27±0.12

13 NBJC113 0.16 0.65 0.31±0.11

14 NBJC114 0.10 0.64 0.30±0.12

15 NBJC115 0.26 0.73 0.54±0.07

16 NBJC116 0.12 0.64 0.30±0.12

17 NBJC117 0.10 0.65 0.29±0.12

18 NBJC118 0.12 0.66 0.33±0.09

19 NBJC119 0.35 0.68 0.44±0.06

20 NBJC120 0.13 0.65 0.33±0.11

21 NBJC121 0.18 0.63 0.31±0.10

22 NBJC122 0.07 0.59 0.25±0.12

23 NBJC123 0.14 0.63 0.29±0.10

24 NBJC124 0.31 0.79 0.59±0.08

25 NBJC125 0.10 0.61 0.26±0.12

26 NBJC126 0.07 0.61 0.26±0.11

27 NBJC127 0.11 0.63 0.28±0.11

28 NBJC128 0.12 0.60 0.27±0.11

29 NBJC129 0.11 0.65 0.31±0.11

30 NBJC130 0.15 0.68 0.33±0.11

31 NBJC131 0.02 0.63 0.25±0.12

32 NBJC132 0.25 0.73 0.48±0.08

33 NBJC133 0.12 0.64 0.27±0.12

34 NBJC134 0.06 0.64 0.29±0.13

35 NBJC135 0.16 0.66 0.31±0.12

36 NBJC136 0.30 0.70 0.45±0.06

37 NBJC137 0.11 0.63 0.32±0.12

38 NBJC138 0.15 0.67 0.31±0.11

39 NBJC139 0.19 0.65 0.35±0.11

40 NBJC140 0.11 0.65 0.32±0.11

41 NBJC141 0.09 0.66 0.28±0.12

42 NBJC142 0.04 0.63 0.27±0.12

43 NBJC143 0.31 0.80 0.58±0.08

44 NBJC144 0.15 0.65 0.34±0.10

45 NBJC145 0.29 0.72 0.50±0.06

46 NBJC146 0.04 0.63 0.26±0.13

47 NBJC147 0.26 0.66 0.37±0.07

48 NBJC148 0.05 0.66 0.29±0.12

49 NBJC149 0.09 0.62 0.28±0.11

50 NBJC150 0.02 0.62 0.25±0.12

51 NBJC151 0.10 0.64 0.33±0.11

118

52 NBJC152 0.23 0.66 0.39±0.06

53 NBJC153 0.29 0.77 0.55±0.08

54 NBJC154 0.19 0.64 0.33±0.10

55 NBJC155 0.05 0.65 0.28±0.12

56 NBJC156 0.09 0.62 0.28±0.11

57 NBJC157 0.18 0.66 0.39±0.07

58 NBJC158 0.12 0.63 0.28±0.11

59 NBJC159 0.12 0.67 0.31±0.12

60 NBJC160 0.06 0.62 0.27±0.13

61 NBJC161 0.12 0.64 0.29±0.12

62 NBJC162 0.08 0.58 0.25±0.12

63 NBJC163 0.09 0.64 0.31±0.12

64 NBJC164 0.09 0.62 0.26±0.12

65 NBJC165 0.14 0.65 0.30±0.11

66 NBJC166 0.23 0.68 0.40±0.07

67 NBJC167 0.14 0.64 0.31±0.11

68 NBJC168 0.23 0.63 0.44±0.06

69 NBJC169 0.10 0.59 0.26±0.11

70 NBJC170 0.14 0.62 0.30±0.11

71 NBJC171 0.14 0.66 0.32±0.11

72 NBJC172 0.08 0.60 0.26±0.12

73 NBJC173 0.10 0.61 0.30±0.11

74 NBJC174 0.10 0.65 0.30±0.12

75 NBJC175 0.09 0.60 0.26±0.11

76 NBJC176 0.10 0.65 0.31±0.12

77 NBJC177 0.07 0.60 0.25±0.12

78 NBJC178 0.15 0.64 0.30±0.11

79 NBJC179 0.10 0.60 0.28±0.12

80 NBJC180 0.07 0.61 0.26±0.12

81 NBJC181 0.10 0.61 0.29±0.11

82 NBJC182 0.08 0.60 0.28±0.11

83 NBJC183 0.02 0.62 0.26±0.12

84 NBJC184 0.09 0.63 0.27±0.11

85 NBJC185 0.18 0.64 0.31±0.10

86 NBJC186 0.11 0.65 0.29±0.12

87 NBJC187 0.27 0.65 0.40±0.06

88 NBJC188 0.18 0.63 0.33±0.09

89 NBJC189 0.11 0.63 0.30±0.11

90 NBJC190 0.02 0.61 0.27±0.11

91 NBJC191 0.23 0.62 0.36±0.07

92 NBJC192 0.09 0.65 0.34±0.11

93 NBJC193 0.11 0.64 0.27±0.12

94 NBJC194 0.29 0.73 0.50±0.07

95 NBJC195 0.26 0.80 0.61±0.07

96 NBJC196 0.26 0.77 0.59±0.07

Range 0.02-0.35 0.58-0.80 0.25-0.61

Average±SD 0.13±0.08 0.65±0.04 0.33±0.11

Based on genotypic data of 152 alleles at 41 polymorphic SSR loci, a neighbor

joining (NJ) tree was constructed using DARwin 5 with 1000 replicate bootstrap. The

dendrogram classified all the 96 accessions into three major clusters namely A, B and C

(Figure 4.17). The bootstrap value of cluster C was found to be highly significant and that of

119

cluster A and B was very low. Considering the cluster wise genetic dissimilarity, accessions

of cluster B showed higher mean genetic dissimilarity (0.40±0.02) followed by cluster C

(0.29±0.05) and cluster A (0.25±0.01) and thus the accessions from cluster B can be

considered comparatively diverse than those of cluster A. The clustering of accessions

revealed that the cluster A accommodated maximum of 77 (80%) accessions followed by

cluster B (26 accessions) and cluster C (3 accessions). The cluster A could be further divided

into three sub-clusters i.e. AI (51 accessions) AII (25 accessions) and AIII (1accession

NBJC188). The sub-cluster AI included 16 accessions of exotic origin including 4 each from

South America (NBJC175, NBJC176, NBJC179 and NBJC181) and Western Africa

(NBJC173, NBJC183, NBJC185, NBJC186) 3 from Eastern Africa (NBJC174, NBJC189 and

NBJC190) and one each from Central America (NBJC182), Middle Africa (NBJC184),

Southern Asia (NBJC172), South-East Asia (NBJC193) and Eastern China (NBJC171). Apart

from the exotic accessions, it contained indigenous accessions from 16 different states of

India including 4 each from Uttaranchal (NBJC105, NBJC106, NBJC107, NBJC108) and

Jharkhand (NBJC109, NBJC110, NBJC111, NBJC112) 3 each from Rajasthan (NBJC101,

NBJC102, NBJC103) Bihar (NBJC113, NBJC114, NBJC116) Himachal Pradesh (NBJC121,

NBJC123, NBJC122) Chhattisgarh (NBJC117, NBJC118, NBJC120) 2 each from West

Bengal (NBJC144, NBJC145),Orissa (NBJC151, NBJC154), Arunachal Pradesh (NBJC159,

NBJC160), Haryana (NBJC125, NBJC126), Madhya Pradesh (NBJC134, NBJC135) and one

each from Manipur (NBJC169), Andhra Pradesh (NBJC139), Tripura (NBJC165), Kerala

(NBJC170) and Uttar Pradesh (NBJC130). Sub-cluster AII had 25 accessions and among

these 4 (16.0%) were of exotic origin and rest 21 were represented by indigenous accessions

from 10 different states of India. The cluster B contained 16 (26%) accessions (NBJC143,

NBJC124, NBJC153, NBJC132, NBJC115, NBJC104, NBJC145, NBJC168, NBJC165,

NBJC152, NBJC119, NBJC157, NBJC136, NBJC187, NBJC191, NBJC147) with maximum

of indigenous origin (87%) representing 12 states of India. The cluster C was found to be

very unique and had 3 non-toxic accessions (NBJC194, NBJC195 and NBJC196) of J. curcas

from Mexico, Central America. The neighbor joining clustering showed that most of the

exotic accessions (81.0%, 21 out of 26) were grouped in single cluster i.e. cluster A along

with the indigenous accessions indicating low level of genetic diversity across the global

collections of J. curcas.

120

Figure 4.17 Genetic relationship among 96 accessions of J. curcas based on NJ tree

constructed using genotyping data of 41 polymorphic genomic SSRs. The numbers on

branches indicate bootstrap values based on 1000 replications.

0 0.1

NBJC101

NBJC102

NBJC103

NBJC104

NBJC105

NBJC106

NBJC107

NBJC108

NBJC109NBJC110

NBJC111

NBJC112

NBJC113

NBJC114

NBJC115

NBJC116

NBJC117

NBJC118

NBJC119

NBJC120

NBJC121

NBJC122

NBJC123

NBJC124

NBJC125

NBJC126

NBJC127

NBJC128

NBJC129

NBJC130

NBJC131

NBJC132

NBJC133

NBJC134

NBJC135

NBJC136

NBJC137

NBJC138

NBJC139

NBJC140NBJC141

NBJC142

NBJC143

NBJC144

NBJC145

NBJC146

NBJC147

NBJC148

NBJC149

NBJC150

NBJC151

NBJC152

NBJC153

NBJC154

NBJC155

NBJC156

NBJC157

NBJC158

NBJC159

NBJC160

NBJC161

NBJC162

NBJC163

NBJC164

NBJC165

NBJC166

NBJC167

NBJC168

NBJC169

NBJC170

NBJC171

NBJC172

NBJC173

NBJC174

NBJC175

NBJC176

NBJC177

NBJC178

NBJC179

NBJC180

NBJC181

NBJC182

NBJC183

NBJC184

NBJC185

NBJC186

NBJC187

NBJC188

NBJC189

NBJC190

NBJC191

NBJC192

NBJC193

NBJC194NBJC195

NBJC196

61

32

26

56

57

46

42

47

12

13

45

19

20

40

23

27

67

31

21

24

22

33

36

10

21

20

47

68

13

58

64

31

47

40

32

13

22

83

25

25

71

22

38

80

30

26

90

47

19

12

16

10

10098

44

27

15

10

11

12

10

12

121

4.4.4 Population structure analysis

In order to determine the genetic structure and define the number of cluster (gene pool),

model-based cluster analysis was performed using software STRUCTURE version 2.3.3

(Pritchard et al. 2000). The model based cluster analysis also showed almost similar result as

obtained in dendrogram. All the 96 accessions of J. curcas grouped into 4 genetically distinct

subpopulations (K=4) based on maximum K values (Figure 4.18 a, b). Based on the

membership probability, 29 accessions assigned to subpopulation 1 with mixed accessions of

indigenous (77%) and exotic (23%) collections and 51 accessions with 70% indigenous and

30% exotic collections assigned to subpopulation 2. The subpopulation 1 had total 14

accessions (NBJC116, NBJC117, NBJC134, NBJC146, NBJC159, NBJC102, NBJC176,

NBJC112, NBJC125, NBJC160, NBJC174, NBJC139, NBJC114 and NBJC169) maintain

their identity without admixture of allele of other accessions while 3 accessions (NBJC179,

NBJC144 and NBJC155) showed admixture with sub population 3. The 7 accessions

(NBJC171, NBJC111, NBJC135, NBJC106, NBJC130, NBJC161 and NBJC109) showed

admixture with sub population 2 and 5 accessions (NBJC114, NBJC169, NBJC151,

NBJC173 and NBJC123) showed admixture of subpopulation 2 and 3. The accessions 93 and

85 showed admixture with subpopulation 4 (non-toxic) and 2. In the subpopulation 2, a total

of 16 accessions (NBJC141, NBJC148, NBJC131, NBJC155, NBJC163, NBJC178,


NBJC158 and NBJC126) showed no admixture while 22 accessions (NBJC142, NBJC162,



NBJC190, NBJC101, NBJC176 and NBJC165) showed admixture with subpopulation 1. The

5 accessions (NBJC192, NBJC137, NBJC140, NBJC186 and NBJC189) showed the

admixture with subpopulation 4 (non-toxic). The 3 accessions (NBJC18, NBJC118 and

NBJC147) showed the admixture with subpopulation 2 and 3 and one accession (NBJC191)

with subpopulation 3 and 4. The three accessions (NBJC188, NBJC157 and NBJC167)

showed admixture with subpopulation 3. The subpopulation 3 had 13 accessions and all

belonging to the indigenous collections except NBJC187. The 7 accessions (NBJC145,

NBJC166, NBJC152, NBJC168, NBJC136, NBJC187 and NBJC119) were found to show the

admixture with subpopulation 1 and 2. The subpopulation 4 is quite unique having 3

exclusive exotic and non-toxic accessions (NBJC194, NBJC195 and NBJC196). The

clustering of both exotic and indigenous accessions in the same subpopulation indicated low

genetic differentiation.

122

Figure 4.18 Population structure analysis showing; a) Delta K showing highest probability of

four (K=4) subpopulation and b) Structural plot of 96 accessions of J. curcas.

a

b

123

4.5 Molecular characterization of interspecific hybrid of J. curcas x J.

integerrima

A successful interspecific cross was made between J. curcas and J. integerrima (Figure

4.19). About more than 300 crosses were attempted and 150 fruits successfully developed

and harvested. After harvesting of mature fruits about 220 hybrid seeds were obtained. These

seeds were shown in polybags in glass house. Out of 220 seeds, 160 seeds germinated

(72.7%). A total of 120 seedlings were successfully grown (Table 4.19). The two month old

seedlings were transferred in the field and 94 seedlings successfully survived at maturity in

the field. The developed interspecific hybrids were vigorous, freely flowering and

morphologically intermediate between the parents. The hybrids resembled the female parents

(J. curcas) in terms of stem type and leaf shape while for leaf pigmentation, fruit size tended

towards the male parent (J. integerrima) (Figure 4.19 and 4.20). The F1 plants exhibited

wider variations in terms of stem character (semi-hard wood), flower colour (pink, white and

yellow), fruit size (small and round) and stigma of female flower was larger than parents. In

the early stage, a large number of hybrid plant leaves showed dark red pigmentation on

ventral surface and at maturity maximum plants leaves became green while some plants

leaves retained their colour. Some hybrids showed complete dark red pigmentation on ventral

surface, some showed pigmentation of half part and some showed scattered pattern of

pigmentation (Figure 4.20). The hybrid plants flowered within 7-8 months unlike the parents

which bore flowers 10-15 months after establishment. The capsule variability were of

intermediate type, either round like J. curcas but purple colored or oval like J. integerrima

without pigmentation.

Table 4.20 Crossability success in crosses between J. curcas and J. integerrima

No. of crosses No. of capsules

formed approx.

No. of

seeds

set

No. of seeds

germinated

Seedlings

established in

field

300 150 220 160 94

124

(A) (B)

(C) (D)

(E) (F)

(G) (H)

Figure 4.19 Photograph showing (A) J. curcas (Female parent) (B) J. integerrima (Male

parent) selected for interspecific hybridization (C) Developing successful fruits after crossing

(D) Seedling of F1 interspecific hybrids in glasshouse, (E) Transplanted F1 hybrid plants

growing in field, (F) Close up view of F1 hybrid plant (G) Close up view of inflorescence of

hybrid plant and (H) Hybrid plant showing fruits

125

Figure 4.20 (A) Dorsal side and (B) ventral side showing morphological variation in leaf

shape and pigmentation of parents (J. curcas and J. integerrima) and hybrid plants.

4.5.1 Confirmation of hybridity using SSR markers

The hybrid identification based on morphological characters is influenced by environmental

factors and frequently lacks the resolving power to identify hybrids at the juvenile stage. For

the identification of true hybrids at early stage, 15 polymorphic SSRs were randomly selected

from Lib A and B which had been screened previously with 7 accessions of J. curcas and one

accession of J. integerrima. These selected SSRs were used to confirm the hybridity and

characterize the 94 interspecific hybrids alongwith their parents (J. curcas and J.

integerrima). The polymorphic microsatellite banding pattern of the parents compared with

hybrid was able to clearly recognize the true hybrid as shown in Figure 4.21. All the 15 SSR

primers were clearly distinguished the hybrid from the parents. A representative snap shot

from GeneMapper showing the parental allele of J. curcas, J. integerrimma and hybrid are

shown in Figure 4.21.

The J. curcas and J. integerrima are reported to be cross-pollinated species and

therefore we could expect genetic variability and segregation in large population F1 hybrid as

in case of tree plants. The amplified primers generated different types of allelic patterns

(Table 4.21) viz. some alleles shared pattern of parents and showed true hybrid, some of the

alleles were segregated in the hybrids but absent in both parents, some alleles were common

to one parent and other to another parent, some alleles showed more similarity to the female

126

parents and some showed more similarity to the male parents (Table 4.21). Out of 94

interspecific hybrids, the maximum number of hybrids sharing parental alleles were detected

with SSR JGM_B334 (93), JGM_A 182 (89) and JGM_A247 (80) followed by JGM_A109

(75) and JGM_A162 (56). On the other hand, the SSR JGM_A291 (1), JGM_A147 (1) and

JGM _A335 (1) showed lowest numbers of hybrids sharing both the parental alleles. The

maximum similarities of hybrid alleles with female parents was found with the JGM B576

(93) and JGM_B368 (88) followed by JGM_A179 (81) and JGM_A335 (62). The minimum

similarities of hybrid alleles with female parents were found with JGM_B334 (1) and

JGM_A182 (2) followed by JGM_B278 (3) and JGM_A247 (6). The similarity of male

parent with hybrids was found less than female parents. The maximum similarities of hybrids

alleles with male parent was found with JGM_A291 (55), JGM_ B 278 (28) followed by

JGM_ A162 (13) and JGM_ A296 (11). The minimum allelic similarities of male parents

with hybrids were found with JGM_A182 (1), JGM_B576 (1) followed by JGM_B335 (2)

and JGM_A247 (3). The number of alleles showed non-parental type of alleles with hybrid

genotypes. The maximum non-parental type of allelic pattern was found with JGM_A295

(51), JGM_ A120 (42) and JGM_ B278 (38) followed by JGM_ A296 (37) and JGM_ A335

(29). The minimum non-parental types of alleles were found with JGM_B369 (1) and JGM_

A182 (2) followed by JGM_ A 162 (3) and JGM_ A 109 (4).

Figure 4.21 A snapshot of GeneMapper showing the allelic pattern of J. curcas,

J. integerrimma and their hybrid

J. curcas

J. integerrima

Hybrid (J. curcas X J. integerrima)

127

Tab

le 4.2

1 G

enoty

pin

g d

etails of 1

5 p

oly

morp

hic p

rimers w

ith 9

4 in

terspecific h

yb

rids an

d its p

arents (J. cu

rcas an

d J. in

tegerrim

a)

SS

R N

ame

Allele size

J. curca

s

Allele size

J. integ

errima

Num

ber o

f hybrid

s with

differen

t allele com

bin

ations

Hyb

rids b

oth

paren

tal type

Hyb

rids fem

ale

paren

tal type

Hyb

rids m

ale

paren

tal type

Non- p

arental

type

JGM

_A

109

244

263

75

9

6

4

JGM

_A

120

252

245

10

36

6

42

JGM

_A

162

275

285

56

22

13

3

JGM

_A

179

175

263

4

81

3

6

JGM

_A

182

178

166

89

2

1

2

JGM

_A

247

288

301

80

6

3

5

JGM

_A

291

290/3

03

298/3

03

1

14

55

18

JGM

_A

295

239

212

13

20

10

51

JGM

_A

296

192

184

15

31

11

37

JGM

_B

244

185

183

20

48

6

20

JGM

_B

278

155

142

25

3

28

38

JGM

_B

334

167

142

93

1

JGM

_B

369

147

142/1

47

1

88

4

1

JGM

_B

576

251

243/2

68

93

1

JGM

_A

335

320

291

1

62

2

29

128

The allelic data of interspecific hybrids alongwith their parental lines were converted

into 0-1 data matrix and used to calculate genetic similarity among hybrids and their parents

according to Jaccord’s similarity coefficient using NTSYS-PC software v2.02 (Exeter

software, New York). The similarity matrix was then used to generate dendrogarm depicting

clustering patterns of hybrids using unweighted pair group method with arithmetic average

(UPGMA) method. Jaccard’s pairwise similarity coefficient values ranged from 0.06 to 0.99.

The dendrogram shows two clades, one major and one minor (Figure 4.22). In the cluster -I

the maximum hybrids grouped with J. curcas that showed more similarity with female

parents. The minimum genetic similarity coefficient was found between J. curcas and J.

integerrima (0.06%). The maximum similarity coefficients (0.99%) were found among

hybrid NBJIS17, NBJIS22, NBJIS25, NBJIS28 and NBJIS21, NBJIS32, and NBJIS53,

NBJIS57, NBJIS83, and NBJIS72, NBJIS90. In cluster-II, male parent, J. integerrima, was

out- grouped with the hybrids-NBJIS1, NBJIS2, NBJIS15, NBJIS48 and NBJIS14. The 2-D

PCA clustering was also in agreement with the clustering depicted in dendrogram. The J.

integerrima occupied a distinct and far position in the plot alongwith some hybrids near by it.

However, J. curcas was found to be closer towards the larger numbers of hybrids (Figure

4.23).

129

Fig

ure 4

.22 D

endro

gram

show

ing clu

stering o

f 94

intersp

ecific hyb

rids alo

ngw

ith th

eir paren

tal lines J. cu

rcas (JC

) and J. in

tegerrim

a (JI).

Coefficient

0.200.40

0.600.80

1.00

JC

NB

JIS3 N

BJIS7

NB

JIS4 N

BJIS17

NB

JIS22 N

BJIS25

NB

JIS28 N

BJIS35

NB

JIS75 N

BJIS34

NB

JIS52 N

BJIS60

NB

JIS62 N

BJIS64

NB

JIS85 N

BJIS38

NB

JIS92 N

BJIS51

NB

JIS77 N

BJIS94

NB

JIS45 N

BJIS91

NB

JIS23 N

BJIS8

NB

JIS79 N

BJIS16

NB

JIS19 N

BJIS33

NB

JIS18 N

BJIS21

NB

JIS32 N

BJIS86

NB

JIS30 N

BJIS44

NB

JIS59 N

BJIS36

NB

JIS20 N

BJIS69

NB

JIS39 N

BJIS42

NB

JIS53 N

BJIS57

NB

JIS83 N

BJIS41

NB

JIS76 N

BJIS71

NB

JIS82 N

BJIS29

NB

JIS37 N

BJIS61

NB

JIS50 N

BJIS84

NB

JIS43 N

BJIS65

NB

JIS81 N

BJIS89

NB

JIS6 N

BJIS27

NB

JIS78 N

BJIS67

NB

JIS88 N

BJIS93

NB

JIS49 N

BJIS55

NB

JIS63 N

BJIS68

NB

JIS24 N

BJIS26

NB

JIS54 N

BJIS47

NB

JIS5 N

BJIS11

NB

JIS10 N

BJIS9

NB

JIS12 N

BJIS13

NB

JIS46 N

BJIS66

NB

JIS56 N

BJIS58

NB

JIS74 N

BJIS72

NB

JIS90 N

BJIS73

NB

JIS31 N

BJIS70

NB

JIS87 N

BJIS40

NB

JIS80 N

BJIS1

NB

JIS2 N

BJIS15

NB

JIS48 N

BJIS14

JI

130

Fig

ure 4

.23

Tw

o- D

imen

sional p

lot b

y P

CA

show

ing clu

stering o

f 94 in

terspecific h

yb

rids alo

ngw

ith th

eir paren

tal lines J. cu

rcas (JC

) and J.

integ

errima

(JI).

0.2

50.4

30.6

10.8

00.9

8

-0.3

0

-0.1

3

0.0

5

0.2

2

0.4

0

JC

JI

NB

JIS1

NB

JIS2

NB

JIS3

NB

JIS4

NB

JIS5

NB

JIS6

NB

JIS7

NB

JIS8

NB

JIS9

NB

JIS10

NB

JIS11

NB

JIS12

NB

JIS13

NB

JIS14

NB

JIS15

NB

JIS16

NB

JIS17

NB

JIS18

NB

JIS19

NB

JIS20

NB

JIS21

NB

JIS22

NB

JIS23

NB

JIS24

NB

JIS25

NB

JIS26

NB

JIS27

NB

JIS28

NB

JIS29 NB

JIS30

NB

JIS31

NB

JIS32

NB

JIS33

NB

JIS34

NB

JIS35

NB

JIS36

NB

JIS37N

BJIS

38

NB

JIS39

NB

JIS40

NB

JIS41

NB

JIS42

NB

JIS43

NB

JIS44

NB

JIS45

NB

JIS46

NB

JIS47

NB

JIS48

NB

JIS49

NB

JIS50

NB

JIS51

NB

JIS52

NB

JIS53

NB

JIS54

NB

JIS55

NB

JIS56

NB

JIS57

NB

JIS58

NB

JIS59

NB

JIS60

NB

JIS61

NB

JIS62

NB

JIS63

NB

JIS64

NB

JIS65

NB

JIS66

NB

JIS67

NB

JIS68

NB

JIS69

NB

JIS70

NB

JIS71

NB

JIS72

NB

JIS73

NB

JIS74

NB

JIS75

NB

JIS76

NB

JIS77

NB

JIS78

NB

JIS79

NB

JIS80

NB

JIS81

NB

JIS82

NB

JIS83

NB

JIS84

NB

JIS85

NB

JIS86

NB

JIS87

NB

JIS88

NB

JIS89

NB

JIS90

NB

JIS91

NB

JIS92

NB

JIS93

NB

JIS94

131

4.6 Heterozygosity assessment of J. curcas

In order to assess the level of heterozygosity of J. curcas, 56 SSRs (14 each 4 four SSR

libraries) were selected. These SSRs were selected on the basis of polymorphism, PIC value

and allelic data over 7 J. curcas accessions. These SSRs were used to genotype 48 progeny

derived from selfed seeds of a single plant. Out of 56, 7 SSRs could not produce sufficient

and significant data as they failed to amplify in more than 70% samples and thus not

considered for further analysis. Therefore, genotypic data of 49 SSRs were used for

heterozygosity assessment. Out of 49 SSRs, 31 SSRs were found to be monomorphic

indicating that all the plants were homozygous at those loci. The rest, 18 SSRs were found

polymorphic producing more than one allele and thus indicating heterozygous condition on

these loci (Table 4.22). The polymorphic SSRs showed allele variation from 2 to 9 with an

average of 3.56 alleles per SSRs (Table 4.22). The SSR JGM_CD232 showed maximum of 9

alleles followed by SSR JGM_CD348, JGM_CD421, JGM_CD092 with 5 alleles and

JGM_B034, JGM_B038 with 4 alleles. The rest 8 and 4 SSR produces 3 and 2 alleles

respectively. The heterozygosity, calculated as proportion of heterozygous individuals in

population, varied from 0.00 to 1.00 with an average of 0.37. However, majority of the

markers (61%, 11 out of 18) showed heterozygosity variation from 0.00 to 0.22 indicating

low level of heterozygosity at these loci. The rest 7 SSRs showed heterozygosity from 0.6 to

1.0 (mean 0.84) indicating higher proportion of heterozygosity at these loci.

132

Table 4.22 Polymorphism feature of 18 polymorphic SSRs among 48 progenies of single

plants used for heterozygosity assessment.

Marker Allele No. Gene Diversity Heterozygosity PIC

JGM_A439 3.00 0.06 0.02 0.06

JGM_A464 2.00 0.49 0.63 0.37

JGM_B034 4.00 0.48 0.22 0.40

JGM_B038 4.00 0.12 0.04 0.12

JGM_B041 2.00 0.17 0.06 0.16

JGM_B054 3.00 0.10 0.02 0.10

JGM_B062 3.00 0.25 0.13 0.23

JGM_B190 2.00 0.50 0.96 0.37

JGM_B479 3.00 0.50 0.84 0.39

JGM_B586 3.00 0.54 1.00 0.44

JGM_CD005 3.00 0.06 0.02 0.06

JGM_CD092 5.00 0.29 0.00 0.28

JGM_CD106 3.00 0.52 0.77 0.41

JGM_CD232 9.00 0.25 0.08 0.25

JGM_CD421 5.00 0.53 1.00 0.42

JGM_CD469 3.00 0.50 0.09 0.40

JGM_CD128 2.00 0.04 0.00 0.04

JGM_CD348 5.00 0.55 0.73 0.48

Range 2.00-9.00 0.04-0.55 0.00-1.00 0.04-0.48

Mean±SD 3.56±1.69 0.33±0.20 0.37±0.41 0.28±0.45

Chapter 5 Discussion

133

Jatropha curcas L. (2n=22), a perennial shrub producing non-edible oil, has emerged as a

renewable source of biodiesel production. J. curcas was distributed by Portuguese seafarers from

the Caribbean via the Cape Verde Islands and Bissau Guinea to other countries as in Africa and

Asia (Heller 1996). Among the potential biofuel crops, J. curcas has been gaining importance as

the most promising oilseed plant as it does not compete with the edible oil supplies (Johnson et

al. 2011; Achten et al. 2008; Fairles 2007). It attracts worldwide attention of research

communities to study and analyze its potential for biodiesel production. Various research

programs have been initiated on different aspects of crop improvement including agronomy,

conventional breeding, molecular breeding, genetic engineering etc (Kaushik et al. 2007; Rao et

al. 2008; Sunil et al. 2009; Liu et al. 2011; Wang et al. 2011; Shabanimofrad et al. 2011; Sun et

al. 2012; King et al. 2013) As per previous reports, J. curcas is still considered as an

undomesticated/semi-domesticated plant which needs further genetic improvement through

intervention of both conventional as well as molecular breeding approaches.

Therefore, there is a serious need to take initiative for its genetic improvement for

adaptability, agronomically desirable traits, yield and oil content. To initiate any genetic

improvement or breeding program the knowledge of the amount of genetic variability existing in

the material, type of gene action governing the yield and its components and a clear-cut

understanding of other important genetic parameters are essential. Based on these genetic

parameters successful exploitation of the available genetic variability and formulation of

efficient breeding program could be exercised. The assessment of the level and pattern of genetic

relationship among germplasm accessions is an important component of genetic improvement

program. The informations obtained could be utilized for (i) analysis of genetic variability (Cox

et al. 1986) (ii) identification of diverse parental combinations to create genetic variability for

further selection (Barrett and Kidwell 1998) and (iii) introgression of desirable genes from

diverse germplasm into the available genetic base (Thompson et al. 1998). Several types of data

sets (morphological, biochemical and molecular markers) and tools have been used for studying

genetic variability and relationship among accessions.

Genetic diversity can be assessed using either morphological traits or molecular markers

or a combination of both. The study of genetic diversity based on morphological traits has not

been always reliable as they are highly influenced by environment as compared to diversity

based on DNA markers which has been shown to be more reliable and independent of

environmental factors. The application of molecular markers for various genetic studies in a

134

particular plant depends upon the availability and types of molecular markers. If there is limited

number or complex markers available, then their application might be limited to that particular

plant. Thus, it would very important to develop large number and simple marker system to be

implemented for various genetic studies. There is a surge of interest in identifying a large

number of molecular markers for rapid application in the assessment of genetic diversity and the

selection of desired genotypes. Among the various molecular markers employed to assess

genetic diversity and other genetic studies, PCR-based molecular markers such as random

amplified polymorphic DNA (RAPD), amplified fragment length polymorphism (AFLP), inter

simple sequence repeats (ISSR), microsatellite or simple sequence repeats (SSRs) and single

nucleotide polymorphism (SNP) are the important ones. The plant of J. curcas has been

projected as potential source of biodiesel production, however very limited genetic studies have

been reported including marker based diversity assessment in larger population. A few marker

based diversity studies reported low level of genetic variability in J. curcas. Thus, in view of the

previous report of low level of genetic diversity and limited number of validated markers, there

is a need to enrich the genetic pool, develop and validate large number of polymorphic markers

and evaluate larger population for variability and new alleles. Therefore, the present

investigation was undertaken to develop a large set of SSR from four microsatellite enriched

genomic libraries and their characterization, validation, identification of polymorphic SSRs,

assessment of genetic diversity in global population. In addition, phenotypic evaluation of large

collections from various part of the India was also carried out to assess the variability in various

important quantitative traits associated with yield and its contributing traits. Efforts have also

been made to develop interspecific hybrid between J. curcas x J. integerrima and their

characterization based on SSR marker. Further, level of heterozygosity was also assessed using

SSR markers. The results obtained on the above various aspects are discussed below.

5.1 Phenotypic characterization of indigenous accessions of J. curcas

The morphological characterization is the first and important step in the description and

classification of germplasm (Smith and Smith 1989). The systematic work on germplasm

exploration, characterization, utilization and documentation is of paramount importance to

identify genetic variability for the desired traits. There are a very few reports on genetic

characterization of J. curcas and even fewer on systematic assessment of phenotypic characters

of different accessions. Most of the morpho-metric trait based genetic studies carried out earlier

135

were restricted to smaller number of accessions collected from limited areas. However, to

explore and exploit the available genetic resources, an extensive survey, collection and

evaluation was required to find out potential genetic material for further genetic improvement.

The basic genetic studies related to various statistical parameters such as range value of trait,

coefficient of variability, variances, heritability and genetic advance guides plant breeders to

devise suitable breeding strategies. The genetic studies based on the multivariate analysis is a

powerful tool for determining the degree of divergence between populations, the relative

contribution of different components to the total divergence and the nature of forces operating at

different levels. Additionally, the selection parameter such as correlation and path coefficient

also provides direction for selected desired plant types. Thus, in the present investigation, 80

accessions of J. curcas collected from different states of India were undertaken to evaluate

genetic variability, assess genetic divergence, correlation and path coefficient among various

traits. The major traits evaluated and studied include female flowers/plant, male flowers/plant,

male/female ratio, no. of fruits/plant, no. of seeds/plant, fruit weight/plant (g), seed weight/plant

(g) seed length (mm), seed width (mm) and oil content (%).

(a) Genetic variability and divergence

The genetic variability studies are the prerequisite and significant for developing genetic

improvement strategies (Gairola et al. 2011). The genetic divergence has been of considerable

importance and powerful tool for determining the degree of divergence between population,

relative contribution of different components to the total divergence and the nature of forces

operating at different levels. The measurement of genetic divergence helps the breeder to select

genotypes to be utilized in hybridization program for creating genetic variability, heterosis or

identification of donor parents for useful agronomic traits and select desirable segregants (Patra

1985, Singh et al. 2003, Singh 1991, Singh and Mittal 1993). Yield is an ultimate expression of

various yield contributing characters; direct selection for yield could be misleading (Islam and

Rasul 1998; Nath and Alam 2002). This is difficult to judge what proportion is non-heritable,

that is, environmental. If variability in population is largely due to genetic causes with least

environmental effect, probability of isolating superior genotype is a prerequisite for obtaining

higher yield. The process of breeding such population is primarily conditioned by magnitude and

nature of interaction of genotypic and environmental variations in plant characters. So, it

becomes necessary to partition the observed variability into its different components and to have

136

an understanding of parameter such as genetic coefficient of variation, heritability and genetic

advance. There are few reports on genetic characterization and assessment of phenotypic

characteristics of different accessions of J. curcas. Comparatively larger variations in seed traits

have been reported from various parts of the world (Gairola et al. 2011, Ghosh and Singh 2011,

Ginwal et al. 2005, Kaushik et al. 2007, Rao et al. 2008, Mohapatra and Panda 2010, Biabani et

al. 2012, Ouattara et al. 2013, Guan et al. 2013, Christo et al. 2014), while other traits exhibited

high genetic similarity. Singh et al. (2013b) found substantial amount of genetic variability

among the seed yield, oil content and its contributing traits in 24 accessions of J. curcas. Das et

al. (2010) revealed the considerable genetic variability in most component traits and seed yield,

except primary branches/plant, fruits/bunch and seeds/fruit among 16 J. curcas genotypes.

In the present investigation, the genetic variability and divergence analysis was estimated

among 80 accessions of J. curcas for 10 quantitative characters (Table 4.1). The genetic

variability of 80 accessions of J. curcas showed that the oil content varied from 20.8 to 36.1%

with an average of 26.2±0.38. Out of 80 accessions, 26 accessions were found to have oil content

above average value of 26.2% and only 3 accessions namely NBJC1055, NBJC1051 and

NBJC1048 were found to have oil content above 35%. So, majority of the accessions studied in

the present investigation were found to be low oil yielding. Contrary to this, Foidl et al. (1996)

and Berchmans and Hirata (2008) reported high oil content upto 40%. Kaushik et al (2007)

studied on 24 accessions of J. curcas and found the oil content varied from 28% to 38.80%. Rao

et al. (2008) studied on 32 wild accessions of J. curcas and found the oil content ranged from

29.85% to 37.05%. Likewise, only 37 accessions had seed weight/plant above average value

(i.e.180.2g) and of which only 4 accessions had seed weight/plant above 250g with maximum in

accession NBJC1078 (273.08g). This study showed that the accessions of J. curcas had low seed

and oil yield. Similarly, Das et al. (2010) recorded seed yield/plant above average value

(i.e.>84.7 g) of 16 J. curcas genotypes, which is lower than our result, average value of seed

yield/plant (i.e. 180.2 g). Rao et al. (2008) recorded 263.97 g as maximum seed yield/plant.

In order to assess the heritable portion of total variability, the phenotypic variance (2p)

was partitioned into genotypic (2g) and error variance (

2e). The values of error variance were

found to be higher than those of genotypic variance (2g) for number of female flowers/plant,

male/female ratio, number of fruits/plant, number of seeds/plant and seed weight/plant

suggesting much influence of environmental factors on these traits. The phenotypic coefficient of

variation (PCV) and genotypic coefficient of variation (GCV) varied from 7.51 to 25.48 and 7.05

137

to 17.87% respectively. Maximum PCV and GCV were noticed for seed weight/plant followed

by fruit weight/plant, number of male flowers/plant, number of seeds/plant. The PCV was found

higher than that of respective GCV for all the traits with remarkable differences in their values.

Similar finding of higher PCV over GCV for various traits in J. curcas has also reported

previously by several researchers (Rao et al. 2008, Kaushik et al 2007, Singh et al 2013b). The

traits as seed length, seed width and oil content have insignificant differences in PCV and GCV

values. Similar findings of small differences in PCV and GCV values were also been reported

earlier by Kaushik et al. (2007) and Rao et al. (2008) for these traits. The genetic improvement of

these traits with small differences in PCV and GCV values can easily be achieved by selection of

promising plant types and also through crossing the desirable accessions among themselves

followed by selection in segregating generations.

The estimate of genetic variability alone is considered as not much helpful in determining

the heritable portion of total genetic variation unless until coupled with the estimate of

heritability. The estimate of heritability along with variability can provide more insight towards

the amount of genetic advance to be expected from the selection process. Thus, the knowledge

of heritability of a character become important as it indicates the possibility and extent to which

improvement is possible through selection (Robinson et al. 1949). It measures the genetic

relationship between parents and progeny and has been widely used in determining the degree to

which a character may be transmitted from parent to offspring. However, the most important

function of heritability in the genetic studies is its predictive role, expressing the reliability of

phenotypic value as a guide to breeding value that determines the influence on the next

generation. Therefore, if a breeder chooses individuals to be parents according to their

phenotypic values, the success in changing the characteristics of population can be predicted

only from knowledge of the degree of correspondence between phenotypic values and breeding

values. Broad sense heritability varied from 20% to 90% and maximum was observed for oil

content (90%) followed by seed length (88%) and seed width (87%). The lowest heritability

(20%) was noticed for male/female ratio and other traits have moderate heritability ranging from

38% to 54%. High heritability was recorded for oil content (99.00%) and seed weight (96.00%)

by Kaushik et al. (2007). High heritability for seed traits was also reported by Rao et al (2008).

However, low heritability (77.956%) was reported for oil content by Singh et al. (2013b).The

high heritability noticed for oil content, seed length and seed width indicates that these characters

are under genotypic control. However, heritability estimates may differ widely in the same crop

138

and same trait (Rasmuson 2002) because heritability always refers to a defined population and a

specific experimental set up (Holland et al. 2002, Nyquist 1991).

Considering high heritability alone is not enough in making efficient selection in advance

generation unless accompanied by substantial amount of genetic advance (GA), which provides

the information about the degree of gain in a character obtained under a particular selection

pressure (Johnson et al. 1955). Expected genetic advance, as a function of selection intensity,

phenotypic variance and heritability has an added advantage over heritability as a guiding factor

to the breeders in a selection program (Singh and Singh 1981). The genetic advance as percent of

mean varied from 8.45 for male/female ratio to 25.81 for seed weight/plant. The higher genetic

advance for seed weight/plant, oil content, and number of fruits/plant was might be due to the

presence of variation in these traits. High heritability coupled with high GA and GCV for oil

content suggests that this trait was primarily controlled by additive gene action and any simple

selection model would be advantageous to obtain the desired genetic gain. Low heritability with

high GA for seed weight/plant, fruit weight/plant and number of seeds/plant and high heritability

with low GA for seed length and seed width indicates that these traits might be largely governed

by non–additive gene actions and hence much improvement cannot be achieved through

selection. Similar to the present findings, high heritability and low GA for seeds length and seed

width was also reported by Kaushik et al. (2007) and Rao et al. (2008). Das et al. (2010) reported

moderate to high heritability accompanied with high genetic advance for seed yield/plant,

fruits/plant and flowering bunches/plant indicated additive gene action. Singh et al. (2013b)

reported the higher heritability with GA for plant height (74.101%, 80.171), oil content

(77.956%, 16.120) and fruit set (57.430%)

Further, the potential accessions could be identified by analyzing genetic diversity in the

genetic resources collected/available, which will further facilitate various genetic improvement

programs. The accurate assessment of the level and pattern of genetic diversity is an important

component of breeding for crop improvement program. It has diverse applications viz. i) analysis

of genetic variability (Smith 1984; Cox et al. 1986), ii) identification of diverse parental

combinations to create genetic variability for further selection (Barrett and Kidwell 1998) and iii)

introgression of desirable genes from diverse germplasm into the available genetic base

(Thompson et al. 1998). Multivariate analysis techniques, which simultaneously analyze multiple

measurements on each individual under investigation, are widely used in analysis of genetic

diversity.

139

The multivariate analysis for diversity assessment results showed that the simultaneous

testing of significance based on Wilk’s lambda criterion for pooled effect of all the characters

have significant differences among the population (2 = 790 df = 3079.06**). A hierarchical

cluster analysis based on Wards minimum variance grouped all the 80 accessions into 4 clusters

(Table 4.2, Figure 4.1). The number of accessions per clusters varied from 6 (cluster III) to 29

(cluster II). The cluster II was largest with 29 accessions collected from 11 states of India

followed by the cluster I which was second largest comprising 27 accessions collected form

Uttar Pradesh, Gujarat, West Bengal, Rajasthan, Tamil Nadu, Himachal Pradesh, Bihar,

Chhattisgarh, Haryana, Punjab, Madhya Pradesh, Uttaranchal, Jharkhand and Kerala. The cluster

III was the smallest with 6 accessions collected from 3 states i.e. Uttaranchal (4), Jharkhand (1)

and Rajasthan (1). The cluster IV had 18 accessions collected from Chhattisgarh, Uttar Pradesh,

Tamil Nadu, Himachal Pradesh, Haryana, Orissa, Andhra Pradesh and West Bengal. The

clustering of the accessions based on multivariate analysis showed that majority of accessions

i.e. 56 accessions (70%) were genetically close to each other and grouped only into two clusters

i.e. cluster I and II. The distribution of accessions from same origin/geo-graphical region into

different clusters or vice-versa indicated that the geographical origin is not related to genetic

divergence. Similar findings were also reported earlier by Rao et al. (2008) and Sudheer et al.

(2010) based on morphological traits and molecular marker studies respectively. The tendency of

accessions occurring in clusters cutting across the eco-geographical boundaries demonstrate that

geographical isolation need not necessarily be related to genetic diversity and was at random

(Murty and Arunachalam1966; Ramanujam et al. 1974). Such unparallelism between

geographical distribution and genetic divergence might be due to some forces other than

geographical distance as ancestral relationship, genetic drift, free exchange of genetic material

from one place to another and variable degree of selection in different regions.

The maximum intra cluster distance was noticed in cluster IV (30.15) followed by cluster

I (28.61), cluster II (25.89) and cluster III (24.77). The accessions of cluster IV had

comparatively more diversity as compared to other clusters as indicated by its intra cluster

distance. The inter cluster distance varied from 47.59 (between cluster I and cluster II) to 211.27

(between cluster III and cluster I). Based on cluster distance, the cluster III showed maximum

genetic distance with cluster I, followed by cluster IV and cluster II suggesting comparatively

wider genetic diversity among them (Table 4.3). The accessions from these clusters could be

utilized in hybridization program to get desirable transgressive segregants in their offspring, as

140

there is a higher probability that unrelated genotypes would contribute unique desirable alleles at

different loci (Beer et al. 1993; Peeters and Martinelli 1989). Considering the cluster means

(Table 4.4), the cluster I showed highest mean value for all the traits except oil content. On

contrary, the cluster III had lowest mean value for number of male flowers/plant, male/female

ratio, fruit weight/plant, seed weight/plant, seed length and seed width. The cluster I and cluster

III seems to be unique in having highest and lowest cluster mean value for most of the traits

respectively and also had highest inter cluster distance between them. The crossing among the

accessions of these two clusters may yield hybrids with desirable traits.

In order to assess the patterns of variation, principal component analysis (PCA) was also

carried out by considering all the 10 variables simultaneously. The first four principal

components (PCs) accounted for more than 93% of the total variation (Table 4.5). PCA is a

multivariate technique that allows to find the major patterns within a multivariate data set.

Associations between traits emphasized by this method may correspond to genetic linkage

between loci controlling traits or a pleiotropic effect (Iezzoni and Pritts 1991). The first principal

components accounted for 42.5% of the total variation due to seed length, seed width, seed

weight/plant and number of seeds/plant which had maximum and positive weight on this

component. Oil content had negative weight on PC1. Thus, PC1 related to the accessions with

thick seeds, moderate to high seed yielder with low oil content. The PC2 concentrated 32% of

total variation and was positively associated with seed weight/plant, number of seeds/plant and

female flower/plant. The oil content had highest negative weight on PC2 also followed by seed

length, seed width and male female ratio. The third PC accounted for 12% variation was mainly

due to seed weight/plant, number of seeds/plant, number of fruits/plants and seed width had

negative weight. The seed weight/plant invariably had almost equal and positive weight on all

four components.

(b) Correlation and path coefficient analyses

Correlation studies provide reliable information on the nature, extent and direction of selection.

A breeder is always concerned with selection of genotypes whose performance is dependent on

the phenotypic performance. In general, the selection based on phenotypic performance does not

always lead to the expected genetic advance due to the presence of genotype x environmental

interaction. Dewey and Lu (1959) emphasized to recognize the nature of population under

consideration as the magnitude of correlation coefficient could often be influenced by the choice

141

of individuals upon which the observations are made. However, there was no way in which yield

could be changed without changing one or more of the components (Grafius 1964; Singh and

Singh 1979; Singh and Khanna 1993). The phenotypic correlation is a function of genotypic and

environmental correlations. Partitioning of phenotypic correlation into these two components, is

therefore important to breeders as it is only the genetic component which decides the usefulness

of such associations between the characters as it indicates the degree to which various traits of

the plant are associated with economic productivity. Correlation study thus provides information

on correlated response of important plant traits and therefore leads to a directional model for

yield response. So, in the present study genotypic and phenotypic correlation between the yield

and its various contributing traits had been calculated in 80 diverse accessions of J. curcas over

pooled data of two years and are presented in Table 4.1. The values of phenotypic and genotypic

correlations were of the same sign except for few such as female flowers/plant vs male

flowers/plant, male female ratio vs fruit weight/plant. In general, the magnitude of genotypic

correlations was slightly higher than the corresponding phenotypic correlation, which might be

due to modifying effect of environment (Singh et al. 2003, 2004). In some cases, where the

magnitude of genotypic and phenotypic correlations were nearly the same, the environmental

covariance was very small, which indicates that the influence of environment on these

correlations was minimal (Falconer 1989). The seed weight/plant was significantly and

positively associated with female flowers/plant, male flowers/plant, number of flower/plant,

number of seeds/plant, fruit weight/plant, seed width and negatively associated with oil content.

Positive and significant correlation of seed yield with male female ratio and flower number was

also reported earlier by Rao et al. (2008). Negative association of seed length with oil content

was also reported by Kaushik et al. (2007) and Rao et al. (2008). On contrary to the present

investigation, positive and significant association of seed yield with oil content was reported by

Kaushik et al. (2007). The oil content was found to be negatively and significantly correlated

with all the traits studied with strong negative association with female flower/plant followed by

male flowers/plant, number of seeds/plant, fruit weight/plant and seed weight/plant. Positive

association of seed yield/plant with fruits/plant was also reported by Das et al. (2010).

Among component traits, female flower/plants was significantly and positively correlated

with male flowers/plant, number of fruits/plants, number of seeds/plants, fruit weight/plants and

seed length and negative association with oil content. Male flower/plant had significant and

positive correlation with male female ratio, number of fruits/plant and fruit weight/plant. Male

142

female ratio had significant and negative correlation with fruit weight/plant. The negative

association of male female ratio with seed yield was also reported earlier by Singh et al. (2013b).

Number of fruits/plant had significant and positive correlation with number of seeds/plant, fruit

weight/plant, seed length and seed width. Number of seeds/plant had significant and positive

correlation with fruit weight/plant and negatively correlated with oil content. Fruit weight/plant

had significant and positive correlation with seed length and seed width, while negatively

correlated with oil content. Seed length had significant and positive correlation with seed width

which was also previously reported by Kaushik et al. (2007) and Guan et al. (2013). The positive

and significant association of major component traits among themselves in general and with seed

weight/plant in particular suggests that selection of component traits jointly or individually may

enhance the seed yield. However, oil yield could be compromised upto some extent as it showed

negative association with most of the component traits. Therefore, precaution should be taken

while selecting plant type, based on component traits so that both seed yield and oil yield could

be optimized to its maximum potential.

Correlation coefficient reveals over all relationship between two traits which may be

negative or positive in nature and it is the net result of direct effect of a particular trait and

indirect effects via other traits. It does not permit the cause and effect relationship of traits

contributing directly and indirectly towards economic yield. In order to predict the direct and

indirect effect of traits on correlation among various traits, the path coefficient analysis studies

are being carried out. The path analysis is a statistical technique used primarily to examine the

comparative strength of direct and indirect relationship among variables and thus permits a

critical examination of components that influence a given correlation and can be helpful in

formulating an efficient selection strategy (Shipley 1997; Scheiner et al. 2000). This approach

quantifies the relationship among variables based on a priori assumptions, which traits are to be

included in analysis. Such assumptions are somewhat subjective but path coefficient may allow a

better understanding of the interrelationships between traits than correlation tables with all

possible combinations between all the traits. The path analysis data specifies the causal and non-

causal paths between independent and dependent variables.

The path coefficient analysis for seed yield in J. curcas germplasm was performed and

results are presented in Table 4.7. The male flowers per plant had the maximum direct effect on

seed yield, followed by number of seeds/plant, seed width, number of fruits/plant and oil content.

On the other hand female flowers/plant, male female ratio, fruit weight/plant, seed length

143

exhibited negative direct path effect on seed yield but showed positive and significant correlation

on seed yield except male female ratio. Similarly, Singh et al. (2013b) also reported negative

direct effect of path coefficient and negative correlation of male female ratio on seed yield/plant

in J. curcas. The negative direct effect of female flowers/plant, fruit weight/plant, seed length on

seed yield/plant was counterbalanced by indirect positive effect via male flowers/plant, number

of fruits/plant, number of seeds/plant, and seed width. The apparent contradiction was probably

due to the fact that the total correlation simply measures mutual association without considering

the causation, whereas the path analysis specifies the causes and measures their relative

importance (Bhatt 1973). Male flowers/plant, number of fruits/plant, number of seeds/plant, seed

width had indirect positive effect and influenced female flowers which indirectly affect yield.

Male female ratio showed negative indirect effect and negative correlation with yield. Oil

content had positive direct effect on seed yield/plant though it had negative association which

was due to negative indirect effect via male flowers/plant, number of seeds/plant, number of

fruit/plant and seed width. The positive direct effect of male flowers/plant, number of

fruits/plant, number of seeds/plant, seed width and oil content for seed yield/plant indicated that

plant types with larger seeds, higher number of male flowers, fruits and seeds would be desirable

trait for improving seed yield in J. curcas. The results of the present investigation suggests that

selection in J. curcas based on male flowers/plant, number of fruits/plant, number of seeds/plant,

seed width and oil content would be advantageous to achieve the desirable goals. The indirect

selection through other component traits would also be rewarding to improve the seed yield.

5.2 Development of large scale genomic derived SSRs from di- and tri-

nucleotide enriched genomic libraries

Microsatellites or simple sequence repeat (SSR) has been widely preferred and used in recent

years for various genetic studies in plants including diversity analysis, linkage map construction,

QTL/association mapping (Wang et al. 2011; Liu et al. 2011; Sun et al. 2012; King et al. 2013;

Yue et al. 2013) and marker assisted selection. Among different approaches of developing

molecular markers, the use of microsatellite enriched genomic libraries has been widely used for

SSRs development. Construction and screening of partial genomic libraries and sequencing of

SSR positive clones have been reported to be an effective method for SSR isolation (Rafalski et

al. 1996; Lench et al. 1996; Lunt et al. 1999; Chen et al. 2005). However, the process of SSR

marker development and characterization of J. curcas is still very limited. In the present

144

investigation, 2329 SSRs (1207 from di-nucleotide enriched libraries, Lib A and B, and 1122

from tetranucleotide enriched libraries, Lib C and D) were developed, characterized, evaluated

for polymorphism. Traditional methods for the identification of microsatellite markers usually

demand the construction of small-insert genomic libraries, colony selection and plasmid isolation

and sequencing of selected clones, primer design for suitable flanking regions and assessments

on the marker polymorphism by PCR analysis on a germplasm sample. The advent and

advancement in next-generation sequencing (NGS) technology offers fast and cheap mode of

genome-wide and gene based SSR development more efficiently (Zalapa et al. 2012). More

recently, researchers are being applying next-generation sequencing technologies to generate

sequence data for the genome identification of microsatellite regions and primer design

(Abdelkrim et al. 2009; Castoe et al. 2010; Csencsics et al. 2010; Zhu et al. 2012). A few

preliminary studies were conducted in the past for the development of molecular markers and

diversity analysis in J. curcas. Thus, in view of the previous report of low level of genetic

diversity and limited number of validated markers we applied next generation Roche 454 GS

FLX sequencer for the development of genomic SSRs from two tri-nucleotide (AAC and AAT)

enriched genomic libraries of J. curcas. The newly identified SSRs were characterized,

validated, analyzed for polymorphism and were used to evaluate the global collection of J.

curcas for variability and new alleles.

(a) SSRs developed from Lib A and B

The microsatellite enrichment level was reported to vary from 11 to 99% in previous studies in

different crop plants (Techen et al. 2010). In the present investigation, the microsatellite

enrichment efficiency was noticed upto 87.9% and a total of 1315 unique sequences were

recovered from both the libraries at 44.2% recovery rate (Table 4.8). The primers were designed

for 1207 SSR containing sequences with 92% of success for primer designing. Wang et al.

(2008) developed SSRs for flowering dogwood (Cornus florida L.) and approximately 94.6%

primers were designed from 351 unique SSR sequences. Sun et al. (2008) have also reported

almost similar enrichment efficiency (89.5%) in J. curcas, however, Sudheer et al. (2010) found

lower enrichment efficiency (39.0%). Most of the SSRs recovered were of perfect types (77%)

which were found to be higher than reported earlier 44.6% in Solanum melongena L. by Nunome

et al. (2009). Das et al. (2012) recovered 62.03% of perfect SSRs in Jute (Corchorus). Wang et

al. (2008) found 43.5% perfect SSRs in Cornus florida. Since, the libraries were enriched with

145

CA and GA repeats, the frequency of core repeats was found to be high for dinucleotide motif,

though other repeat motif were also recovered. The maximum number of clones of CA enriched

library had AC/GT repeat motif and GA enriched library had AG/CT repeat (Figure 4.4). The

non-targeted AT motifs were also recovered in significant number and were the major part of

compound SSRs with targeted repeat motifs. Similar findings have also been reported in other

plant genomes like Maize (Taramino and Tingey 1996), sunflower (Paniego et al. 2002), and

safflower (Hamdan et al. 2011). The AT repeats motifs among DNRs are reported to be most

common in genomic sequences of Arabidopsis (Cardle et al. 2000). Recently, Sato et al. (2011)

also reported high frequency of AT repeat motif (71%) among DNR and AAT (60%) among

TNR in the genome sequence of J. curcas. The appearance of AT motif in the present

investigation might be due to its higher frequency in J. curcas genome. Considering the repeat

length of SSR motif, it was noticed that most of the SSRs (65%) showed short repeat length in

the range of 4-10 and only 37 (2.2%) were found to have repeat length of more than 20 (Figure

4.3).

(b) SSRs developed from Lib C and D

New and revolutionary sequencing methods, referred to as next-generation sequencing (NGS)

are extremely high-throughput technology that produce thousands or millions of sequences at

once at a fraction of the cost of traditional Sanger methods (Shendure and Ji 2008; Ekblom and

Galindo 2011). A specific application of this new technology in plants is the possibility of rapid

and cost-effective discovery of simple sequence repeat (SSR) or microsatellite loci. In the

present investigation, we used Roche 454 GS-FLX sequencer for developing genomic SSRs

from tri-nucleotide repeat motif (AAT and AAG) enriched genomic libraries of J. curcas. The

assembled sequences data yielded a total of 5,142 unique sequences having 5,844 putative SSRs.

In total only 1,122 flaking primers were successfully designed. The SSR recovery rate (23%) and

success of primer designing (22%) was quite low as compared to the SSRs developed from Lib

A and B using Sanger sequencing. The low rate of primer designing might be due to the

existence of marginal SSRs as the sequences generated from 454 sequences were shorter than

those generated from Sanger sequencing. As the libraries were enriched with trinucleotide

repeats, 59.0% of the SSRs recovered were of trinucleotide repeat type followed by

tetranucleotide repeats (25.0%). The other repeat motifs, i.e. DNR, PNR and HNR were also

recovered with less than 6% frequency. Among the TNR, the repeat motif AAT/ATT and

146

AAG/CTT were found in major frequency (73.2%) followed by ATC/ATG (7.0%) and

ACC/GGT (6.0%). Although, we have targeted TNR SSRs but significant amount of TtNR SSRs

was also recovered and among them AAAT/ATTT motif was found in maximum frequency

(51.0%). Considering the repeat length of SSR motif, it was noticed that most of the SSRs (95%)

showed short repeat length in the range of 4-10 and only 16 (0.3%) were found to have repeat

length of more than 20. Pootakham et al. (2012) developed genomic derived SSR markers in

Hevea brasiliensis using 454 pyrosequencing. They identified a total of 24674 SSRs, of which

the frequency of mononucleotide repeats (47.12%) and dinucleotide repeats (29.72%) comprise

of the two largest groups of repeat motifs. Less commonly found were trinucleotide repeats

(13.42%), followed by tetranucleotide repeats (6.07%) and pentanucleotide repeats (2.94%), with

hexanucleotide repeats being the least abundant at merely (0.71%). Yang et al. (2012) developed

microsatellite markers for Faba bean using 454 pyrosequencer. They identified a total of 250,393

SSRs and most common SSR motifs comprised trinucleotide and dinucleotide. The dinucleotide

repeats (AC/GT)n and (AG/CT)n were predominant, representing 99.2% of all the dinucleotide

characterized. Trinucleotide (AAC/GTT)n repeats were the most abundant (96.5%) which

showed higher enrichment than our results.

(c) Similarity search and functional annotation

A sequence similarity search SSR containing sequences derived from di-nucleotide enriched

libraries showed 50% similarity with Ricinus communis and a total of 1539 GO term assigned

based on Gene Ontology. A significant putative function could be assigned for 17% of

sequences. Likewise, the similarity search of trinucleotide SSR containing sequences showed

maximum similarity of 63% with Ricinus communis (F. Euphorbiaceae). Significant similarity

was also observed with Populus trichocarpa (19%), Vitis vinifera (9%), Glycine max (3%) and

Medicago truncatula (2%).

147

5.3 PCR optimization, polymorphism detection and characterization of

developed SSRs for various attributes

With the help of Sanger and next generation sequencing a total of 2,329 genomic derived SSRs

have been developed for J. curcas. The discovery and development of SSR markers is not of

much use until unless characterized and studied for polymorphism. The characterization and

polymorphism analysis helps to identify suitable markers which will be deployed in various

genetic studies. Therefore, all the SSRs developed in the present investigation were subjected to

PCR optimization for proper amplification protocol, polymorphism detection and studies on

various marker attributes. The results obtained are discussed below.

All the 1,207 SSRs developed from Lib A and B were amplified with 7 accessions of

J. curcas and one accession of J. integerrima for PCR optimization and polymorphism detection.

J. integerrima was included in the polymorphism screening panel to check the cross species

transferability of the newly developed SSRs and also to identify polymorphic SSRs between

J. curcas and J. integerrima which were used as parental lines for developing an interspecific

mapping population. J. integerrima is the only reported species of the genus Jatropha which

showed hybridization compatibility. Previously, researchers have used it to develop interspecific

hybrids for genetic improvement, linkage mapping and other genetic studies (Sujatha and

Prabhakaran 2003; Wang et al. 2011). The PCR amplification results showed that approximately

90% of the primers (1089 out of 1207 SSRs) have been successfully amplified with expected

amplicon. This was comparatively higher proportion of successful primer amplification than

reported earlier in J. curcas by Yadav et al. (2011, 78%) and Sudheer et al. (2010, 74%).

However, it was comparable to results obtained by Sato et al (2011, 88%). The polymorphism

rate among different accessions of J. curcas was found to be low (23%) as compared to the

previous studies in Jatropha (Yadav et al. 2011; Sudheer et al. 2010), however it was higher than

that reported by Sun et al (2008). We observed ~52% polymorphism reduction when non toxic

accession (NBJC195) was excluded from the analysis (only 129 polymorphic out of 1089) which

showed that the maximum polymorphism was due to non toxic accession. The diverse nature of

non toxic accession has also been reported in earlier studies (Basha et al. 2009; Makkar et al.

1998). Further, the polymorphism analysis with respect to parental lines of intra and interspecific

mapping population showed low among intraspecific parental lines (Lib A SSRs ~12%; Lib B

SSRs ~19%) as compared to interspecific parental lines (Lib A SSRs ~25%; Lib B SSRs ~33%).

The cross species transferability of SSRs to J. integerrima was ~52%. The 248 polymorphic

148

SSRs could be potential markers for future genetic studies in Jatropha as they showed

considerable amount of allelic variation (2-5 alleles per locus) among limited accession of J.

curcas.

The PIC value reveals the informativeness level and accordingly defined into high

(PIC>0.5), moderate (0.5>PIC >0.25) and low (PIC<0.25) categories (Botstein et al. 1980). The

SSRs developed exhibited low level of informativeness with an average PIC value of 0.24±0.10

and 0.28±0.13 of Lib A and Lib B respectively. The majority of the SSRs (71.0%) showed lower

PIC value (<0.30) which might be due to either low level of genetic diversity available in the

germplasm or less number of accessions studied. This is further supported by the fact that

exclusion of NBJC195 (non-toxic accession) further reduces the PIC value. No specific

correlation between number of repeats and number of alleles or PIC value was observed in J.

curcas. Similar findings were also reported in previous studies (Ferguson et al 2004; Hossain et

al. 2000; Gupta et al. 2012b). However, a positive correlation was reported in grape (Bowers et

al. 1996) and Arachis (Moretzsohn et al. 2005). The low average PIC value and number of

alleles observed suggests narrow genetic diversity available in the gene pool of J. curcas. Based

on the PIC value and other parameters the SSRs JGM_A281, JGM_A326, JGM_A244,

JGM_A107, JGM_A577, JGM_B300, JGM_B361, JGM_B595, JGM_B176, JGM_B330 and

JGM_B248 were found to be highly informative and could be potential markers for various

genetic studies in J. curcas.

Similarly, all the 1,122 SSRs developed from Lib C and D were also evaluated with the 7

accession of J. curcas and on accession of J. integerrima for PCR optimization and

polymorphism detection. Out of 1122 SSRs, 447 (40%) were found to be polymorphic among 7

accessions of J. curcas, 570 (51%) were monomorphic and the rest 105(9%) either failed to

amplify or gave non-specific amplification. The genotypic data showed that 273 SSR primers

failed to amplify with J. integerrima and 226 found to be monomorphic. However, the

polymorphic SSRs increases from 447 to 623 (73%) with respect to J. integerrima and J. curcas.

In this way, approximately 91% primers amplified among the different accessions of J. curcas

and 76% between J. curcas and J. integerrima which was in accordance with our previous report

with dinucleotide SSRs (Maurya et al. 2013). SSR polymorphism among 7 accessions was found

to be substantially higher (44%) as compared to earlier report of Maurya et al. (2013), Ouattara

et al. (2014) but less than that reported by Yadav et al. (2011), Mastan et al. (2012), Salvador-

Figueroa et al. (2014) and Osorio et al. (2014). The transferability of SSRs from one species to

149

another species might be useful for various genetic analyses and such studies have been reported

in various plants (Castillo et al. 2010; Ince et al. 2010; Chai et al. 2013, Yu et al. 2013). The

SSRs developed in the present investigation showed a higher transferability rate (76%) to J.

integerrima in comparison to reported earlier (Maurya et al. 2013), where it was noticed 52%

with dinucleotide repeat SSRs.

The allele diversity of all the 447 polymorphic SSRs among 7 accessions of J. curcas

varied from 2 to 9 with an average of 2.7±1.18 per markers. The average number of alleles

produced by trinucleotide SSRs in the present investigation was found to be higher than that

reported by Maurya et al. (2013) with dinucleotide SSRs and Salvador-Figueroa et al. (2014).

The majority of the markers, i.e. 57% produced two alleles followed by three (23%) and four

alleles (10%). There were only two markers JGM_CD797 and JGM_CD928 which produced 8

and 9 alleles respectively. The PIC value ranged from 0.12 to 0.85 (average 0.34±0.17). Most of

the SSRs (211, 47%) showed low PIC value (<0.30) (Figure 2). The average PIC value was

found to be higher as compared to previous reports (Ricci et al. 2012; Maurya et al. 2013, Osorio

et al. 2014) suggesting usefulness of the SSRs developed in the present investigation.

Approximately half of the SSRs (49.2%) showed a moderate range of PIC value. However, only

4% SSRs showed higher PIC value in the range of 0.71 – 0.90. The higher interspecific

transferability rate and average PIC value (0.34±0.17) was calculated based on 7 accessions of J.

curcas as compared to earlier reports (Maurya et al. 2013) which indicates that trinucleotide

SSRs are more useful in J. curcas for cross species transferability and comparative genetic

studies. Based on the PIC value and other parameters the SSRs JGM_CD928, JGM_CD1067,

JGM_CD911, JGM_CD650, JGM_CD926, JGM_CD887, JGM_CD914, JGM_CD940,

JGM_CD916, JGM_CD857 and JGM_CD873 were found to be highly informative and could be

potential markers for various genetic studies in J. curcas.

Based on polymorphism analysis of all the newly developed SSRs (2,329), 2,016

working SSRs (1089 from Lib A and B, 1017 from Lib C and D) have been identified to enrich

the repertoire of SSR marker for J. curcas. A total of 695 polymorphic SSRs have been

identified for J. curcas and characterized for various attributes. These newly developed SSRs

may facilitate construction of high density linkage map, diversity analysis, and QTL/association

mapping and may be further utilized in making strategies for marker assisted breeding towards

developing high seed and oil yielding cultivars of J. curcas.

150

5.4 Study of molecular genetic diversity among indigenous and exotic

accessions of J. curcas L.

A few studies on SSR based diversity assessment have been reported in J. curcas for Indian

accessions. Sujatha et al. (2005) investigated diversity among toxic Indian accessions and non-

toxic Mexican accessions using RAPD markers. Reddy et al. (2007) used AFLP and RAPD on

20 Indian accessions while Basha and Sujatha (2007) used RAPD and ISSR primers on 42

accessions from different regions of India. They also included the non-toxic Mexican accessions

in their analysis and although it could be easily differentiated from the Indian accessions. Ranade

et al. (2008) used single-primers amplification reaction (SPAR) to compare 21 accessions from

different parts of India and demonstrated that 3 North-East accessions were different among

them and from other accessions analyzed. Ganesh Ram et al. (2008) and Pamidimarri et al.

(2009) compared J. curcas with additional Jatropha species from India and demonstrated clear

divergence. Apart from these, several other genetic diversity studies have been reported in J.

curcas using different types of molecular markers like RAPD, AFLP, ISSR and SSR (details

summarized in review of literature). In all these studies, J. curcas accessions from different eco-

geographic regions of India were shown to be 60 to 80% similar. It is therefore important to

evaluate J. curcas accessions from wider eco-geographic regions to obtain more genetic

information. Therefore, in the present investigation, a random set of 41 polymorphic SSRs have

been utilized to evaluate 96 accessions of J. curcas including 70 Indigenous and 26 exotic

collections for genetic diversity/interrelationship and new alleles.

A total of 152 alleles produced at 41 SSR loci ranging from 2 to 9 with an average of

4.0±1.9 alleles/SSR (Table 4.16). The majority of the SSRs (71%) showed low PIC value

(<0.30) and there were only three SSRs which showed PIC value in between 0.61 and 0.80. The

range of PIC value obtained here was found to be comparable to previous report (Salvador-

Figueroa et al. 2014) and higher than reported by Ricci et al (2012). The observed heterozygosity

(Ho) varied between 0.00 and 0.99 with an average of 0.16±0.27 and expected heterozygosity

(He) or gene diversity varied from 0.01 to 0.82 with an average of 0.24±0.21. The AMOVA was

carried out considering indigenous and exotic as two distinct populations to understand the

pattern of differentiation of genetic variation among and between the populations. The

partitioning of genetic variations within and between populations (indigenous and exotic

collections) showed that 6% of the total genetic variation exists among the population and 94%

within populations with an average pairwise PT (similar to FST) was 0.063. The study carried

151

out by Salvador et al. (2014) with 93 accessions of J. curcas native to Chiapas, Mexico using

SSR markers also showed that the population were structured and moderately differentiated (FST

0.087). Likewise, Osorio et al (2014) performed AMOVA on 182 accessions collected from

Asia, Africa, South America and Central America and indicated high genetic variation within

population as compared to between populations. In contrast, Pecina-Quintero et al. (2014)

showed higher variance among population than variance within populations while studying

genetic diversity among 175 accessions from 9 Central and South-Estern Mexican states

including toxic and non-toxic accessions of J. curcas with AFLP markers. The accessions used

in the present investigation showed low level of variation as indicated by Na, Ne, He and I.

However, as compared with exotic collections, indigenous accessions showed comparatively

higher values for these genetic parameters except He. The exotic accessions had comparatively

higher number of unique alleles (0.92 allele/accessions) than indigenous accessions (0.71

alleles/accessions) which might be due to the fact that exotic accessions were collected from a

wider range of geographic and climatic conditions.

Further, the data of 152 alleles of 41 loci produced on 96 accessions were utilized to

prepare distance matrix based of Jaccard’s coefficient. The genetic dissimilarity ranged from

0.02 to 0.80 with an average of 0.32±0.11 and maximum was noticed between accession

NBJC195 and NBJC143 (0.80) followed by NBJC195 and NBJC124 (0.79) and NBJC195 and

NBJC153 (0.77). The intra-specific crosses among these accessions could be useful for

developing mapping population in J. curcas as well as genetic improvement through selection of

desirable hybrids. In agreement to the present investigation, the low level of genetic diversity in

J. curcas was also reported earlier by Rosado et al. (2010), Sudheer et al. (2010), Sato et al.

(2011), Shen et al. (2012) and Ouattara et al. (2014). However, higher genetic diversity were

reported by Pecina-Quintero et al. (2014), and Salvador-Figueroa et al. (2014) among Mexican

accessions. Tatikonda et al. (2009) and Wen et al. (2010) also reported high level of genetic

diversity in J. curcas.

The genetic distance matrix produced was used to create a neighbor joining (NJ) tree

using DARwin 5 with 1,000 replicate bootstrap. The dendrogram classified all the 96 accessions

into 3 major clusters, namely A, B and C (Figure 4.17). The bootstrap value of cluster C was

found to be highly significant and that of cluster A and B was very low. The cluster A

accommodated maximum of 77 (80%) accessions, followed by cluster B (26 accessions) and

cluster C (3 accessions). The cluster A could be further divided into 3 subclusters i.e. AI (51

152

accessions), AII (25 accessions) and AIII (1 accession). The sub-cluster AI included 16 (31.0%)

exotic accessions from different countries such as South America, Western Africa, Eastern

Africa, Central America, Middle Africa, Southern Asia, South-East Asia and Eastern China.

Besides, it also contained indigenous accessions from 16 different states of India. Sub-cluster AII

had 25 accessions and among these 4 (16.0%) were of exotic origin from South America,and

Southeastern Asia. The rest 21 were represented by indigenous accessions from 10 different

states of India. The subcluster AIII had only one accession from South Africa. The cluster B

contained 16 (26%) accessions with maximum of indigenous accessions (87%) representing 12

states of India. The cluster C was very unique and had three non-toxic accessions of J. curcas

from Mexico, Central America (NBJC 194, NBJC 195 and NBJC 196). The neighbor joining

clustering showed that most of the exotic accessions (81.0%, 21 out of 26) were grouped in

single cluster i.e. cluster A along with the indigenous accessions indicating low levels of genetic

diversity across the global collections of J. curcas. On the other hand, three non-toxic accessions

grouped into a single and distant cluster (cluster C) indicating that these accessions are widely

distinct from toxic accessions at genetic level too. The higher bootstrap value of cluster C

strongly supported the distinct group, however, the low bootstrap value of cluster A and B

indicated that the relative positions of the accessions of these clusters may vary if the

dendrogram is rebuilt. In contrary to Tatikonda et al. (2009) the clustering pattern in the present

investigation was not in accordance to geographical distribution. Considering the cluster wise

genetic dissimilarity, accessions of cluster B showed higher mean genetic dissimilarity

(0.40±0.02) followed by cluster C (0.29±0.05) and cluster A (0.25±0.01) and thus the accessions

from cluster B can be considered comparatively diverse than those of cluster A.

The clustering pattern indicated that the maximum genetic similarity found among

various accessions might be due to the ancestral closeness or duplication of the accessions.

Since, J. curcas is mainly propagated through cuttings (vegetative propagation) the possibility of

duplication of accessions and distribution in different parts of the country becomes high. Thus,

analysis of genetic diversity using molecular markers becomes more important to avoid wrong

selection of accessions for hybridization and genetic improvement programs. The vegetative

mode of reproduction could also be one of the reasons for low genetic diversity in J. curcas.

According to the review by Ellstrand and Roose (1987), the plant species with predominantly

vegetative reproduction generally, have lower level of genetic diversity than species that

successfully produce progeny solely by sexual reproduction. It is interesting to note that the

153

cluster A comprises at least one representative indigenous accession from all the 23 states of

India used in the study except Meghalaya and 81.0% exotic accessions. This further illustrates

the existence of low level of genetic diversity not only in indigenous accessions but also in exotic

accessions.

Further, in order to determine the genetic structure and define the number of cluster (gene

pool), model-based clustering was also carried out which showed almost similar pattern as

obtained in neighbor joining based dendrogram. All the 96 accessions were differentiated into

four genetically distinct subpopulations (K=4) based on maximum K values (Figure 4.18 a, b).

Based on membership probability, 29 accessions were assigned to subpopulation 1 with mixed

accessions of indigenous (77%) and exotic (23%) collections and 51 accessions with 70%

indigenous and 30% exotic collections assigned to subpopulation 2. The subpopulation 3 had 13

accessions and all belonged to the indigenous collections except one (NBJC187). The

subpopulation 4 is quite unique having three exclusive non-toxic accessions indicating their

diverse genetic makeup. The clustering of both exotic and indigenous accessions in the same

subpopulation indicated low genetic differentiation. Earlier, Sato et al. (2011) has also reported

significant phylogenetic relationship among Asian and African lines using SSR markers. Basha

and Sujatha (2007) also showed a unique pattern of non-toxic accession while studying genetic

diversity among 42 accessions of J. curcas using RAPD and ISSR markers and developed SCAR

markers that can differentiate non-toxic accessions. Basha et al. (2009) further evaluated 72 J.

curcas accessions from 13 countries using RAPD, ISSR and SSR markers and showed that 12

SSR markers differentiated the non-toxic Mexican accessions and disclosed novel alleles.

Pamidimarri et al. (2009) characterized toxic and non-toxic accession of J. curcas through

RAPD and AFLP markers and identified polymorphic markers that can differentiate toxic and

non-toxic accessions. Tanya et al. (2011) developed new EST-SSRs and identified 5

polymorphic SSRs which clearly displayed distinct banding pattern between non-toxic Mexican

and toxic Asian accessions. Furthermore, high molecular genetic diversity among J. curcas

population from Mexico has been reported earlier by Pecina-Quintero et al. (2011, 2014), Shen

et al. (2012), Pamidimarri and Reddy (2014) and Salvador-Figueroa et al. (2014). It is interesting

to note that the newly developed SSR markers also have the ability to differentiate toxic and non-

toxic accessions. Further, apart from utilization in the development of mapping population, the

non-toxic accessions will also serve as a potential genetic resource for further genetic

improvement of J. curcas.

154

5.5 Molecular characterization of interspecific hybrid of J. curcas x J.

integerrima

Almost every study carried out to evaluate the genetic variability in J. curcas either based on

morphological traits or on molecular markers showed low genetic divergence. Therefore, it was

an urgent need to search an alternative way to create genetic variation which could be later

exploited for various genetic studies and improvement of J. curcas. Previously, Sujatha and

Prabakaran (2003) and Parthiban et al. (2009) have reported the development of hybrid progeny

by interspecific crosses between J. curcas and J. integerrima. Therefore, an interspecific cross

was developed between J. curcas and J. intergerrima and large number of F1 hybrids raised.

These F1 hybrids were characterized using SSR markers and results obtained are discussed here.

More than 300 crosses were attempted and able to develop and establish 94 F1 hybrids in the

field. The crosses were made by taking J. curcas as female and J. intergerrima as male. The

reciprocal crosses were not successful. It has previously been also shown that crosses were

unsuccessful when J. integerrima was used as female parent (Dhillon et al. 2009). The reason for

unsuccessful reciprocal crosses has not been explored yet. The developed interspecific hybrids

were found to be vigorous in appearance, freely flowering and morphologically intermediate

between the parents. The most of the hybrids resembled the female parents in terms of stem type

and leaf shape while for flower color, pigmentation, fruit size tended towards the male parent.

The hybrid plants flowered much early than J. curcas. This trait might be governed by the

genetic loci coming from J. integerrima as it has early flowering than J. curcas. The SSR based

characterization of all the 94 F1 hybrids showed variation at various loci. The allelic pattern of

parents was shared in the hybrids showing true hybrids. However, some new allelic pattern was

also observed having no similarities with the parents which might be due to new allelic

recombination developed during crossing of genome of two different species. The different

allelic pattern in different F1 hybrid was expected as the parental lines were reported to be

heterozygous in nature due to cross pollination behavior as in case of tree plants. The clustering

of F1 hybrid based on SSR data showed that the maximum number of interspecific hybrids had

similarities with the female parents while few were grouped with male parents. Likewise,

Sujatha and Prabakaran (2003) and Dhillon et al. (2009) reported similar results. J. integerrima

does not exhibit most agronomic or commercially useful traits. However, interspecific

hybridization with compatible species leads to generation of valuable material due to

heterozygosity at several loci. A large number of genes express differential alleles in a hybrid,

155

which if resulting from genetically diverse parents permits large numbers of permutations and

combinations, in turn resulting in pronounced range of differences in the progeny in comparison

to the parents. The developed interspecific population can be used in the future for linkage and

QTL mapping. Several researchers have utilized interspecific population for linkage and QTL

mapping. Wang et al. (2011) constructed first generation linkage map of J. curcas with 216 SSRs

and 290 SNPs. King et al. (2013) developed linkage map from four F2 mapping populations that

reveals a locus controlling the biosynthesis of phorbol esters. Sun et al. (2012) developed linkage

map and reported QTLs associated with growth and seed in J. curcas. Liu et al. (2011) identified

18 QTL underlying the oil traits and 3 eQTLs of the oleosin genes using backcrossing population

(286) derived from crosses with J. curcas and J. integerrima. The interspecific population

developed in the present investigation could also be utilized for linkage and QTL mapping for

identification of additional markers related to various agronomic important traits. The trait

associated markers could be used for marker assisted breeding (MAB) in Jatropha genetic

improvement programs.

5.6 Assessment of heterozygosity in J. curcas L.

The tree breeding is more difficult by the changes that occur during the transition from juvenility

to maturity. Breeding populations can be characterized by quantifying the levels and

organization of genetic variation within and between different breeding groups. Under the

appropriate conditions, markers can replace phenotypic selection, thereby removing the need for

growing or rearing of individuals (Chen et al. 2010). Markers- based systems have been used to

study and compare the levels of random genetic variation throughout the different cycles of a

breeding programme, thus allowing much greater flexibility and control over the rate of

reduction of genetic variability (Lia and Wua 2007). A correlation between individual

heterozygosities of parents and their offspring arises from the fact that, at most allelic

frequencies, heterozygous parents produce higher proportion of heterozygous progeny than do

homozygous parents (Mitton et al. 1993). Microsatellite markers are an efficient tool for the

assessment of heterozygosity and homozygosity. In the present investigation, the heterozygosity

was assessed in J. curcas using SSR markers. Majority of the SSRs (64%) used to assess the

heterozygosity were found to be monomorphic and 36% polymorphic. Majority of the markers

(61%, 11 out of 18) showed heterozygosity variation from 0.00 to 0.22 indicating low level of

heterozygosity at these loci.

Chapter 6 Summary

156

The present investigation entitled “Development and application of microsatellite markers

for diversity analysis in Jatropha curcas L.” was undertaken with the following objectives:

1. Phenotypic characterization of indigenous accessions of J. curcas.

2. Development of large scale genomic derived SSRs from four microsatellite enriched

genomic libraries.

3. PCR optimization, polymorphism detection and characterization of developed SSRs for

various attributes.

4. Study of molecular genetic diversity among indigenous and exotic accessions of

J. curcas.

The outcome and conclusion of the research work done during the present investigation

are summarized as under:

The material for the phenotypic characterization of J. curcas was collected from the

different states of India and represented by 80 accessions. The genetic variability of 80

accessions were recorded for the 10 agronomically important traits i.e. Male flowers/plant,

Male/female ratio, Number of fruits/plant, Number of seeds/plant, Fruit weight/plant (g),

Seed weight/plant (g), Seed length (mm), Seed width (mm) and Oil content (%)

The analysis of variance revealed significant differences for all the traits. The oil

content varied between 20.8 to 36.1% with an average of 26.2±0.38. Out of 80 accessions, 37

accessions had seed weight/plant above average value (i.e.180.2g) and of which only 4

accessions had seed weight/plant above 250g with maximum in accession NBJC1078

(273.08g). Likewise, 26 accessions had oil content above average value of 26.2% and only 3

accessions namely NBJC1055, NBJC1051 and NBJC1048 having oil content above 35%.

The genetic variance showed that the traits as male flowers/plant, fruit weight/plant,

seed length, seed width and oil content were not much influenced by environmental factors as

these traits showed lower error variance than the genotypic variance. The PCV was found to

higher than that of GCV for all the traits with remarkable differences in their values.

However, the traits as seed length, seed width and oil content had very insignificant

differences in PCV and GCV values. Higher broad sense heritability was observed for oil

content (90%), seed length (88%), and seed width (87%) which indicates that these characters

are under genotypic control.

The hierarchical clustering based on morphological traits grouped all the accessions

into 4 clusters. The majority of accessions (70%) were genetically close to each other and

grouped in two clusters. The cluster II was largest with 29 accessions collected from 11 states

India. The cluster I is second largest comprising 27 accessions. The cluster III was the

157

smallest with 6 accessions. The cluster III showed maximum genetic distance with cluster I,

followed by cluster IV and cluster II suggesting comparatively wider genetic diversity among

them. The accessions from these clusters could be utilized in hybridization program to get

desirable transgressive segregants in their offspring, as there is a higher probability that

unrelated genotypes would contribute unique desirable alleles at different loci

The Principal Component Analysis (PCA) showed that first four principal

components (PCs) accounting for more than 93% of the total variation. The first principal

components accounted for 42.5% of the total variation mainly due to seed length, seed width,

seed weight/plant and number of seeds/plant which had maximum and positive weight on this

component. Oil content had negative weight on PC1. Thus, PC1 related to the accessions

with thick seeds, moderate to high seed yielder with low oil content.

Correlation analysis showed that seed weight/plant was significantly and positively

associated with female flowers/plant, male flowers/plant, number of flowers/plant, number of

seeds/plant, fruit weight/plant, seed width and negatively associated with oil content. Oil

content was negatively and significantly correlated with all the traits studied with strong

negative association with female flowers/plant followed by male flowers/plant, number of

seeds/plant, fruit weight/plant and seed weight/plant. The positive and significant association

of major component traits among themselves in general and with seed weight/plant in

particular suggests that selection of component traits jointly or individually may enhance the

seed yield. However, oil yield could be compromised upto some extent as it showed negative

association with most of the component traits. Therefore, precaution should be taken while

selecting a plant type based on component traits so that both seed yield and oil yield could be

optimized to its maximum potential.

Path coefficient showed that the male flowers/plant had the maximum positive direct

effect on seed yield while female flowers/plant, male female ratio, fruit weight/plant, seed

length exhibited negative direct effect on seed yield. The oil content had negative correlation

for all the traits. The positive direct effect of male flowers/plant, number of fruits/plant,

number of seeds/plant, seed width and oil content for seed yield/plant. The results of the

present investigation suggests that selection in J. curcas based on male flowers/plant, number

of fruits/plant, number of seeds/plant, seed width and oil content would be advantageous to

achieve the desirable goals.

Four SSR enriched genomic libraries (Lib A, Lib B, Lib C and Lib D) were developed

from genomic DNA of J. curcas including two di-nucleotide enriched and two tri-nucleotide

enriched SSR motifs. Among di-nucleotide a total of 639 (46.1%) and 676 (50.3%) SSR

158

containing sequences were developed from Lib A and B respectively. Out of 1,315 SSRs

(48.2%), 1207 (44.2%) were found to be suitable for primer designing from both the libraries.

From Lib A, 152 sequences and from Lib B, 191 sequences were found to have >1

SSRs. In total, 343 sequences were found to have >1 SSRs from the both libraries. Among

the total SSRs identified, the perfect SSRs were found to be 928 from both the libraries. The

enrichment percentage was found to be higher for GA repeat motif (87.9%) than CA repeat

motif (79.6%). The SSR repeat length analysis revealed that the majority of identified SSRs

i.e. 557 from Lib A and 500 from Lib B were of short length varied from 4 to 10 repeat units.

The AC/GT and AG/CT motif, the targeted motif, were recovered in higher frequency from

Lib A and Lib B respectively.

Out of 1207, a total of 248 polymorphic SSR were identified with a panel of 7

accessions of J. curcas. Furthermore, 179 and 331 SSR were found polymorphic among

parental lines of NBRI-J05 (NBJC132) x EC643912 (NBJC195) and Chhatrapati (NBJC147)

x J. integerrima used for development of two mapping populations respectively. The rate of

polymorphism was quite low among intra-specific parental lines (Lib A SSRs ~12%, Lib. B

SSRs~ 19%) as compared to interspecific parental lines (Lib. A SSRs ~25%, Lib B SSRs

~33%). The majority of the SSRs (71%) showed lower PIC value. The cross species

transferability of SSRs to J. integerrima was ~52%.

The trinucleotide enriched libraries (Lib C and D) were subjected to Roche 454 GS-

FLX sequencing for SSR development. A total of 25,495 raw reads (6.79 Mb) were generated

including 2,845 contigs and 22,650 singletons. A total of 5,844 putative microsatellites were

identified in 5,141 sequences with 23% SSR recovery. Six hundred nine sequences have

more than one SSR loci and 485 SSRs were found to be present in compound form. Out of

5,141 microsatellites containing sequences, 1,122 primer pairs were designed and synthesized

for validation and characterization.

The tri-nucleotide repeat SSRs was found in maximum numbers (59%) which was as

per expectation and among which AAT/ATT and AAG/CTT were found in major frequency

(73.2%) followed by ATC/ATG (7.0%) and ACC/GGT (6.0%). Most of the SSRs (5,559;

59%) had short repeat length varied in the range of 4-10.

The 1,122 SSRs were genotyped with a subset of 7 accessions of J. curcas and one

accession of J. integerrima. Out of 1,122 SSRs, 447 (39.83%) were found to be polymorphic

among 7 accessions of J. curcas. The number of alleles varied between 2 to 9 with an

average of 2.7±1.18 per markers. The PIC values of polymorphic SSRs ranged from 0.12 to

0.85 (average 0.34±0.17). Majority of SSRs (211, 47%) showed low PIC value (<0.30).

Approximately half of the SSRs (49.2%) showed moderate range of PIC value. However,

159

only 4% SSRs showed higher PIC value in the range of 0.71 – 0.90. The developed SSRs

showed higher transferability rate (76%) to J. integerrima.

Based on the PIC value and other parameters the SSRs JGM_A281, JGM_A326,

JGM_A244, JGM_A107, JGM_A577, JGM_B300, JGM_B361, JGM_B595, JGM_B176,

JGM_B330, JGM_B248 from Lib A and B and JGM_CD928, JGM_CD1067, JGM_CD911,

JGM_CD650, JGM_CD926, JGM_CD887, JGM_CD914, JGM_CD940, JGM_CD916,

JGM_CD857, JGM_CD873 from Lib C and D were found to be highly informative and could

be potential markers for various genetic studies in J.curcas.

The similarity search of SSR containing sequences showed maximum similarity of

63% with Ricinus communis (F. Euphorbiaceae). Significant similarity was also observed

with Populus trichocarpa (19%), Vitis vinifera (9%), Glycine max (3%) and Medicago

truncatula (2%).

The polymorphism analysis of all the newly developed SSRs (2,329) resulted in

identification of 2,016 working SSRs (1089 from Lib A and B, 1017 from Lib C and D) to

enrich the repertoire of SSR marker for J. cu/cas. A total of 695 polymorphic SSRs have been

identified for J. curcas and have been characterized for various attributes. These newly

developed SSRs may facilitate construction of high density linkage map, diversity analysis,

and QTL/association mapping and may be further utilized in making strategies for marker

assisted breeding towards developing high seed and oil yielding cultivars of J. curcas.

A total of 96 accessions of J. curcas which comprises 70 indigenous collections from

different states of India and 26 exotic collections were used for the genetic diversity analysis

with a set of 41 selected polymorphic SSR markers. Based on genotypic data of 152 alleles at

41 polymorphic SSR loci, a neighbor joining (NJ) tree was constructed using DARwin5 with

1,000 replicate bootstrap. The dendrogram classified all the 96 accessions into 3 major

clusters namely A, B and C. The cluster A accommodated maximum of 77 (80%) accessions

followed by cluster B (26 accessions) and cluster C (3 accessions). The cluster A could be

further divided into 3 subclusters i.e. AI (51 accessions), AII (25 accessions) and AIII (1

accession).

The neighbor joining clustering showed that most of the exotic accessions (81.0%, 21

out of 26) were grouped in single cluster i.e. cluster A along with the indigenous accessions

indicating low level of genetic diversity across the global collections of J. curcas. On the

other hand, three non-toxic accessions grouped into a single and distant cluster (cluster C)

indicating that these accessions are widely distinct from toxic accessions at genetic level too.

160

The model based cluster analysis also showed almost similar result as obtained in

dendrogram and grouped 96 accessions of J. curcas into four genetically distinct

subpopulations (K=4) based on maximum K values. Based on membership probability, 30

accessions assigned to subpopulation 1 with mixed accessions of indigenous (77%) and

exotic (23%) collections and 50 accessions with 70% indigenous and 30% exotic collections

assigned to subpopulation 2. The subpopulation 2 had 30 accessions. The subpopulation 3

had 13 accessions and all belonging to the indigenous collections except NBJC187. The

subpopulation 4 is quite unique having 3 exclusive exotic and non-toxic accessions

(NBJC194, NBJC195 and NBJC196).

In addition, a successful interspecific hybrid between J. curcas and J. integerrima

developed and 94 seedlings successfully raised in the field. The developed interspecific

hybrids were found to be vigorous in appearance, freely flowering and morphologically

intermediate between the parents. The 94 F1 hybrids were characterized using SSR markers

which showed variation at various SSR loci. The interspecific population could be utilized for

linkage and QTL mapping for identification of additional markers related to various

agronomic important traits.

The heterozygosity assessment of J. curcas using SSR markers showed majority of

the markers (61%, 11 out of 18) varied from 0.00 to 0.22 indicating low level of

heterozygosity in J. curcas. The heterozygosity assessment will also facilitate for the

selection of desired parent in the hybridization program for the genetic improvement of

J. curcas.

Chapter 7 References

161

Abdelkrim J, Robertson B, Stanton JA, Gemmell N (2009) Fast, cost-effective development of species-

specific microsatellite markers by genomic sequencing. Biotechniques 46:185-192.

Achten WMJ, Verchot L, Franken YJ, Mathijs E, Singh VP, Aerts R, Muys B (2008) Jatropha bio-

diesel production and use. Biomass and Bioenergy 32:1063-1084.

Agarwal M, Shrivastava N, Padh H (2008) Advances in molecular marker techniques and their

applications in plant sciences. Plant Cell Reports 27:617-631.

Agbogidi OM, Akparobi SO and Eruotor PG (2013) Health and environmental benefits of Jatropha

curcas linn. Applied Scientific Report 1: 36-39.

Akkaya MS, Bhagwat AA and Cregan PB (1992) Length polymorphisms of simple sequence repeat DNA

in Soybean. Genetics 132:1131-1139.

Alkimim ER, Sousa TV, Soares BO, Souza DA, Juhasz ACP, Nietsche S, Costa MR (2013) Genetic

diversity and molecular characterization of physic nut genotypes from the active germplasm bank

of the Agricultural Research Company of Minas Gerais, Brazil. African Journal of Biotechnology

12:907-913.

Allard RW (1999) Principles of Plant Breeding, 2nd

edn. New York, Wiley.

Alves AA, Bhering LL, Rosado TB, Laviola BG, Formighieri EF, Cruz CD (2013) Joint analysis of

phenotypic and molecular diversity provides new insights on the genetic variability of the Brazilian

physic nut germplasm bank. Genetics and Molecular Biology 36:371-381.

Ambrosi DG, Galla G, Purelli M, Barbi T, Fabbri A, Lucretti S (2010) DNA markers and FCSS analyses

shed light on the genetic diversity and reproductive strategy of Jatropha curcas L. Diversity 2:

810-836.

Andersen JR, Liberstedt T (2003) Functional markers in plants. Trends in Plant Science 8:554-560.

Anderson JA, Churchill GA, Autrique JE, Tanksley SD and Sorrells ME (1993) Optimizing parental

selection for genetic linkage maps. Genome 36:181-186.

Arcot SS, Wang Z, Weber JL, Deininger L and Batzer MA (1995) Alu repeats: A source for the genesis

of primate micro-satellites. Genomics 29:136-144.

Armour JAL, Alegre SA, Miles S, Williams LJ, Badge RM (1999) Minisatellites and mutation

processes in tandemly repetitive DNA. In: Goldstein DB, Schlotterer C (eds) Microsatellites:

evolution and applications. Oxford University Press, Oxford, pp. 24-33.

Armour JAL, Neumann R, Gobert S and Jefferys AJ (1994) Isolation of human simple repeat loci by

hybridization selection. Human Molecular Genetics 3: 599-605.

Arolu IW, Rafii MY, Hanafi MM, Mahmud TMM, Latif MA (2012) Molecular characterization of

Jatropha curcas germplasm using inter simple sequence repeat (ISSR) markers in Peninsular

Malaysia. Australian Journal of Crop Science 6: 1666-1673.

Atabani AE, Silitonga AS, Ong HC, Mahlia TMI, Masjuki HH, Irfan AB (2013) Non-edible vegetable

oils: a critical evaluation of oil extraction, fatty acid compositions, biodiesel production,

characteristics, engine performance and emissions production. Renewable and Sustainable

Energy Reviews 8:211-245.

Barkley NA, Dean RE, Pittman RN, Wang ML, Holbrook CC, Pederson GA (2007) Genetic diversity of

cultivated and wild-type peanuts evaluated with M13-tailed SSR markers and sequencing.

Genetical Research 89:93-106.

Barkley NA, Newman ML, Wang ML, Hotchkiss MW, Pederson GA (2005) Assessment of the genetic

diversity and phylogenetic relationships of a temperate bamboo collection by using transferred

EST-SSR markers. Genome 48:731-737.

Barkley NA, Roose ML, Krueger RR, Federici CT (2006) Assessing genetic diversity and population

structure in a Citrus germplasm collection utilizing simple sequence repeat markers (SSRs).

Theoretical and Applied Genetics 112:1519-1531.

Barreneche T, Casasoli M, Russell K, Akkak A, Meddour H,Plomion C, Villani F, Kremer A (2004)

Comparative mapping between Quercus and Castanea using simple-sequence repeats (SSRs).


162

Barrett BA and Kidwell KK (1998) AFLP-based genetic diversity assessment among wheat cultivars

from the pacific North-West. Crop Science 38: 1261-1271.

Basha SD, Francis G, Makkar HPS, Becker K, Sujatha M (2009) A comparative study of biochemical

traits and molecular markers for assessment of genetic relationships between Jatropha curcas L.

germplasm from different countries. Plant Science 6:812-823.

Basha SD, Sujatha EM (2007) Inter and intra-population variability of Jatropha curcas (L.) characterized

by RAPD and ISSR markers and development of population specific SCAR markers. Euphytica

156: 375-386.

Beckmann JS, Weber JL (1992) Survey of human and rat microsatellites. Genomics 12:627-631.

Beer SC, Goffreda J, Phillips TD, Murphy JP, Sorrells ME (1993) Assessment of genetic variation in

Avena sterilis using morphological traits, isozymes and RFLPs. Crop Science 33:1386-1393.

Belewu MA, Adekola FA, Adebayo GB, Ameen OM, Muhammed NO, Olaniyan AM, Adekola OF and

Musa AK (2010) Physcio-Chemical characteristics of oil and Biodiesel from Nigerian and Indian

Jatropha curcas seeds. International Journal of Biological and Chemical Science 4:524-529.

Berchmans HJ, Hirata S (2008) Biodiesel production from crude Jatropha curcas L. seed oil with a high

content of free fatty acids. Bioresource Technology 99:1716-1721.

Bhatt GM (1973) Significance of path coefficient analysis in determining the nature of character

association. Euphytica 2:338-343.

Biabani A, Rafii MY, Saleh GB, Shabanimofrad M, Latif MA (2012) Phenotypic and genetic variation of

Jatropha curcas L. populations from different countries. Maydica 57:164-171.

BiabaniA, Rafii MY, Saleh GB, Latif MA (2013) Inter- and intra-population genetic variations in

Jatropha curcas populations revealed by inter-simple sequence repeat molecular markers. Maydica

58:111-118.

Billotte N, Lagoda PJL, Risterucci AM, Baurens FC (1999) Microsatellite-enriched libraries: applied

methodology for the development of SSR markers in tropical crops. Fruits 54:277-288.

Birnboim HC and Doly JA (1979) A rapid alkaline extraction procedure for screening recombinant

plasmid DNA. Nucleic Acid Research 7:1513-1523.

Boder P, Deak T, Bacso R, Velich I, Bisztray GD, Fascar G, Gyulai P (2006) Morphological and genetic

investigation of medieval grape seeds. Acta Horticulture (ISHS) 713-718.

Botstein D, White RL, Skolnick M, Davis RW (1980) Construction of genetic linkage map in man using

restriction fragment length polymorphisms. American Journal of Human Genetics 32:314-331.

Bowers JE, Dangl GS, Vignani R, Meredith CP (1996) Isolation and characterization of new

polymorphic simple sequence repeat loci in grape (Vitis vinifera L.). Genome 39: 628-633.

Brasileiro BP, Silva SA, Souza DR, Santos PA, Oliveira RS, Lyra DH (2013) Genetic diversity and

selection gain in the physic nut (Jatropha curcas). Genetics and Molecular Research 12:2341-

2350.

Breseghello F, Sorrells ME (2006a) Association analysis as a strategy for improvement of quantitative

traits in plants. Crop Science 46: 1323-1330.

Breseghello F, Sorrells ME (2006b) Association analysis as a strategy for improvement of quantitative

traits in plants. Crop Science 46:1323-1330.

Bressan EA, Scotton DC, Ferreira RR, Jorge EC, Sebbenn, AM, Gerald L, Figueira A (2012)

Development of microsatellite primers for Jatropha curcas (Euphorbiaceae) and transferability to

congeners. American Journal of Botany 99:237-239.

Bretting PK, Widrlechner MP (1995) Genetic markers and horticultural germplasm management. Hort

Science 30:1349-1356.

Brittaine R and Lutaladio Ne B (2010) Jatropha: A smallholder bioenergy crop the potential for Pro-poor

development. Integrated Crop Management Vol. 8-2010.

Brohele J, Ellegren H (1999) Microsatellite evolution: polarity of substitution within repeats and

neutrality of flanking sequences. Proceedings of Royal Society of London B, Biological Sciences

266: 825-833.

Brookfield JFY (1992) DNA Fingerprint in clonal organisms. Molecular Ecology 1:21-26.

163

Buerstmayr H, Lemmens M, Hartl L, Doldi L, Steiner B, Stierschneider M, Ruckenbauer P (2002)

Molecular mapping of QTLs for Fusarium head blight resistance in spring wheat. I. Resistance to

fungal spread (Type II resistance). Theoretical and Applied Genetics 104: 84-91.

Cai HW, Yuyama N, Tamaki H, Yoshizawa A (2003) Isolation and characterization of simple sequence

repeat markers in the hexaploid forage grass timothy (Phleum pretense L.). Theoretical and

Applied Genetics 107:1337-1349.

Cai Y, Sun D, Wu G, Peng J (2010) ISSR based genetic diversity of Jatropha curcas germplasm in

China. Biomass and Bioenergy 34:1739-50.

Camellia NA, Thohirah LA and Abdullah NAP (2012) Genetic relationships and diversity of Jatropha

curcas accessions in Malaysia. African Journal of Biotechnology 11: 3048-3054.

Cao Z, Wang P, Zhu X, Chen H, Zhang T (2014) SSR marker‑ assisted improvement of fiber qualities in

Gossypium hirsutum using G. barbadense introgression lines. Theoretical Applied Genetics DOI

10.1007/s00122-013-2241-3.

Cardle L, Ramsay L, Milbourne D, Macaulay M, Marshall D, Waugh R (2000) Computational and

experimental characterization of physically clustered simple sequence repeats in plants. Genetics

156:847-854.

Carvalho CR, Clarindo WR, Prac MM FS, Araujo Carels N (2008) Genome size base composition and

karyotype of Jatropha curcas L.: an important biofuel plant. Plant Science 174: 613-617.

Castillo A, Budak H, Martin AC, Dorado G, Borner A, Roder M, Hernandez P (2010) Interspecies and

intergenus transferability of barley and wheat D-genome microsatellite markers. Annals of Applied

Biology 156:347-356.

Castoe TA, Poole AW, Gu W, de Koning AP J, Daza JM, Smith EN, Pollock DD (2010) Rapid

identification of thousands of copperhead snake (Agkistrodon contortrix) microsatellite loci from

modest amounts of 454 shotgun genome sequence. Molecular Ecology Resources 10:341-347.

Chambers GK and MacAvoy ES (2000) Microsatellites: consensus and controversy. Comparative

Biochemistry and Physiology 126: 455-476.

Chai L, Biswas MK, Yi H, Guo W, Deng X (2013) Transferability, polymorphism and effectiveness for

genetic mapping of the Pummelo (Citrus grandis Osbeck) EST-SSR markers. Scientia

Horticulturae 155:85-91.

Charlesworth B, Sniegowski P, Stephan W (1994) The evolutionary dynamics of repetitive DNA in

eukaryotes. Nature 371: 215-220.

Chen K, Knorr C, Bornemann KK, Ren J, Huang L, Rohrer GA, Brenig B (2005) Targeted

oligonucleotide-mediated microsatellite identification (TOMMI) from large-insert library clones.

BMC Genetics 6:54.

Chen SY, Lin YT, Lin CW, Chen WY, Yang CH, Ku HM (2010) Transferability of rice SSR markers to

bamboo. Euphytica 175:23-33.

Chen X and Sullivan PF (2003) Single nucleotide polymorphism genotyping: biochemistry, protocol, cost

and throughput. Pharmacogenomics Journal 3: 77-96.

Christo LF, Colodetti TV, Rodrigues WN, Martins LD, Brinate SB, Amaral JFT, Laviola BG, Tomaz MA

(2014) Genetic variability among genotypes of physic nut regarding seed biometry. American

Journal of Plant Sciences 5:1416-1422.

Cifarelli RA, Gallitelli M and Cellini F (1995) Random amplified hybridization microsatellites (RAHM):

isolation of a new class of microsatellite-containing DNA clones. Nucleic Acids Research

23:3802-3803.

Condit R, Hubbell SP (1991) Abundance and DNA sequence of two-base repeat regions in tropical tree

genomes. Genome 34: 66-71.

Connell JP, Pammi S, Iqbal MJ, Huizinga T, Reddy AS (1998) A high throughput procedure for capturing

microsatellites from complex plant genomes. Plant Molecular Biology Report 16:341-49.

Cooper S, Bull CM, Gardner MG (1997) Characterization of microsatellite loci from the socially

monogamous lizard Tiliqua rugosa usisng a PCR-based isolation technique. Molecular Ecology

6:793-795.

164

Cordeiro GM, Casu R, McIntyre CL, Manners JM, Henry RJ (2001) Microsatellite markers from

sugarcane (Saccharum spp.) ESTs across transferable to Erianthus and Sorghum. Plant Science

160:1115-1123.

Costa HPS, Oliveira JTA, Sousa DOB, Morais JKS, Moreno FB, Monteiro-Moreira ACO, Viegas RA and

Vasconcelos IM (2014) JcTI-I: a novel trypsin inhibitor from Jatropha curcas seed cake with

potential for bacterial infection treatment. Frontier in Microbiology 5:1-12.

Cox TS, Murphy JP and Rodgers DM (1986) Changes in genetic diversity in the red winter wheat regions

of the United States. Proceeding of National Academy Science (USA) 83:5583-5586.

Crossa J, Burgueno J, Dreisigacker S, Vargas M, Herrera FSA, Lillemo M, Singh RP, Trethowan R,

Warburton M, Franco J (2007) Association analysis of historical bread wheat germplasm using

additive genetic covariance of relatives and population structure. Genetics 177: 1889-1913.

Csencsics D, Brodbeck S, Holderegger R (2010) Cost-effective, species-specific microsatellite

development for the endangered Dwarf Bulrush (Typhaminima) using next-generation sequencing

technology. Journal of Heredity 101:789-793.

Cubry P, Pujade-Renaud V, Garcia D, Espeout S, Le Guen V, Granet F, Seguin M (2014) Development

and characterization of a new set of 164 polymorphic EST-SSR markers for diversity and breeding

studies in rubber tree (Hevea brasiliensis Mull. Arg.). Plant Breeding DOI: 10.1111/pbr.12158.

Cuc LM, Mace ES, Crouch JH, Quang VD, Long TD, Varshney RK (2008) Isolation and characterization

of novel microsatellite markers and their application for diversity assessment in cultivated

groundnut (Arachis hypogaea). BMC Plant Biology 8: 55.

Cui F, Fan X, Zhao C, Zhang W, Chen M, Ji J, Li J (2014) A novel genetic map of wheat: utility for

mapping QTL for yield under different nitrogen treatments. BMC Genetics 15:57.

Dahake R, Roy S, Patil D, Chowdhary A, Deshmukh RA (2012) Evaluation of anti-viral activity of

Jatropha curcas leaf extracts against potentially drug-resistant HIV isolates. BMC Infectious

Diseases 12:P14.

Das S, Misra RC, Mahapatra AK, Gantayat BP, Pattnaik RK (2010) Genetic variability, character

association and path analysis in Jatropha Curcas. American-Eurasian Journal of Sustainable

Agriculture 4:147-151.

Dehgan B (1984): Phylogenetic significance of interspecific hybridization in Jatropha (Euphorbiaceae).

Systematics Botany 9:467- 478.

Dewey JR and Lu KH (1959) A correlation and path co-efficient analysis of components of crested wheat

seed production. Agronomy Journal 51:515-518.

Dhillon RS, Hooda MS, Jattan M, Chawla V, Bhardwaj M, Goyal SC (2009) Development and molecular

characterization of interspecific hybrids of Jatropha curcas and J. integerrima. Indian Journal of

Biotechnology 8:384-390.

Dhillon RS, Saharan RP, Jattan M, Rani T, Sheok RN, Dalal V and Von Wuehlisch G (2014) Molecular

characterization of induced mutagenesis through gamma radiation using RAPD markers in

Jatropha curcas L. African Journal of Biotechnology 13:807-813.

Dieringer D & Schlotterer C (2003) Two distinct modes of microsatellite mutation processes: evidence

from the complete genomic sequences of nine species. Genome Research 13:2242-2251.

Ding JQ, Ma JL, Zhang CR, Dong HF, Xi ZY, Xia ZL, Wu JY (2011) QTL mapping for test weight by

using F2:3 population in maize. Journal of Genetics 90:75-80.

Divakara BN, Upadhyaya HD, Wani SP, Gowda LCL (2010) Biology and genetic improvement of

Jatropha curcas L.: A review. Applied Energy 87:732-742.

Earl DA, vonHoldt BM (2012) STRUCTURE HARVESTER: a website and program for visualizing

STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources

4:359-361.

Edwards KJ, Barker JHA, Daly A, Jones C, and Karp A (1996) Microsatellite libraries enriched for

several microsatellite sequences in plants. Biotechniques 20: 758-760.

Ekblom R and Galindo J (2011) Application of next generation sequencing in molecular ecology of non-

model organisms. Heredity 107:1-15.

165

Ellegren Hans (2004) Microsatellites: simple sequences with complex evolution. Nature Reviews 5: 435-

445.

Ellstrand NC and Roose ML (1987) Patterns of genotypic diversity in clonal plant species. American

Journal of Botany 74: 123-131.

Ender A, Schwenk K, Stdler T, Streit B, Schierwater B (1996) RAPD identification of microsatellites in

Daphnia. Molecular Ecology 5:437-441.

Erschadi S, Haberer G, Schoniger M, TorresRuiz RA (2000) Estimating genetic diversity of Arabidopsis

thaliana ecotypes with amplified fragment length polymorphisms (AFLP). Theoretical and

Applied Genetics 100: 633-640.

Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software

STRUCTURE: a simulation study. Molecular Ecology 14:2611-2620.

Excoffier L, Smouse PE, Quattro JM (1992) Analysis of molecular variance inferred from metric

distances among DNA haplotypes: Application to human mitochondrial DNA restriction data.

Genetics 131:479-491.

Fairless D (2007) Biofuel: the little shrub that could-maybe. Nature 449: 652-655.

Falconer DS (1989) Introduction to quantitative genetics. 3rd

ed. Longman Sci. and Techn, London.

Feng N, Hea Z, Zhanga Y, Xia X, Zhang Y (2013) QTL mapping of starch granule size in common wheat

using recombinant inbred lines derived from a PH82-2/Neixiang 188 cross. The crop Journal

1:166-171.

Ferguson ME, Burow MD, Schultz SR, Bramel PJ, Paterson AH, Kresovich S, Mitchell S (2004)

Microsatellite identification and characterization in peanut (A. hypogaea L.). Theoretical and


Fisher D, Bachmann K (1998) Microsatellite enrichment in organisms with large genomes (Allium cepa

L.). BioTechniques 24:796-802.

Fisher PJ, Gardner RC and Richardson TE (1996) Single locus microsatellites isolated using 5′anchored

PCR. Nucleic Acids Research 24:4369-4371.

Flachowsky H, Schumann E, Weber WE, Peil A (2001) Application of AFLP for the detection of sex-

specific markers in hemp. Plant Breeding120:305-309.

Foidl N, Foidl G, Sanchez M, Mittelbach M, Hackel S (1996) Jatropha curcas L. as a source for the

production of biofuel in Nicaragua. Bioresour Technology 58:77-82.

Gairola KC, Nautiyal AR, Sharma G, Dwivedi AK (2011) Variability in seed characteristics of Jatropha

curcas Linn. from hill region of Uttarakhand. Bulletin of Environment, Pharmacology and Life

Sciences 1:64-69.

Ganesh Ram S, Parthiban KT, Kumar RS, Thiruvengadam V, Paramathma M (2008) Genetic diversity

among Jatropha species as revealed by RAPD markers. Genetic Recourses Crop Evolution

55:803-809.

Gao SQ, Qu J, Chua NH, Ye J (2010) A new strain of Indian cassava mosaic virus causes a mosaic

disease in the biodiesel crop Jatropha curcas. Archives of Virology 155:607-612.

Ghosh L and Singh L (2011) Variation in seed and seedling characters of Jatropha curcas L. with varying

zones and provenances. Tropical Ecology 52:113-122.

Ginwal HS, Phartyal SS, Rawat PS, Srivastava RL (2005) Seed source variation in morphology,

germination and seedling growth of Jatropha curcas Linn. in central India. Silvae Genetica 54:76-

80.

Gmunder S, Singh R, Pfister S, Adheloya A and Zah R (2012) Environmental impacts of J. curcas

biodiesel in India. Journal of Biomedicine and Biotechnology Volume 2012.

Godt MJ, Hamrick JL (1999) Population genetic analysis of Elliottia racemosa (Ericaceae), a rare

Georgia shrub. Molecular Ecology 8:75-82.

Goldstein DB and Schlotterer C (1999) “Microsatellites: Evolution and Applications.” Oxford Press,

Oxford UK.

Goldstein DB, Pollock DD (1997) Launching microsatellites: a review of mutation processes and

methods of phylogenetic inference. Journal of Heredity 88:335-342.

166

Gopale KD and Zunjarrao RS (2013) Evaluation of genetic diversity of Jatropha Curcas L. using RAPD

markers in Maharashtra. International Journal of Pure and Applied Science and Technology14:

12-24.

Grafius JE (1964) A geometry for plant breeding. Crop Science 4:241-246.

Grativol C, Fonseca D, Lira-MC, Hemerly AS, Ferreira PC (2011) High efficiency and reliability of inter-

simple sequence repeats (ISSR) markers for evaluation of genetic diversity in Brazilian cultivated

Jatropha curcas L. accessions. Molecular Biological Report 38:4245-4256.

Grodzicker T, Williams J, Sharp P, Sambrook J (1975) Physical mapping of temperature sensitive

mutants of adenovirus. Cold Spring Harbor Symposia on Quantitative Biology 39:439-446.

Guan J, Yu H, Zhang J, Yang R, Fan Y-h (2013) Study on seed morphology and genetic diversity of

Jatropha curcas L. from different provenances. Advance Journal of Food Science and

Technology 5:169-173.

Guo Y, Yang X, Chander S, Yana J, Zhang J, Song T, Li J (2013) Identification of unconditional and

conditional QTL for oil, protein and starch content in maize. The Crop Journal 1:34-42.

Gupta PK, Rustgi S, Sharma S, Sing R, Kumar N, Balyan HS (2003)Transferable EST-SSR

markers for the study of polymorphism and genetic diversity in bread wheat. Molecular

Genetics and Genomics 270:315-323.

Gupta P, Idris A, Mantri S, Asif MH, Yadav HK, Roy JK, Tuli R, Mohanty CS, Sawant SV (2012a)

Discovery and use of single nucleotide polymorphic (SNP) markers in Jatropha curcas L.

Molecular Breeding 30:1325-1335.

Gupta S, Kumari K, Sahu PP, Vidapu S, Prasad M (2012b) Sequence-based novel genomic microsatellite

markers for robust genotyping purposes in foxtail millet [Setaria italica (L.) P Beauv]. Plant

Cell Reports 31:323-337.

Gupta PK, Roy JK, Prasad M (2001) Single nucleotide polymorphisms: A new paradigm for molecular

marker technology and DNA polymorphism detection with emphasis on their use in plants.

Current Science 80:524-535.

Haimei C, Linzhi L, Xianyun W, Sishen L, Tiandong L, Haizhou H., Honggang W, Xiansheng Z (2005)

Development, chromosome location and genetic mapping of EST-SSR markers in wheat. Chinese

Science Bulletin 50: 2328-2336.

Hamada H, Petrino MG, Kakunaga T (1982) A novel repeated element with Z-DNA-forming potential is

widely found in evolutionarily diverse eukaryotic genomes. Proceeding of the National Academy

of Sciences USA 79:6465-6469.

Hamdan YAS, García-Moreno MJ, Redondo-Nevado J, Velasco L, Pérez-Vich B (2011) Development

and characterization of genomic microsatellite markers in safflower (Carthamus tinctorius L.).

Plant breeding 130:237-241.

Hamilton MB, Pincus EL, Di Fiore A, and Fleischer RC (1999) Universal linker and ligation procedures

for construction of genomic DNA libraries enriched for microsatellites. BioTechniques 27: 500-

507.

Hamrick JL, Godt MJ (1990) Allozyme diversity in plant species. Pages 43-63 in Brown AHD, MT

Clegg, AL Kahler and BS Weir eds. Plant population genetics, breeding and genetic resources.

Sinauer Associates Inc, Sunderland.

Hancock JM (1995) The contribution of slippage like process to genome evaluation. Journal of

Molecular Evolution 41:1038-1047.

Hancock JM (1999) Microsatellites and other simple sequences: genomic context and mutational

mechanisms. In: Goldstein DB, Schlotterer C (eds) Microsatellites: evolution and applications.

Oxford University Press, Oxford, pp. 1-9.

Hao X, Li X, Yang X, Li J (2014) Transferring a major QTL for oil content using marker-assisted

backcrossing into an elite hybrid to increase the oil content in maize. Molecular Breeding DOI

10.1007/s11032-014-0071-x.

Harding RM, Boyce AJ and Clegg JB (1992) The evolution of tandemly repetitive DNA: recombination

rules. Genetics 132: 847-859.

167

He C, Poysa V, Yu K (2003) Development and characterization of simple sequence repeat (SSR) markers

and their use in determining relationships among Lycopersicon esculentum cultivars. Theoretical

and Applied Genetics 106:363-373.

He W, Guo L, Wang L, Yang W, Tang L, Cheng F (2007) ISSR analysis of genetic diversity of J. curcas

L. Chinese. Journal of Applied and Environmental Biology 13:466–70.

Helentjaris T, Slocum M, Wright S, Schaefer A, Nienhuis J (1986) Construction of genetic

linkage maps in maize and tomato usingrestriction fragment length polymorphisms.


Heller J (1996) Physic Nut- Jatropha curcas L.-Promoting the conservation and use of underutilized and

neglected crops. PhD dissertation at the Institute of Plant Genetic and Crop Plant Research,

Gatersleben, Germany & International Plant Genetic Resource.

Hill J, Becker HC and Tigerstedt PMA (1998) Quantitative and ecological aspects of Plant Breeding.

Chapman and Hall, London.

Holland JB, Nyquist WE, Cervantes-Martinez CT (2002) Estimation and interpreting heritability for plant

breeding: An update. Plant Breeding Reviews 22: 9-112.

Hossain KG, Kawai H, Hayashi M, Hoshi M, Yamanaka N, Harada K (2000) Characterization and

identification of (CT)n microsatellites in soybean using sheared genomic libraries. DNA

Research7:103-110.

Hu Z, Zhang D, Zhang G, Kan G, Hong D, Yu D (2014) Association mapping of yield-related traits and

SSR markers in wild soybean (Glycine soja Sieb. and Zucc.). Breeding Science 63:441-449.

Huang X, Madan A (1999) CAP3: A DNA sequence assembly program. Genome research 9:868-877.

Iezzoni AF, Pritts MP (1991) Applications of principal components analysis to horticultural research.

HortScience 26:334-338.

Ikbal BKS, Dhillon RS (2010) Evaluation of genetic diversity in Jatropha curcas L. using RAPD

markers. Indian Journal of Biotechnology 9:50-57.

Ince AG, Karaca M, Onus AN (2010) Polymorphic microsatellite markers transferable

across Capsicum species. Plant Mol Biol Report 28:285-291.

Islam MS, Rasul MG (1998) Genetic parameters, correlation and path coefficient analysis in groundnut

(Arachis hypogeal L.). Bangladesh Journal of Science and Industrial Research 33:250-254.

Jain N, Patil GB, Bhargava P, Nadgauda RS (2014) In Silico Mining of EST-SSRs in Jatropha curcas L.

towards assessing genetic polymorphism and marker development for selection of high oil yielding

clones. American Journal of Plant Sciences 5:1521-1541.

Jakse J, Javornik B (2001) High throughput isolation of microsatellites in hop (Humulus lupulus L.).

Plant Molecular Biology Report 19:217-226.

Jakupciak JP, Wells RD (2000) Gene conversion (recombination) mediates expansions of CTG.CAG

repeats. Journal of Biological Chemistry 275:4003-4013.

Jarne P and Lagoda PJL (1996) Microsatellites, from molecules to populations and back. Tree 11: 424-

429.

Jeffreys AJ, Wilson V, Thein SL (1985) Individual- specific fingerprints’ of human DNA. Nature 316:

76-79.

Johnson HW, Robinson HF and Comstock RE (1955) Estimates of genetic and environmental variability

in Soybean. Agronomy Journal 47: 314-318.

Johnson TS, Eswaran N, Sujatha M (2011) Molecular approaches to improvement of Jatropha curcas L.

as a sustainable energy crop. Plant Cell Reports 30: 1573-1591.

Jones ES, Dupal MP, Kolliker R, Drayton MC, Forster JW (2001) Development and characterization of

simple sequence repeat (SSR) markers for perennial ryegrass (Lolium perenne L.). Theoretical and

Applied Genetics 102: 405-415.

Jones N, Ougham H, Thomas H (1997) Markers and mapping: We are all geneticists now. New

Phytologist 137:165-177.

Jones N, Ougham H, Thomas H, Pasakinskiene I (2009) Markers and mapping revisited: finding your

gene. New Phytologist 183:935-966.

168

Joshi SP, Ranjekar PK, Gupta VS (1999) Molecular markers in plant genome analysis. Current Science

77:230-240.

Jubera MA, Janagoudar BS, Biradar DP, Ravikumar RL, Koti RV, Patil SJ (2009) Genetic diversity

analysis of elite Jatropha curcas (L.) genotypes using randomly amplified polymorphic DNA

markers. Karnataka Journal of Agricultural Sciences 22:293-295.

Kalendar R, Grob T, Regina MT, Suoniemi A, Schulman A (1999) IRAP and REMAP: two new

retrotransposon-based DNA fingerprinting techniques. Theoretical and Applied Genetics 98:704-

711.

Kalia RK, Rai MK, Kalia S, Singh R, Dhawan AK (2011) Microsatellite markers: an overview of the

recent progress in plants. Euphytica 177: 309-334.

Kandpal RP, Kandpal G and Weissman SM (1994) Construction of libraries enriched for sequence

repeats and jumping clones, and hybridization selection for region-specific markers. Proceeding of

the National Academy of Science, USA 91: 88-92.

Kantety RV, Rota ML, Matthews DE, Sorrells ME (2002) Data mining for simple sequence repeats in

expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Molecular Biology

48:501-510.

Karagyozov L, Kalcheva ID and Chapman VM (1993) Construction of random small-insert genomic

libraries highly enriched for simple sequence repeats. Nucleic Acids Research 21: 3911-3912.

Karp A, Edwards KJ (1997) DNA markers: a global overview. Pages 1-13 in Caetano-Anolles G and PM

Greshoff eds. DNA markers: protocols, application and overviews. Wiler-VCH, New York.

Katwal RPS, Soni PL (2003) Biofuels: an opportunity for socioeconomic development and cleaner

environment. Indian Forester 129:939-949.

Kaushik N, Kumar K, Kumar S, Kaushik N and Roy S (2007) Genetic variability and divergence studies

in seed traits and oil content of Jatropha (Jatropha curcas L.) accessions. Biomass and

Bioenergy 31: 497-502.

Khurana-Kaul V, Kachhwaha S, and Kothari SL (2012) Characterization of genetic diversity in Jatropha

curcas L. germplasm usisng RAPD and ISSR markers. Indian Journal of Biotechnology11:54-61.

Kijas JMH, Fowler JCS, Garbett CA and Thomas MR (1994). Enrichment of microsatellites from the

citrusgenome using biotinylated oligonucleotide sequences bound to streptavidin-coated magnetic

particles. BioTechniques 16: 656-662.

King AJ, Montes LR, Clarke JG, Affleck J, Li Y, Witsenboer H, van der Vossen E, van der Linde P,

Tripathi Y, Tavares E, Shukla P, Rajasekaran T, van Loo EN and Graham IA (2013) Linkage

mapping in the oilseed crop Jatropha curcas L. reveals a locus controlling the biosynthesis of

phorbol esters which cause seed toxicity. Plant Biotechnology Journal 11:986-996.

King K, He W, Cuevas JA, Freudenberger M, Ramiaramanana D, Graham IA (2009) Potential of

Jatropha curcas as a source of renewable oil and animal feed. Journal of Experimental Botany

60: 2897-2905.

Kliebenstein DJ, Gershenzon J, Mitchell-Olds T (2001) Comparative quantitative trait loci mapping of

aliphatic, indolic and benzylic glucosinolate production in Arabidopsis thaliana leaves and seeds.

Genetics 159:359-370.

Koblizkova A, Dolezel J, and Macas J (1998) Subtraction with 3’ modified oligonucleotides eliminates

amplification artifacts in DNA libraries enriched for microsatellites. BioTechniques 25:32-38.

Kolliker R, Jones ES, Drayton MC, Dupal MP, Forster JW (2001) Development and characterization of

simple sequence repeat (SSR) markers for white clover (Trifolium repens L.). Theoretical and


Kuleung C, Baenziger PS, Dweikat I (2004) Transferability of SSR markers among wheat, rye, and

triticale. Theoretical and Applied Genetics 108:1147-1150.

Kumar P, Gupta VK, Misra AK, Modi DR and Pandey BK (2009) Potential of molecular markers in plant

biotechnology. Plant Omics Journal 2:141-162.

169

Kumar S, Kumar V, Sharma MK, Kumar N, Kumar A, Sengar RS, Jaiswal N (2013) Assessment of

genetic diversity in Jatropha (Jatropha Curcus L.) by using RAPD markers. International

Journal of Biotechnology and Bioengineering Research 4:383-394.

Kumari M, Grover A, Yadav PV, Arif M and Ahmed Z (2013) Development of EST-SSR markers

through data mining and their use for genetic diversity study in Indian accessions of Jatropha

curcas L.: a potential energy crop. Genes Genome 35:661-670.

Kumpatla S and Mukhopadhyay S (2005) Mining and survey of simple sequence repeat in expressed

sequence tags of dicotyledonous species. Genome 48:985-998.

Kwon S, Hong S, Son J, Lee JK, Cha Y, Eun M, Kim N (2006) CACTA and MITE transposon

distributions on a genetic map of rice using F15 RILs derived from Milyang 23 and Gihobyeo

hybrids. Molecules and Cells 21: 360-366.

Lagercrantz U, Ellegren H, Andresson L (1993) The abundance of various polymorphic microsatellite

motifs differ between plants and vertebrates. Nucleic Acids Research 21:1111-1115.

Laido G, Marone D, Russo MA, Colecchia SA, Mastrangelo AM, Vita PD, Papa R (2014) Linkage

disequilibrium and genome-wide association mapping in tetraploid wheat (Triticum turgidum L.).

Plos One 9(4): e95211.

Laosatit K, Tanya P, Saensuk C, Srinives P (2013) Development and characterization of EST-SSR

markers from Jatropha curcas EST database and their transferability across Jatropha-related

species/genus. Biologia 68:41-47.

Lawson MJ and Zhang L (2006) Distinct patterns of SSR distribution in the Arabidopsis thaliana and rice

genomes. Genome Biology7:R14.

Lench NJ, Norris A, Bailey A, Booth A and Markham AF (1996) Vectorette PCR isolation of

microsatellite repeat sequences using anchored dinucleotide repeat primers. Nucleic Acids

Research 24: 2190-2191.

Lerceteau E, Szmidt AE (1999) Properties of AFLP markers in inheritance and genetic diversity studies

of Pinus sylvestries L. Heredity 82:252-260.

Levinson G and Gutman GA (1987) Slipped strand mispairing: A major mechanism for DNA sequence

evolution. Molecular and Biology Evolution 4:203-221.

Li XY (2005) Cutting technique of Jatropha curcas. Autumn Agri Techn Ser 7:10-12.

Li YC, Fahima T, Roder MS, Kirzhner VM, Beiles A, Korol AB and Nevo E (2003) Genetic effects on

micro-satellite diversity in wild emmer wheat, Triticum dicoccoides, at Yehudiyya microsite.

Heredity 90:150-156.

Li YC, Korol AB, Fahima T, Beiles A, Nevo E (2002) Microsatellites: genomic distribution, putative

functions and mutational mechanisms: a review. Molecular Ecology 11:2453-2465.

Li YC, Korol AB, Fahima T, Nevo E (2004) Microsatellites within genes: structure, function, and

evolution. Molecular Biology and Evolution 21:991-1007.

Lia L and Wua LC (2007) Genetic diversity and molecular markers of five snapper species. Chinese

Journal of Agricultural Biotechnology 4:39-46.

Liebhard R, Gianfranceschi L, Koller B, Ryder CD, Tarchini R, Van De Weg E, Gessler C (2002)

Development and characterization of 140 new microsatellites in apple (malus×domesticaBorkh.).

Molecular Breeding 10:217-241.

Lim JH, Yang HJ, Jung KH, Yoo SC, Paek NC (2014) Quantitative trait locus mapping and candidate

gene analysis for plant architecture traits using whole genome re-Sequencing in rice. Molecules

and Cells 37:149-160.

Linman MJ, Yu H, Chen X and Cheng Q (2009) Fabrication and characterization of a sialoside-based

carbohydrate microarray biointerface for protein binding analysis with surface plasmon resonance

imaging. American Chemical Society 1:1755-76209.

Linnaeus C. (1753) Species plantarum. In: Jatropha. Impensis Stockholm: Laurentii Salvii; 1006-1007.

Litt M and Luty JA (1989) A hypervariable microsatellite revealed by in vitro amplification of a

dinucleotide repeat within the cardiac muscle actin gene. American Journal of Human Genetics

44:397-401.

170

Liu K, Muse SV (2005) PowerMarker: an integrated analysis environment for genetic marker analysis.

Bioinformatics 21:2128-2129.

Liu P, Wang CM, LiL, Sun F, Liu P and Yue GH (2011) Mapping QTLs for oil traits and eQTLs for

oleosin genes in Jatropha. BMC Plant Biology11:132.

Liu Y, Wang L, Sun C, Zhang Z, Zheng Y, Qiu F (2014) Genetic analysis and major QTL detection for

maize kernel size and weight in multi-environment. Theoretical and Applied Genetics 127:1019-

1037.

Lunt DH, Hutchinson WF, Carvalho GR (1999) An efficient method for PCR-based isolation of

microsatellite arrarys (PIMA). Molecular Ecology 8:891-894.

Lynch M and Walsh B (1998) Genetics and analysis of quantitative traits, Sinauer.

Mace ES, Gebhardt CG, Lester RN (1999) AFLP analysis of genetic relationship in the tribe Datureae

(Solanaceae). Theoretical and Applied Genetics 99:634-641.

Machua J, Muturi G, Omondi SF, Gicheru J (2011) Genetic diversity of Jatropha curcas L. populations

in Kenya using RAPD molecular markers: Implication to plantation establishment. African

Journal of Biotechnology 10:3062-3069.

Makkar HPS, Aderibigbe AO, Becker K (1998) Comparative evaluation of non-toxic and toxic

varieties of Jatropha curcas for chemical composition, digestibility, protein degradability and

toxic factors. Food Chemistry 62:207-215.

Manen JF, Bouby L, Dalnoki O, Marinval P, Turgay M, Schlumbaum A (2003) Microsatellites from

archaeological Vitis vinifera seeds allow a tentative assignment of the geographical origin of

ancient cultivars. Journal of Archaeological Science 30: 721-729.

Marinoni D, Akkak A, Bounous G, Edwards K, Botta R (2003) Development and characterization of

microsatellite markers in Castanea sativa (Mill.). Molecular Breeding 11:127-136.

Martinez EA, Destombe C, quilled MC, Valero M (1999) Identification of random amplified polymorphic

DNA (RAPD) markers highly linked to sex determination in the red alga Gracilaria gracilis.

Molecular Ecology 8:1533-1538.

Mastan SG, Sudheer PDVN, Rahman H, Ghosh A, Rathore MS, Prakash CR, Chikara J (2012)

Molecular characterization of intra-population variability of Jatropha curcas L. using DNA based

molecular markers. Molecular Biology Report 39:4383-4390.

Maurya R, Gupta A, Singh SK, Rai KM, Chandrawati, Sawant SV, Yadav HK (2013) Microsatellite

polymorphism in Jatropha curcas L.-A biodiesel plant. Industrial Crops and Products 49:136-

142.

Medina IO, Garcia FJE, Farfan JSN and Figueroa MS (2011) State of the art of genetic diversity research

in Jatropha curcas. Scientific Research and Essays 6: 1709-1719.

Merdinoglu D, Butterlin G, Bevilacqua L, Chiquet V, Adam-Blondon AF, Decroocq S (2005)

Development and characterization of a larger set of microsatellite markers in grapevine (Vitis

vinifera L.) suitable for multiplex PCR. Molecular Breeding 15:349-366.

Messier W, Li SH, Stewart CB (1996) The birth of microsatellites. Nature 381:483.

Miesfeld R, Krystal M, Arnheim N (1981) A member of a new repeated sequence family which is

conserved throughout the eukaryotic evolution is found between the human globin genes. Nucleic

Acids Research 9:5931-5947.

Mitton JB (1993) Theory and data pertinent to the relationship between heterozygosity and fitness. In the

natural history of inbreeding and outbreeding: Theoretical and empirical perspectives (Thornhill, N.

W., ed.), pp. 352–374. Chicago, IL: University of Chicago Press.

Mofijur M, Masjuki HH, Kalam MA, Hazrat MA, Liaquat AM, Shahabuddin M (2012) Prospects of

biodiesel from Jatropha in Malaysia. Renewable and Sustainable Energy Reviews 16:5007-

5020.

Mohapatra S, Panda PK (2010) Genetic variability on growth, phenological and seed characteristics of

Jatropha curcas L. Notulae Scientia Biologicae 2:127-132.

Montes JM, Technow F, Martin M, Becker K (2014) Genetic diversity in Jatropha curcas L. assessed

with SSR and SNP markers. Diversity 6: 551-566.

171

Moretzsohn MC, Leoi L, Proite K, Guimaraes PM, Leal-Bertioli SC, Gimenes MA, Martins WS,

Valls JF, Grattapaglia D, Bertioli DJ (2005) A microsatellite-based, gene-rich linkage map for

the AA genome of Arachis (Fabaceae). Theoretical and Applied Genetics 111:1060-1071.

Morgante M, Hanafey M, Powell W (2002) Microsatellites are preferentially associated with non-

repetitive DNA in plant genomes. Nature Genetics 30:194-200.

Morgante M, Olivieri AM (1993) PCR amplified microsatellites as markers in plant genetics. Plant

Journal 3:175-182.

Morgante M, Rafalski A, Biddle P, Tingey S, Olivieri AM (1994) Genetic mapping and variability of

seven soybean simple sequence repeat loci. Genome 37:763-769.

Moursy AA, Aziz HAA and Mostafa AZ (2014) The effect of irradiated and non-irradiated sewage sludge

application on Uptake of Heavy Metals by Jatropha curcas L. plants. International Journal of

Advanced Research 2:1072-1080.

Mullis K, Faloona F, Scharf S, Saiki R, Horn G, Erlich H (1986) Specific enzymatic amplification Of

DNA in vitro: The polymerase chain reaction. Cold Spring Harbor Symposia on Quantitative

Biology 51: 263-273.

Murty SG, Patel F, Punwar BS, Patel M, Singh AS, Fougat RS (2013) Comparison of RAPD, ISSR, and

DAMD markers for genetic diversity assessment between accessions of Jatropha curcas L. and its

related species. Journal of Agriculture Science and Technology 15:1007-1022.

Murty BR and Arunachalam V (1966) The nature of genetic divergence in relation to breeding system in

crop plants. Indian Journal of Genetics 26:188-198.

Nadir E, Margalit H, Gallily T, Ben-Sasson SA (1996) Microsatellite spreading in the human genome:

evolu-tionary mechanisms and structural implications. Proceeding of the Natural Academy

Science, USA 93:6470-6475.

Na-ek Y, Wongkaew A, Phumichai T, Kongsiri N, Kaveeta R, reewongchai T, Phumichai C (2011)

Genetic diversity of physic nut (Jatropha curcas L.) revealed by SSR markers. Journal of Crop

Science and Biotechnology14:105-110.

Narayana DSA, Shankarappa KS, Govindappa MR, Prameela HA, Gururay Rao MR, Rangaswamy KT

(2006) Natural occurrence of Jatropha mosaic virus disease in India. Current Science 91:584-

586.

Nath UK and Alam MS (2002) Genetic variability, heritability and genetic advance of yield and related

traits of groundnut (Arachis hypogaea L.). Journal of Bioscience Science 2:762-764.

Neeraja C, Maghirang-Rodriguez R, Pamplona A, Heuer S, Collard B, Septiningsih EM, Vergara G,

Sanchez D, Xu K, Ismail AM (2007) A marker-assisted backcross approach for developing

submergence-tolerant rice cultivars. Theoretical and Applied Genetics 115: 767-776.

Negussie A, Achten MJW, Verboven AFH, Hermy M, Muys B (2013) Potential pollinators and floral

visitors of introduced tropical biofuel tree species Jatropha curcas L. (Euphorbiaceae), in

Southern Africa. African Crop Science Journal 21:149 -158.

Nunome T, Negoro S, Miyatake K, Yamaguchi H and Fukuoka H (2006) A protocol for the construction

of microsatellite enriched genomic library. Plant Molecular Biology Reporter 24:305-312.

Nunome T, NegoroS, KonoI, KanamoriH, MiyatakeK, YamaguchiH, OhyamaA, FukuokaH (2009)

Development of SSR markers derived from SSR-enriched genomic library of eggplant (Solanum

melongena L.).Theoretical and Applied Genetics 119:1143-1153.

Nyquist WE (1991) Estimation of heritability and prediction of selection response in plant populations.

Critical Reviews in Plant Science 10: 235-322.

Oliveira EJ, Pdua JG, Zucchi MI, Vencovsky R and Vieira MLC (2006) Origin, evolution and genome

distribution of microsatellites. Genetics and Molecular Biology 29: 294-307.

Ong HC, Mahlia TMI, Masjuki HH, Norhasyima RS (2011) Comparison of palm oil, Jatropha curcas and

Calophyllum inophyllum for biodiesel: a review. Renewable and Sustainable Energy

Reviews15:3501-3515.

Openshaw KA (2000) Review of Jatropha curcas: an oil plant of unfulfilled promise. Biomass and

Bioenergy 19:1-15.

172

Osorio LRM, Salvador AFT, Jongschaap REE, Perez CAA, Sandoval JEB, Trindade LM, Visser RGF,

van Loo EN (2014) High level of molecular and phenotypic biodiversity in Jatropha curcas from

Central America compared to Africa, Asia and South America. BMC Plant Biology 14:77.

Ostrander EA, Jong PM, Rine J and Duyk G (1992) Construction of small-insert genomic DNA libraries

highly enriched for microsatellite repeat sequences. Proceeding of the National Academy of

Science 89:3419-3423.

Ouattara B, Diedhiou I, Ndir KN, Agbangba EC, Cisse N, Diouf D, Akpo EL Zongo JD (2013) Variation

in seed traits and distribution of Jatropha curcas L. in Senegal. International Journal of Current

Research 5:17-21.

Ouattara B, Ndir KN, Gueye MC, Diedhiou I, Barnaud A, Fonceka D, Cisse N, Akpo EL, Diouf D (2014)

Genetic diversity of Jatropha curcas L. in Senegal compared with exotic accessions based on

microsatellite markers. Genetic Resources and Crop Evolution DOI 10.1007/s10722-014-0106-5.

Ovando-Medina I, Sanchez-Gutierrez A, Adriano-Anaya L, Espinosa-Garcia F, Nunez-Farfan J,

Salvador-Figueroa M (2011) Genetic diversity in Jatropha curcas populations in the state of

Chiapas, Mexico. Diversity 3:641-659.

Paetkau D (1999) Microsatellites obtained using strand extension: an enrichment protocol. Biotechniques

26: 694-697.

Palacios C, Kresovich S, Gonzales-Candelas F (1999) A population genetic study of the endangered plant

species Limonium dufourii (Plumbaginaceae) based on amplified fragment length polymorphism

(AFLP). Molecular Ecology 8:645-657.

Pamidimarri DVNS, Mastan SG, Rahman H, Ravi Prakash Ch, Singh S, Reddy MP (2011) Cross species

amplification ability of novel microsatellites isolated from Jatropha curcas and genetic relationship

with sister taxa. Molecular Biology Reports 38:1383-1388.

Pamidimarri DVNS, Reddy MP (2014) Phylogeography and molecular diversity analysis of Jatropha

curcas L. and the dispersal route revealed by RAPD, AFLP and nrDNA-ITS analysis. Molecular

Biology Report 41:3225-3234.

Pamidimarri DVNS, Singh S, Mastan SG, Patel J, Reddy MP (2009) Molecular characterization and

identification of markers for toxic and non-toxic varieties of Jatropha curcas L. using RAPD,

AFLP and SSR markers. Molecular Biology Report 36:1357-1364.

Pandey VC, Singh K, Sigh JS, Kumar A, Sigh B and Sigh RP (2012) Jatropha curcas: A potential biofuel

plant for sustainable environmental development. Renewable and Sustainable Energy

Reviews16:2870-2883.

Paniego N, Echaide M, Munoz M, Fernandez L, Torales S, Faccio P, Fuxan I, Carrera M, Zandomeni R,

Suarez EY and Hopp HE (2002) Microsatellite isolation and characterization in sunflower

(Helianthus annuus L.). Genome 45:34-43.

Pardue ML, Gall JG (1970) Chromosomal localization of mouse satellite DNA. Science 168: 1356-1358.

Parida SK, Raj Kumar KA, Dalal V, Singh NK, Mohapatra T (2006) Unigene derived microsatellite

markers for the cereal genomes. Theoretical and Applied Genetics 112:808-817.

Park YJ, Ju KL and Nam-Soo K (2009) Simple sequence repeat polymorphisms (SSRPs) for evaluation

of molecular diversity and germplasm classification of minor crops. Molecules 14: 4546-4569.

Parthiban KT, Kumar RS, Thiyagarajan P, Subbulakshmi V, Vennila S, Rao MG (2009) Hybrid progenies

in Jatropha – a new development. Current Science 96:815-23.

Patil SVR (2004) Jatropha oil-As an alternative to diesel. Agriculture and Industry Survey 13:17-22.

Patra NK (1985) Genetic diversity in relation to oil content potential in mint. Genetics 12:173-177.

Peakall R, Gilmore S, Keys W, Morgante M, Rafalski A (1998) Cross-species amplification of soybean

(Glycine max) simple sequence repeats within the genus and other legume genera: Implications for

the transferability of SSRs in plants. Molecular Biology and Evolution 15:1275 -1287.

Pearson CE, Edamura NK, Cleary JD (2005) Repeat instability: mechanisms of dynamic mutations.

Nature Reviews Genetics 6:729-742.

173

Pecina-Quintero V, Anaya JL, Zamarripa A, Montes N, Nunez C, Solis J (2011) Molecular

characterization of Jatropha curcas L. genetic resources from Chiapas, Mexico through AFLP

markers. Biomass and Bioenergy 35:1897-905.

Pecina-Quintero V, Anaya-Lopez JL, Zamarripa-Colmenero A, Nunez-Colın CA, Montes-Garcıa N,

Solıs-Bonilla JL, Jimenez-Becerril MF (2014) Genetic structure of Jatropha curcas L. in Mexico

and probable centre of origin. Biomass and Bioenergy 60:147-155.

Peeters JP and Martinelli JA (1989) Hierarchical cluster analysis as a tool to manage variation in

gremplasm collections. Theoretical and Applied Genetics 78:42-48.

Pepin L, Amigues Y, Lepingle A, Berthier J, Bensaid A and Vaiman D (1995) Sequence conservation of

microsatellites between Bos Taurus (cattle), Capra hircus (goat) and related species. Examples of

use in parentage testing and phylogeny analysis. Heredity 74:53-61.

Perrier X, Flori A, Bonnot F (2003) Data analysis methods. In: Hamon P, Seguin M, Perrier X,

Glaszmann JC (eds) Genetic diversity of cultivated tropical plants. Enfield Science Publishers,

Montpellier, pp 43-76.

Phan J, Reue K, and Peterfy M (2000) MS-IRS: A simple method for the isolation of microsatellites.

BioTechniques 28:18-20.

Pootakham W, Chanprasert J, Jomchai N, Sangsrakru D, Yoocha T, Ragoonrung S, Tangphatsornruang S

(2012) Development of genomic-derived simple sequence repeat markers in Hevea brasiliensis

from 454 genome shotgun sequences. Plant Breeding 131: 555-562.

Powell W, Machray G, Provan J (1996) Polymorphism revealed by simple sequence repeats. Trends in

Plant Sciences 1:215-222.

Primmer CR, Moller AP and Ellengren H (1996) A wide-range survey of cross species microsatellite

amplifications in birds. Molecular Ecology 5: 365-378.

Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype

data. Genetics 155:945-959.

Provan J, Powell W, Hollingsworth PM (2001) Chloroplast microsatellites: new tools for studies in plant

ecology and evolution. Trends in Ecology and Evolution 16: 142-147.

Qu J and Liu J (2013) A genome-wide analysis of simple sequence repeats in maize and the development

of polymorphism markers from next-generation sequence data. BMC Research Notes 6:403.

Rafalski JA, Vogel JM, Morgante M, Powell W, Andre C, Tingey SV (1996) Generating and using DNA

markers in plants. In: non mammalian genomic analysis: a practical guide (eds Birren B, Lai E), pp.

75–134, Academic Press, New York.

Rafii MY, Shabanimofrad M, Edaroyati PMW, Latif MA (2012) Analysis of the genetic diversity of

physic nut, Jatropha curcas L. accessions using RAPD markers. Molecular Biology Report

39:6505-6511.

Rajkumar, Fakrudin B, Kavil SP, Girma Y, Arun SS, Dadakhalandar D, Gurusiddesh BH, Patil AM,

Thudi M, Bhairappanavar SB, Narayana YD, Krishnaraj PU, Khadi BM, Kamatar MY (2013)

Molecular mapping of genomic regions harbouring QTLs for root and yield traits in sorghum

(Sorghum bicolor L. Moench). Physiol Mol Biol Plants 19:409-419.

Raju AJS, Ezradanam V (2002) Pollination ecology and fruiting behaviour in a monoecious species,

Jatropha curcas L. (Euphorbiaceae). Current Science 83:1395-1398.

Ramanujam S, Tewari AS and Mehra RB (1974) Genetic divergence and hybrid performance in

mungbean. Theoretical and applied Genetics 45:211-214.

Ranade SA, Srivastava AP, Rana TS, Srivastava J, Tuli R (2008) Easy assessment of diversity in

Jatropha curcas L. plants using two single-primer amplification reaction (SPAR) methods.

Biomass and Bioenergy32: 533-540.

Rao AC, Maya MM, Duque MC, Tohme J, Allem AC, Bonierable MW (1997) AFLPs analysis of

relationship among cassava other Manihot species. Theoretical and Applied Genetics 95: 741-

750.

Rao CR (1952) Advanced statistical methods in biometric research. John Wiley, New York.

174

Rao GR, Korwar GR, Shanker AK, Ramakrishna YS (2008) Genetic associations, variability and

diversity in seed characters, growth, reproductive phenology and yield in Jatropha curcas (L.)

accessions. Trees 22: 697-709.

Rasmuson M (2002) The genotype-phenotype link. Hereditas 136:1-6.

Reddy MP, Chikara J, Patolia JS, Ghosh A (2007) Genetic improvement of J. curcas L. adaptability and

oil yield. In: Fact seminar on J. curcas L., agronomy and genetics, 26-28 march 2007, wageningen,

the Netherlands, published by FACT foundation.

Ricci A, Chekhovskiy K, Azhaguvel P, Albertini E, Falcinelli M, Saha M (2012) Molecular

characterization of Jatropha curcas resources and identification of population-specific markers.

Bioenergy Research 5:215-224.

Richard GF, Paques F (2000) Mini- and microsatellite expansions: the recombination connection. EMBO

Reports 11:122-126.

Richardson T, Cato S, Ramser J, Kahl G, Weising K (1995) Hybridization of microsatellites to RAPD :

a new source of polymorphic markers. Nucleic Acids Research 23:3798-3799.

Robinson HF, Comstock RE, Harvey PH (1949) Estimates of heritability and the degree of dominance in

corn. Agronomy Journal 41:353-359.

Romero G, Adeva C, Battad Z (2009) Genetic fingerprinting: advancing the frontiers of crop biology

research. Philippine Science Letter 2:8-13.

Rosado TB, Laviola BG, Faria DA, Pappas MR, Bhering LL, Quirino B, Grattapaglia D (2010) Molecular

marker reveal limited genetic diversity in large germplasm collection of the biofuel crop Jatropha

curcas L. in Brazil. Crop Science 50:2372-2382.

Rose O and Falush D (1998) A threshold size for microsatellite expansion. Molecular Biology and

Evolution 15:613-615.

Rossetto M, McLauchlan A, Harriss FCL, Henry RJ, Baverstock PR, Lee LS, Maguire TL, Edwards KJ

(1999) Abundance and polymorphism of microsatellite markers in the tea tree (Melaleuca

alternifolia, Myrtaceae). Theoretical and Applied Genetics 98:1091-1098.

Rustgi S, Shafqat MN, Kumar N, Baenziger PS, Ali ML, Dweikat I, Campbell BT, Gill (2013) Genetic

dissection of yield and its component traits using high-density composite map of wheat

chromosome 3A: bridging gaps between QTLs and underlying genes. PLoS One 8(7): e70526.

Saghai-Maroof MA, Soliman KM, Jorgensen RA, Allard RW (1984) Ribosomal DNA spacer-length

polymorphisms in barley: Mendelian inheritance, chromosomal location and population

dymnamics. Proceeding of the National Academy of Science USA 81:8014-8018.

Saha MC, Mian MA, Eujayl I, Zwonitzer JC, Wang L, and May GD (2004) Tall fescue EST-SSR markers

with transferability across several grass species. Theoretical and Applied Genetics 109: 783-791.

Sahai K, Srivastava V, Rawat KK (2011) Impact assessment of fruit predation by Scutellera perplexa

Westwood on the reproductive allocation of Jatropha. Biomass and Bioenergy 35:4684-4689.

Salvador-Figueroa M, Magana-Ramos J, Vazquez-Ovando JA, Adriano-Anaya ML, Ovando-Medina I

(2014) Genetic diversity and structure of Jatropha curcas L. in its centre of origin. Plant Genetic

Resources: characterization and utilization 1–9, doi:10.1017/S1479262114000550.

Sambrook J, Fritsch EF and Maniatis T (1989) Molecular cloning: A Laboratory Manual, Ed 2. Cold

Spring Harbor Laboratory Press, New York.

Santos C, Antonio F, Marcos AD, Marciene AR and Marcio RVE (2010)Genetic similarity of Jatropha

curcas accessions based on AFLP markers. Crop Breeding and Applied Biotechnology10: 364-

369.

Sathya M, Jayamani P (2013) Cross species amplification of Adzukibean derived microsatellite loci and

diversity analysis in green gram and related Vigna Species. Molecular Plant Breeding 4:89-95.

Sato S, Hirakawa H, Isobe S (2011) Sequence analysis of the genome of an oil-bearing tree, Jatropha

curcas L. DNA Research 18: 65-76.

Scheiner SM, Mitchell RJ, Callahan HS (2000) Using path analysis to measure natural selection. Journal

of Evolutionary Biology 13:423-433.

175

Schlotterer C (1998) Microsatellites. In: Hoelzel AR (ed) Molecular genetic analysis of populations: A

practical approach. IRL Press, Oxford, pp. 237-261.

Schlotterer C (2004) The evolution of molecular markers- just a matter of fashion? Genetics 5:63-6.

Schranz ME, Song BH, Windsor AJ, Mitchell-Olds T (2007) Comparative genomics in the Brassicaceae:

a family-wide perspective. Current Opinion in Plant Biology 10:168-175.

Schuelke M (2000) An economic method for the fluorescent labeling of PCR fragments. Nature


Scott KD, Eggler P, Seaton G, Rossetto M, Ablett EM, Lee LS, Henry RJ (2000) Analysis of SSRs

derived from grape ESTs. Theoretical and Applied Genetics 100: 723-726.

Segura MPS, Oomah BD, Drover JCG, Heredia JB, Enciso TO, Torres JBV, Villa ES, Landeros FS,

Escalante MAA (2014) Physical and chemical characterization of three non-toxic oilseeds from the

Jatropha genus. Journal of Food and Nutrition Research 2: 56-61.

Semagn K, Bjornstad A and Ndjiondjop MN (2006) An overview of molecular marker methods for

plants. African Journal of Biotechnology 5: 2540-2568.

Shabanimofrad M, Yusop MR, Saad MS, Wahab PEM, Biabanikhanehkahdani A, Latif MA (2011)

Diversity of physic nut (Jatropha curcas) in Malaysia: application of DIVA-geographic

information system and cluster analysis. Australian Journal of Crop Science 5:361-368.

Sharma A and Chauhan RS (2011) Repertoire of SSRs in the castor bean genome and their utilization in

genetic diversity analysis in Jatropha curcas. Comparative and Functional Genomics. Volume

2011, Article ID 286089, 9 pages.

Shen J, Jia X, Ni H, Sun P, Niu S, Chen X (2010) AFLP analysis of genetic diversity of Jatropha curcas

grown in Hainan, China. Trees 24: 455-462.

Shen J, Khongsak P, David B, Xiaoyang C (2012) AFLP-based molecular characterization of 63

populations of Jatropha curcas L. grown in provenance trials in China and Vietnam. Biomass and

Bioenergy 37:265-274.

Shendure J and Ji H (2008) Next-generation DNA sequencing. Nature Biotechnology 26:135- 1145.

Shin JH, Kwon SJ, Lee JK, Min HK, Kim NS (2006) Genetic diversity of maize kernel starch synthesis

genes with SNAPs. Genome 49: 1287-1296.

Shipley B (1997) Exploratory path analysis with applications in ecology and evolution. Am. Nat. 149:

1113-1138.

Shoemaker RC, Grant D, Olson T, Warren WC, Wing R, Yu Y, Kim H, Cregan P, Joseph B, Futrell-

Griggs M, Nelson W, Davito J, Walker J, Wallis J, Kremitski C, Scheer D, Clifton SW, Graves T,

Nguyen H, Wu X, Luo M, Dvorak J, Nelson R, Cannon S, Tomkins J, Schmutz J, Stacey G,

Jackson S (2008) Microsatellite discovery from BAC end sequences and genetic mapping to anchor

the soybean physical and genetic maps. Genome 51:294-302.

Shultz JL, Kazi S, Bashir R, Afzal JA, Lightfoot DA (2007)The development of BAC-end sequence-

based microsatellite markers and placement in the physical and genetic maps of soybean.


Silitonga AS, Atabani AE, Mahlia TMI, Masjuki HH, Badruddin Irfan Anjum, Mekhilef S (2011) A

review on prospect of Jatropha curcas for biodiesel in Indonesia. Renewable and Sustainable

Energy Reviews15:3733-3756.

Singh VK, Singh A, Singh SP, Ellur RK, Singh D, Krishnan SG, Bhowmick PK, Nagarajan M, Vinod

KK, Singh UD, Mohapatra T, Prabhu KV, Singh AK (2013a) Marker-assisted simultaneous but

stepwise backcross breeding for pyramiding blast resistance genes Piz5 and Pi54 into an elite

Basmati rice restorer line ‘PRR78’. Plant Breeding 132:486-495.

Singh R, Pandey RM and Singh B (2013b) Genetic association, divergence and variability studies for

seed yield and oil content and its contributing traits in Jatropha (Jatropha curcas L.). Journal of

Medicinal Plants Research 7:1931-1939.

Singh RK and Chaudhary BD (1985) Biometrical methods in quantitative genetic analysis. Kalyani

Publishers. New Delhi.

176

Singh SP (1991) Genetic divergence and canonical analysis in hyacinth bean. Journal of Genetics 45:7-

12.

Singh SP and Kahana KR (1993) Path coefficient analysis for opium and seed yield in opium poppy (P.

somniferum L.). Genetika 25:119-128.

Singh SP and Mittal RK (1993) Geographical divergence in relation to genetic diversity in potato

(Solanum tuberosum). Genetika 25:63-70.

Singh SP and Singh HN (1979) Path coefficient analysis for yield component in Okra. Indian Journal of

Agricultural Science 36:166-170.

Singh SP and Singh HN (1981) Gene system involved and their implication in breeding lablab bean

(Dolichos lablab). Zeitschriftfur Pflanzenzüchtung 87: 240-247.

Singh SP, Shukla S, Yadav HK (2004) Multivariate analysis in relation to breeding system in opium

poppy (Papaver somniferum L.). Genetika 36:111-120.

Singh SP, Shukla S, Yadav HK, Chatterjee A (2003) Multivariate and canonical analyses in opium poppy

(Papaver somniferum). Journal of Medcinal and Aromtic Plant science 25:380-384.

Sinha P, Tripathi SB (2013) Molecular characterization of Jatropha genetic resources through amplified

fragment length polymorphism (AFLP) markers. International Journal of Chemical and

Technical Research 5:735-740.

Slavov GT, Howe GT, Gyaourova AV, Birkes DS, Adams WT (2005) Estimating pollen flow using SSR

markers and paternity exclusion: Accounting for mistyping. Molecular Ecology 14: 3109-3121.

Smith JSC (1984) Genetic variability within U.S. hybrid maize: Multivariate analysis of isozyme data.

Crop Science 24:1041-1046.

Smith JSC and Smith OS (1989) The description and assessment of distances between inbred lines of

maize: The utility of morphological, biochemical and genetic descriptors and a scheme for the

testing of distinctiveness between inbred lines. Maydica 34:151-161.

Snehi SK, Raj SK, Khan MS, Prasad V (2011) Molecular identification of a new begomovirus associated

with yellow mosaic disease of Jatropha gossypifoliain India. Archives of Virology 156:2303-2307.

Solomon RAJ and Ezradanam V (2002) Pollination ecology and fruiting behaviour in a monoecious

species, Jatropha curcas L. (Euphorbiaceae). Current science 83:1395-1398.

Spritz RA (1981) Duplication/deletion polymorphism 5'- to the human beta globin gene. Nucleic Acids

Research 9:5037-5047.

Stansfield WD (1986) Theory and problems of genetics. McGraw-Hill Book Company, New York.

Stein N, Prasad M, Scholz U, Thiel T, Zhang H, Wolf M, Kota R, Varshney RK, Perovic D, Grosse I,

Graner A (2007) A1,000-loci transcript map of the barley genome: new anchoring points for

integrative grass genomics. Theoretical and Applied Genetics 114:823-839.

Stephan W, Kim Y (1998) Persistence of microsatellite arrays in finite populations. Molecular Bology

and Evolution 15: 1332-1336.

Subirana J and Messeguer (2008) Structural families of genomic microsatellites. Gene 408:124-132.

Subramanyam K, Rao DM, Devanna N, Aravinda A, Pandurangadu (2010) Evaluation of genetic

diversity among Jatropha curcas (L.) by RAPD analysis. Indian Journal of Biotechnology 9:283-

288.

Sudheer DVNP, Mastan SG, Rahman H, Reddy MP (2010) Molecular characterization and genetic

diversity analysis of Jatropha curcas L. in India using RAPD and AFLP analysis. Molecular

Biology Report 37:2249-2257.

Sujatha M and Prabakaran AJ (2003) New ornamental Jatropha hybrids through interspecific

hybridization. Genetic Resources and Crop Evolution 50:75-82.

Sujatha M, Makkar HPS, Becker K (2005) Shoot bud proliferation from axillary nodes and leaf sections

of non-toxic Jatropha curcas L. Plant Growth Regulation 47:83-90.

Sujatha M, Reddy TP, Mahasi MJ (2008) Role of biotechnological interventions in the improvement

of castor (Ricinus communis L.) and Jatropha curcas L. Biotechnology Advance 26:424-35.

Sujatha M and Prabakaran AJ (1997) Characterization and utilization of Indian Jatropha curcas. Indian

Journal of Plant Genetic Resource 10:123-128.

177

Sun F, Liu P, Ye J, Lo L, Cao S, Li L, Yue G and Wang C (2012) An approach for Jatropha

improvement using pleiotropc QTLs regulating plant growth and seed yield. Biotechnology for

Biofuels 5:12.

Sun JX, Mullikin JC, Patterson N and Reich DE (2009) Microsatellites are molecular clocks that support

accurate inferences about history. Molecular Biology and Evolution 26:1017-1027.

Sun QB, Li LF, Li Y, Wu GJ, Ge XJ (2008) SSR and AFLP markers reveal low genetic diversity in the

biofuel plant Jatropha curcas in China. Crop Science 48:1865-1871.

Sunil N, Sujatha M, Kumar V, Vanaja M, Basha SD, Varaprasad KS (2011) Correlating the phenotypic

and molecular diversity in Jatropha curcas L. Biomass and Bioenergy 35 :1085 1096.

Sunil N, Sivaraj N, Anitha K, Abraham B, Kumar V, Sudhir E, Vanaja M, Varaprasad KS (2009)

Analysis of diversity and distribution of Jatropha curcas L. germplasm using Geographic

Information System (DIVA-GIS). Genet Resour Crop Evol 56:115-119.

Surwenshi A, Kumar V, Shanwad UK and Jalageri BR (2011) Critical review of diversity in Jatropha

curcas for crop improvement: A candidate biodiesel crop. Research Journal of Agricultural

Sciences 2:193-198.

Szalma SJ, Buckler ES, Snook ME, McMullen MD (2005) Association analysis of candidate genes for

maysin and chlorogenic acid accumulation in maize silks. Theoretical and Applied Genetics

110:1324-1333.

Tang J, Gao L, Cao Y, Jia J (2006) Homologous analysis of SSR-ESTs and transferability of wheat SSR-

EST markers across barley, rice and maize. Euphytica 151: 87-93.

Tanya P, Taeprayoon P, Hadkam Y, Srinives P (2011) Genetic diversity among Jatropha and Jatropha

related species based on ISSR markers. Plant Molecular Biology Reports 29:252-264.

Taramino G and Tingey SV (1996) Simple sequence repeats for germplasm analysis and mapping in

maize. Genome 39:277-287.

Tatikonda L, Wani PS, Kannan S, Beerelli N, Sreedevi KT, Hoisington AD (2009) AFLP-based

molecular characterization of an elite germplasm collection of Jatropha curcas L. biofuel plant.

Plant Science 176:505-513.

Tautz D (1989) Hyper variability of simple sequences as a general source for polymorphic DNA markers.

Nucleic Acids Research 17:6463-6471.

Tautz D, Renz M (1984) Simple sequence repeats are ubiquitous repetitive components of eukaryotic

genomes. Nucleic Acid Research 12:4127-4138.

Tautz D, Schlotterer C (1994) Simple Sequences. Current Opinion in Genetics Development 4:832-

837.

Techen N, Arias RS, Glynn NC, Pan Z, Khan IA, Scheffler BE (2010) Optimized construction of

microsatellite-enriched libraries. Mol Ecol Resour 10:508-515.

Temnykh S, DeClerck G, Lukashova A, Lipovich L, Cartinhour S, McCouch S (2001) Computational and

experimental analysis of microsatellites in rice (Oryza sativa L.): Frequency, length variation,

transposon associations, and genetic marker potential. Genome Research 11:1441-1452.

Tewari JP, Dwivedi HD, Pathak M, Srivastava SK (2007) Incidence of mosaic disease in Jatropha

curcas L. from eastern Uttar Pradesh. Current Science 93:1048-1049.

Thiel T, Michalek W, Varshney RK, Graner A (2003) Exploiting EST databases for the development of

cDNA derived microsatellite markers in barley (Hordeum vulgare L.). Theoretical and Applied

Genetics 106:411-422.

Thompson JA, RL Nelson, Vodkin LO (1998) Identification of diverse soybean germplasm using RAPD

markers. Crop Science 38:1348-1355.

Thoren PA, Paxton RJ, Estoup A (1995) Unusually high frequency of (CT)n and (GT)n microsatellite loci

in a yellow jacket wasp, Vespula rufa (L.) (Hymenoptera: Vespidae). Insect Molecular Biology

4:141-148.

Toth G, Gaspari Z, Jurka J (2000) Microsatellites in different eukaryotic genomes: survey and analysis.

Genome Research 10:967-981.

178

Tyagi S, Mir RR, Kaur H, Chhuneja P, Ramesh B, Balyan HS, Gupta PK (2014) Marker-assisted

pyramiding of eight QTLs/genes for seven different traits in common wheat (Triticum aestivum L.).

Molecular Breeding DOI 10.1007/s11032-014-0027-1.

Uga Y, Yamamoto E, Kanno N, Kawai S, Mizubayashi T, Fukuoka S (2013) A major QTL controlling

deep rooting on rice chromosome 4. Scientific Reports 3:3040.

Varshney RK, Graner A, Sorrells ME (2005) Genic microsatellite markers in plants: features and

applications. Trends in Biotechnology 23:48-55.

Varshney RK, Thiel T, Stein N, Langridge P, Graner A (2002) In silico analysis on frequency and

distribution of microsatellites in ESTs of some cereal species. Cellular and Molecular Biology

Letter 7:537-546.

Vijayanand V, Senthil N, Vellaikumar S, Paramathama M (2009) Genetic diversity of Indian Jatropha

species as revealed by morphological and ISSR markers. Journal of Crop Science and


Vischi M, Raranciuc S, Baldini M (2013) Evaluation of genetic diversity between toxic and non toxic

Jatropha curcas L. accessions using a set of simple sequence repeat (SSR) markers. African

Journal of Biotechnology12: 265-274.

Vos P, Hogers R, Bleeker M, Reijans M, van de Lee T, Hornes M, Frijters A, Pot J, Peleman J, Kuiper M

and Zabeau M (1995) AFLP: a new technique for DNA fingerprinting. Nucleic Acids Research

23:4407-4414.

Vuylsteke M, Mank R, Antonise R, Bastiaans E, Senior ML, Stuber CW, Melchinger AE, Lbberstedt T,

Xia XC, Stam P (1999) Two high-density AFLP® linkage maps of Zea mays L.: Analysis of

distribution of AFLP markers. Theoretical and Applied Genetics 99: 921-935.

Wang A, Li Y and Zhang C (2012) QTL mapping for stay-green in maize (Zea mays). Canadian Journal

of Plant Science 92: 249-256.

Wang CM, Li LH, Zhang XT, Gao Q, Wang RF and An DG (2009a) Development and application of

EST-STS markers specific to chromosome 1RS of Secale cereal. Cereal Research

Communication 37:13-21.

Wang ML, Barkley NA, Jenkins TM (2009b) Microsatellite markers in plants and insects. Part I.

Applications of biotechnology. Genes Genomes Genomics 3:54-67.

Wang CM, Liu P, Yi C, Gu K, Sun F, Li L, Lo LC, Liu X, Feng F, Lin G, Cao S, Hong Y, Yin Z, Yue

GH (2011) A first generation microsatellite and SNP based linkage map of Jatropha. PLoS ONE

6(8): e23632.

Wang R, Hanna MA, Zhou W-W, Bhadury PS, Chen Q, Song BA (2011) Production and selected fuel

properties of biodiesel from promising non-edible oils: Euphorbia lathyris L., Sapium sebiferum L.

and Jatropha curcas L. Bioresource Technology 102:1194-1199.

Wang H, Li X, Jin WGX, Zhang X, Lin Z (2014) Comparison and development of EST-SSRs from two

454 sequencing libraries of Gossypium barbadense. Euphytica DOI 10.1007/s10681-014-1104-6.

Wang X, Trigiano R, Windham M, Scheffler B, Rinehart T, Spiers J (2008) Development and

characterization of simple sequence repeats for flowering dogwood (Cornus florida L.). Tree

Genetics and Genomes 4:461-468.

Wang XW, Kaga A, Tomooka N, Vaughan DA (2004) The development of SSR markers by a new

method in plants and their application to gene flow studies in azuki bean Vigna angularis (Willd.)

Ohwi and Ohashi. Theoretical and Applied Genetics109:352-360.

Wang Z, Weber JL, Zhang G, Tanksley SD (1994) Survey of plant short tandem DNA repeats.


Wani SP, Chander G, Sahrawt KL, Rao Ch. S, Raghvendra G, Susanna P, Pavani M (2012a) Carbon

sequestration and land rehabilitation through Jatropha curcas (L.) plantation in degraded lands.

Agriculture, Ecosystem and Environment 161:112-120.

Wani TA, Kitchlu S, Ram G (2012b) Genetic variability studies for morphological and qualitative

attributes among Jatropha curcas L. accessions grown under subtropical conditions of North India.

South African Journal of Botany 79:102-105.

179

Wani SP, Osman M, D’Silva E, Sreedevi TK (2006) Improved livelihoods and environmental

protection through biodiesel plantations in Asia. Asian Biotechnology Development Review

8:11-29.

Ward JH (1963) Hierarchical grouping to optimize an objective function. Journal of the American

Statistical Association 58: 236-244.

Weber D and Helentjaris (1989) Mapping RFLP loci in Maize using B-A translocations. Genetics

121:583-590.

Weber JL (1990) Informativeness of human (dC-dA)n (dGdT)n polymorphisms. Genomics 7:524-530.

Weber JL and May PE (1989) Abundant class of human DNA polymorphisms which can be typed using

the polymerase chain reaction. American Journal of Human Genetics 44: 388-396.

Weissenbach J, Gyapay G, Dib C, Vignal A, Morissette J, Millasseau P, Vaysseix G and Lathrop M

(1992). A second-generation linkage map of the human genome. Nature 59: 794-801.

Wen M, Wang H, Xia Z, Zou M, Lu C and Wang W (2010) Development of EST-SSR and genomic-SSR

markers to assess genetic diversity in Jatropha Curcas L. BMC Research Notes 3:42.

Williams JGK, Kubelik AR, Livak KJ, Rafalski JA and Tingey SV (1990) DNA polymorphisms

amplified by arbitrary primers are useful as genetic markers. Nucleic Acids Research 18:6531-

6535.

Winter P and Kahl G (1995) Molecular marker technologies for plant improvement. World Journal of

Microbiology and Biotechnology 11: 438-448.

Wolff RK, Plaetke R, Jeffreys AJ, White R (1991) Unequal crossing over between homologous

chromosomes is not the major mechanism involved in the generation of new alleles at VNTR loci.

Genomics 5: 382-384.

Wu KS, Jones R, Danneberger L and Scolnik PA (1994) Detection of microsatellite polymorphisms

without cloning. Nucleic Acids Research 22: 3257-3258.

Wu KS, Tanksley SD (1993) Abundance, polymorphism and genetic mapping of microsatellites in rice.

Molecular and General Genetics 241:225-235.

Xiao Y, yu C, Yabes JM, Lei J, Wang X, Wang Z, Chen H, Tabanao DA (2013) Linkage disequilibrium

in a diversity and stress adaptation rice panel for association mapping. Molecular Plant Breeding

4:297-303.

Yadav HK, Ranjan A, Asif MH, Mantri S, Sawant SV, Tuli R (2011) EST-derived SSR markers in

Jatropha curcas L.: development, characterization, polymorphism, and transferability across the

species/genera. Tree Genetics and Genomes 7:207-219.

Yang T, Bao S, Ford R, Jia T, Guan J, He Y, Sun X, Jiang J, Hao J, Zhang X, Zong X (2012) High-

throughput novel microsatellite marker of faba bean via next generation sequencing. BMC

Genomics13:602.

Ye M, Li C, Francis G, Makkar HPS (2009) Current situation and prospects of Jatropha curcas as a

multipurpose tree in China. Agroforestry System 76:487-497.

Yu J, Zhao H, Zhu T, Chen L, Peng J (2013) Transferability of rice SSR markers to Miscanthus sinensis,

a potential biofuel crop. Euphytica 191:455-468.

Yue GH, Lo LC, Sun F, Cao SY, Yi CX, Hong Y, Sun WB (2013) No variation at 29 microsatellites in

the genome of Jatropha curcas. Journal of Genomics 2:59-63.

Zabeau M and Vos P (1993) Selective restriction fragment amplification: a general method for DNA

fingerprinting. European Patent 0534858 A1.

Zalapa JE, Cuevas H, Zhu H, Steffan S, Senalik D, Zeldin E, Mccown B, Harbut R, Simon P (2012)

Using next-generation sequencing approaches to isolate simple sequence repeat (SSR) loci in the

plant sciences. American Journal of Botany 99:193-208.

Zane I, Bergelloni L and Patarnello T (2002) Strategies for microsatellite isolation: a review. Molecular

Ecology 11:1-16.

Zapico FL, Nival SK, Aguilar CH, Eroy MN (2011) Phenotypic diversity of Jatropha curcas L. from

diverse origins. Journal of Agricultural Science and Technology 5:215-219.

180

Zeng Y, Yang S, Cui H, Yang X, Xu L, Du J, Pu X, Li Z, Cheng Z, Huang X (2009) QTLs of cold-related

traits at the booting stage for NIL-RILsin rice revealed by SSR. Genes and Genomics 31: 143-154.

Zhao X, Kochert G (1993) Phylogenetic distribution and genetic mapping of a (GGC) n microsatellite

from rice (Oryza sativa L.). Plant Molecular Biology 21:607-614.

Zhao Y, Wang H, Chen W, Li Y (2014) Genetic structure, linkage disequilibrium and association

mapping of verticillium wilt resistance in elite cotton (Gossypium hirsutum L.) germplasm

population. PLoS ONE 9:e86308.

Zhu H, Senalik D, McCown BH, Zeldin EL, Speers J, Hyman J, Bassil N, Hummer K, Simon PW, Zalapa

JE (2012) Mining and validation of pyrosequenced simple sequence repeats (SSRs) from American

cranberry (Vaccinium macrocarpon Ait.). Theoretical and Applied Genetics 124:87-96.

Zhu Y, Queller DC and Strassmann JE (2000) A phylogenetic perspective on sequence evolution in

microsatellite loci. Journal of Molecular Evolution 50:324-338.

Zietkiewicz E, Rafalski A and Labuda D (1994) Genome fingerprinting by simple sequence repeat (SSR)-

anchored polymerase chain reaction amplification. Genomics 20:176-183.

Appendix I. Details of chemicals and kits used

Table 1. Composition of suspension buffer used in the genomic DNA isolation

Suspension buffer (pH 8.0) Molecular weight Stock conc. Final conc.

EDTA 372.24 0.5M 50mM

Tris HCl 121.12 1M 100 mM

NaCl 58.44 5M 0.8M

Sucrose 342.3 1M 0.5M

Triton X-100 100% 2%

βME 0.1%

Table 2. Composition of extraction buffer used in the genomic DNA isolation

Extraction buffer (pH 8.0) Molecular weight Stock conc. Final conc.

EDTA 372.24 0.5M 20mM

Tris HCl 121.12 1M 100mM

NaCl 58.44 5M 1.5M

CTAB 2%

βME 0.1%

Table 3. Composition of buffers and solution used in plasmid isolation

Solution I Solution II Solution III

Glucose 50 mM NaOH 10 N Potassium acetate 5 mM

EDTA 10 mM SDS 10% Glacial acetic acid absolute

Tris HCl 25 mM

Table 4. Composition of buffer for agarose gel electrophoresis

Recipe for 1 liter 5X TBE buffer Recipe for 1 liter 50 X TAE buffer

Tris base 54 g Tris base 242 g

Boric Acid 27.5 g Acetic acid 57.1 mL

EDTA 4.65 g EDTA (0.5 M) (pH 8.0) 100 mL

Appendix II. Chemicals and reagents used

Table 5 List of chemicals used in the study

S.N. Name Chemicals S.N. Name Chemicals

1. Agarose 27. Glycerol

2. Acrylamide 28. Formaldehyde

3. Bis-Acrylamide 29. Chloroform

4. Luria agar (LA) 30. Isoamyl alcohol

5. Luria broth (LB) 31. Isopropyl alcohol

6. Luria broth agar (LBA) 32. Phenol

7. Sodium chloride 33. IPTG

8. β-mercaptoethanol 34. X-gal

9. Tris 35. PVP

10. Boric acid 36. Labolene

11. EDTA 37. SOC media

12. Silver nitrate 38. HiDi Formamide

13. Ammonium Per sulfate (APS) 39. Liz 600

14. Binding silane 40 ROX

15. TEMED 41. Polymer POP7

16 Glacial acetic acid 42. 10x buffer

17. Sodium acetate 43. PCR master mix

18. Potassium acetate 44. dNTPs

19. Sodium hydroxide 45. Taq Polymerase

20. Ammonia 46. RNAse

21. Sodium carbonate 47. Ampicillin

22. Sodium Dodacyl Sulfate (SDS) 48. Ethidium bromide (EtBr)

23. CTAB 49. Mix bed resin

24. Sucrose 50. Bromophenol blue

25. Triton 51. Xylene cyanol

26. Tris HCl

Appendix III Primers/Oligonucleotides

Table 6 Fluorescently labeled M13 tag primers

S.N. Fluorescent dye Primer Sequence (5'-3')

1. FAM TGTAAAACGACGGCCAGT

2. VIC TGTAAAACGACGGCCAGT

3. NED TGTAAAACGACGGCCAGT

4. PET TGTAAAACGACGGCCAGT

Appendix IV Electrophoresis reagents and kits

Table 7 Chemicals used for PAGE

S.N. Chemicals S.N. Chemicals

1. TBE Buffer (5X) 8. Formaldehyde

2. Acrylamide 9. Silver nitrate

3. N,N’ Methylene bis-acrylamide 10. Binding silane

4. TEMED 11. Repel silane

5. Ammonium per sulphate (APS) 12. Ethanol

6. Ammonia solution 13. CTAB

7. Sodium carbonate 14. Sodium hydroxide

Table 8 Chemicals used for capillary electrophoresis

1. HiDi Formamide

2. Size standard (LIZ 600)

3. POP-7TM

polymer

4. 10X buffer with EDTA

Appendix V. Major Equipments used

Table 9 Major equipments used in the study

S.N. Equipment Model/type Company

1. Centrifuges Mini spin, Ultra centrifuge,

96-well plate centrifuge

Eppendorf, Thermo Fisher,

Sorvall fresco, Sigma svi

Biosolutions Pvt. Ltd.

2. Nano drop Spectrophotometer ND1000 Nanodrop Technologies, DE,

USA

3. PCR machines DNA Engine Tetrad 2 and

Verti 96 well thermo cycler

Bio-Rad Inc. USA, and

Applied Biosystems

4. Agarose gel

system

Horizontal Takara and GeneiTM

5. Gel Doc Gel DocTM

XR with Image

LabTM

Bio-Rad, USA

6. PAGE system Bio-Rad Sequi Gen gel

apparatus

Bio-Rad, USA.

7. Sequencing

platform

DNA analyzer ABI 3730 xl

GS FLX 454 pyrosequencer

Applied Biosystem Inc. USA)

Roche’s life science

8. High

throughput

electrophoresis

system

DNA analyzer ABI 3730 xl Applied Biosystem Inc. USA)

List of publications

(A) RESEARCH PAPERS PUBLISHED AND COMMUNICATED

1. Maurya R, Gupta A, Singh SK, Rai KM, Chandrawati, Sawant SV, Yadav HK (2013)

Microsatellite polymorphism in Jatropha curcas L.- a biodiesel plant. Industrial Crops and

Products. 49:136-142.

2. Maurya R, Verma S, Gupta A, Singh B, Yadav HK (2013) Genetic variability and

divergence analyses in Jatropha Curcas L. based on floral and yield traits. Genetika 45:

655-666.

3. Gupta A, Maurya R, Roy RK, Sawant SV, Yadav HK (2013) AFLP genetic relationship

and population structure analysis of Canna-an ornamental plant. Scientia Horticulturae

154:1-7.

4. Chandrawati, Maurya R, Singh PK, Ranade SA, Yadav HK (2014) Diversity analysis in

Indian genotypes of linseed (Linum usitatissimum L.) using AFLP markers. Gene 549:171-

178.

5. Maurya R, Kumar U, Katiyar R, Yadav HK (2014) Correlation and path coefficient analysis

in J. curcas. Genetika (Accepted).

6. Maurya R#, Gupta A

#, Singh SK, Rai KM, Chandrawati, Ktiyar R, Sawant SV, Yadav HK

(2014) Development of microsatellite markers for Jatropha curcas using 454 sequencing and

its application for diversity analysis. (Communicated). # Equal contribution

(B) SCIENTIFIC PRESENTATIONS

1. Yadv HK, Maurya R, Gupta A, Singh S, Rai KM, Tuli R and Sawant SV (2011)

Development of microsatellite markers and their application for diversity analysis in

Jatropha curcas L. (All India Botanical conference held at Lucknow University, October

10-12, 2011). Page N. 195.

2. Gupta A, Maurya R, Singh SK, Rai KM and Yadav HK (2011) Development of large scale

genomic SSRs from microsatellite enriched libraries of Jatropha curcas L”. - a biofuel plant,

(Society of Biotechnological chemist Conference, 12-15 November, 2011, held on CSIR-

CIMAP, Lucknow). Page N. 59. 3. Maurya R, Gupta A, Kumar U, Chandrawati and Yadav HK (2013) Development and

characterization of trinucleotide repeat (TNR) motif microsatellite markers in Jatropha

curcas L. (SAB-Society for Applied Biotechnology held on 28th

-29th

, June, 2013, at

Tirupati, Andhra Pradesh. First International and Third National Conference). Page N.

28.

4. Chandrawati, Maurya R, Singh PK and Yadav HK (2013) AFLP based genetic diversity in

linseed (Linum usitatissimum L.). (SAB-Society for Applied Biotechnology held on 28th

-

29th

, June, 2013, at Tirupati, Andhra Pradesh. First International and Third National

Conference). Page N. 199. 5. Maurya R, Gupta A, Katiyar R, Ranade SA and Yadav HK (2014) Development of

interspecific hybrid of Jatropha curcas L. and its characterization using SSR markers.

(2nd UP Agricultural Science Congress held on June 14th

-16th

June, 2014 at IISR,

Lucknow). Page N. 313-314. 6. Chandrawati, Maurya R, Singh PK, Ranade SA and Yadav HK (2014) Molecular diversity

in Indian genotypes of linseed (Linum usitatissimum L.). (2nd

UP Agricultural Science

Congress held on June 14th

-16th

June, 2014 at IISR, Lucknow). Page N. 312-313.

Industrial Crops and Products 49 (2013) 136– 142

Contents lists available at SciVerse ScienceDirect

Industrial Crops and Products

journa l h om epa ge: www.elsev ier .com/ locate / indcrop

Microsatellite polymorphism in Jatropha curcas L.—A biodiesel plant

Ramanuj Maurya, Astha Gupta, Sunil Kumar Singh, Krishan Mohan Rai, Chandrawati,Samir V. Sawant, Hemant Kumar Yadav ∗

CSIR-National Botanical Research Institute, Rana Pratap Marg, Lucknow 226001, India

a r t i c l e i n f o

Article history:Received 5 February 2013Received in revised form 19 April 2013Accepted 19 April 2013

Keywords:Jatropha curcasEnriched libraryGenomic SSRBiodieselGenetic diversity

a b s t r a c t

We developed and characterized 1207 SSRs to enrich the validated markers repertoire of Jatropha curcas.A total of 248 polymorphic SSRs were identified with a panel of 7 accessions of J. curcas including someexotic accessions. Furthermore, 179 and 331 SSRs were found polymorphic among parental lines ofNBRI-J05 × EC643912 and Chhatrapati × Jatropha integerrima used in developing two mapping populationrespectively. The number of alleles varied from 2 to 5 with an average of 2.24 ± 0.55 and 2.42 ± 0.62alleles/SSR for CA and GA enriched library respectively. Most of the SSRs had lower PIC value (less than0.30) and the maximum PIC value was observed for JGM A281, JGM A326 (0.63) followed by JGM B300(0.62), JGM B361 (0.58), JGM A244 (0.55), JGM B595 (0.55) and JGM B176 (0.55). The genetic similaritycoefficient among the accessions of J. curcas and one accession of J. integerrima ranged from 0.11 to 0.92with an average of 0.57 ± 0.31. The phenogram classified all the 7 accessions of J. curcas in one cluster andthe J. integerrima remained as an out group. The BLASTX analysis of SSR containing sequences showedmaximum similarity of 50% with Ricinus communis (Euphorbiaceae) followed by Populus trichocarpa (22%),Vitis vinifera (16%) and Arabidopsis spp. (4.5%). This study may enrich the validated repertoire of SSRmarkers in J. curcas and could be used in various genetic studies including construction of linkage map,diversity analysis, and QTL/association mapping.

© 2013 Elsevier B.V. All rights reserved.

1. Introduction

Jatropha curcas L., commonly known as physic nut, belongs tothe family Euphorbiaceae and has small genome size of 416 Mb(Carvalho et al., 2008). It is reported to be native to tropical Amer-ica, but also widely distributed in other tropical and sub-tropicalareas of the world, especially in Africa, India and Southeast Asia(Heller, 1996; Rao et al., 2008). Traditionally, this plant is used asa fence plant, to prevent and/or control erosion and reclaim land.Recently, J. curcas has been projected as a promising and poten-tial source of biodiesel as crude oil from its seeds meets the fuelquality of rapeseed (www.fact-foundation.com) which can be eas-ily converted to biodiesel with US and European standards (Azamet al., 2005). However, J. curcas is still considered as undomes-ticated/semi domestic plant with various negative features withmajor knowledge gaps regarding basic genetics, ecological andagronomic properties (Achten et al., 2008; Fairless, 2007). Consid-ering the importance of the crop, there is a need for a betterunderstanding of various basic and applied aspects of this plant soas to improve or develop stable high yielding varieties. Traditionalapproaches for genetic improvement of polygenic traits mainly rely

∗ Corresponding author. Tel.: +91 522 2297938; fax: +91 522 2205836.E-mail address: [email protected] (H.K. Yadav).

on phenotypic and pedigree information (Falconer et al., 1996) thatare labor and time intensive. Therefore, the molecular breedingapproach is being considered for genetic improvement for mostof the economically important crop plants as faster alternatives.The basic requirements of any marker assisted breeding programto be successful include availability of reliable molecular markersystem, and markers tightly linked to QTLs of desired traits. Amongvarious DNA markers, SSRs and SNPs have been widely applied forestimation of genetic diversity, construction of linkage map andassociation/QTL mapping to tag the target traits. As far as J. curcasis concerned, a limited number of SSRs and SNPs marker are avail-able as the information on quantitative genetics. Although, Satoet al. (2011) reported genome sequence of J. curcas and identifiedlarge number of SSRs, but the details of these SSRs are not avail-able with the public domain. Further, they characterized only 100SSRs (<0.3% reported) over 12 accessions. So, there is still need oflarger numbers of validated markers. In recent past, various effortshave been made to develop and utilize different molecular mark-ers in J. curcas which majorly include RAPD (Ganesh et al., 2008;Rafii et al., 2012), SPAR (Ranade et al., 2008), ISSR (Grativol et al.,2010; Kumar et al., 2009) and AFLP (Tatikonda et al., 2009; Shenet al., 2010) markers. However, some reports on SSRs and SNPsmarkers are also available (Sudheer et al., 2010; Yadav et al., 2011;Wang et al., 2011; Gupta et al., 2012a). Wang et al. (2011) have con-structed linkage map of J. curcas with 216 SSRs and 290 SNPs using

0926-6690/$ – see front matter © 2013 Elsevier B.V. All rights reserved.http://dx.doi.org/10.1016/j.indcrop.2013.04.034

R. Maurya et al. / Industrial Crops and Products 49 (2013) 136– 142 137

backcross mapping population derived from J. curcas and Jatrophaintegerrima. Later, Sun et al. (2012) used linkage map developed byWang et al. (2011) and reported QTLs associated with growth andseed traits with deployment of 105 SSR markers. Till date, most ofthe markers studies in J. curcas have been performed with limitednumbers of markers. For better understanding of polygenic traits,construction of dense map and fine QTL mapping a large numberof workable and validated markers are required. Thus, in view ofthe previous report of low level of genetic diversity and limitednumber of validated markers, there is a need to enrich the geneticpool, develop and validate large number of polymorphic markers.Therefore, the present investigation was undertaken to develop aset of SSRs from genomic libraries and to thereafter characterize,validate and identify polymorphic SSRs in J. curcas.

2. Materials and methods

2.1. Plant materials and DNA isolation

The plant materials include 7 accessions of J. curcas i.e. Han-sraj, Chhatrapati, CRIDA-C-16, RRL-Mon-C-1, NBRI J05, EC685205and EC643912 and 1 accession of J. integerimma. All these acces-sions (except EC685205 and EC643912) of indigenous origin andcollected from different eco-geographical and agro-climatic zonesof India. These are reported to be elite and diverse accessions ofJ. curcas from India (Yadav et al., 2011). The accessions EC685205and EC643912 are exotic collections and were procured from SouthAfrica and Mexico respectively. The Mexican accession is reportedto be non toxic as it contains low amount of phorbol esters (Makkaret al., 1998). The NBRI-J05 × EC643912 and Chhatrapati × J. inte-gerrima were used as parental lines for developing intraspecific andinterspecific mapping population respectively at National Botani-cal Research Institute (NBRI), Lucknow, India. The DNA from youngleaves was extracted using Qiagen DNeasy Plant Mini kit (Qiagen,Valencia, California) as per manufacturer’s instructions. The qualityof DNA was checked on 0.8% agarose gel and the concentration wasdetermined using a Nanodrop spectrophotometer ND1000 (Nano-drop Technologies, DE, USA). Finally, the DNA was normalized to10 ng/�l for PCR amplification.

2.2. SSR enriched genomic library and sequencing

Two genomic libraries enriched for CA and GA repeat motifswere custom produced by Genetic Identification Services (GIS,Chatsworth, CA, USA). The SSR enriched DNA fragments werecloned in HindIII restriction site of the plasmid vector pUC19.The recombinant plasmid was transformed into ElectroMaxTM

DH5�-ETM electrocompetent Escherichia coli cells (Invitrogen)and plasmid DNA was isolated from transformed cells followingstandard alkaline lysis mini prep protocol. The sequencing was car-ried out with M13 primer (5′ACGACGTTGTAAAACGACGG-3′) andBig Dye Terminator Cycle Sequencing Kit 3.1 on ABI 3730xl DNAAnalyzer (Applied Biosystems, Foster City, CA, USA).

2.3. SSR identification, primer designing and PCR amplification

The sequence data obtained were checked manually and vectorsequences were removed. The redundant sequences were identi-fied by comparison using stand-alone BLAST (2.2.1.2) and removed.The unique sequences were subjected to SSR search by web basedprogram SSRIT (http://www.gramene.org/db/markers/ssrtool).The basic search criteria for SSRs were a minimum of fiverepeat and maximum motif length was six. The primer pairsflanking the SSRs were designed using PRIMER3 software(http://frodo.wi.mit.edu/primer3). The primers were synthe-sized with an additional 18 base (5′-TGTAAAACGACGGCCAGT-3′)

tag at 5′ end to all the forward primers as M13 tail (Eurofins,Germany). The PCR amplification was carried out in 10 �l reactionvolume containing 10 ng of genomic DNA, 1× PCR master mix (Fer-mentas Inc, USA), 0.1 �l (5 pmol/�l) of forward primer (tailed withM13 tag), 0.3 �l (5 pmol/�l) each of both normal reverse and M13tag (labeled with either 6-FAM NED, VIC and PET) using AppliedBiosystems Veriti PCR machine. The PCR conditions was as follows:initial denaturation for 5 min at 95 ◦C, followed by 35 cyclesof denaturation for 30 s at 94 ◦C, annealing for 45 s at 48–52 ◦C(primer specific) and extension for 30 s at 72 ◦C. Subsequently, 10cycles of denaturation for 30 s at 94 ◦C, annealing for 45 s at 53 ◦C,extension for 45 s at 72 ◦C followed by final extension for 15 minat 72 ◦C was performed. After PCR amplification confirmation on1.5% agarose gel, post PCR multiplex sets was prepared based onfluorescence labeled primers. For post PCR multiplexing, 1 �l of6-FAM and 2 �l of each VIC, NED and PET labeled PCR productrepresenting different SSRs were combined with 13 �l of water.1 �l of this mixed product was added to 10 �l Hi-Di formamidecontaining 0.25 �l GeneScanTM 600 LIZ® as internal size standard,denatured for 5 min at 95 ◦C, quick chilled on ice for 5 min andloaded on ABI 3730xl DNA Analyzer. The fragment analysis wasperformed by GeneMapper v4.0 software (Applied Biosystems,Foster City, CA, USA).

2.4. Data acquisition and statistical analyses

The allelic data of polymorphic SSRs were subjected to statisti-cal analysis using PowerMarker (Liu and Muse, 2005) to calculateobserved heterozygosity (Ho), gene diversity or expected heterozy-gosity (He), major allele frequency and polymorphic informationcontent (PIC) value. The PIC value was calculated following Botsteinet al. (1980) as follow:

PIC = 1 −[

n∑i=1

Pi2

]−

⎡⎣ n−1∑

i=1

n∑j=i+1

2Pi2Pj2

⎤⎦

where Pi and Pj are the frequencies of ith and jth allele.Further, pair-wise genetic similarities among all the accessions

using Jaccard’s coefficient was also calculated and a dendrogramwas prepared based on unweighted pair-group method of arith-metic average (UPGMA) using NTSYS-pc v.2.02e software (Rohlf,2000).

2.5. Functional annotation and GO analysis

The SSR containing sequences were annotated against NCBInr protein database (NCBI nr, release: 20th Dec, 2011) usingBLASTX with a criteria of minimum e-value of 1e−5 and minimumalignment length 50% of the query sequence. The annotationswere further classified on the basis of their plant specific associ-ations. The Gene Ontology (GO) analysis (http://arabidopsis.org/tools/bulk/go/index.jsp/), for functional annotations, was per-formed on the basis of TAIR GO annotations (ftp://ftp.arabidopsis.org/home/tair/Sequences/blast datasets/TAIR10blastsets/TAIR10 pep 20101214). The GO terms associated withArabidopsis loci (best BLASTX hit) were assigned for annotationsof corresponding sequences and categorized under molecularfunction, biological process and cellular component categories.

3. Results

3.1. Characterization and polymorphism evaluation ofdinucleotide SSRs

The SSRs were developed from genomic libraries enriched withCA (designated as Lib A) and GA (designated as Lib B) repeat units.

138 R. Maurya et al. / Industrial Crops and Products 49 (2013) 136– 142

Table 1Summary of genomic SSRs developed from enriched libraries of J. curcas.

Lib A (CA enriched) Lib B (GA enriched) Total

Clones Sequenced 1740 1530 3270Sequences containing SSR motifs 1385 (79.6%) 1345 (87.9%) 2730 (83.5%)SSR containing unique sequences 639 (46.1%) 676(50.3%) 1315 (48.2%)Primers designed for 574(41.4%) 633(47.1%) 1207(44.2%)Total number of SSRs identified 759 857 1616Sequences with > 1 SSRs 152 (26.5%) 191 (30.2%) 343 (28.4%)Compound SSRs 115(20.0%) 154(24.0%) 269 (22.0%)Perfect 455 (79.0%) 473 (75.0%) 928 (77.0%)Interrupted 4 (1.0%) 6(1.0%) 10 (1.0%)

More than 3500 clones were sequenced and a total of 1740 and1530 good quality sequences were selected from Lib A and B respec-tively (Table 1). A total of 1385(79.6%) sequence from Lib A and1530 (87.9%) sequences from Lib B were found to have SSR motifs.After removing redundant sequences, a total of 639 (46.1%) and 676(50.3%) SSR containing sequences were found to be unique from LibA and B respectively. Primers were successfully designed for 574(out of 639 of Lib A) SSR containing sequences while the remaining65 primers could not be designed due to either marginal SSRs orflanking sequences not suitable for primer designing criteria. Like-wise, out of 674 sequences of Lib B, primers were designed for 633sequences. Thus, a total of 1207 (44.2%) primer pairs flanking 1616SSRs were successfully designed and synthesized (SupplementaryTable 1). These 1207 SSR containing sequences were submitted toNCBI under GSS (Acc. No. JM427845–JM429048). The enrichmentpercentage was found to be higher for GA repeat motif (87.9%)than CA repeat motif (79.6%). As expected, dinucleotide repeats(DNR) were recovered in higher proportion from both the libraries(Supplementary Fig. 1a). However, other SSR repeat types such astrinucleotide repeat (TNR) and tetranucleotide repeat (TNR) werealso obtained but in limited numbers. The majority of identifiedSSRs (65.4%) were of short length, in the range of 4–10 repeat unitsfrom both the libraries (Supplementary Fig. 1b). However, 37 longerSSRs were also found with more than 20 repeat unit. Consider-ing the frequencies of different SSR motifs, the AC/GT and AG/CTrecovered in higher frequency of 32.5% and 45.3% of Lib A and Brespectively (Supplementary Fig 2).

Out of 574 Lib A SSRs, 106 (18.5%) were found to be polymorphicamong 7 accessions of J. curcas, 420 (73.2%) were monomorphic andthe rest 48 (8.3%) failed to amplify. Further, exclusion of EC643912(non-toxic accession) reduces the polymorphic SSRs to 55 (9.6%)(Table 2). The intraspecific mapping population parental screeningof Lib A SSRs revealed that 71 (12.4%) were polymorphic and 446were monomorphic among NBRI-J05 × EC643912, while 57 werenot amplified in both the accessions. In case of the parents of inter-specific population i.e. Chhatrapati × J. integerrima, 144 (25%) SSRswere polymorphic and 177 (30.8%) were monomorphic. Similarly,251 and 2 SSRs were not amplified with J. integerrima and Chhatra-pati respectively. Furthermore, the results obtained with Lib B SSRsshowed that 70 were not amplified with any accession of J. curcasand out of rest 563, 421 (75.0%) were found to be monomorphic and142 (25.0%) were polymorphic. The number of polymorphic SSRs

reduced from 142 to 74 when EC643012 (non-toxic accession) wasexcluded from the analysis. The polymorphism screening of Lib BSSRs with parental lines showed that 108 (19.2%) and 187 (33.2%)were polymorphic with NBRI-J05 × EC643912 and Chhatrapati × J.integerrima respectively (Table 2).

In addition to polymorphism detection with respect to map-ping populations, the genotypic data of polymorphic SSRs with 7accessions of J. curcas was also subjected to various statistical anal-yses to assess the potential of newly developed SSRs for furthergenetic studies. The 106 polymorphic Lib A SSRs showed differentdegree of variability at each locus as the number of alleles variedfrom 2 to 5 (JGM A281) with an average of 2.24 ± 0.55 alleles/SSR(Supplementary Table 2). The PIC value of these polymorphic SSRsranged between 0.12–0.63 with an average of 0.24 ± 0.10. The max-imum PIC was noticed for JGM A281 and JGM A326 (0.63) followedby JGM A244 (0.55) and JGM A107 (0.52). Most of the SSRs (85;53.1%) showed lower PIC value less than 0.3 (Fig. 1). There wereonly three SSRs showing PIC value in between 0.51 – 0.70. Theobserved heterozygosity (Ho) and expected heterozygosity (He)or gene diversity varied in the range of 0.00–1.0 (0.12 ± 0.23) and0.12–0.65 (0.24 ± 0.11) respectively (Supplementary Table 2). Themajor allele frequency of polymorphic SSRs varied from 0.43 to 0.93with an average of 0.83 ± 0.11. The detailed analysis of 142 poly-morphic Lib B SSRs revealed considerable variability at each locusand showed allele range from 2 to 5 with an average of 2.42 ± 0.62alleles/SSR. The PIC value varied between 0.12 – 0.62 with an aver-age of 0.28 ± 0.13 per SSR (Supplementary Table 2). Also in caseof Lib B, most of the SSRs (92; 65%) had lower PIC value (<0.30)(Fig. 1). There were only 9 SSRs which showed PIC value in rangeof 0.51–0.70. The maximum PIC value was observed for JGM B300(0.62) followed by JGM B361 (0.58), JGM B595 (0.55) and JGM B176(0.55). Major allele frequency of polymorphic SSRs of Lib B rangedfrom 0.43 to 0.93 with an average of 0.78 ± 0.15. The observed het-erozygosity (Ho) varied between 0.00 and 1.00 with an averageof 0.31 ± 0.21 and expected heterozygosity (He) or gene diversityvaried from 0.12 to 0.65 with an average of 0.28 ± 0.14 (Supple-mentary Table 2). Further, to study the extent of allelic diversityand potential of the developed SSRs for toxic accessions, the PICvalue was also calculated based on 6 accessions of J. curcas exclud-ing EC643912 (non-toxic accession). The PIC value of polymorphicLib A and Lib B SSRs varied from 0.14 to 0.64 (0.22 ± 0.11) and 0.14to 0.57 (0.56 ± 0.12) respectively (data not shown).

Table 2Polymorphism screening details of SSRs with different accession of J. curcas and mapping populations.

Lib A SSRs Lib B SSRs

Polymorphic Monomorphic Fail Polymorphic Monomorphic Fail

Seven acc. of J. curcas 106 420 48 142 421 70Six acc. J. curcas (excluding EC643912) 55 471 48 74 489 70NBRI-J05 x EC643912a 71 446 57 108 450 75Chhatrapati x J. integerrimab 144 177 253 187 228 218

a Parental lines of intraspecific mapping population.b Parental lines of inter-specific mapping population.


Fig. 1. PIC distribution of polymorphic SSR loci calculated from 7 accessions of J. curcas including non-toxic accessions.

3.2. Similarity search and functional annotations

The similarity search of SSR containing sequences showed max-imum similarity of 50% with Ricinus communis (Euphorbiaceae)(Fig. 2). Significant similarity was also observed with Populus tri-chocarpa (22%), Vitis vinifera (16%) and Arabidopsis spp. (4.5%). GeneOntology analysis (http://arabidopsis.org/tools/bulk/go/index.jsp/)resulted into a total of 1539 GO terms (against 1207 sequences),which were further categorized under biological processes, (638terms, 41%), cellular components (366 terms, 24%) and molecularfunctions (535 terms, 35%) category. Besides, significant numbersof terms for unknown/unclassified annotations (under all the threecategories), maximum numbers of terms were assigned for proteinmetabolism (10%) in biological processes category (Fig. 3). Simi-larly, maximum number of terms was assigned for nucleus (9%)and chloroplast (7%) in cellular components and for protein binding(13%) in molecular functions category.

3.3. Phylogenetic analyses

The genotypic data of all the polymorphic SSRs over 7 acces-sions of J. curcas and 1 accession of J. integerrima was used toevaluate the genetic diversity/interrelationship among the selectedaccessions. The genetic similarity coefficient ranged from 0.11to 0.92 with an average of 0.57 ± 0.31. The phenogram classi-fied all the 7 accessions of J. curcas into one cluster and the J.integerrima remained as an out group (Fig. 4). The two acces-sions i.e. Hansraj and Chhatrapati were found to be very closewith 92% of genetic similarity. Among exotic accessions, theEC643912 (non-toxic accession) showed higher degree of diver-sity and EC685202 (toxic accession) was grouped with NBRI-J05.The genetic diversity among the parental lines of intraspecificmapping population i.e. Chhatrapati × EC643912 was found to be51% while in case of interspecific mapping population it was89%.

Fig. 2. Annotation of genomic SSRs developed from enriched libraries of J. curcas. Each bar indicates the percent sequence similarity with various plant genomes based onBLASTN.


0

5

10

15

20

25

% G

ene

cou

nt

Functional categories

Cellular co mponent Molecular func tionsBiological proc ess

Fig. 3. Gene Ontology (GO) classification of the SSR containing genomic sequences derived from microsatellite enriched libraries of J. curcas. The relative frequencies of GOhits to functional categories of cellular components, biological process, and molecular functions.

4. Discussion

Jatropha curcas (2n = 22), a perennial shrub producing non edibleoil, has emerged as a renewable source of biodiesel production. Itattracts worldwide attention of research communities to study andanalyze its potential for biodiesel production. Various research pro-grams have been initiated on different aspects of crop improvementincluding agronomy, conventional breeding, molecular breeding,genetic engineering etc. As per previous reports, the J. curcas isstill considered as an undomesticated/semi-domesticated plantwhich needs further genetic improvement through intervention

of both conventional as well as molecular breeding approaches.The advent of various molecular tools and further advances inmolecular marker technologies offers fast and targeted geneticimprovement through genomic assisted breeding. Various typesof molecular markers have been widely used to track loci tightlylinked with the different agronomic and disease resistance traits inseveral crop species (Phillips and Vasil, 2001; Jain et al., 2002; Guptaand Varshney, 2000). Among different approaches of developingmolecular markers, the use of microsatellite enriched genomiclibraries has been widely used for SSRs development. In the presentinvestigation SSRs were developed from CA and GA repeat enriched

Fig. 4. Genetic relationship among 7 accessions of J. curcas and 1 of J. integerrima based on UPGMA clustering derived from 248 polymorphic genomic SSRs.


genomic libraries of J. curcas (NBRI J05). The microsatellite enrich-ment level was reported to vary from 11 to 99% in previous studiesin different crop plants (Techen et al., 2010). Here, the enrich-ment efficiency was noticed upto 87.9% and a total of 1207 uniquesequences were recovered from both the libraries at 44.2% recoveryrate. Sun et al. (2008) have also reported almost similar enrich-ment efficiency (89.5%) in J. curcas. However, Sudheer et al. (2009)found lower enrichment efficiency (39.0%). Since, the libraries wereenriched with CA and GA repeats, the frequency of core repeatswas found to be high for dinucleotide motif, though other repeatmotif were also recovered. The maximum number of clones of CAenriched library has AC/GT repeat motif and GA enriched libraryhas AG/CT repeat. The non-targeted AT motifs were also recoveredin significant number and were the major part of compound SSRswith targeted repeat motifs. Similar findings are also reported inother plant genomes like Maize (Taramino and Tingey, 1996), sun-flower (Paniego et al., 2002), and safflower (Hamdan et al., 2011).The AT repeat motifs among DNRs are reported to be most com-mon in genomic sequences of Arabidopsis (Cardle et al., 2000).Recently, Sato et al. (2011) also reported high frequency of ATrepeat motif (71%) among DNR and AAT (60%) among TNR in thegenome sequence of J. curcas. The appearance of AT motif in thepresent investigation might be due to its higher frequency in J.curcas genome.

Approximately 90% amplification of primers from both thelibraries was observed which is comparatively higher in propor-tion than earlier reports in J. curcas by Sudheer et al. (2010, 74%)and Yadav et al. (2011, 78%). However, it was comparable to resultsobtained by Sato et al. (2011, 88%). The rate of polymorphism wasquite low among intraspecific parental lines (Lib A SSRs ∼12%; Lib BSSRs ∼19%) as compared to interspecific parental lines (Lib A SSRs∼25%; Lib B SSRs ∼33%). The cross species transferability of SSRs toJ. integerrima was ∼52%. It was noticed that majority of intraspecificpolymorphism among 7 accessions of J. curcas was due to accessionEC643912 (non-toxic). We observed ∼52% polymorphism reduc-tion when EC643912 accession was excluded from analysis. Thepolymorphism rate among different accessions of J. curcas wasfound to be low (23%) as compared to the previous studies in Jat-ropha (Sudheer et al., 2010; Yadav et al., 2011), however it washigher than that reported by Sun et al. (2008). The 248 polymorphicSSRs could be potential markers for future genetic studies in Jat-ropha as they showed considerable amount of allelic variation (2–5alleles per locus) among limited accession of J. curcas. The PIC valuereveals the informativeness level and accordingly defined into high(PIC > 0.5), moderate (PIC > 0.25 and < 0.5) and low (PIC < 0.25) cat-egories (Botstein et al., 1980). The SSRs developed exhibited lowlevel of informativeness with an average PIC value of 0.24 ± 0.10and 0.28 ± 0.13 of Lib A and Lib B respectively. The majority of theSSRs (71.0%) showed lower PIC value which might be due to eitherlow level of genetic diversity available in the germplasm or lessnumber of accessions studied. This is further supported by the factthat exclusion of EC643912 (non-toxic accession) further reducesthe PIC value. No specific correlation between number of repeatsand number of alleles or PIC value was observed in J. curcas. Similarfindings were also reported in previous studies (Hossain et al., 2000;Ferguson et al., 2004; Gupta et al., 2012b). However, a positive cor-relation was reported in grape (Bowers et al., 1996) and Arachis(Moretzsohn et al., 2005). The low average PIC value and number ofalleles observed suggests narrow genetic diversity available in thegene pool of J. curcas. Therefore, the study suggests collection andevaluation of more number of accessions especially from the centerof origin and also to develop larger number of polymorphic mark-ers in order to facilitate genetic improvement of J. curcas throughmarker assisted breeding programs.

Further, significant putative function could be assigned for 17%(200) of the newly developed SSR containing sequences and 50%

of that showed homology with Ricinus communis, a member ofEuphorbiaceae followed by Populus trichocarpa, Vitis vinifera andArabidopsis spp. The genetic relationship among 7 J. curcas accessionand 1 accession of J. integerrima based on 248 polymorphic markers,depicts a clear distinctness of J. integerrima from J. curcas. Among J.curcas, the accession EC643912 (non-toxic) was found to be highlydiverse, which is also reported in earlier studies (Makkar et al.,1998; Basha et al., 2009). The clustering of the accessions suggeststhat the parental lines selected for developing mapping population(both intra specific and interspecific) was a right approach whichmay be useful in future genetic studies.

5. Conclusions

Here, we reported 1207 genomic SSRs (1089 working SSR) fromCA and GA repeat motif enriched genomic libraries of J. curcas.Polymorphic SSRs for both intraspecific and interspecific mappingpopulations have been identified and characterized for variousstatistical attributes. These newly developed SSRs may facilitateconstruction of high density linkage map, diversity analysis, andQTL/association mapping and may be further utilized in makingstrategies for marker assisted breeding toward developing highseed and oil yielding cultivars.

Acknowledgements

The authors thank the Department of Biotechnology, Ministryof Science, Government of India, for financial support. Authors alsothank Dr. S.A. Ranade, Chief Scientist, CSIR-NBRI for his critical eval-uation of manuscript and useful suggestions.

Appendix A. Supplementary data

Supplementary data associated with this article can be found,in the online version, at http://dx.doi.org/10.1016/j.indcrop.2013.04.034.

References

Achten, W.M.J., Verchot, L., Franken, Y.J., Mathijs, E., Singh, V.P., Aerts, R., Muys, B.,2008. Jatropha biodiesel production and use. Biomass Bioenergy 32, 1063–1084.

Azam, M.M., Waris, A., Nahar, N.M., 2005. Prospects and potential of fatty acidmethyl esters of some non-traditional seed oils for use as biodiesel in India.Biomass Bioenergy 4, 293–302.

Basha, S.D., Francis, G., Makkar, H.P.S., Becker, K., Sujatha, M., 2009. A comparativestudy of biochemical traits and molecular markers for assessment of geneticrelationships between Jatropha curcas L. germplasm from different countries.Plant Sci. 176, 812–823.

Botstein, D., White, R.L., Skolnick, M., Davis, R.W., 1980. Construction of geneticlinkage map in man using restriction fragment length polymorphisms. Am. J.Hum. Genet. 32, 314–331.

Bowers, J.E., Dangl, G.S., Vignani, R., Meredith, C.P., 1996. Isolation and characteri-zation of new polymorphic simple sequence repeat loci in grape (Vitis viniferaL.). Genome 39, 628–633.

Cardle, L., Ramsay, L., Milbourne, D., Macaulay, M., Marshall, D., Waugh, R., 2000.Computational and experimental characterization of physically clustered simplesequence repeats in plants. Genetics 156, 847–854.

Carvalho, C.R., Clarindo, W.R., Prac, M.M., Araujo, F.S., Carels, N., 2008. Genome size,base composition and karyotype of Jatropha curcas L. an important biofuel plant.Plant Sci. 174, 613–617.

Fairless, D., 2007. Biofuel: the little shrub that could – maybe. Nature 449 (7163),652–655.

Falconer, D.S., Mackay, T.F.C., Frankham, R., 1996. Introduction to quantitative genet-ics (4th edn). Trends Genet. 12 (7), 280.

Ferguson, M.E., Burow, M.D., Schultz, S.R., Bramel, P.J., Paterson, A.H., Kresovich, S.,Mitchell, S., 2004. Microsatellite identification and characterization in peanut(A. hypogaea L.). Theor. Appl. Genet. 108, 1064–1070.

Ganesh, R.S., Parthiban, K.T., Senthil Kumar, R., Thiruvengadam, V., Paramathma,V.M., 2008. Genetic diversity among Jatropha species as revealed by RAPD mark-ers. Genet. Resour. Crop Evol. 55, 803–809.

Grativol, C., da Fonseca Lira-Medeiros, C., Hemerly, A.S., Ferreira, P.C., 2010. Highefficiency and reliability of inter-simple sequence repeats (ISSR) markers forevaluation of genetic diversity in Brazilian cultivated Jatropha curcas L. acces-sions. Mol. Biol. Rep. 38, 4245–4256.


Gupta, P.K., Varshney, R.K., 2000. The development and use of microsatellite mark-ers for genetic analysis and plant breeding with emphasis on bread wheat.Euphytica 113, 163–165.

Gupta, P., Idris, A., Mantri, S., Asif, M.H., Yadav, H.K., Roy, J.K., Tuli, R., Mohanty, C.S.,Sawant, S.V., 2012a. Discovery and use of single nucleotide polymorphic (SNP)markers in Jatropha curcas L. Mol. Breed. 30, 1325–1335.

Gupta, S., Kumari, K., Sahu, P.P., Vidapu, S., Prasad, M., 2012b. Sequence-based novelgenomic microsatellite markers for robust genotyping purposes in foxtail millet[Setaria italica (L.) P Beauv]. Plant Cell Rep. 31, 323–337.

Hamdan, Y.A.S., García-Moreno, M.J., Redondo-Nevado, J.R., Velasco, L., Pérez-Vich,B., 2011. Development and characterization of genomic microsatellite markersin safflower (Carthamus tinctorius L.). Plant Breed. 130, 237–241.

Heller, J., PhD dissertation 1996. Physic Nut – Jatropha curcas L. – Promotingthe Conservation and Use of Underutilized and Neglected Crops. Institute ofPlant Genetic and Crop Plant Research, Gatersleben/International Plant GeneticResource Institute, Germany/Rome, Italy.

Hossain, K.G., Kawai, H., Hayashi, M., Hoshi, M., Yamanaka, N., Harada, K., 2000.Characterization and identification of (CT)n microsatellites in soybean using

sheared genomic libraries. DNA Res. 7, 103–110.Jain, S.M., Brar, D.S., Ahloowalia, B.S., 2002. Molecular Techniques in Crop Improve-

ment. Kluwer Academic Publishers, The Netherlands.Kumar, S., Parthiban, K.T., Govinda Rao, M., 2009. Molecular characterization of

Jatropha genetic resources through Inter Simple Sequence Repeats (ISSR) forcomparative analysis of genetic diversity. Mol. Biol. Rep. 36, 1951–1956.

Liu, K., Muse, S.V., 2005. Power marker: integrated analysis environment for geneticmarker data. Bioinformatics 21, 2128–2129.

Makkar, H.P.S., Aderibigbe, A.O., Becker, K., 1998. Comparative evaluation ofnon-toxic and toxic varieties of Jatropha curcas for chemical composition,digestibility, protein degradability and toxic factors. Food Chem. 62, 207–215.

Moretzsohn, M.C., Leoi, L., Proite, K., Guimaraes, P.M., Leal-Bertioli, S.C., Gimenes,M.A., Martins, W.S., Valls, J.F., Grattapaglia, D., Bertioli, D.J., 2005. Amicrosatellite-based, gene-rich linkage map for the AA genome of Arachis(Fabaceae). Theor. Appl. Genet. 111, 1060–1071.

Paniego, N., Echaide, M., Munoz, M., Fernandez, L., Torales, S., Faccio, P., Fuxan, I.,Crrera, M., Zandomeni, R., Syarez, E.Y., Hopp, E., 2002. Microsatellite isolationand characterization in sunflower (Helianthus annuus L.). Genome 45, 34–43.

Phillips, R.L., Vasil, I.K., 2001. DNA-based markers in plants. In: Phillips, R.L., Vasil, I.K.(Eds.), DNA-based Markers in Plants. Kluwer Academic Publishers, Dordrecht,The Netherlands, p. 497.

Rafii, M.Y., Shabanimofrad, M., Edaroyati, M.W.P., Latif, M.A., 2012. Analysis of thegenetic diversity of physic nut, Jatropha curcas L. accessions using RAPD markers.Mol. Biol. Rep. 39, 6505–6511.

Ranade, S.A., Srivastava, A.P., Rana, T.S., Srivastava, J., Tuli, R., 2008. Easy assessmentof diversity in Jatropha curcas L. plants using two single-primer amplificationreaction (SPAR) methods. Biomass Bioenergy 32, 533–540.

Rao, G.R., Korwar, G.R., Shanker, A.K., Ramakrishna, Y.S., 2008. Genetic asso-ciations,variability and diversity in seed characters, growth, reproductivephenology and yield in Jatropha curcas (L.) accessions. Trees 22, 697–709.

Rohlf, F.J., 2000. NTSYS-pc Numerical Taxonomy and Multivariate Analysis System,Version 2.1. Exeter Publ, New York.

Sato, S., Hirakawa, H., Isobe, S., Fukai, E., Watanabe, A., Kato, M., Kawashima, K.,Minami, C., Muraki, A., Nakazaki, N., et al., 2011. Sequence analysis of the genomeof an oil-bearing tree, Jatropha curcas L. DNA Res. 18, 65–76.

Shen, J., Jia, X., Ni, H., Sun, P., Niu, S., Chen, X., 2010. AFLP analysis of genetic diversityof Jatropha curcas grown in Hainan, China. Trees 24, 455–462.

Sudheer, D.V.N.P., Sinha, R., Kothari, P., Reddy, M.P., 2009. Isolation of novelmicrosatellites from Jatropha curcas L. and their cross-species amplification. Mol.Ecol. Resour. 9, 431–433.

Sudheer, P.D.V.N., Rahman, H., Mastan, S.G., Reddy, M.P., 2010. Isolation of novelmicrosatellites using FIASCO by dual probe enrichment from Jatropha curcas L.and study on genetic equilibrium and diversity of Indian population revealed byisolated microsatellites. Mol. Biol. Rep. 37, 3785–3793.

Sun, F., Liu, P., Ye, J., Lo, L., Cao, S., Li, L., Yue, G., Wang, C., 2012. An approach forJatropha improvement using pleiotropc QTLs regulating plant growth and seedyield. Biotechnol. Biofuels 5, 12.

Sun, Q.B., Li, L., Li, Y., Wu, G.J., Ge, X.J., 2008. SSR and AFLP markers reveal lowgenetic diversity in the biofuel plant Jatropha curcas in China. Crop Sci. 48, 1865–1871.

Taramino, G., Tingey, S.V., 1996. Simple sequence repeats for germplasm analysisand mapping in maize. Genome 39, 277–287.

Tatikonda, L., Wani, S.P., Kannan, S., Beerelli, N., Sreedevi, T.K., Hoisington, D.A.,Devi, P., Varshney, R.K., 2009. AFLP-based molecular characterization of anelite germplasm collection of Jatropha curcas L., a biofuel plant. Plant Sci. 176,505–513.

Techen, N., Arias, R., Glynn, N.C., Pan, Z., Khan, I.A., Sceffler, B.E., 2010. Opti-mized construction of microsatellite-enriched libraries. Mol. Ecol. Resour. 10,508–515.

Wang, C.M., Liu, P., Yi, C., Gu, K., Sun, F., Li, L., Lo, L.C., Liu, X., Feng, F., Lin, G., Cao, S.,Hong, Y., Yin, Z., Yue, G.H., 2011. A first generation microsatellite and SNP basedlinkage map of Jatropha. PLoS ONE 6, e23632.

Yadav, H.K., Ranjan, A., Asif, M.H., Mantri, S., Sawant, S.V., Tuli, R., 2011. EST-derivedSSR markers in Jatropha curcas L.: development, characterization, polymor-phism, and transferability across the species/genera. Tree Genet. Genom. 7,207–219.

___________________________

Corresponding author: Hemant Kumar Yadav, CSIR-National Botanical Research Institute,

Lucknow-226001, India, Tel.: +91-522-2297982, Fax: +91-522-2205836, E-mail:

[email protected]

UDC 575 DOI: 10.2298/GENSR1303655M

Original scientific paper

GENETIC VARIABILITY AND DIVERGENCE ANALYSES IN Jatropha curcas

BASED ON FLORAL AND YIELD TRAITS

Ramanuj MAURYA1, Saurabh VERMA

1, Astha GUPTA

1, Bajrang SINGH

1

and Hemant KUMAR YADAV1,2

*

1CSIR-National Botanical Research Institute, Lucknow-226001, India.

2Academy of Scientific and Innovative Research (AcSIR), New Delhi, India

Maurya R., S. Verma, A. Gupta, B.Singh, and H. Kumar Yadav

(2013): Genetic variability and divergence analyses in Jatropha curcas based

on floral and yield traits -. Genetika, Vol 45, No. 3, 655-666.

Genetic variability of 80 accessions of Jatropha curcas showed that

oil content varied between 20.8-36.1% (X=26.2±0.38). Thirty seven accessions

showed seed weight/plant above average mean value (180.2g) and 26

accessions showed oil content above average mean (26.2%). The hierarchical

clustering grouped all the accessions into 4 clusters. Clustering showed that

majority of accessions i.e. 56 (70%) were genetically close to each other and

grouped in two clusters. The maximum intra cluster distance was recorded in

cluster IV (30.15). The inter cluster distance varied from 47.59 (between

cluster I and cluster II) to 211.27 (between cluster III and cluster I). The cluster

III showed maximum genetic distance with cluster I, followed by cluster IV

and cluster II suggesting comparatively wider genetic diversity among them.

The Principal Component Analysis (PCA) showed that first four principal

components (PCs) accounted for more than 93% of the total variation. The first

principal components accounted for 42.5% of the total variation mainly due to

seed length, seed width, seed weight/plant and number of seeds/plant which

had maximum and positive weight on this component. Oil content had negative

weight on PC1. Thus, PC1 related to the accessions with thick seeds, moderate

to high seed yielder with low oil content.

Key words: Divergence, Genetic variability, Genetic advance,

Heritability, Jatropha curcas

INTRODUCTION

Jatropha curcus, also known as physic nut or purging nut, is a perennial shrub

belonging to family Euphorbiaceae. It is a diploid (2n=22) with relatively smaller genome size of

656 GENETIKA, Vol. 45, No.3,655-666, 2013

416 Mb (CARVALHO et al., 2008) and contains about 175 species worldwide. It is native to

tropical America but widely distributed in other tropical and subtropical areas of the world,

especially in Africa, India, and Southeast Asia (OPENSHAW 2000, SUJATHA and PRABHAKARAN,

1997). The Portuguese settlers are believed to have introduced Jatropha to India during the

sixteenth century (GINWAL et al., 2005). Traditionally, the plant is used as fence, to control

erosion and reclaim land, and as animal feed and manufacturing of lubricants, soaps, candles,

purgative agents, astringents, and coloring dyes (OPENSHAW, 2000). Recently, it gains popularity

worldwide as an alternative and renewable source of biodiesel production. The plant can rapidly

grow in a wide range of agro-climatic conditions and is not grazed by animals (SUBRAMANIAN et

al., 2005). Its hardy nature and high oil content make it a promising oil crop for biodiesel

(HENNING, 1998). However, J. curcas is still an undomesticated plant and its response to yield

and oil content found to be erratic with different agro-climatic zones. Therefore, there is a

serious need to take initiative for its genetic improvement for adaptability, agronomically

desirable traits, yield and oil content. The assessment of the level and pattern of genetic

relationship among germplasm accessions is an important component of genetic improvement

program. The informations obtained could be utilized for i) analysis of genetic variability ii)

identification of diverse parental combinations to create genetic variability for further selection

(BARNETT and KIDWELL, 1998) and iii) introgression of desirable genes from diverse germplasm

into the available genetic base (THOMSON et al., 1998). Several types of data sets (morphological,

biochemical, molecular markers) and tools have been used for studying genetic variability and

relationship among accessions. Currently DNA based molecular markers are being widely used

for genetic analysis. In past several genetic diversity studies have also been reported in J. curcas

using different types of molecular markers like RAPD (GANESH et al., 2008), KUAMR et al.,

2009; SUBRAMANYAM et al., 2009), AFLP (QUINTERO et al., 2011; SUN et al., 2008), ISSR

(SENTHIL et al., 2009; TANYA et al., 2011) and SSR (WEN et al., 2010). However, the

morphological characterization is the first and important step in the description and classification

of germplasm. Few preliminary studies based on quantitative genetic variations were reported in

J. curcas. GINWAL et al. (2005) reported some preliminary quantitative genetic variations in seed

morphology, germination and seedling growth among ten accessions collected mainly from

Madhya Pradesh, India. KAUSHIK et al. (2007) evaluated 24 accessions for seed oil content

variations and divergence. Genetic association, variability and diversity in seed traits, growth,

reproductive and yield traits were reported by RAO et al. (2008) among 32 wild accessions of J.

curcas collected from Andhra Pradesh, India. Most of the morphometric trait based genetic

studies carried out earlier were restricted to smaller number of accessions collected from limited

areas. However, to explore and exploit the available genetic resources, an extensive survey,

collection and evaluation required to find out potential genetic material for genetic improvement

of J. curcas. Genetic studies based on the multivariate analysis is a powerful tool for determining

the degree of divergence between populations, the relative contribution of different components

to the total divergence and the nature of forces operating at different levels. Thus, the present

investigation was undertaken with 80 accessions of J. curcas collected from different states of

the India to evaluate genetic variability, assess genetic divergence and identify diverse

accessions/groups to facilitate the future breeding strategies and genetic improvement of J.

curcas.

R. MAURYA et al: VARIABILITY AND DIVERSITY IN Jatropha 657

MATERIALS AND METHODS

A total of eighty accessions of J. curcas representing different eco-geographical and

agro-climatic zone of India (Fig.1) were selected from germplasm bank maintained at Banthra

Research Center (BRC) of National Botanical Research Institute, Lucknow India. Fifteen

cuttings of each accession were raised in polybags filled with soil, cowdung manure and sand in

equal proportion during March 2008. The six rooted cuttings of each accession were then

transplanted in experimental plot in Randomized Block Design (RBD) with 3 replications and

two plants/replication during July 2008. The experimental plot is situated between 260

40’N

latitude and 800

45’ E longitude and at an altitude of 129 m above sea level. The distances

between rows and plants were kept 2 meter. The field was irrigated as and when required.

Pruning of plants was practiced 2 feet above the ground in the first week of March 2009 and

2010. Data on different morphometric traits were recorded during November 2010- January

2011. Following traits were considered for data recording: Female flower/plant: Number of

female flower counted during flowering period (November – January), male flower/plant:

Number of male flower counted per plant, male/female ratio: ratio between female and male

flower per plant, Number of fruits/plant: total number of fruits counted per plant at harvesting

time, Number of seeds/plant: Total number of seeds counted per plant, Fruit weight/plant: total

fruit weight measured in gram per plant, Seed weight/plant: total seed weight measured in gram

per plant, Seed length and width: twenty seeds per plant randomly selected and length and width

measured in middle with vernier caliper (mm), Oil content: twenty five to thirty seeds randomly

selected per plant and used to measure oil content in percent through Nuclear Magnetic

Resonance (NMR) spectrometer.

Figure1. Map of India showing collection site of Jatropha curcas from different states.

658 GENETIKA, Vol. 45, No.3,655-666, 2013

The mean values for each trait were used for statistical analysis and subjected to

analysis of variance and covariance using WINDOSTAT software (www.windostat.org).

Variance components were estimated from mean square of ANOVA (SINGH and CHAUDHARY,

1985). Heritability in broad sense (hB) was estimated on genotypic mean basis as described by

Allard (1999). Expected genetic advance (%) of mean was estimated according to JOHNSON et al.

(1955). Phenotypic correlation (rp) was calculated using analysis of variance and covariance

values as suggested by JOHNSON et al. (1955). For divergence studies the variability among

population was tested by Wilk’s lambda criterion for pooled effect of all the characters.

Hierarchical clustering was carried to find out the pattern of similarity/dissimilarity among

accessions using ward’s minimum variance method (WARD, 1963). The relationships among the

clusters were assessed by estimating the intercluster distances using Mahalanobis distance (D2)

statistics (RAO, 1952).

RESULTS AND DISCUSSION

The range and mean value of different traits along with various statistical parameters is

presented in Table 1. The number of female flowers/plant varied from 72.6 to 118.0 with an

average of 94.6±1.39 and the number of male flowers/plant varied from 16.27.2 to 2960.0 with

an average of 2240.8±42.92. Male/female flower ratio was found variable between 17.2 - 32.1

with an average of 24.0±0.37. The number of fruits/plant and number of seeds/plant varied

between 63.4 -112.9 and 169.6-297.0 with an average of 80.5±1.31 and 218.3±3.89 respectively.

The range of seed weight/plant (g), seed length (mm) and seed width (mm) were 102.7 - 273.8,

13.3 - 18.5 and 7.8 - 11.8 with an average of 180.2±4.20, 16.4±0.13 and 10.7±0.09 respectively.

The oil content varied between 20.8 - 36.1% with an arithmetic mean of 26.2±0.38. Out of 80

accessions, 37 accessions have seed weight/plant above average value (i.e.180.2g) and of which

only 4 accessions have seed weight/plant above 250g with maximum in accession NBJC1078

(273.08g). Likewise, 26 accessions have oil content above average value of 26.2% and only

three accessions namely NBJC1055, NBJC1051 and NBJC1048 having oil content above 35%.

So, majority of the accessions in the present investigation were found to be low oil yielder.

However, FOIDL et al. (1996) and BERCHMANS and HIRATA, (2008) reported oil content upto

40%. In order to assess the heritable portion of total variability, the phenotypic variance (δ2p)

was partitioned into genotypic (δ2g) and error variance (δ2

e). The values of error variance were

found to be higher than those of genotypic variance (δ2g) for number of female flowers/plant,

male/female ratio, number of fruits/plant, number of seeds/plant and seed weight/plant

suggesting much influence of environmental factors on these traits. The phenotypic coefficient of

variation (PCV) and genotypic coefficient of variation (GCV) varied from 7.51 to 25.48 and 7.05

to 17.87% respectively. Maximum PCV and GCV were noticed for seed weight/plant followed

by fruit weight/plant, number of male flower/plant, number of seeds/plant. The PCV was found

to higher than that of GCV for all the traits with remarkable differences in their values. The traits

seed length, seed width and oil content has very small differences in PCV and GCV values.

Similarly, KAUSHIK et al. (2007) and RAO et al. (2008) also noticed higher PCV over GCV with

small differences for seed length, seed width and oil content. The genetic improvement in the

traits with small differences in PCV and GCV values can easily be achieved by selection of

promising plant types and also through crossing the desirable accessions among themselves

followed by selection in segregating generations.


Table 1. Range, mean, estimates of variance components, broad sense heritability and genetic advance in Jatropha

curcas

The estimate of genetic variability alone is considered as not much helpful in

determining the heritable portion of variation unless until coupled with the estimate of

heritability. The estimate of heritability along with variability can provide more insight towards

the amount of genetic advance to be expected from the selection process. Thus, the knowledge

of heritability of a character become important as it indicates the possibility and extent to which

improvement is possible through selection. Broad sense heritability varied from 20% to 90% and

maximum was observed for oil content (90%) followed by seed length (88%) and seed width

(87%). The lowest heritability (20%) was noticed for male/female ratio and other traits have

moderate heritability ranging from 38% to 54%. The high heritability noticed for oil content,

seed length and seed width indicates that these characters are under genotypic control. However,

heritability estimates may differ widely in the same crop and same trait (RASMUSON, 2002)

because heritability always refers to a defined population and a specific experimental set up

(HOLLAND et al., 2002). Considering high heritability alone is not enough in making efficient

selection in advance generation unless accompanied by substantial amount of genetic advance

(GA), which provides the information about the degree of gain in a character obtained under a

particular selection pressure (JOHNSON et al., 1955). Expected genetic advance, as a function of

selection intensity, phenotypic variance and heritability, has an added advantage over heritability

as a guiding factor to breeders in a selection program. The genetic advance as percent of mean

varied from 8.45 for male/female ratio to 25.81 for seed weight/plant. The higher genetic

advance for seed weight/plant, oil content, and number of fruits/plant was might be due to

presence of variation in these traits. High heritability coupled with high GA and GCV for oil

content suggests that this trait was primarily controlled by additive gene action and any simple

selection model would be advantageous to obtain the desired genetic gain. Low heritability with

high GA for seed weight/plant, fruit weight/plant and number of seeds/plant and high heritability

with low GA for seed length and seed width indicates that these traits might be largely governed

by non–additive gene actions and hence much improvement cannot be achieved through

Min Max Mean ±SD F value σ2g σ

2p σ2e GCV PCV Hb GA GA%

Female

flower

/plant

72.6

118.0

94.6±1.39

2.82**

99.55

263.10

163.5

10.54

17.13

38.0

12.64

13.35

Male flower

/plant

1627.2

2960.0

2240.8±42.92

4.06**

109448.18

216787.51

107339.3

14.76

20.78

50.0

484.24

21.61

male/female

ratio

17.2

32.1

24±0.37

1.76**

4.79

23.69

18.89

9.12

20.27

20.0

2.03

8.45

No. of fruits

/plant

63.4

112.9

80.5±1.31

2.86**

88.95

232.08

143.13

11.72

18.93

38.0

12.03

14.95

No. of seeds

/plant

169.6

297.0

218.3±3.89

3.43**

850.42

1898.14

1047.72

13.35

19.94

45.0

40.21

18.40

Fruit weight

/plant

301.1

575.2

397±8.18

4.5**

4128.07

7665.63

3537.56

16.15

22.01

54.0

97.13

24.42

Seed weight

/plant (g)

102.7

273.8

180.2±4.20

3.90**

1040.43

2116.22

1075.79

17.87

25.48

49.0

46.59

25.81

Seed length

(mm)

13.3

18.5

16.4±0.13

23.32**

1.34

1.52

0.18

7.05

7.51

88.0

2.24

13.64

seed width

(mm)

7.8

11.8

10.7±0.09

20.69**

0.61

0.70

0.09

7.28

7.82

87.0

1.50

13.97

Oil content

(%)

20.8

36.1

26.2±0.38

27.10**

10.84

12.09

1.25

12.60

13.30

90.0

6.42

24.58

660 GENETIKA, Vol. 45, No.3,655-666, 2013

selection. Similar to the present findings, high heritability and low GA for seed length and seed

width was also reported by KAUSHIK et al. (2007) and RAO et al. (2008).

Table 2. Distribution of 80 accessions of Jatropha curcas in 4 clusters based on their 10 quantitative traits

Further, the potential accessions could be identified by analyzing genetic diversity in the

genetic resources collected/available, which will further facilitate various genetic improvement

programs. The simultaneous testing of significance based on Wilk’s lambda criterion for pooled

effect of all the characters showed significant differences among the population (χ2 = 790 df =

3079.06**). A hierarchical cluster analysis (Wards minimum variance) grouped all the 80

accessions into 4 clusters (Table 2, Fig. 2). The number of accessions per clusters varied from 6

(cluster III) to 29 (cluster II). The cluster II was largest with 29 accessions collected from 11

states India i.e. Andhra Pradesh (6 accessions), Rajasthan (5), Jharkhand (4), Uttar Pradesh (2),

Chhattisgarh (2), Bihar (2), Punjab (2), Gujarat (1), Haryana (1), Kerala (1) and Tamil Nadu (1).

The cluster I is second largest comprising 27 accessions including 5 accessions collected from

Uttar Pradesh, 4 from Gujarat, 3 from West Bengal, 2 each from Rajasthan, Tamil Nadu,

Himachal Pradesh, Bihar, Chhattisgarh, Haryana, and 1 each from Punjab, Madhya Pradesh,

Uttaranchal, Jharkhand and Kerala. The cluster III was the smallest with 6 accessions collected

from Uttaranchal (4), Jharkhand (1) and Rajasthan (1). The cluster IV had 18 accessions

collected from Chhattisgarh (4), Uttar Pradesh (4), Tamil Nadu (3), Himachal Pradesh (2),

Haryana (2), Orissa (1), Andhra Pradesh (1), and West Bengal (1). The clustering of accessions

based on multivariate analysis showed that majority of accessions i.e. 56 accessions (70%) were

genetically close to each other and grouped only into two clusters. The distribution of accessions

from same origin/geo-graphical region into different clusters or vice versa indicated that the

geographical origin is not related to genetic divergence.

Cluster Number of

accessions

Accessions name

Cluster I 27 NBJC1001, NBJC1034, NBJC1044, NBJC1005, NBJC1039, NBJC1054, NBJC1004,

NBJC1045,NBJC1060, NBJC1121, NBJC1007, NBJC1031,NBJC1137, NBJC1107,

NBJC1127,NBJC1023, NBJC1097, NBJC1008, NBJC1033,NBJC1129, NBJC1064,

NBJC1022, NBJC1058,NBJC1124,NBJC1078, NBJC1122, NBJC1130.

Cluster II 29 NBJC1006, NBJC1093, NBJC1083, NBJC1057, NBJC1133, NBJC1082, NBJC1087,

NBJC1135, NBJC1003, NBJC1035, NBJC1112, NBJC1036, NBJC1050, NBJC1020,


NBJC1073, NBJC1052, NBJC1071, NBJC1084, NBJC1101, NBJC1138,

NBJC1094,NBJC1072.

Cluster III 6 NBJC1009, NBJC1014, NBJC1013, NBJC1019, NBJC1017, NBJC1025.

Cluster IV 18 NBJC1048, NBJC1051, NBJC1055, NBJC1049, NBJC1081, NBJC1089, NBJC1092,


NBJC1076, NBJC1070, NBJC1090, NBJC1077.


Figure 2. Dendrogram of 80 Jatropha curcas accessions derived from the Wards minimum variance cluster

analysis using Mahalanobis distances

662 GENETIKA, Vol. 45, No.3,655-666, 2013

Similar findings were also reported earlier by RAO et al. (2008) and SUDHEER, et al.

(2010) based on morphological traits and molecular marker studies respectively. The maximum

intra cluster distance was noticed in cluster IV (30.15) followed by cluster I (28.61) and cluster II

(25.89). The inter cluster distance varied from 47.59 (between cluster I and cluster II) to 211.27

(between cluster III and cluster I). Based on cluster distance, the cluster III showed maximum

genetic distance with cluster I, followed by cluster IV and cluster II suggesting comparatively

wider genetic diversity among them. The accessions from these clusters could be utilized in

hybridization program to get desirable transgressive segregants in their offspring, as there is a

higher probability that unrelated genotypes would contribute unique desirable alleles at different

loci. Considering the cluster means, the cluster I showed highest mean value for all the traits

except oil content. On contrary, the cluster III had lowest mean value for number of male

flower/plant, male/female ratio, fruit weight/plant, seed weight/plant, seed length and seed width.

The cluster I and cluster III seems to be unique with having highest and lowest cluster mean

value for most of the traits respectively and also had highest inter cluster distance among them.

The crossing among the accessions of these two clusters may yield hybrids with desirable traits.

Table 3. Intra- (diagonal bold) and inter-cluster Mahalanobis distances for 80 accessions in Jatropha

curcas Cluster I Cluster II Cluster III Cluster IV

Cluster I 28.61 47.59 211.27 91.62

Cluster II 25.89 129.47 66.51

Cluster III 24.77 153.24

Cluster IV 30.15

Table 4a. Cluster means and standard errors of the means of different traits in Jatropha curcas L.

Female

flower/plant

Male

flower/plant

male/female

ratio

No. of

fruits/plant

Fruit

weight/plant

(g)

Cluster I 103.12±2.18 2471.53±68.74 24.30±0.70 89.31±2.37 432.88±14.52

Cluster II 91.71±1.82 2200.53±67.26 24.06±0.65 77.77±1.75 390.95±14.18

Cluster III 91.44±4.54 2018.00±1114.06 22.29±1.07 75.50±3.55 347.50±11.85

Cluster IV 85.91±2.26 2033.94±66.70 24.04±0.68 73.20±1.42 372.94±12.19


Table 4b. Cluster means and standard errors of the means of different traits in Jatropha curcas L.

Table 5. Loadings of the first four principal components of genetic divergence in 80 accessions of Jatropha

curcas

In order to assess the patterns of variation, principal component analysis (PCA) was done

by considering all the ten variables simultaneously. The first four principal components (PCs)

accounted for more than 93% of the total variation (Table 5). PCA is a multivariate technique

that allows to find the major patterns within a multivariate data set. Associations between traits

emphasized by this method may correspond to genetic linkage between loci controlling traits or a

pleiotropic effect. The first principal components accounted for 42.5% of the total variation due

to seed length, seed width, seed weight/plant and number of seeds/plant which had maximum

and positive weight on this component. Oil content had negative weight on PC1. Thus, PC1

related to the accessions with thick seeds, moderate to high seed yielder with low oil content.

The PC2 concentrated 32% of total variation and was positively associated with seed

No. of seeds

/plant

Seed

weight/plant (g)

Seed length

(mm)

Seed width

(mm)

Oil content

(%)

Cluster I 245.39±7.10 198.61±7.75 17.58±0.10 11.12±0.08 24.97±0.36

Cluster II 209.29±4.90 179.32±6.24 16.17±0.14 10.88±0.06 24.34±0.27

Cluster III 205.28±9.10 151.44±6.98 13.92±0.24 8.29±0.02 25.56±1.02

Cluster IV 196.41±4.11 163.63±7.37 15.98±0.16 10.72±0.07 31.04±0.69

Characters PC1 PC2 PC3 PC4

Female flower/plant 0.40 0.42 0.56 0.09

Male flower/plant 0.37 0.32 0.25 0.21

male/female ratio 0.01 -0.05 -0.00 0.07

No. of Fruits/plant 0.37 0.21 0.48 0.19

No. of seeds /plant 0.51 0.55 0.89 0.55

Fruit weight/plant (g) 0.20 0.14 0.29 0.06

Seed weight/plant (g) 0.61 0.52 0.88 0.59

Seed length (mm) 3.07 -0.18 0.26 -0.87

seed width (mm) 1.95 -0.74 -1.10 0.95

Oil content (%) -0.32 -3.05 0.72 0.07

Components

Root 14.49 10.86 4.09 2.44

% Variance explained 42.54 31.87 12.02 7.18

Cum. variance Explained 42.54 74.41 86.43 93.62

664 GENETIKA, Vol. 45, No.3,655-666, 2013

weight/plant, number of seeds/plant and female flower/plant. The oil content had highest

negative weight on PC2 also followed by seed length, seed width and male female ratio. The

third PC accounted for 12% variation was mainly due to seed weight/plant, number of

seeds/plant, number of fruits/plants and seed width had negative weight. The seed weight/plant

invariably had almost equal and positive weight on all four components.

CONCLUSION

In conclusion, the phenotypic evaluation and characterization of wide range of J. curcas

accessions showed that the most of the traits have low variability as revealed by various

statistical parameters. Though, some sort of variations was noticed and cluster analysis indicated

that the accessions from the clusters I and II have some potential towards the development of

high oil yielding accessions of J. curcas. A planned hybridization programme based on inter

crossing of promising accessions of different cluster may facilitate to accumulate favorable

genes in hybrids.

ACKNOWLEDGEMENT

The authors thank the Director, NBRI for providing the necessary facilities during the

investigation

Received August 03th, 2013

Accepted October 05th, 2013

REFERENCES

ALLARD, R.W. (1999): Principles of Plant Breeding, 2nd edn. New York, Wiley.

BARNETT, B.A. and K.K. KIDWELL (1998): AFLP-based genetic diversity assessment among wheat cultivars from the

Pacific Northwest. Crop Sci., 38,1261–1271.

BERCHMANS, H.J. and S. HIRATA (2008): Biodiesel production from crude Jatropha curcas L. seed oil with a high content

of free fatty acids. Bioresour. Tech., 99, 1716-1721.

CARVALHO, C.R., W.R. CLARINDO, M.M. PRAC, F.S. ARAUJO and N. CARELS (2008): Genome size, base composition and

karyotype of Jatropha curcas L.: an important biofuel plant. Plant Sci., 174, 613–617.

FOIDL, N., G. FOIDL, M. SANCHEZ, M. MITELBACH and S. HECKEL (1996): Jatropha curcas L. as a source for the production

of biofuel in Nicaragua. Bioresour.Tech., 58,77-82.

GANESH, R.S., K.T. PARTHIBAN, R. SENTHIL KUMAR, V. THIRUVENGADAM and V.M. PARAMATHMA (2008): Genetic

diversity among Jatropha species as revealed by RAPD markers. Genetic Resour. Crop Evol., 55,803–809.

GINWAL, H.S., S.S. PHARTYAL, P.S. RAWAT and R.L. SRIVASTAVA (2005): Seed source variation in morphology,

germination and seedling growth of Jatropha curcas L. in central India. Silvae Genetica, 54,76–80.

HENNING, R (1998): Use of Jatropha curcas – household perspective and its contribution to rural employment creation.

In: Proceedings of the regional workshop on the ‘‘potential of Jatropha curcas in rural development and

environmental protection’’ May 13–15, Harare, Zimbabwe

HOLLAND, J.B., W.E. NYQUIST and C.T. CERVANTES-MARTINEZ (2002): Estimating and interpreting heritability for

plant breeding: An update. Plant Breed. Review, 22, 9-112.

KAUSHIK, N., K. KUMAR, S. KUMAR, N. KAUSHIK and S. ROY (2007) Genetic variability and divergence studies in seed

traits and oil content of Jatropha (Jatropha curcas L.) accessions. Biomass Bioenergy, 31,497–502.

KUMAR, R.V., Y.K. TRIPATHI, S. SHUKLA, S.P. AHLAWAT and V.K. GUPTA (2009): Genetic diversity and relationships

among germplasm of Jatropha curcas L. revealed by RAPD. Trees, 23,1075-1079.


OPENSHAW, K (2000): A review of Jatropha curcas: an oil plant of unfulfilled promise. Biomass Bioenergy, 19,1–15.

QUINTERO, V.P., J.L.A. LOPEZ, A.Z. COLMENERO, N.M. GARCIA, C.A.N. COLIN, J.L.S. BONILLA, M.R.A. RANGEL, H.R.G.

LANGARICA and D.J.M. BUSTAMANTE (2011): Molecular characterization of Jatropha curcas L. genetic resources

from Chiapas, Mexico through AFLP markers. Biomass Bioenergy, 35, 1897-1905.

RAO, G.R., G.R. KORWAR, A.K. SHANKER and Y.S. RAMAKRISHNA (2008): Genetic associations,variability and diversity in

seed characters, growth, reproductive phenology and yield in Jatropha curcas (L.) accessions. Trees, 22, 697–

709.

RASMUSON, M. (2002) The genotype-phenotype link. Heriditas, 136, 1-6.

SENTHIL KUMAR, R., K.T. PARTHIBAN and M. GOVINDA RAO (2009): Molecular characterization of Jatropha genetic

resources through inter-simple sequence repeat (ISSR) markers. Mol. Biol. Rep. 36, 1951-1956.

SUBRAMANIAN, K.A., S.K. SINGAL, M. SAXENA and S. SINGHAL (2005): Utilization of liquid biofuels in automotive diesel

engines: an Indian perspective. Biomass Bioenergy, 29,65–72.

SUBRAMANYAM, K., D. MURALIDHARARAO and N. DEVANNA (2009): Genetic diversity assessment of wild and cultivated

varieties of Jatropha curcas (L.) in India by RAPD analysis. African J. Biotech. 8,1900-1910.

SUDHEER. D.V.N.P., S.G. MASTAN, H. RAHMAN and M.P. REDDY (2010): Molecular characterization and genetic diversity

analysis of Jatropha curcas L. in India using RAPD and AFLP analysis. Mol.Biol. Rep. 37, 2249–57.

SUJATHA, M. and A.J. PRABAKARAN (1997): Characterization and utilization of Indian Jatropha curcas. Indian J. Plant

Genet. Resour., 10,123–28.

SUN, Q.B., L.F. LI, Y. LI, G.J. WU, and X.J. GE (2008): SSR and AFLP markers reveal low genetic diversity in the biofuel

plant Jatropha curcas in China. Crop Sci.,48,1865-71.

TANYA, P., P. TAEPRAYOON, Y. HADKAM and P. SRINIVES (2011): Genetic diversity among Jatropha and Jatropha-related

species based on ISSR markers. Plant Mol. Biol. Repor. 29,252-264.

THOMPSON, J.A., R.L. NELSON and L.O. VODKIN (1998): Identification of diverse soybean germplasm using RAPD

markers. Crop Sci., 38,1348–1355.

WEN, M., H. WANG, Z. XIA, M. ZOU, C. LU and W. WANG (2010): Development of EST-SSR and genomic-SSR markers to

assess genetic diversity in Jatropha Curcas L. BMC Res. Notes, 3, 42.

666 GENETIKA, Vol. 45, No.3,655-666, 2013

ANALIZA GENETIČKE VARIJABILNOSTI I DIVERGNTNOSTI Jatropha curcas

NA OSNOVU OSOBINA CVETA I PRINOSA

Ramanuj MAURYA

1, Saurabh VERMA

1, Astha GUPTA

1, Bajrang SINGH

1

i Hemant Kumar YADAV1,2

*

1CSIR-Nacionalni botanički istraživački Institute, Lucknow-226001, India.

2 Akademija naučnih i inovativnih istraživanja (AcSIR), New Delhi, India

Izvod Utvrđena je genetička varijabilnost sadržaja ulja u 80 genotipova Jatropha curcas u rasponu u

rasponu od 20.8-36.1% (X=26.2±0.38) a prosečan sadržaj je 26.2%. Hijerarhijskom analizom

grupisanja ispitivani genotipovi su se grupisali u četiri grupe (klastera). Utvrđenaa je različita

distanca unutar i između klastera. Analiza osnovnih komponenata variranja (PCA) je pokazala da

prve četiri komponente (PCs) učestvuju sa više od 93 % ukupne varijabilnosti.

Primljeno 03. VIII.2013.

Odobreno 05. X. 2013.

development and application of microsatellite markers for diversity analysis...

Documents