organization of microsatellites differs between mammals and cold-water teleost fishes
TRANSCRIPT
Organization of Microsatellites Differs between Mammals
and Cold-water Teleost Fishes
Amanda L. Brooker, Doug Cook, Paul Bentzen,1 Jonathan M. Wright,2 and Roger W. Doyle
Marine Gene Probe Laboratory, Department of Biology, Dalhousie University, Halifax, NS B3H 4J1, Canada
Brooker, A.L., D. Cook, P. Bentzen, J.M. Wright, and R.W. Doyle. 1994. Organization of microsatellites differsbetween mammals and cold-water teleost fishes. Can. J. Fish. Aquat. Sci. 51:1959-1966.
Microsatellites, in particular (dG-dT)n and (dG-dA)n dinucleotide repeats, are abundant and display a high
degree of length polymorphism and heterozygosity in eukaryotic genomes. Here, we report the cloning andcharacterization of 64 microsatellite sequences from Atlantic cod, Gadus morhua. The microsatelliteswere classified as perfect, imperfect, and compound repeats. The length and integrity of these repeatswere compared with microsatellites characterized from two other teleosts, rainbow trout (Oncorhynchusmykiss) and Atlantic salmon (Salmo salar), and from three mammalian genomes, human, porcine, andcanine. Differences were found in the proportions of the repeat classes; however, the most significantdifference between microsatellites from teleost fishes and mammals was the propensity of the former to beof greater length: some cod and rainbow trout microsatellites were more than twice the size of the longestmicrosatellite repeats reported for any mammalian genome. Primers for PCR amplification were constructed for seven of the cod microsatellites. Allele frequencies, degree of polymorphism, and heterozygositywere estimated for a sample population. Amplification with these cod primers was also carried out ona number of related gadids. These polymorphic microsatellite loci have enormous potential utility asgenetic markers for use in population, breeding, and evolutionary studies.
Les chaines d'ADN microsatellite, et plus particulierement les sequences r£petitives des dinucleotides(dG-dT)net (dG-dA),,, sont abondantes et leurs genomes eucaryotes presentent un taux eleve de poly-morphisme et d'heterozygosite". Dans ce rapport, nous pre*sentons les n§sultats du clonage et de la carac-terisation de 60 sequences d'ADN microsatellite de la morue franche, Gadus morhua. Les ADN microsatellites etaient classees en trois categories : parfaites, imparfaites et a sequence de repetition composee. Lalongueur et I'integrite de ces sequences de repetition ont et<§ comparees a celles d'ADN microsatellitecaracterisee chez deux autres tel£osteens, la truite arc-en-ciel {Oncorhynchus mykiss) et le saumon deI'Atlantique {Salmo salar), et chez trois genomes de mammiferes, (humain, porcin et canin). On a trouvedes differences dans les proportions des categories de sequences de repetition; toutefois, la difference laplus significative entre ies ADN microsatellites de te§l<§osteens et celle de mammiferes etait une tendancea une longueur supe>ieure chez ces derniers : les ADN microsatellites de certaines morues et truites arc-en-ciel etaient deux fois plus longues que les sequences rep<§t<§es d'ADN microsatellite observees pourtous les genomes de mammiferes. Des amorces pour I'amplification par la PCR ont et§ creees pour sept desADN microsatellites de morue. Les frequences d'alleles, le degre de polymorphisme et I'heterozygositeont ete estime*s pour une population e*chantillon. On a egalement fait I'amplification de ces amorces de
morue pour un certain number de gadides connexes. Ces locus d'ADN microsatellites polymorphespeuvent etre extremement utiles comme marqueurs genetiques dans les etudes de population, de reproductionet devolution.
Received November 24, 1993
Accepted March 16, 1994
(J12177)
Re$u le 24 novembre 1993
Accepte le 16 mars 1994
Atlantic cod, Gadus morhua, is one of the most com
mercially important fish in the North Atlantic. Highly
polymorphic genetic markers assayable via the poly-
merase chain reaction (PCR) hold the potential for consid
erable utility in selective breeding, fisheries management,
and genome mapping of cod (Bentzen et al. 1991; Wright
1993; Wright and Bentzen 1994). To this end, we have
cloned and sequenced microsatellite repeats from cod.
Microsatellites consist of tandem repeats of simple
sequence DNA, where the varying number of repeats at a
particular locus creates a polymorphism (Litt and Luty 1989;
'Current address: School of Fisheries HF-10, MAR Building,Room 201, University of Washington, Seattle, WA 98195, USA.
2Author to whom correspondence should be addressed.
Can. J. Fish. Aquat. Sci., Vol. 51, 1994
Tautz 1989; Weber and May 1989). These arrays of differing
lengths can be amplified by PCR and size fractionated on
acrylamide gels. The most abundant and widely used of these
simple repeats are the dinucleotide sequences (dG-dT)n and
(dG-dA)^. Cloning of these microsatellites has been reported
for a variety of species, although mostly mammals (Stallings
et al. 1991). At present, extensive microsatellite data for
human (Weber 1990), mouse (Love et al. 1990), rat (Beckmann
and Weber 1992), dog (Ostrander et al. 1993), pig (Wintero
et al. 1992), horse (Ellegren et al. 1992), bees (Estoup et al.
1993), Atlantic salmon {Salmo salar) (Slettan et al. 1993),
brown trout {Salmo trutta) (Estoup et al. 1993), and zebrafish
(Goff et al. 1992) have been published.
Highly polymorphic microsatellite markers have great
potential utility as genetic tags for use in aquaculture and
1959
Table 1. Repeat sequence type and size of microsatellites from mammalian and
teleost fish species. Figures in parentheses are the second highest size class due to abimodal distribution.
Atlantic Rainbow Atlantic
cod trout salmon Human Canine Porcine
n
% perfect
% imperfect
% compound
Most common
size class3
Largest
size classa
64
48.4
45.3
6.3
6-11
(30-35)
>60
51
56.9
31.4
11.7
24-29
>60
45
80
20
0
6-9
(21-24)
>33
114
64
25
11
12-15
27-30
101
80
8
12
14-16
23-25
105
71
19
10
16-18
28-30
"Number of dinucleotide repeats.
□ Perfect
E3 Imperfect
Compound
Table 2. Nucleotide sequence of Atlantic cod microsatellites
and primers for amplification of loci by PCR.
6-11 12r17 18-23 24-29 30-35 36-41 42-47 48-53 54-59 >60
Number of repeat units (longest continuous run)
Fig. 1. Frequency of different size classes of the longest uninter
rupted microsatellite arrays from Atlantic cod. Size classes are rep
resented as number of repeat units.
fisheries biology. They may prove particularly valuable for
stock discrimination and population genetics due to the high
levels of polymorphism compared with conventional allozyme
markers (Bentzen et al. 1991; Wright and Bentzen 1994).
The utility of the PCR approach also makes possible the pro
cessing of large sample numbers required for population sur
veys and large-scale breeding programs. Moreover, the minute
amount of tissue required to genotype individuals allows
larvae as well as retrospective studies of archival samples
such as otoliths to be examined (Wright and Bentzen 1994).
Recently, microsatellites from Atlantic salmon were reported
to differ significantly in length and composition from mam
malian microsatellites (Slettan et al. 1993). In order to deter
mine if this finding is a general trend for microsatellites iso
lated from teleost fishes, we have investigated the organization
of Atlantic cod and, to a lesser extent, rainbow trout
(Oncorhynchus mykiss) microsatellites and compared them
with microsatellites isolated from other species. We also
report the allele distributions and heterozygosities of seven
microsatellite loci that exhibit a wide range of polymorphism
in a sample of 127 cod from the western North Atlantic
Ocean and investigate whether or not these microsatellite
loci are conserved in related gadid species.
i960
Primer Sequence 5' -> 3'
Gmo 1 (GA)4N(GA)2N,(GA)7N2(GA)4N2(GA)5Fa AAA AAC AAA ACG AGA GCG GTRb GGA GTT GCT TGT GGT GGC AT
57
Gmo2
Rb
Gmo9
F
R
Gmo 10
F
R
Gmol23
F
R
Gmol32
F
R
Gmol45
F
R
(GT)I4
CCC TCA GAT TCA AAT GAA GGA
GTG TGA GAT GAC TGT GTC G
(GT)6N2GTN2GTN2(GT)46
GCA GAT TTG AAT CAG CGT GT
CTC TGT GGG TAT GTG CAA AG
(GT)5N2(GT)3N,(GT)2N2(GT)44
GGA GTT TGA CTC ATC AGA TGC
ACA GCG TAT GAC CAT GAT TCG
(GT)gN2(GT)IIN(GT)2N3(GT)43N3(GT)8GAG GGA CAT AAA GAC ACT T
AAA CAT GCA TGC AGG CAA C
(GT)19
GGA ACC CAT TGG ATT CAG GC
CGA AAG GAC GAG CCA ATA AC
GI2(GA)29
GCA TTG TAG GAA CAA CAA TTA AC
GTG CAT GTG CTC ATT ATA GC
51
50
53
56
52
50
aF = forward primer.
bR = reverse primercAnnealing temperature.
Methods and Materials
Isolation of Microsatellites from Cod
Two cod genomic libraries were constructed. The first
employed the vector M13mp9 and was constructed from
purified, size-selected fragments of 200-500 bp. These frag
ments were generated by digestion with enzymes A/wI,
Haelll, Hindi, and Rsal (Pharmacia) according to the man
ufacturer's instructions and cloned into the Smal site of the
vector. Construction of a second library was perfqrmed by
cleaving cod genomic DNA with Alul, Hindi, Pall, and
Rsal, followed by ligation of size-selected fragments of
300-800 bp into the Smal site of the pUC18 plasmid.
Can. J. Fish. Aquat. ScL, Vol. 51, 1994
Table 3. Pedigree analysis of three two-generation families of Atlantic cod using
six polymorphic microsatellite loci.
Fish
AI,AI2
All,
AII2
AII3
AII4
AII5
AII6
AII7
AII8AII9
AII.o
BI,
BI2
BII,
BII2
BII3
BII4
BII5
BH6
CI,
CI2
CII,
CII2
CII3
CII4
CII5
CII6
CII7
Size of
alleles
(bp)
Gmol Gmo2
1»
1»
'»
'»
l »
» ■
» •
» •
» ■
» i
» J
» J
» J
1 3,4
1 2,5*
1 2,4
1 3,5
1 2,3
1 2,4
1 3,5
1 2,4
1 2,4
1 3,5
1 2,3
I 4,5
1 2,3
1 1,5
I 3,5
I 1,3
1 3,5
1 1,2
1 2,5
1 1,3
2,5
1 2,5
2,5
2,2
2,5
2,2
2,5
5,5
5,5
1 = 104 1 = 116
2= 114
3 = 112
4= 110
5= 108
Gmo9
4,7
8,9
4,8
4,9
7,9
4,9
4,9
4,9
4,8
4,9
7,8
7,9
2,3
6,6
3,6
3,6
3,6
2,6
3,6
3,6
1,8
5,7
5,8
1,7
7,8
7,8
1,5
1,7
5,8
1 =236
2 = 220
3 = 216
4 = 212
5 = 208
6 = 206
7 = 202
8 = 194
9= 190
Gmol 23
1,4
2,5
4,5
1,5
4,5
1,2
2,4
2,4
4,5
4,5
1,5
2,4
6, 10
2,4
2, 10
2, 10
2, 10
2,6
4, 10
4, 10
7,9
3,8
3,7
8,9
3,7
3,9
3,9
8,9
7,8
1 =221
2 = 219
3 = 213
4 = 203
5 = 201
6= 199
7= 195
8= 191
9= 187
10= 141
Gmol 32
4,5
4,5
4,4
5,5
4,5
4,4
5,5
4,4
4,4
5,5
5,5
4,5
1,2
3,5
1,5
1,3
2,5
1,5
2,3
1,5
5,5
3,5
3,5
5,5
5,5
5,5
5,5
5,5
5,5
1 = 135
2= 123
3= 119
4= 113
5= 111
Gmol45
2,8
5, 10
2, 10
2, 10
8, 10
8, 10
5,8
2,5
2, 10
8, 10
2, 10
5,8
4,7
3,9
4,9
7,9
4,9
4,9
4,9
3,7
1, 10
5,6
1,5
1,5
1,5
1,6
1,6
5, 10
6, 10
1 = 183
2= 181
3= 179
4= 177
5 = 175
6= 173
7= 171
8= 169
9= 167
10 = 165
Following transformation, bacterial colonies or plaques
were transferred to nylon filters (Amersham) according to the
manufacturer's instructions and the immobilized DNA was
hybridized to (GT)15 synthetic oligonucleotide, radiolabeled
with [7 32P]ATP and T4 polynucleotide kinase. Filters wereincubated in prehybridization solution of Westneat et al.
(1988) at 65°C for 1 h before addition of the radiolabeled
probe. The hybridization reaction was allowed to proceed
overnight. Filters were washed once with 2 X SSC/0.2%
SDS at room temperature for 15 min and once with 2 X
SSC/0.2% SDS at 65°C for 15 min. Dideoxy-chain termi
nation sequencing of DNA from purified positive clones
was accomplished using T7 DNA polymerase (Pharmacia).
PCR Analysis
PCR primers complementary to the sequence flanking the
microsatellite were synthesized. For all PCR analyses, except
those that amplify the locus Gmo9, the reverse primer was
5' end-labeled with [7 32P]ATP (37 kBq/10 pmol primer)using T4 polynucleotide kinase (Pharmacia). PCR was carried
Can. J. Fish. Aquat. Sci., Vol. 51, 1994
out in a Perkin Elmer thermal cycler using Taq Polymerase
(Perkin Elmer). The standard reaction contained 10 ng of
template DNA, 0.6 \xM labeled reverse primer, 0.6 \M for
ward primer, 10 mM Tris-HCl (pH 8.3), 1 mM MgCl2,
50 mM KC1, 0.01% gelatin, 200 \iM dNTPs, and 0.25 unit
of Taq Polymerase in a 10-jjlL volume. For Gmo9, the reverse
primer was not labeled; instead, [a 32P]dATP was includedin the PCR mix along with 20 mM MgCl2. Amplifications
were achieved by running 30 cycles of 1 min at 95°C, 2 min
at the appropriate annealing temperature (see Table 2), and
20 s at 72°C. Stop dye (10 jjiL, Pharmacia) was added to
each reaction and the products were denatured at 80°C for
5 min. Two microlitres of the reaction was subjected to gel
electrophoresis on a 6% denaturing polyacrylamide gel.
Results and Discussion
Characterization of Cod-derived (dG-dA)n and (dG-dT)wSequences
As a result of the two cloning efforts, over 120 clones
1961
A C G T bp
13 112 111112 2
243444443434 ':
Fig. 2. Allelic products generated by PCR of the Gmo9 locus from a two-generation Norwegian
pedigree of Atlantic cod. Offspring demonstrate stable inheritance of parental alleles which
are numbered and shown at the bottom of the figure. Size of alleles are estimated using an
M13 sequence ladder.
that hybridized to the dinucleotide probe were isolated and
sequenced from Atlantic cod, approximately half from each
library. A crude estimate of the number of times (dG-dT),,
arrays occur in the genome and the distance between these
arrays was calculated. A plasmid library of approximately
7000 clones containing an average insert size of 400 bp
was constructed, representing 2.8 X 106 bp of cod genomicDNA. We make the assumption that (GT)rt sequences are
distributed evenly throughout the cod genome and also
that the 54 positive clones that we sequenced were repre
sentative of all the positive clones (about 10%). Thirty-one
of 54 clones, or 57%, contained a (GT)n array. Based on a
"c" value of 0.9 pg, the haploid genome of cod is 821 Mbp
(Hinegardner and Rosen 1972); we estimate that there are
approximately 2.34 X 105 copies of (GT),, microsatellitesequence present in the cod genome at intervals of 7 kbp.
These estimates are comparable with those suggested for
mammalian genomes (Tautz 1989) and approximately three
times more frequent than in brown trout (Estoup et al. 1993).
Forty-nine clones harbored repeats bracketed by nonrepet-
itive sequences. These were classified as perfect, imperfect,
and compound according to Weber (1990). Briefly, perfect
1962
repeats are uninterrupted stretches of the repeat unit, while
imperfect repeats have one to three intervening bases with
repeat sequence on either side. Compound repeats consist
of neighboring blocks of different types of repeat sequence.
Sixty-four independent microsatellite arrays were identi
fied from the 49 clones. The longest perfect stretch of each
of these arrays was plotted according to class and number of
uninterrupted repeat units (Fig. 1). The percentage of each
class of repeat, the most common size class, and the longest
size class for cod were compared with microsatellites from
other species: human (Weber 1990), pig (Wintero et al.
1992), dog (Ostrander et al. 1993), Atlantic salmon (Slettan
et al. 1993), and rainbow trout (unpublished data) (Table 1).
Slightly different procedures have been used to isolate
microsatellites from different species. It is possible that we
may have biased our screening towards longer arrays with
the use of a (GT)I5 oligonucleotide and higher hybridization and
washing temperatures (65°C); however, we feel that this is
unlikely. Less stringent screening efforts for mammals, as
used by Weber (1990), Beckmann and Weber (1992),
Ellegren et al. (1992), Wintero et al. (1992), and Ostrander
et al. (1993) have failed to reveal microsatellites over 30 repeats
Can. J. Fish. Aquat. ScL, Vol. 51, 1994
ACGT ACGT bp
-140
hi30
-120
-110
Fig. 3. PCR amplified alleles from 48 Atlantic cod individuals at microsatellite locus Gmo2. The sizes of the PCR products were esti
mated by co-migration with those of the M13 sequence fragments shown at either side of the cod samples.
in length. Screening at a lower stringency should enhance
detection of long, short, and more degenerate dinucleotide
arrays, but there is no evidence of this in the literature.
In all animal species, (dG-dT),, microsatellites predomi
nate with the next abundant type being (dG-dA)w. For this
reason, we chose to focus our search on (dG-dT)n. We only
succeeded in isolating two (dG-dA),, microsatellites, detected
by a (GT)W probe; these were found in clones containing a
(dG-dT),, array. The proportions of the different classes of
repeat types differ between species, with the highest per
centage of imperfect repeats found in cod and rainbow trout.
Proportions of perfect and imperfect microsatellites isolated
from Atlantic salmon (Slettan et al. 1993) more closely
resemble the mammalian genomes, except for the absence
of compound repeats, than do microsatellites from Atlantic cod
and rainbow trout (Table 1). An even more striking difference
between mammalian and cold-water fish microsatellites is
the most common size class of the microsatellite arrays.
Sixty-four percent of the smallest class of arrays (contain
ing 6-11 repeats) in cod were separated by four or five bases
adjacent to a larger repeat in the same clone. They were
classified as separate repeats because of the arbitrary defin
ition of Weber (1990). If these shorter microsatellites are
viewed as part of a larger degenerate microsatellite array,
and the criteria which define an imperfect repeat are altered
to incorporate four or five intervening bases rather than three,
then the most common size class for cod microsatellites
would be at least 30-35 repeats, approximately twice the
size of those reported in mammalian genomes. Regardless
of this, the next abundant size classes for cod and Atlantic
salmon are 30-35 and 21-24 dinucleotide repeats, respec
tively, significantly larger than the most common size classes
reported for mammals. This conclusion is consistent with
the size of microsatellites found in rainbow trout (unpub
lished data), where smaller adjacent repeats were not found.
The absolute lengths of the largest microsatellite arrays
in cod, rainbow trout, and Atlantic salmon (Slettan et al.
1993) are also greater than those reported in mammalian
genomes. In a study of microsatellite arrays from zebrafish,
although not providing data on all repeat arrays, three of 16
sequences were over 30 repeat units in length, the longest
consisting of 38 dinucleotide units (Goff et al. 1992). Data
were statistically analyzed to see if this difference in
Can. J. Fish. Aquat. Sci, Vol. 51, 1994
Table 4. Allele size and heterozygosity at Atlantic cod
microsatellite loci detected by PCR amplification.
Locus
Gmol
Gmo2
Gmo9
GmolO
Gmol23
Gmol 32
Gmol45
No. of
alleles
8
15
38
1
46
15
26
Size
(bp)a
94-114
106-140
116-246
214
137-239
109-155
123-217
% heterozygosity
14
82
92
0
89
85
92
aEstimated size of the PCR fragment when compared
with M13 sequence fragments of known length.
microsatellite length was significant for fishes and mam
mals. A chi-squared test was inappropriate because of the
varying size classes used by the different authors to group
data from the different species. A new data set was con
structed by taking the midpoint of the microsatellite length
of each size class and then weighting the length data by
frequency of the class. Bartlett's test for homogeneity of
variance showed that the fish data were more variable than
the mammal data (P < 0.001). A Kruskal-Wallis nonpara-
metric analysis on the pooled mammal and fish data demon
strated that the maximum lengths of fish and mammalian
microsatellites are highly different (P < 0.001).
Also noteworthy, the longest microsatellite arrays in the
cod genome consisted of perfect dinucleotide repeats. In
rainbow trout, six of seven microsatellites longer than
47 repeat units were perfect motifs. Cod, rainbow trout, and
possibly Atlantic salmon and zebrafish microsatellites are
longer and more degenerate than similar mammalian
sequences. Cod and rainbow trout microsatellites are similar
in length, but there is a higher proportion of imperfect
repeats in the cod, often with smaller degenerate repeats
adjacent to larger ones.
Inheritance and Allele Frequencies at Seven Microsatellite
Loci f
PCR primers were designed from flanking regions for a
number of microsatellites and conditions such as temperature,
1963
0 I ™l " " I H 1 1 ™ I
94 98 102 106 110 114
l—I—P"H—Fl-l* I
106 114 122 130 138
0.06
D0.4
w 0.2
.III ill ,in-l*l i-i-i-iI109 117 125 133 141 149
Fig. 4. Allele frequency distribution of alleles at microsatellite loci from Atlantic cod from the Northwest Atlantic Ocean (n127). Microsatellite loci: (A) Gmol, (B) Gmo2, (C) Gmo9, (D) Gmol23, (E) Gmol32, and (F) Gmol45.
radiolabeling (end labeling versus incorporation), and mag
nesium concentration were optimized (Table 2). Seven primer
pairs produced a product readily scored. All seven loci exhib
ited stable inheritance of alleles by PCR assay in three unre
lated, two-generation pedigrees of Atlantic cod (Table 3).Figure 2 demonstrates that even the most complex fragments
produced by PCR of the Gmo9 locus can be readily scored.
Genomic DNA samples from 127 fish caught off the coast
of Nova Scotia in the western North Atlantic were used to
1964
amplify seven microsatellites with the primer pairs described
in Table 2. Alleles for each individual were scored by comparing their migration with sequence fragments from M13 of
known length (Fig. 3). The seven microsatellites varied
widely in the degree of polymorphism exhibited (Table 4).Of the seven microsatellites, six were polymorphic, with 8-46
alleles and heterozygosities of 14-92%. Five of th'e micro-satellites exhibited heterozygous arrays in excess of 80%. It is
interesting that the only locus which was monomorphic
Can. J. Fish. Aquat. ScL, Vol. 51, 1994
(GmolO; (GT)5N2(GT)3N2(GT)2N2(GT)44 consists of a long
imperfect repeat, while all of the short repeats such as Gmo2,
Gmol32, and Gmol45 were highly polymorphic. This result
suggests that factors other than overall length of a microsatel
lite array, as suggested by Weber (1990), may determine
whether a microsatellite will exhibit allelic variation (see
Wright 1993).
The allele frequency distributions varied markedly for
the seven microsatellite loci assayed (Fig. 4; Table 4). Some
microsatellites showed wide variation in allele size (e.g.,
Gmo9 and Gmol23). Most allele distributions were uni-
modal, but those of Gmo9 and Gmol32 appeared weakly
and strongly bimodal), respectively (Fig. 4; Table 4). Similar
allele frequency distributions, reported for human microsatel
lites (Valdes et al. 1993), have been attributed to a stepwise
mechanism for the generation of new alleles. Presumably,
a similar mechanism is probably responsible for the gener
ation of allelic variation at microsatellite loci in cod.
In order to determine the evolutionary conservation of
microsatellite loci in related gadids, PCR amplification was
attempted with the seven primer pairs for the Atlantic cod
microsatellite loci on genomic DNA from the following
species: haddock (Melanogrammus aeglefinus), pollock
(Pollachius virens), and white hake {Urophycis tennis). PCR
products were detected in all of the gadid samples for the loci
Gmol, Gmo2, and Gmo9, while GmolO, Gmol32, and
Gmol45 produced fragments in only haddock and pollock.
No products were detected after PCR with the Gmol23
primers. Although only three individuals for each species
were assayed, all of the loci except GmolO and Gmol23
appeared to be polymorphic for at least one of the other
gadids (data not shown).
General Discussion
We report here the characterization of a group of micro-
satellites from Gadus morhua. It appears that the microsatel
lites in teleost fishes differ significantly in length and com
position from those of mammals. We can only speculate as
to why the microsatellites found in Atlantic cod, rainbow
trout, and Atlantic salmon are larger than microsatellites
from mammals. It remains to be seen, with subsequent iso
lation of microsatellites from other species, whether poikilo-
therms consistently have longer arrays than homiothermic
species. The longer, degenerate microsatellite arrays found
in teleost fishes may be caused by a predisposition to slip
page owing to fluctuations in temperature. The DNA poly-
merase may not function efficiently when subjected to
changes in temperature of several degrees, or temperatures
close to freezing, such as occur in the Atlantic cod envi
ronment and that of the salmonid species.
Disparities in genomic composition between warm- and
cold-blooded vertebrates have already been established by
Bernardi and Bernardi (1990). They found that fish iso-
chores were much more homogeneous and lower in GC con
tent than those of warm-blooded vertebrates. A subsequent
study by Cross et al. (1991) makes a further distinction
between fish genomes and other warm- and cold-blooded
vertebrates with regard to the GC-poor CpG islands that
are associated with some genes. It is possible that different
evolutionary forces are acting on the fish genome, gener
ating longer repeat arrays at microsatellite loci, or con
versely, shorter arrays in the genomes of mammals.
Can. J. Fish. Aquat. Sci, Vol. 51, 1994
Using PCR and primers specific to the nonrepetitive
flanks, we have successfully amplified from cod genomic
DNA seven of the cod microsatellites isolated in this study.
Five of these microsatellites exhibit heterozygosities greatly
in excess of those reported for allozyme or mitochondrial
studies. The average heterozygosity of these microsatellite
markers is H = 0.85 compared with H = 0.071 for allozymes
(Mork et al. 1985) and an equivalent measure for nucleon
diversity, h = 0.36, for mitochondrial genotypes (Carr and
Marshall 1991).
Breeding and population studies of Atlantic cod utilizing
these microsatellites as genetic markers are in progress in our
laboratory. The family Gadidae to which Atlantic cod belong
includes many other economically important species. Our
results have shown that the microsatellites reported here
may prove useful as genetic markers in other gadid species,
similar to a study previously carried out on domestic mam
mals (Moore et al. 1991). Furthermore, a comparative study
of the conservation and polymorphism of these microsatellite
loci in related gadids may elucidate the mode and tempo
of mutation at these fascinating sequences (Bentzen and
Wright 1993; Wright 1993, 1994).
Acknowledgements
Technical assistance was provided by Michiko Fllipak, Linda
Aitkinson, Barbara Edgar, and Harvey Domoslai. Rainbow trout
microsatellite data were provided by Dianne Morris prior to
publication. DNA from cod pedigrees was a generous gift of
Geir Dahle, Institute of Marine Research, Bergen, Norway. This
work was supported by funds from the Ocean Production
Enhancement Network (OPEN), the Nova Scotia government,
Department of Industry Trade and Technology, to the Marine
Gene Probe Laboratory, and the Natural Sciences and Engineer
ing Research Council of Canada to J.M.W. OPEN fellowship
support to A.L.B. is also acknowledged.
References
Beckmann, J.S., and J.L. Weber. 1992. Survey of human and rat microsatel
lites. Genomics 12: 627-631.
Bentzen, P., and J.M. Wright. 1993. Nucleotide sequence and evolu
tionary conservation of a minisatellite variable number tandem repeat
cloned from Atlantic salmon, Salmo salar. Genome 36: 271-277.
Bentzen, P., A.S. Harris, and J.M. Wright. 1991. Cloning of hyper-
variable minisatellite and simple sequence microsatellite repeats for
DNA fingerprinting of important aquacultural species of salmonids
and tilapia, p. 243-262. In T. Burke, G. Dolf, A.J. Jeffreys, and R. Wolf
[ed.] DNA fingerprinting: approaches and applications. Birkhauser
Verlag, Basel, Switzerland.
Bernardi, G., and G. Bernardi. 1990. Compositional transitions in the
nuclear genomes of cold-blooded vertebrates. J. Mol. Evol. 31:
2856-2867.
Carr, S.M., and H.D. Marshall. 1991. Detection of intraspecific DNA
sequence variation in the mitochondrial cytochrome b gene of Atlantic
cod {Gadus morhua) by the polymerase chain reaction. Can. J. Fish.
Aquat. Sci. 48: 48-52.
Cross, S., P. Kovarik, J. Schmidtke, and A. Bird. 1991. Nonmethylated
islands in fish genomes are GC-poor. Nucleic Acids Res. 19: 1469-1474.
Ellegren, H., M. Johansson, K. Sandberg, and L. Andersson. 1992.
Cloning of highly polymorphic microsatellites in the horse. Anim.
Genet. 23: 133-142.
Estoup, A., P. Presa, F. Krieg, D. Vaiman, and R. Guyomard. 1993.
(CT)n and (GT),, microsatellites: a new class of genetic markers for
Salmo trutta L. (brown trout). Heredity 71: 488-496.
Estoup, A., M. Solignac, M. Harry, and J-M. Cornuet. 1993. Charac
terization of (GT)n and (CT)n microsatellites in two insect species:
Apis mellifera and Bombus terrestris. Nucleic Acids Res. 21: 1427-1431.
Goff, D.J., K. Galvin, H. Katz, M. Westerfield, E. Lander, and
C.J. Tabin. 1992. Identification of polymorphic simple sequence
repeats in the genome of the zebrafish. Genomics 14: 200-202.
1965
Hinegardner,R.,andD.E.Rosen.1972.CellularDNAcontentandtheevolutionofteleosteanfishes.Am.Nat.106:621-644.
Litt,M.,andJ.A.Luty.1989.Ahypervariablemicrosatelliterevealedbyinvitroamplificationofadinucleotiderepeatwithinthecardiacmuscleactingene.Am.J.Hum.Genet44:397-401.
Love,J.,A.Knight,M.McAleer,andJ.Todd.1990.TowardsconstructionofahighresolutionmapofthemousegenomeusingPCR-analysedmicrosatellites.NucleicAcidsRes.18:4123-4130.
Moore,S.S.,L.L.Sargeant,T.J.King,J.S.Mattick,M.Georges,andD.J.Hetzel.1991.TheconservationofdinucleotidemicrosatellitesamongmammaliangenomesallowstheuseofheterologousPCR
primerpairsincloselyrelatedspecies.Genomics10:654-660.Mork,J.,R.Ryman,G.Stahl,F.Utter,andG.Sundnes.1985.Genetic
variationinAtlanticcod(Gadusmorhua)throughoutitsrange.CanJ.Fish.Aquat.Sci.42:1580-1587.
Ostrander,E.A.,J.F.SpragueJr.,andJ.Rine.1993.Identificationandcharacterizationofdinucleotiderepeat(CA)flmarkersforgeneticmappingindog.Genomics16:207-213.
Slettan,A.,I.Olsaker,andO.Lie.1993.Isolationandcharacterizationof(GT)nrepetitivesequencesfromAtlanticsalmon,SalmosalarLAnim.Genet.24:195-197.
Stallings,R.L.,A.F.Ford,D.C.Torney,C.E.Hildebrand,and
R.K.Moyzis.1991.Evolutionanddistributionof(GT)nrepetitivesequencesinmammaliangenomes.Genomics10:807-815.
Tautz,D.1989.HypervariabilityofsimplesequencesasasourceforpolymorphicDNAmarkers.NucleicAcidsRes.17:6463-6471.
Valdes,A.M.,M.Slatkin,andN.B.Freimer.1993.Allelefrequenciesatmicrosatelliteloci:thestepwisemutationmodelrevisitedGenetics133:737-749.
Weber,J.L.1990.Informativenessofhuman(dC-dA)-(dG-dT)polymorphisms.Genomics7:524-530."
Weber,J.L.,andP.E.May.1989.AbundantclassofhumanDNApolymorphismwhichcanbetypedusingthepolymerasechainreaction.Am.J.Hum.Genet.44:388-396.
Westneat,D.F.,W.A.Noon,H.K.Reeve,andC.F.Aquadro.1988.ImprovedhybridizationconditionsfromDNA"fingerprints"probedwithMl3.NucleicAcidsRes.16:4161.
Wintero,A.K.,M.Fredholm,andP.B.Thomsen.1992.Variable(dG-dT)n-(dC-dA)rtsequencesintheporcinegenome.Genomics12:281—288.
Wright,J.M.1993.DNAfingerprintingoffishes,p.57-91.InP.W.HochachkaandT.Mommsen[ed.]Biochemistryandmolecularbiologyoffishes.Vol.2.ElsevierSciencePublishersB.V.Amsterdam,TheNetherlands.
Wright,J.M.1994.MutationatVNTRs:areminisatellitestheevolutionaryprogenyofmicrosatellites.Genome37:345-347.
Wright,J.M.,andP.Bentzen.1994.Microsatellites:geneticmarkersforthefuture.InG.R.CarvalloandT.J.Pitcher[ed.]Rev.FishBiolFish.4:384-388.
1966Can.J.Fish.Aquat.Sci.,Vol.51,1994
Organization of Microsatellites Differs between Mammals
and Cold-water Teleost Fishes
Amanda L. Brooker, Doug Cook, Paul Bentzen,1 Jonathan M. Wright,2 and Roger W. Doyle
Marine Gene Probe Laboratory, Department of Biology, Dalhousie University, Halifax, NS B3H 4J1, Canada
Brooker, A.L, D. Cook, P. Bentzen, J.M. Wright, and R.W. Doyle. 1994. Organization of microsatellites differs
between mammals and cold-water teleost fishes. Can. J. Fish. Aquat. Sci. 51:1959-1966.
Microsatellites, in particular (dG-dT)n and (dG-dA),, dinucleotide repeats, are abundant and display a high
degree of length polymorphism and heterozygosity in eukaryotic genomes. Here, we report the cloning andcharacterization of 64 microsatellite sequences from Atlantic cod, Cadus morhua. The microsatelliteswere classified as perfect, imperfect, and compound repeats. The length and integrity of these repeats
were compared with microsatellites characterized from two other teleosts, rainbow trout (Oncorhynchusmykiss) and Atlantic salmon {Salmo salar), and from three mammalian genomes, human, porcine, andcanine. Differences were found in the proportions of the repeat classes; however, the most significantdifference between microsatellites from teleost fishes and mammals was the propensity of the former to beof greater length: some cod and rainbow trout microsatellites were more than twice the size of the longestmicrosatellite repeats reported for any mammalian genome. Primers for PCR amplification were constructed for seven of the cod microsatellites. Allele frequencies, degree of polymorphism, and heterozygositywere estimated for a sample population. Amplification with these cod primers was also carried out on
a number of related gadids. These polymorphic microsatellite loci have enormous potential utility asgenetic markers for use in population, breeding, and evolutionary studies.
Les chatnes d'ADN microsatellite, et plus particulierement les sequences rep£titives des dinucleotides(dG-dT)net (dG-dA),,, sont abondantes et leurs genomes eucaryotes pr£sentent un taux eleve de poly-morphisme et d'heterozygosite\ Dans ce rapport, nous pr£sentons les r£sultats du clonage et de la carac-terisation de 60 sequences d'ADN microsatellite de la morue franche, Gadus morhua. Les ADN microsatellites etaient classees en trois categories : parfaites, imparfaites et a sequence de r£p£tition composed. Lalongueur et l'integrit£ de ces sequences de r£p£tition ont £t£ comparers a celles d'ADN microsatellitecaracterisee chez deux autres teleost£ens, la truite arc-en-ciel (Oncorhynchus mykiss) et le saumon deI'Atlantique (Salmo salar), et chez trois genomes de mammiferes, (humain, porcin et canin). On a trouvedes differences dans les proportions des categories de sequences de repetition; toutefois, la difference laplus significative entre les ADN microsatellites de teleosteens et celle de mammiferes etait une tendancea une longueur supe>ieure chez ces derniers : les ADN microsatellites de certaines morues et truites arc-en-ciel Etaient deux fois plus longues que les sequences re>§t§es d'ADN microsatellite observees pourtous les genomes de mammiferes. Des amorces pour I'amplification par la PCR ont £te cr£ees pour sept desADN microsatellites de morue. Les frequences d'alleles, le degre de polymorphisme et I'heterozygosityont £t£ estim£s pour une population £chantillon. On a Sgalement fait I'amplification de ces amorces de
morue pour un certain number de gadid£s connexes. Ces locus d'ADN microsatellites polymorphespeuvent etre extremement utiles comme marqueurs gen&iques dans les eludes de population, de reproductionet devolution.
Received November 24, 1993
Accepted March 16, 1994
(J12177)
Regu le 24 novembre 1993
Accepte le 16 mars 1994
Atlantic cod, Gadus morhua, is one of the most com
mercially important fish in the North Atlantic. Highly
polymorphic genetic markers assayable via the poly-
merase chain reaction (PCR) hold the potential for consid
erable utility in selective breeding, fisheries management,
and genome mapping of cod (Bentzen et al. 1991; Wright
1993; Wright and Bentzen 1994). To this end, we have
cloned and sequenced microsatellite repeats from cod.
Microsatellites consist of tandem repeats of simple
sequence DNA, where the varying number of repeats at a
particular locus creates a polymorphism (Litt and Luty 1989;
'Current address: School of Fisheries HF-10, MAR Building,Room 201, University of Washington, Seattle, WA 98195, USA.
2Author to whom correspondence should be addressed.
Can. J. Fish. Aquat. Sci.. Vol. 51, 1994
Tautz 1989; Weber and May 1989). These arrays of differing
lengths can be amplified by PCR and size fractionated on
acrylamide gels. The most abundant and widely used of these
simple repeats are the dinucleotide sequences (dG-dT)w and
(dG-dA),,. Cloning of these microsatellites has been reported
for a variety of species, although mostly mammals (Stallings
et al. 1991). At present, extensive microsatellite data for
human (Weber 1990), mouse (Love et al. 1990), rat (Beckmann
and Weber 1992), dog (Ostrander et al. 1993), pig (Wintero
et al. 1992), horse (Ellegren et al. 1992), bees (Estoup et al.
1993), Atlantic salmon (Salmo salar) (Slettan et al. 1993),
brown trout (Salmo trutta) (Estoup et al. 1993), and z^rafish
(Goff et al. 1992) have been published.
Highly polymorphic microsatellite markers have great
potential utility as genetic tags for use in aquaculture and
1959
Table 1. Repeat sequence type and size of microsatellites from mammalian and
teleost fish species. Figures in parentheses are the second highest size class due to abimodal distribution.
n
% perfect
% imperfect
% compound
Most common
size classa
Largest
size class3
Atlantic
cod
64
48.4
45.3
6.3
6-11
(30-35)
>60
Rainbow
trout
51
56.9
31.4
11.7
24-29
>60
Atlantic
salmon
45
80
20
0
6-9
(21-24)
>33
Human
114
64
25
11
12-15
27-30
Canine
101
80
8
12
14-16
23-25
Porcine
105
71
19
10
16-18
28-30
aNumber of dinucleotide repeats.
□ Perfect
0 Imperfect
El Compound
Table 2. Nucleotide sequence of Atlantic cod microsatellitesand primers for amplification of loci by PCR.
6-11 12r17 18-23 24-29 30-35 36-41 42-47 48-53 54-59 >60
Number of repeat units (longest continuous run)
Fig. 1. Frequency of different size classes of the longest uninter
rupted microsatellite arrays from Atlantic cod. Size classes are rep
resented as number of repeat units.
fisheries biology. They may prove particularly valuable for
stock discrimination and population genetics due to the high
levels of polymorphism compared with conventional allozyme
markers (Bentzen et al. 1991; Wright and Bentzen 1994).
The utility of the PCR approach also makes possible the pro
cessing of large sample numbers required for population sur
veys and large-scale breeding programs. Moreover, the minute
amount of tissue required to genotype individuals allows
larvae as well as retrospective studies of archival samples
such as otoliths to be examined (Wright and Bentzen 1994).
Recently, microsatellites from Atlantic salmon were reported
to differ significantly in length and composition from mam
malian microsatellites (Slettan et al. 1993). In order to deter
mine if this finding is a general trend for microsatellites iso
lated from teleost fishes, we have investigated the organization
of Atlantic cod and, to a lesser extent, rainbow trout
(Oncorhynchus mykiss) microsatellites and compared them
with microsatellites isolated from other species. We also
report the allele distributions and heterozygosities of seven
microsatellite loci that exhibit a wide range of polymorphism
in a sample of 127 cod from the western North Atlantic
Ocean and investigate whether or not these microsatellite
loci are conserved in related gadid species.
I960
Primer Sequence 5' -> 3' 7-ann <QC
Gmo 1 (GA)4N(GA)2N,(GA)7N2(GA)4N2(GA)5
AAA AAC AAA ACG AGA GCG GT
57
Rb
Gmo2
Rb
Gmo9
F
R
Gmo 10
F
R
Gmol23
F
R
Gmol32
F
R
Gmo 145
F
R
GGA GTT GCT TGT GGT GGC AT
(GT)I4
CCC TCA GAT TCA AAT GAA GGA
GTG TGA GAT GAC TGT GTC G
(GT)6N2GTN2GTN2(GT)46
GCA GAT TTG AAT CAG CGT GT
CTC TGT GGG TAT GTG CAA AG
(GT)5N2(GT)3N2(GT)2N2(GT)44
GGA GTT TGA CTC ATC AGA TGC
ACA GCG TAT GAC CAT GAT TCG
(GT)8N2(GT)nN(GT)2N3(GT)43N3(GT)8GAG GGA CAT AAA GAC ACT T
AAA CAT GCA TGC AGG CAA C
(GT)I9
GGA ACC CAT TGG ATT CAG GC
CGA AAG GAC GAG CCA ATA AC
G12(GA)29
GCA TTG TAG GAA CAA CAA TTA AC
GTG CAT GTG CTC ATT ATA GC
51
50
53
56
52
50
ZF = forward primer.
bR = reverse primercAnnealing temperature.
Methods and Materials
Isolation of Microsatellites from Cod
Two cod genomic libraries were constructed. The first
employed the vector M13mp9 and was constructed from
purified, size-selected fragments of 200-500 bp. These frag
ments were generated by digestion with enzymes Alul,
Haelll, Hindi, and Rsal (Pharmacia) according to the man
ufacturer's instructions and cloned into the Smal site of the
vector. Construction of a second library was perfqrmed by
cleaving cod genomic DNA with Alul, Hindi, Pall, and
Rsal, followed by ligation of size-selected fragments of
300-800 bp into the Smal site of the pUC18 plasmid.
Can. J. Fish. Aquat. ScL, Vol. 51, 1994
Table 3. Pedigree analysis of three two-generation families of Atlantic cod using
six polymorphic microsatellite loci.
Fish
AI,AI2
All,
AII2
AII3
AII4
AII5
AII6
AII7
AH8AII9
AH,o
BI,BI2
BII,
BII2
BII3
BII4
BII5
BH6
CI,
CI2
CII,
CII2
CII3
CII4
CII5
CII6
CII7
Size of
alleles
(bp)
Gmol Gmo2
[ y
[ ?
[ ^
[ ^
L ?
»
»
»
»
, ■
I 3,4
1 2, 5V
1 2,4
1 3,5
1 2,3
1 2,4
I 3,5
I 2,4
I 2,4
I 3,5
1 2,3
I 4,5
I 2,3
I 1,5
I 3,5
I 1,3
I 3,5
I 1,2
I 2,5
I 1, 3
2,5
I 2,5
I 2,5
2,2
2,5
i 2,2
2,5
5,5
5,5
1 = 104 1 = 116
2 = 114
3=112
4= 110
5 = 108
Gmo9
4, 7
8,9
4, 8
4,9
7,9
4,9
4,9
4,9
4,8
4,9
7,8
7,9
2,3
6,6
3,6
3,6
3,6
2,6
3,6
3,6
1,8
5,7
5,8
1,7
7, 8
7,8
1, 5
1,7
5,8
1 =236
2 = 220
3 = 216
4 = 212
5 = 208
6 = 206
7 = 202
8= 194
9= 190
Gmol 23
1,4
2,5
4,5
1, 5
4, 5
1, 2
2,4
2,4
4,5
4,5
1,5
2,4
6, 10
2,4
2, 10
2, 10
2, 10
2,6
4, 10
4, 10
7,9
3,8
3,7
8,9
3,7
3,9
3,9
8,9
7, 8
1 =221
2 = 219
3 = 213
4 = 203
5 = 201
6= 199
7= 195
8= 191
9= 187
10= 141
Gmol32
4, 5
4, 5
4, 4
5,5
4, 5
4,4
5,5
4,4
4,4
5,5
5,5
4, 5
1,2
3,5
1,5
1,3
2,5
1,5
2,3
1,5
5,5
3,5
3,5
5,5
5, 5
5, 5
5,5
5, 5
5,5
1 = 135
2= 123
3 = 119
4= 113
5 = 111
Gmol45
2,8
5, 10
2, 10
2, 10
8, 10
8, 10
5, 8
2,5
2, 10
8, 10
2, 10
5,8
4,7
3,9
4,9
7,9
4,9
4,9
4,9
3,7
1, 10
5,6
1,5
1,5
1,5
1,6
1,6
5, 10
6, 10
1 = 183
2= 181
3 = 179
4= 177
5 = 175
6= 173
7= 171
8= 169
9= 167
10= 165
Following transformation, bacterial colonies or plaques
were transferred to nylon filters (Amersham) according to the
manufacturer's instructions and the immobilized DNA was
hybridized to (GT)15 synthetic oligonucleotide, radiolabeled
with [7 32P]ATP and T4 polynucleotide kinase. Filters wereincubated in prehybridization solution of Westneat et al.
(1988) at 65°C for 1 h before addition of the radiolabeled
probe. The hybridization reaction was allowed to proceed
overnight. Filters were washed once with 2 X SSC/0.2%
SDS at room temperature for 15 min and once with 2 X
SSC/0.2% SDS at 65°C for 15 min. Dideoxy-chain termi
nation sequencing of DNA from purified positive clones
was accomplished using T7 DNA polymerase (Pharmacia).
PCR Analysis
PCR primers complementary to the sequence flanking the
microsatellite were synthesized. For all PCR analyses, except
those that amplify the locus Gmo9, the reverse primer was
5' end-labeled with [7 32P]ATP (37 kBq/10 pmol primer)using T4 polynucleotide kinase (Pharmacia). PCR was carried
Can. J. Fish. Aquat. ScL, Vol. 51, 1994
out in a Perkin Elmer thermal cycler using Taq Polymerase
(Perkin Elmer). The standard reaction contained 10 ng of
template DNA, 0.6 pM labeled reverse primer, 0.6 |nM for
ward primer, 10 mM Tris-HCl (pH 8.3), 1 mM MgCl2,
50 mM KC1, 0.01% gelatin, 200 \xM dNTPs, and 0.25 unit
of Taq Polymerase in a 10-jiL volume. For Gmo9, the reverse
primer was not labeled; instead, [a 32P]dATP was includedin the PCR mix along with 20 mM MgCl2. Amplifications
were achieved by running 30 cycles of 1 min at 95°C, 2 min
at the appropriate annealing temperature (see Table 2), and
20 s at 72°C. Stop dye (10 jxL, Pharmacia) was added to
each reaction and the products were denatured at 80°C for
5 min. Two microlitres of the reaction was subjected to gel
electrophoresis on a 6% denaturing polyacrylamide gel.
Results and Discussion
Characterization of Cod-derived (dG-dA)w and (dG-dT)n
Sequences
As a result of the two cloning efforts, over 120 clones
1961
OtD
13 112 111112 2
243444443434
Fig. 2. Allelic products generated by PCR of the Gmo9 locus from a two-generation Norwegian
pedigree of Atlantic cod. Offspring demonstrate stable inheritance of parental alleles which
are numbered and shown at the bottom of the figure. Size of alleles are estimated using an
M13 sequence ladder.
that hybridized to the dinucleotide probe were isolated and
sequenced from Atlantic cod, approximately half from each
library. A crude estimate of the number of times (dG-dT),,
arrays occur in the genome and the distance between these
arrays was calculated. A plasmid library of approximately
7000 clones containing an average insert size of 400 bp
was constructed, representing 2.8 X 106 bp of cod genomicDNA. We make the assumption that (GT)n sequences are
distributed evenly throughout the cod genome and also
that the 54 positive clones that we sequenced were repre
sentative of all the positive clones (about 10%). Thirty-one
of 54 clones, or 57%, contained a (GT)n array. Based on a
"c" value of 0.9 pg, the haploid genome of cod is 821 Mbp
(Hinegardner and Rosen 1972); we estimate that there are
approximately 2.34 X 105 copies of (GT)n microsatellitesequence present in the cod genome at intervals of 7 kbp.
These estimates are comparable with those suggested for
mammalian genomes (Tautz 1989) and approximately three
times more frequent than in brown trout (Estoup et al. 1993).
Forty-nine clones harbored repeats bracketed by nonrepet-
itive sequences. These were classified as perfect, imperfect,
and compound according to Weber (1990). Briefly, perfect
1962
repeats are uninterrupted stretches of the repeat unit, while
imperfect repeats have one to three intervening bases with
repeat sequence on either side. Compound repeats consist
of neighboring blocks of different types of repeat sequence.
Sixty-four independent microsatellite arrays were identi
fied from the 49 clones. The longest perfect stretch of each
of these arrays was plotted according to class and number of
uninterrupted repeat units (Fig. 1). The percentage of each
class of repeat, the most common size class, and the longest
size class for cod were compared with microsatellites from
other species: human (Weber 1990), pig (Wintero et al.
1992), dog (Ostrander et al. 1993), Atlantic salmon (Slettan
et al. 1993), and rainbow trout (unpublished data) (Table 1).
Slightly different procedures have been used to isolate
microsatellites from different species. It is possible that we
may have biased our screening towards longer arrays with
the use of a (GT)I5 oligonucleotide and higher hybridization and
washing temperatures (65°C); however, we feel that this is
unlikely. Less stringent screening efforts for mammals, as
used by Weber (1990), Beckmann and Weber (1992),
Ellegren et al. (1992), Wintero et al. (1992), and Ostrander
et al. (1993) have failed to reveal microsatellites over 30 repeats
Can. J. Fish. Aquat. Set., Vol. 51. 1994
ACGT
Fig. 3. PCR amplified alleles from 48 Atlantic cod individuals at microsatellite locus Gmo2. The sizes of the PCR products were esti
mated by co-migration with those of the M13 sequence fragments shown at either side of the cod samples.
in length. Screening at a lower stringency should enhance
detection of long, short, and more degenerate dinucleotide
arrays, but there is no evidence of this in the literature.
In all animal species, (dG-dT),, microsatellites predomi
nate with the next abundant type being (dG-dA),,. For this
reason, we chose to focus our search on (dG:dT)w. We only
succeeded in isolating two (dG-dA),, microsatellites, detected
by a (GT)n probe; these were found in clones containing a
(dG-dT)w array. The proportions of the different classes of
repeat types differ between species, with the highest per
centage of imperfect repeats found in cod and rainbow trout.
Proportions of perfect and imperfect microsatellites isolated
from Atlantic salmon (Slettan et al. 1993) more closely
resemble the mammalian genomes, except for the absence
of compound repeats, than do microsatellites from Atlantic cod
and rainbow trout (Table 1). An even more striking difference
between mammalian and cold-water fish microsatellites is
the most common size class of the microsatellite arrays.
Sixty-four percent of the smallest class of arrays (contain
ing 6-11 repeats) in cod were separated by four or five bases
adjacent to a larger repeat in the same clone. They were
classified as separate repeats because of the arbitrary defin
ition of Weber (1990). If these shorter microsatellites are
viewed as part of a larger degenerate microsatellite array,
and the criteria which define an imperfect repeat are altered
to incorporate four or five intervening bases rather than three,
then the most common size class for cod microsatellites
would be at least 30-35 repeats, approximately twice the
size of those reported in mammalian genomes. Regardless
of this, the next abundant size classes for cod and Atlantic
salmon are 30-35 and 21-24 dinucleotide repeats, respec
tively, significantly larger than the most common size classes
reported for mammals. This conclusion is consistent with
the size of microsatellites found in rainbow trout (unpub
lished data), where smaller adjacent repeats were not found.
The absolute lengths of the largest microsatellite arrays
in cod, rainbow trout, and Atlantic salmon (Slettan et al.
1993) are also greater than those reported in mammalian
genomes. In a study of microsatellite arrays from zebrafish,
although not providing data on all repeat arrays, three of 16
sequences were over 30 repeat units in length, the longest
consisting of 38 dinucleotide units (Goff et al. 1992). Data
were statistically analyzed to see if this difference in
Can. J. Fish. Aquat. ScL. Vol. 51, 1994
Table 4. Allele size and heterozygosity at Atlantic cod
microsatellite loci detected by PCR amplification.
Locus
Gmol
Gmo2
Gmo9
GmolO
Gmol 23
Gmol 32
Gmol45
No. of
alleles
8
15
38
1
46
15
26
Size
(bp)a
94-114
106-140
116-246
214
137-239
109-155
123-217
% heterozygosity
14
82
92
0
89
85
92
Estimated size of the PCR fragment when compared
with M13 sequence fragments of known length.
microsatellite length was significant for fishes and mam
mals. A chi-squared test was inappropriate because of the
varying size classes used by the different authors to group
data from the different species. A new data set was con
structed by taking the midpoint of the microsatellite length
of each size class and then weighting the length data by
frequency of the class. Bartlett's test for homogeneity of
variance showed that the fish data were more variable than
the mammal data (P < 0.001). A Kruskal-Wallis nonpara-
metric analysis on the pooled mammal and fish data demon
strated that the maximum lengths of fish and mammalian
microsatellites are highly different (P < 0.001).
Also noteworthy, the longest microsatellite arrays in the
cod genome consisted of perfect dinucleotide repeats. In
rainbow trout, six of seven microsatellites longer than
47 repeat units were perfect motifs. Cod, rainbow trout, and
possibly Atlantic salmon and zebrafish microsatellites are
longer and more degenerate than similar mammalian
sequences. Cod and rainbow trout microsatellites are similar
in length, but there is a higher proportion of imperfect
repeats in the cod, often with smaller degenerate repeats
adjacent to larger ones.
Inheritance and Allele Frequencies at Seven Microsatellite
Loci ,
PCR primers were designed from flanking regions for a
number of microsatellites and conditions such as temperature,
1963
©4 98 102 106 110 114
106
B
Fig. 4. Allele frequency distribution of alleles at microsatellite loci from Atlantic cod from the Northwest Atlantic Ocean (n127). Microsatellite loci: (A) Gmol, (B) Gmo2, (C) Gmo9, (D) Gmol23, (E) Gmol32, and (F) Gmol45.
radiolabeling (end labeling versus incorporation), and mag
nesium concentration were optimized (Table 2). Seven primer
pairs produced a product readily scored. All seven loci exhib
ited stable inheritance of alleles by PCR assay in three unre
lated, two-generation pedigrees of Atlantic cod (Table 3).
Figure 2 demonstrates that even the most complex fragments
produced by PCR of the Gmo9 locus can be readily scored.
Genomic DNA samples from 127 fish caught off the coast
of Nova Scotia in the western North Atlantic were used to
1964
amplify seven microsatellites with the primer pairs described
in Table 2. Alleles for each individual were scored by com
paring their migration with sequence fragments from M13 of
known length (Fig. 3). The seven microsatellites varied
widely in the degree of polymorphism exhibited (Table 4).
Of the seven microsatellites, six were polymorphic, with 8-46
alleles and heterozygosities of 14-92%. Five of th'e micro-
satellites exhibited heterozygous arrays in excess of 80%. It is
interesting that the only locus which was monomorphic
Can. J. Fish. Aquat. ScL, Vol. 51, 1994
(GmolO; (GT)5N2(GT)3N2(GT)2N2(GT)44 consists of a long
imperfect repeat, while all of the short repeats such as Gmo2,
Gmol 32, and Gmol45 were highly polymorphic. This result
suggests that factors other than overall length of a microsatel
lite array, as suggested by Weber (1990), may determine
whether a microsatellite will exhibit allelic variation (see
Wright 1993).
The allele frequency distributions varied markedly for
the seven microsatellite loci assayed (Fig. 4; Table 4). Some
microsatellites showed wide variation in allele size (e.g.,
Gmo9 and Gmol23). Most allele distributions were uni-
modal, but those of Gmo9 and Gmol32 appeared weakly
and strongly bimodai), respectively (Fig. 4; Table 4). Similar
allele frequency distributions, reported for human microsatel
lites (Valdes et al. 1993), have been attributed to a stepwise
mechanism for the generation of new alleles. Presumably,
a similar mechanism is probably responsible for the gener
ation of allelic variation at microsatellite loci in cod.
In order to determine the evolutionary conservation of
microsatellite loci in related gadids, PCR amplification was
attempted with the seven primer pairs for the Atlantic cod
microsatellite loci on genomic DNA from the following
species: haddock (Melanogrammus aeglefinus), pollock
(Pollachius virens), and white hake (Urophycis tenuis). PCR
products were detected in all of the gadid samples for the loci
Gmol, Gmo2, and Gmo9, while GmolO, Gmol32, and
Gmol45 produced fragments in only haddock and pollock.
No products were detected after PCR with the Gmol23
primers. Although only three individuals for each species
were assayed, all of the loci except GmolO and Gmol23
appeared to be polymorphic for at least one of the other
gadids (data not shown).
General Discussion
We report here the characterization of a group of micro-
satellites from Gadus morhua. It appears that the microsatel
lites in teleost fishes differ significantly in length and com
position from those of mammals. We can only speculate as
to why the microsatellites found in Atlantic cod, rainbow
trout, and Atlantic salmon are larger than microsatellites
from mammals. It remains to be seen, with subsequent iso
lation of microsatellites from other species, whether poikilo-
therms consistently have longer arrays than homiothermic
species. The longer, degenerate microsatellite arrays found
in teleost fishes may be caused by a predisposition to slip
page owing to fluctuations in temperature. The DNA poly-
merase may not function efficiently when subjected to
changes in temperature of several degrees, or temperatures
close to freezing, such as occur in the Atlantic cod envi
ronment and that of the salmonid species.
Disparities in genomic composition between warm- and
cold-blooded vertebrates have already been established by
Bernardi and Bernardi (1990). They found that fish iso-
chores were much more homogeneous and lower in GC con
tent than those of warm-blooded vertebrates. A subsequent
study by Cross et al. (1991) makes a further distinction
between fish genomes and other warm- and cold-blooded
vertebrates with regard to the GC-poor CpG islands that
are associated with some genes. It is possible that different
evolutionary forces are acting on the fish genome, gener
ating longer repeat arrays at microsatellite loci, or con
versely, shorter arrays in the genomes of mammals.
Can. J. Fish. Aquat. Sci, Vol. 51. 1994
Using PCR and primers specific to the nonrepetitive
flanks, we have successfully amplified from cod genomic
DNA seven of the cod microsatellites isolated in this study.
Five of these microsatellites exhibit heterozygosities greatly
in excess of those reported for allozyme or mitochondrial
studies. The average heterozygosity of these microsatellite
markers is H = 0.85 compared with H = 0.071 for allozymes
(Mork et al. 1985) and an equivalent measure for nucleon
diversity, h = 0.36, for mitochondrial genotypes (Carr and
Marshall 1991).
Breeding and population studies of Atlantic cod utilizing
these microsatellites as genetic markers are in progress in our
laboratory. The family Gadidae to which Atlantic cod belong
includes many other economically important species. Our
results have shown that the microsatellites reported here
may prove useful as genetic markers in other gadid species,
similar to a study previously carried out on domestic mam
mals (Moore et al. 1991). Furthermore, a comparative study
of the conservation and polymorphism of these microsatellite
loci in related gadids may elucidate the mode and tempo
of mutation at these fascinating sequences (Bentzen and
Wright 1993; Wright 1993, 1994).
Acknowledgements
Technical assistance was provided by Michiko Fllipak, Linda
Aitkinson, Barbara Edgar, and Harvey Domoslai. Rainbow trout
microsatellite data were provided by Dianne Morris prior to
publication. DNA from cod pedigrees was a generous gift of
Geir Dahle, Institute of Marine Research, Bergen, Norway. This
work was supported by funds from the Ocean Production
Enhancement Network (OPEN), the Nova Scotia government,
Department of Industry Trade and Technology, to the Marine
Gene Probe Laboratory, and the Natural Sciences and Engineer
ing Research Council of Canada to J.M.W. OPEN fellowship
support to A.L.B. is also acknowledged.
References
Beckmann, J.S., and J.L. Weber. 1992. Survey of human and rat microsatel
lites. Genomics 12: 627-631.
Bentzen, P., and J.M. Wright. 1993. Nucleotide sequence and evolu
tionary conservation of a minisatellite variable number tandem repeat
cloned from Atlantic salmon, Salmo salar. Genome 36: 271-277.
Bentzen, P., A.S. Harris, and J.M. Wright. 1991. Cloning of hyper-
variable minisatellite and simple sequence microsatellite repeats for
DNA fingerprinting of important aquacultural species of salmonids
and tilapia, p. 243-262. In T. Burke, G. Dolf, A.J. Jeffreys, and R. Wolf
[ed.] DNA fingerprinting: approaches and applications. Birkhauser
Verlag, Basel, Switzerland.
Bernardi, G., and G. Bernardi. 1990. Compositional transitions in the
nuclear genomes of cold-blooded vertebrates. J. Mol. Evol. 31:
2856-2867.
Carr, S.M., and H.D. Marshall. 1991. Detection of intraspecific DNA
sequence variation in the mitochondrial cytochrome b gene of Atlantic
cod {Gadus morhua) by the polymerase chain reaction. Can. J. Fish.
Aquat. Sci. 48: 48-52.
Cross, S., P. Kovarik, J. Schmidtke, and A. Bird. 1991. Nonmethylated
islands in fish genomes are GC-poor. Nucleic Acids Res. 19: 1469-1474.
Ellegren, H., M. Johansson, K. Sandberg, and L. Andersson. 1992.
Cloning of highly polymorphic microsatellites in the horse. Anim.
Genet. 23: 133-142.
Estoup, A., P. Presa, F. Krieg, D. Vaiman, and R. Guyomard. 1993.
(CT)n and (GT)fl microsatellites: a new class of genetic markers for
Salmo trutta L. (brown trout). Heredity 71: 488-496.
Estoup, A., M. Solignac, M. Harry, and J-M. Cornuet. 1993. Charac
terization of (GT)n and (CT)n microsatellites in two insect species:
Apis mellifera and Bombus terrestris. Nucleic Acids Res. 21: 1427-1431.
Goff, D.J., K. Galvin, H. Katz, M. Westerfield, E. Lander, and
C.J. Tabin. 1992. Identification of polymorphic simple sequence
repeats in the genome of the zebrafish. Genomics 14: 200-202.
1965
Hinegardner,RMandD.E.Rosen.1972.CellularDNAcontentandtheevolutionofteleosteanfishes.Am.Nat.106:621-644.
Lmr,M.,andJ.A.Luty.1989.Ahypervariablemicrosatelliterevealedbyinvitroamplificationofadinucleotiderepeatwithinthecardiacmuscleactingene.Am.J.Hum.Genet44:397-401.
Love,J.,A.Knight,M.McAleer,andJ.Todd.1990.TowardsconstructionofahighresolutionmapofthemousegenomeusingPCR-analysedmicrosatellites.NucleicAcidsRes.18:4123-4130.
Moore,S.S.,L.L.Sargeant,T.J.King,J.S.Mattick,M.Georges,andD.J.Hetzel.1991.TheconservationofdinucleotidemicrosatellitesamongmammaliangenomesallowstheuseofheterologousPCR
primerpairsincloselyrelatedspecies.Genomics10:654-660.Mork,J.,R.Ryman,G.Stahl,F.Utter,andG.Sundnes.1985.Genetic
variationinAtlanticcod(Gadusmorhua)throughoutitsrangeCanJ.Fish.Aquat.Sci.42:1580-1587.
Ostrander,E.A.,J.F.SpragueJr.,andJ.Rine.1993.Identificationandcharacterizationofdinucleotiderepeat(CA)nmarkersforgeneticmappingindog.Genomics16:207-213.
Slettan,A.,I.Olsaker,andO.Lie.1993.Isolationandcharacterizationof(GT)flrepetitivesequencesfromAtlanticsalmon,SalmosalarLAnim.Genet.24:195-197.
Stallings,R.L.,A.F.Ford,D.C.Torney,C.E.Hildebrand,andR.K.Moyzis.1991.Evolutionanddistributionof(GT)nrepetitivesequencesinmammaliangenomes.Genomics10:807-815.
Tautz,D.1989.HypervariabilityofsimplesequencesasasourceforpolymorphicDNAmarkers.NucleicAcidsRes.17:6463-6471
Valdes,A.M.,M.Slatkin,andN.B.Freimer.1993.Allelefrequenciesatmicrosatelliteloci:thestepwisemutationmodelrevisited.Genetics
Weber,J.L.1990.Informativenessofhuman(dC-dA)(dG-dT)polymorphisms.Genomics7:524-530."
Weber,J.L.,andP.E.May.1989.AbundantclassofhumanDNApolymorphismwhichcanbetypedusingthepolymerasechainreactionAm.J.Hum.Genet.44:388-396.
Westneat,D.F.,W.A.Noon,H.K.Reeve,andC.F.Aquadro.1988.ImprovedhybridizationconditionsfromDNA"fingerprints"probedwithMl3.NucleicAcidsRes.16:4161.
Wintero,A.K.,M.Fredholm,andP.B.Thomsen.1992.Variable(dG-dT)n.(dC-dA)nsequencesintheporcinegenome.GenomicsP281-288.
Wright,J.M.1993.DNAfingerprintingoffishes,p.57-91.InP.W.HochachkaandT.Mommsen[ed.]BiochemistryandmolecularbiologyoffishesVol.2.ElsevierSciencePublishersB.V.Amsterdam,TheNetherlands
Wright,J.M.1994.MutationatVNTRs:areminisatellitestheevolutionaryprogenyofmicrosatellites.Genome37:345-347
Wright,J.M.,andP.Bentzen.1994.Microsatellites:geneticmarkersforthefuture.InG.R.CarvalloandT.J.Pitcher[ed.]Rev.FishBiolFish.4:384-388.
1966Can.J.Fish.Aquat.Sci.Vol.51,1994