university of groningen divergence and adaptive capacity ... · clapham et al. 1999; baker &...

University of Groningen

Divergence and adaptive capacity of marine keystone speciesFietz, Katharina

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite fromit. Please check the document version below.

Document VersionPublisher's PDF, also known as Version of record

Publication date:2017

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):Fietz, K. (2017). Divergence and adaptive capacity of marine keystone species. [Groningen]:Rijksuniversiteit Groningen.

CopyrightOther than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of theauthor(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policyIf you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediatelyand investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons thenumber of authors shown on this cover page is limited to 10 maximum.

Download date: 09-09-2020

https://www.rug.nl/research/portal/en/publications/divergence-and-adaptive-capacity-of-marine-keystone-species(9366d060-bb36-490a-8f7e-c970956ca18a).html

https://www.rug.nl/research/portal/en/publications/divergence-and-adaptive-capacity-of-marine-keystone-species(9366d060-bb36-490a-8f7e-c970956ca18a).html

60

Chapter 3

Genetic insights into the extent of gene flow between the two

known humpback whale (Megaptera novaeangliae) breeding

grounds in the North Atlantic, and into the effective size of

the breeding population in Cape Verde

61

Genetic insights into the extent of gene flow between the two known humpback whale

(Megaptera novaeangliae) breeding grounds in the North Atlantic, and into the effective

size of the breeding population in Cape Verde

Katharina Fietz1, 2*, Martine Bérubé1, Conor Ryan3,4,5, Simon D. Berrow3,4, Pedro López-

Suárez6, Frederick W.Wenzel7, Per J. Palsbøll1*

1 Marine Evolution and Conservation, Faculty of Science and Engineering, University of

Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands

2 Natural History Museum of Denmark, University of Copenhagen, Section for Evolutionary

Genomics, Øster Voldgade 5-7, 1350 Copenhagen Denmark 3 Irish Whale and Dolphin Group, Merchant’s Quay, Kilrush, Co. Clare, Ireland 4 Marine and Freshwater Research Group, Galway-Mayo Institute of Technology, Dublin Road,

Galway, Ireland 5 Current address: Hebridean Whale and Dolphin Trust, 28 Main Street, Tobermory, Isle of

Mull, PA75 6NU, United Kingdom 6 BIOS.cv, C.P. 100, Sal Rei, Boa Vista, Republic of Cape Verde 7 NOAA, National Marine Fisheries Service, Northeast Fisheries Science Center, 166 Water

St., Woods Hole, Ma. 02543 USA

Keywords Genetic connectivity - breeding ground - effective population size - Cape Verde -

humpback whale

Corresponding authors*:

Katharina Fietz, [email protected], Natural History Museum of Denmark, University of

Copenhagen, Section for Evolutionary Genomics, Øster Voldgade 5-7, 1350 Copenhagen,

Denmark

Per J. Palsbøll: [email protected], Faculty of Science and Engineering, Groningen Institute for

Evolutionary Life Sciences, Nijenborgh 7, 9747 AG Groningen, The Netherlands

62

Abstract

Knowledge of the genetic connectivity and effective population size is important for

conservation management of species whose populations have undergone extensive declines.

Humpback whales (Megaptera novaeangliae) in the North Atlantic have undergone a drastic

depletion during the 19th and 20th century due to heavy whaling activity and only within the last

60 years have had a chance to recover. Of the two known North Atlantic breeding grounds, the

one in the Cape Verde Archipelago has previously been estimated to be home to only ca. 100

individuals and is thought to be reproductively isolated. Using two types of neutral genetic

markers (up to 20 microsatellite loci and a fragment of the mitochondrial control region), we

inferred the genetic connectivity among Cape Verde and the only other known North Atlantic

breeding ground in the West Indies. We further provide an estimate of Cape Verde's

contemporary effective population size (Ne). Genetic divergence estimates between Cape Verde

and the West Indies are of the same order of magnitude as between different ocean basins, and

the Cape Verde population is currently at or below a minimum viable population size. An

Approximate Bayesian Computation (ABC) investigation of Cape Verdes demographic

population history supports a population decline in the past predating anthropogenic exploitation.

While Cape Verde's humpback whale population is isolated from the West Indies breeding

ground, our data suggest that it might be part of a larger, yet undiscovered Eastern Atlantic

breeding population.

Introduction

Anthropogenic impacts on the earth's ecosystems are steadily expanding and are the main causes

for population declines in many taxa (Morrison et al. 2007; Frankham et al. 2009). Reductions in

population sizes typically entail a (temporary) limitation of gene flow and an associated increase

in isolation. Small isolated populations often experience a loss of genetic diversity and as a result

are particularly prone to the impacts of genetic stochasticity. It is worth noting that there is no

straightforward causal link between decreased genetic diversity and inbreeding depression

following a population reduction (Bouzat 2010). Populations of some wild species display

remarkably low genetic diversity without showing signs of inbreeding depression (Obrien et al.

1985; Merola 1994; Castro-Prieto et al. 2011). Nonetheless, the loss of genetic diversity is a

concern as reduced polymorphism increases the likelihood of inbreeding depression and in the

long term lowers the evolutionary potential of a population to respond to environmental change

(Fisher 1930; Wright 1931; Reed & Frankham 2003). Both the intensity and duration of a

bottleneck thereby impact population survival and fitness. The large changes in allele

frequencies that are associated with small populations will entail an increase in the frequency of

deleterious alleles. The longer a bottleneck lasts, the more frequent these deleterious alleles may

become over time, see e.g. Trimble and Keeler (1938).

63

In order to incorporate information on population size and the level of isolation in a conservation

management framework, critical levels of these parameters need to be quantified. We may draw

on genetic data to do so, and to thereby delineate appropriate population units for conservation

(Palsboll et al. 2007). Regarding population size, an ongoing debate concerns the concept of a

minimum viable population size (MVP), the 'population size required to provide some specified

probability of persistence for a given time period' (Flather et al. 2011). According to Franklin

(1980), as a rule-of-thumb, the effective size of a population should not be smaller than 50 in the

short-term (five generations) to avoid inbreeding depression. This rule has been subjected to

much controversy (Franklin & Frankham 1998; Frankham 2005; Jamieson & Allendorf 2012;

Frankham et al. 2014). A recent perspective recommends a revision of the short-term minimum

estimate of 50, and suggests that a short-term minimum viable Ne ≥ 100 may actually be needed

to avoid inbreeding depression (Frankham et al. 2014). The authors further suggest setting the

approximation to retain evolutionary potential to at least Ne ≥ 1000. Populations whose effective

sizes range around these threshold levels should receive particular attention by conservation

managers. As part of this MVP debate, the level of isolation between populations need be taken

into account as well. Also termed connectivity, its extent impacts population persistency

(Frankham et al. 2014). It has been shown that as little as one migrant per generation is sufficient

to avoid harmful effects of genetic drift and inbreeding (Wright 1951). This one-migrant-per-

generation rule is no threshold above which there will be no harmful effects (Mills & Allendorf

1996). Rather, it is based on the expectation that a significant reduction in the harmful effects of

inbreeding can be achieved with one migrant per generation (Lowe & Allendorf 2010). Lowe

and Allendorf (2010) termed this 'inbreeding connectivity' and it serves the purpose that many

conservation managers will have in mind.

In the marine environment, physical barriers to movement are often absent and organisms with

strong dispersal abilities may be expected to exhibit high connectivity patterns throughout their

range of occurrence. Humpback whales (Megaptera novaeangliae) are a prime example of

marine long-distance travelers with annual migration routes of > 8,000km (Darling et al. 1996;

Rasmussen et al. 2007; Stevick et al. 2011). They spend the summer months in high-latitude

areas for feeding while they migrate to low-latitude areas for breeding and calving during the

winter (Dawbin 1966; Stone et al. 1990). In the North Atlantic, humpback whales during the

winter mating season aggregate in two known subtropical breeding grounds in the Caribbean

West Indies and off the West-African coast in Cape Verde (Jann et al. 2003). During the summer

months, they migrate to several distinct feeding grounds in the Gulf of St. Lawrence, the Gulf of

Maine, off Western Greenland, around Iceland, and in the Barents Sea (Katona & Beard 1990;

Clapham et al. 1993a). Individual humpbacks thereby show a high degree of maternally

transmitted site fidelity to their feeding ground, with little exchange between these areas (Katona

& Beard 1990). Over many generations, such site fidelity can lead to population subdivision

even in highly migratory species due to reproductive isolation; indeed, the complex repertoire of

64

behaviors in humpback whales has been shown lead to significant population structure even

within a single ocean (Baker et al. 1990; Palsbøll et al. 1995). It may therefore be expected that

we find small local populations among humpback whales besides their dispersal abilities.

Small populations are no rarity in baleen whales (suborder Mysticeti), and a range of species

today range in low numbers. The gray whale (Eschrichtius robustus) population in the Western

North Pacific is one example that has been estimated at only 140 individuals (95% confidence

interval (CI) 128 - 152) in 2012 (Cooke et al. 2013a). It is listed as Critically Endangered by the

IUCN Red List of Threatened Species (Reilly et al. 2008b). Similarly, the blue whale

(Balaenoptera musculus) population of the Western North Atlantic is estimated to hold a

minimum of 440 individuals (NOAA 2010). The species as a whole has been estimated to have

been depleted by 70 – 90% over the last three generations, and today is listed as Endangered

(Reilly et al. 2008a). Among humpback whales, the Arabian Sea subpopulation is one of the

smallest estimated populations to date. It displays a unique non-migratory behavior and resides

off Oman year-round. The current abundance estimate is 82 individuals (95% CI 60 – 111)

(Minton et al. 2011).

For humpback whales as for most of the baleen species, the main cause of population decline can

be attributed to the extensive whaling that took place throughout the last centuries (Best 1993;

Clapham et al. 1999; Baker & Clapham 2004). Humpback whales in the North Atlantic have

been subject to whaling activities since at least the 17th century (Stevick et al. 2003). During the

19th century, over 2,000 humpback whales were taken by the American whaling fleet (Smith &

Reeves 2003), and between 1885 and 1910, close to 5,000 individuals were caught off Norway

and Iceland (Ingebrigtsen 1929; Sigurjónsson 1988). Recent estimates assume the numbers of

landings and total removals of humpback whales from the North Atlantic to be 21,476 (95% CI

21,257 – 21,895) and 30,842 (95% CI 29,558 – 32,126), respectively (Smith & Reeves 2010). It

has been estimated that the North Atlantic humpback whale population was reduced to < 1,000

individuals before protection measures were put into action (Mitchell & Reeves 1983; Katona &

Beard 1990). A study based on coalescent models for mitochondrial DNA even estimated a 95%

decline in the North Atlantic humpback whale population pre- versus post-whaling (Roman &

Palumbi 2003). Due to this drastic depletion, humpback whales have been protected since 1955

(Best 1993). Within the last few decades, the census size has steadily increased (Stevick et al.

2003). A series of abundance estimates from capture-recapture data for the breeding population

in the West Indies has shown an increase of 3.5% individuals per year during the period 1979 -

1993 with the most precise estimate of 10,700 individuals (coefficient of variation = 0.068) in

1992/93 (Stevick et al. 2003). From the only other known, possibly genetically differentiated,

North Atlantic breeding population in Cape Verde, only a single census size point estimate for

the entire area is available rating the abundance at around 100 individuals (Punt et al. 2006). Due

to this low census size, the Cape Verde breeding population may be vulnerable to threats faced

65

by small populations unless there is gene flow between Cape Verde and another breeding

ground. To date, no quantification of gene flow between the two known breeding grounds in the

North Atlantic exists, and the degree of genetic differentiation between Cape Verde and West

Indies humpback whales remains unknown.

In this study, we set out to assess the degree of gene flow between individuals sampled at the two

known North Atlantic breeding grounds. We further determined the effective population size

(Ne) of humpback whales frequenting the Cape Verde Archipelago, and estimated their

demographic history. Our results suggest that the amount of genetic exchange between the

known breeding grounds in the West Indies (off the Dominican Republic) and Cape Verde is

indeed very limited. However, some data point to a potentially heterogeneous use of Caribbean

waters, which would entail a yet unknown separation between whales in different parts

throughout the Caribbean. Having likely undergone several historic and more recent population

declines, the Cape Verde population has been small throughout history, and today may be close

to or below a minimum viable size. Some data suggest though that Cape Verde might in fact be

part of a larger Eastern Atlantic breeding range.

Materials and Methods

Sample collection and laboratory methods

Skin biopsy (Lambertsen 1987) and sloughed skin samples (Clapham et al. 1993b) were

collected from free-ranging humpback whales in the West Indies in 2004-2005. In the Cape

Verde Archipelago, individuals were biopsy-sampled in 2012-2013. Samples were conserved in

a saturated NaCl solution with 20% dimethylsulphoxide (Amos & Hoelzel 1991) and were stored

at either -20°C or -80°C.

Total cell-DNA was extracted from all samples following the protocol in Palsbøll et al. (1997).

We PCR-amplified and genotyped 10 microsatellite loci (EV096, GATA028, GATA053,

GATA098, GATA417, GT015, GT211, GT271, GT575, TAA031) in all samples, and an

additional 10 microsatellite loci (AC087, EV001, EV037, EV094, GGAA520, GT011, GT023,

GT101, GT195, GT307) in all Cape Verde samples. The microsatellites had di-, tri- or tetramer

repeat motifs, respectively. For amplification and genotyping of six loci (GATA028, GATA053,

GATA098, GATA417, GGAA 520 and TAA031), we followed the description of Palsbøll et al.

(1997). EV001, EV037, EV094, and EV096 were amplified and genotyped as described by

Valsecchi and Amos (1996). AC087, GT211, GT271, and GT575 were amplified and genotyped

as described by Bérubé et al. (2005). GT023, GT101, GT195, and GT307 were amplified and

genotyped as described by Bérubé et al. (2000). For amplification and genotyping of GT011 we

followed the protocol of Bérubé et al. (1998). GT015 was amplified and genotyped as described

66

by Andersen et al. (2001). Further, part of the mitochondrial control region (mtDNA) was

amplified and sequenced following Palsbøll et al. (1995), and a 288 bp fragment was used for

analyses. Finally, the genetic sex of each sample was determined following the protocol as

described by Bérubé and Palsbøll (1996).

Data analyses

Multiple samples from the same individual were identified by aligning and visually comparing

data with no mismatches allowed in GENEIOUS ver. 6.0.6 (Drummond et al. 2009) (mtDNA) and

in Microsoft EXCEL (Microsoft 2007) (microsatellite data). The number of samples from different

individuals with identical genotypes across 10 loci arising by chance was estimated from the

probability of identity (I) (Paetkau & Strobeck 1994) using the software GENECAP ver. 1.4

(Wilberg & Dreher 2004). If all microsatellite and mtDNA markers were identical, the samples

were then identified as coming from the same individual. Mother-calf-dyads were visually

identified during sample collection, and all duplicate samples and calves were removed from the

dataset before further analysis. All analyses of microsatellite data were conducted with a dataset

of 10 microsatellite loci unless stated otherwise.

Genetic diversity, Hardy-Weinberg proportions, linkage disequilibrium and neutrality

For mtDNA, we determined the number of unique control region haplotypes, haplotype

diversity, nucleotide diversity (π) (Nei & Li 1979), and the neutrality test Tajima's D (Tajima

1989) for the West Indies and Cape Verde separately and combined using ARLEQUIN ver. 3.5

(Excoffier & Lischer 2010). For microsatellite data, we used Fstat 2.9.3 (Goudet 1995) to

estimate allelic richness (AR), a measure of allelic diversity corrected for sample size. The

package adegenet (Jombart 2008) implemented in R Software (R Development Core Team 2015)

was used to determine the observed (Hobs) and expected heterozygosity (Hexp) (Nei 1978) per

locality and overall. We tested differences in genetic diversity between breeding grounds using a

Wilcoxon signed-rank (WSR) test for paired samples (Wilcoxon 1945) also implemented in R

Software. Linkage disequilibrium (LD) and FIS, a measure of deviation from Hardy-Weinberg

equilibrium (HWE), estimated following Weir and Cockerham (1984), were assessed using

GENEPOP ver. 4.2.1 (Rousset 2008). We tested for LD among loci for each population and for

deviations from HWE using a Monte Carlo Markov Chain approach. For each assessment, the

number of dememorizations was set to 10,000, the number of batches to 1,000, and the number

of iterations to 10,000. Sequential Bonferroni corrections (Holm 1979; Rice 1989) were

conducted for both LD and HWE calculations, and statistical significances are reported after

sequential Bonferroni corrections.

67

Genetic differentiation between North Atlantic breeding grounds

We assessed the genetic differentiation between Cape Verde and the West Indies with

hierarchical analyses of molecular variance (AMOVA) conducted in ARLEQUIN 3.5 (Excoffier &

Lischer 2010). We estimated pairwise differentiation using FST (Weir & Cockerham 1984) for

mtDNA and microsatellite data and 95% CIs were computed over 20,000 bootstraps. In order to

determine whether samples from different years and between genders may be pooled, we

estimated pairwise differentiation using FST (Weir & Cockerham 1984) between years and

gender. For each breeding ground, we therefore used randomly drawn equal numbers of samples

per year and per gender, respectively. We tested whether FST estimates were significantly larger

than the range expected under panmixia using 10,000 permutations.

Recent effective population size estimates

We estimated the recent Ne’s of humpback whales at both known North Atlantic breeding

grounds one generation ago from microsatellite data with the LD method (Hill 1981; Waples

2006) implemented in NEESTIMATOR ver.2 (Do et al. 2014). We used the random mating system

setting, and estimated Ne bounded by 95% CI. Critical values of allele frequencies to screen out

rare alleles were set to pcrit > 1/(2S) (S = the number of individuals with data at both pairs of

loci), so as to ensure that alleles which occur in only a single copy in the sample were excluded

(Do et al. 2014). We excluded all samples with incomplete genotypes from the West Indies

dataset, and all loci with incomplete genotypes from the Cape Verde dataset.

Detection and quantification of population size change

We inferred the demographic history of humpback whales in Cape Verde using a coalescent-

based Approximate Bayesian Computation (ABC) approach (Beaumont et al. 2002; Csillery et

al. 2010) implemented in the software DIYABC ver. 2.0.4 (Cornuet et al. 2014). We compared

seven demographic models (Fig. 1). S1 consists of a null hypothesis assuming a constant Ne

through time; S2 – S4 are models that assume a recent bottleneck event. S2 assumes a historic

effective size (N1) that declined to its current Ne t1 generations ago; S3 assumes a historic

bottleneck t2 generations ago with a change from population size N3 to N2, followed by a

second, more recent bottleneck t1 generations ago to current Ne; S4 assumes a historic expansion

t3 generations ago from effective population size N5 to N4, and a recent bottleneck t1

generations ago to current Ne. S5 also assumes a bottleneck, but more historically (t3 generations

ago) from a historic effective population size N1 to current Ne. The final class of models (S6 and

S7) assumes population expansions: S6 assumes a historic effective size (N6) that increased to

its current Ne t1 generations ago; and S7 assumes a historic bottleneck event t3 generations ago

68

with a change from population size N7 to N6, followed by a recent expansion t1 generations ago

to current Ne.

Approximate Bayesian computations

For each model, we simulated 1 million datasets based on the demographic history that describes

the respective model. The parameters defining each model were considered as random variables

that follow a predefined prior distribution (Table 5). We determined these priors based on initial

DIYABC analyses to ensure that the distributions encompassed the parameter distribution. We

jointly and separately analyzed a set of 20 microsatellite loci and a 288 bp fragment of the

mtDNA control region. The microsatellite data were assumed to follow a generalized stepwise

mutation model (Estoup et al. 2002) with two parameters: the mean mutation rate (µ) and the

mean parameter of the geometrical distribution assumed for the length in repeat numbers of

mutation events (P) drawn from uniform distributions (µ: 10-5 – 10-3; P: 0.1 – 0.3). All but two

loci had a possible range of 40 contiguous allelic states; GATA417 and GGAA520 had a larger

possible range of 60 contiguous allelic states. Each locus was characterized by individual µ and

P values drawn from gamma distributions with respective means µ and P, and shape parameters

of 2. For mtDNA, we selected the Hasegawa-Kishino-Yano (HKY) DNA substitution model

(Hasegawa et al. 1985) as empirically determined in JMODELTEST ver. 2.1.5 (Darriba et al. 2012)

and assumed a per-site and generation µ ranging uniformly between 10-7 and 10-5. We further

used the proportion of invariant sites of 63% and the shape of the gamma distribution of

mutations among sites equal to 0.23 likewise determined by JMODELTEST.

Summary statistics

We calculated a range of summary statistics for each competing demographic model: For

mtDNA data, the number of haplotypes, the mean of pairwise differences, Tajima’ D (Tajima

1989), and the mean number of the rarest nucleotide at segregating sites were estimated. For

microsatellite data, the mean number of alleles and the mean Garza-Williamson M-ratio (Garza

& Williamson 2001) were estimated.

Model choice procedure and performance analyses

The posterior probability of each demographic model was estimated using a polychotomous

logistic regression (Cornuet et al. 2008; Cornuet et al. 2010) on the 1% of simulated datasets

closest to the observed dataset. In order to evaluate the confidence in model choice, type I error

rates were estimated following the method described in (Cornuet et al. 2010).

Parameter estimation and model checking

Based on the best demographic model, we estimated posterior parameter distributions using a

logit transformation of parameter values and subsequently conducted a local linear regression on

the 1% of closest simulated parameters. Finally, following Gelman et al. (1995), we evaluated

69

the ability to reproduce the observed data as described in Cornuet et al. (2010). The test

quantities chosen to do so for microsatellite loci were the mean genic diversity (= mean Hexp) and

the mean allelic size variance. For mtDNA data, we calculated the number of segregating sites,

the variance of pairwise differences, the number of private sites, and the variance of numbers of

the rarest nucleotide at segregating sites for mtDNA data.

Results

After removal of duplicate samples and calves, our microsatellite dataset consisted of 1,745

individual humpback whales sampled in the West Indies and 36 individuals sampled in Cape

Verde; our mtDNA dataset consisted of 1,739 individuals from the West Indies and 36

individuals from Cape Verde. The probability of different individuals having identical genotypes

across 10 microsatellite loci by chance was estimated under the assumption of HWE at 1.62*10-

11 for the West Indies, and at 7.29*10-9 for Cape Verde. Allele frequencies between humpback

whales of different gender and sampled in different years were not significantly different in

either sampling locality (Table 1); we therefore per locality pooled samples of different years

and gender in subsequent analyses.

Genetic diversity, Hardy-Weinberg proportions, linkage disequilibrium and neutrality

We identified 27 mtDNA haplotypes among the West Indies samples and seven among the Cape

Verde samples. All Cape Verde haplotypes were shared with the West Indies. Haplotype

diversity was significantly higher in the West Indies than in Cape Verde and π was almost twice

as high in the West Indies (Table 2). Our observed estimates of Tajima's D did not differ

significantly from the expectation under neutrality for either breeding ground (Table 2).

The microsatellite dataset contained very few missing data (Cape Verde = 0.27%, West Indies =

0.05%, Combined = 0.05%). Genetic diversity estimates, Hardy-Weinberg proportions, and

neutrality test estimates per locus for each breeding ground and for both breeding grounds

combined are presented in Table 3. The nuclear genetic diversity was comparable between Cape

Verde and the West Indies (WSP tests, P-values > 0.05 for all pairwise comparisons). AR over

all loci was also comparable between the two breeding grounds, as were Hobs and Hexp. We failed

to reject HWE expectations in all but two loci in the West Indies after sequential Bonferroni

corrections (GATA028 and GT417). No evidence was found for LD among loci after sequential

Bonferroni corrections.

70

Genetic differentiation between North Atlantic breeding grounds

Genetic differentiation between Cape Verde and the West Indies was highly significant based on

mtDNA and microsatellite data (mtDNA: FST = 0.116 (95% CI = 0.077 - 0.172); microsatellites:

FST = 0.02 (95% CI = 0.012 - 0.029)).

Recent effective population size estimates

After exclusion of all samples with incomplete genotypes from the West Indies dataset and all

loci with incomplete genotypes from the Cape Verde dataset, 1738 West Indies samples

genotyped at 10 loci, and 36 Cape Verde samples genotyped at 17 loci remained for analysis. To

evaluate the influence of rare alleles on the Ne results, we excluded rare frequency alleles at

frequencies (pcrit) smaller than 0.02 through 0.05 for Cape Verde, and smaller than 0.01 through

0.05 for the West Indies. Estimates among pcrit values within the same breeding ground

overlapped (Table 4). For Ne evaluation we selected pcrit = 0.02 with estimates bound by 95%

CIs. Estimates of Ne using the LD method differed widely between the two breeding grounds and

were 20 times higher for the West Indies than for Cape Verde (Table 4). We also estimated Ne

from LD for the West Indies for a dataset with five microsatellite loci including 1,594 additional

samples from 1992/93 to investigate potential differences due to sample size and variation

between time periods (data and results not shown). As no differences were detected (all CIs

overlapped), we reported only results based on 1,738 samples.

Detection and quantification of population size change

To compute the timing and the nature of demographic changes in the Cape Verde breeding

population, all ABC analysis steps were conducted with three datasets: one combining

microsatellite and mtDNA data, and two using only one of these, respectively. We first evaluated

the posterior probability of each competing demographic model. This model checking pointed to

the group modeling a population bottleneck (S2 - S4) in the microsatellite and combined dataset

(Table 6). Using mtDNA data alone, none of the models significantly differed from the observed

data. When including microsatellites, models S2 through S4 received significantly greater

support than the other models, but had very similar posterior probabilities among each other

(Table 7). We next estimated the type I error probability by evaluating the power of the model

choice procedure. The model with the highest posterior probability (S3) was selected in slightly

more than 50% of cases (microsatellite dataset: 52.0%; microsatellite & mtDNA dataset: 51.6%),

leaving a high type I error rate. When taking the group of bottleneck models together (S2 - S4),

the proportion of times that those were not selected dropped to 19.8% or 18.6% for the

microsatellite and combined dataset, respectively (Table 8). The posterior parameter estimates

and their 95% CIs were then computed under the best-supported model (S3) (Table 9). CIs

71

between the three datasets largely overlapped. According to the estimation using both nuclear

and mtDNA data, a historic population with an Ne of 175,000 (95% CI 65,200 – 488,000)

declined to 11,300 breeding whales (95% CI 6,630 – 145,000) approximately 20,200 generations

ago (95 % CI 6,560 – 24,600) and subsequently declined further to a current Ne of 2,550 (95 %

CI 1,780 – 5,700) approximately 4,560 generations ago (95 % CI 1,620 – 14,700). The model

checking procedure provided support that the observed data were plausible under the selected

posterior predictive distribution of S3 as the probabilities Prob (Ssimul < Sobs) for all summary

statistics were not significantly different from 0.5 (Table 10).

Discussion

While humpback whales are among the widest-traveling mammals on earth with annual

migration routes of > 8,000km, see e.g. (Rasmussen et al. 2007; Stevick et al. 2011), they have

developed a complex repertoire of behaviors that can lead to significant population structure

even on relatively small spatial scales (Baker et al. 1990; Palsbøll et al. 1995). To date, the

extent of genetic connectivity between the two known humpback whale breeding grounds in the

North Atlantic was not known and it was suspected that the whales in Cape Verde might form a

small, genetically isolated population (Wenzel et al. 2009).

How much gene flow exists between Cape Verde and the West Indies breeding grounds?

Our study revealed that Cape Verdean humpback whales are considerably differentiated from the

West Indies on an evolutionary time scale. Genetic divergence estimates between the two

breeding grounds suggest very low long-term average gene flow; the matrilineal differentiation

is of the same order of magnitude as genetic differentiation between ocean basins that was

estimated from a global humpback whale dataset from breeding and feeding grounds (Jackson et

al. 2014). This degree of isolation even led Jackson et al. (2014) to propose that different

subspecies inhabit different ocean basins. Nuclear differentiation estimates in our study are less

pronounced than in Jackson et al.'s study, and less pronounced than mitochondrial ones.

Nonetheless do they demonstrate the considerable limitation of gene flow. Nuclear divergence

estimates are approximately 10 times higher between the two breeding grounds than between the

North Atlantic feeding areas (unpublished data). The higher divergence at the maternally

inherited mtDNA marker in comparison to divergence at the Mendelian-inherited nuclear

markers suggests male-mediated gene flow (Karl et al. 1992). This is in accordance with

suggestions in previous humpback whale studies (Palumbi & Baker 1994; Palsbøll et al. 1995;

Palsboll et al. 2004), and is supported by the fact that genetic diversity estimates in the two

breeding grounds are comparable in the nuclear dataset while they differ in the maternally

inherited mtDNA.

72

Data encompassing a more recent time scale suggest a similar extent of divergence. A

comparison of 88 individual fluke photographs from the Cape Verde region collected between

1990 and 2009 to over 6,500 individual fluke photographs maintained in the North Atlantic

Humpback Whale Catalogue yielded three matches with whales photographed in the feeding

grounds of the Eastern North Atlantic, but no matches were found with the Western North

Atlantic (Jann et al. 2003; Wenzel et al. 2009). While the low match-rate with the Eastern North

Atlantic might be comprehensible given the limited photo-ID effort throughout the last decades

in this area (Wenzel et al. 2009), the lack of any matches with the extensive data base from the

Western North Atlantic is surprising and designative. It suggests very little migration between

this part of the Atlantic and Cape Verde. Photo identification of some few individuals (four

sightings) that have been sighted in both Cape Verde and Guadeloupe in the Eastern Caribbean

provides the only direct proof of current migration between Cape Verde and the Western North

Atlantic to date (Stevick pers. com.). Along with a high interannual resighting rate suggesting

strong site fidelity (Wenzel et al. 2009), this low match rate points to a high contemporary

degree of isolation between these two known North Atlantic breeding grounds; It hence supports

our findings from genetic data and the hypothesis that Cape Verde might constitute a distinct

breeding population.

A logical next step is to test the degree of genetic differentiation between the humpback whales

within the Caribbean - between the West Indies and Guadeloupe in the East. Given the high

maternally induced population structure between Cape Verde and the West Indies and the

absence of photographic matches between these areas while some whales seem to be migrating

between Guadeloupe and Cape Verde, we might expect to find whales in the West Indies and

Guadeloupe to be distinct from each other at a molecular level.

How many whales belong to the Cape Verde population?

Recent research has suggested that the population size of the potentially isolated Cape Verde

humpback whales might be very small (Jann et al. 2003; Wenzel et al. 2009; Ryan et al. 2014).

On an evolutionary time scale, the small size of Cape Verdes humpback whale population is

supported by our genetic diversity estimates for mitochondrial data. Low genetic diversity can be

caused by a small Ne, see e.g. Fontaine et al. (2014). The small number of haplotypes, as well as

the significantly smaller haplotype and nucleotide diversities in Cape Verde in comparison to the

West Indies suggest that many more whales were part of the West Indies population than of the

Cape Verde one.

It is also possible to infer a direct estimate of the historic Cape Verde Ne from the data we have

available. This can be achieved by assuming, for the purpose of comparability, that the LD Ne

estimate is equivalent to the current Ne estimate of our ABC analyses. Calculating the conversion

73

factor between the LD Ne and ABC estimates of current Ne, and then multiplying the historic

ABC Ne estimate by that same factor provides a historic Ne estimate of 3,836 individuals for the

Cape Verde population. Reason for the initial difference between ABC and LD Ne estimates

likely are the different time scales at which the respective methods are informative. Whereas the

estimate based on LD provides a population size estimate of the parent generation of the sampled

individuals, the ABC estimate represents an average Ne estimate since the last change in

population size (t1), hence applying to a larger temporal scale. This difference in population size

and the time scales which the two estimates apply to supports the depletion of the Cape Verde

humpback whales through heavy whaling in the last 150 years (Mitchell & Reeves 1983; Smith

& Reeves 2003, 2010).

On a more recent time scale, estimates of Ne based on the LD method have been shown to be

reliable for small Ne’s when at least 25-50 individuals are sampled at 10-20 polymorphic loci

(Waples & Do 2010). For larger populations, reliable estimates are more difficult to obtain as the

genetic signal becomes weak in relation to sampling noise (Ansmann et al. 2013). In our study,

we included a similar number of samples and microsatellite loci as recommended by (Waples &

Do 2010). However, the calculation of our CIs is based on the assumption of a homogeneous

population with non-overlapping generations. As our focal population does not meet this

assumption, the true CIs may be larger than our reported ones. The only abundance estimate of

the entire Cape Verde Archipelago to date rates the total abundance at 99 individuals (Punt et al.

2006), while a yet more recent study incorporating recapture data only from around the island of

Boa Vista concluded that the census population size around Boa Vista ranged between 171 and

260 individuals (Ryan et al. 2014). Taking into account that spatial and temporal research efforts

were not completely comparable between studies, our Ne estimates overall support that Cape

Verde hosts few humpback whales. A small and potentially isolated population is further

affirmed by high interannual re-sighting rates of whales in the archipelago (>22%) (Wenzel et al.

2009).

Our estimate of 43 to 76 breeding individuals one generation ago places the Cape Verde

humpback whale breeding population at a precarious state, the small Ne making it vulnerable to

stochastic effects and anthropogenic impacts. It is by a multitude smaller than the West Indies

breeding population, and is within the range of or below a minimum viable short-term population

size that is required to avoid inbreeding depression according to Franklin’s rule-of-thumb. As

such, any management should be conservative and treat the Cape Verde population as one that is

currently potentially at risk.

74

What do genetics reveal about the demographic history of the Cape Verde humpback

whale population?

Our ABC analyses support the hypothesis of a population decline among Cape Verde humpback

whales. Among the three bottleneck models that we included in our analysis, each had an

approximate posterior probability of 1/3. While the demographic model modeling a two-stage

decline was consistently best-supported, the high type I error rate (55%) suggested that our data

did not allow distinguishing between those three most likely models without doubt. We therefore

caution to put too much emphasis on the precise estimates, but to rather take notion of the

presence of a substantial population size reduction throughout the last millennia. This decline

does not however reflect the intense recent decline caused by whaling as we would have

expected, but instead denotes more historic events. Of the best-supported model, the more recent

of the two bottleneck events dates back ca. 66,000 years (95% CI 23,500 - 213,000) when

assuming a generation time of 14.5 years (Taylor et al. 2007). This time period - let alone the

more ancient bottleneck estimate of ca. 290,000 years (95% CI 95,000 - 357,000) - by far pre-

dates any noteworthy anthropogenic impacts; humpback whales in the North Atlantic were

hunted most intensively within the last 150 years (Mitchell & Reeves 1983; Smith & Reeves

2003, 2010), and even early whaling may date back only as far as 6,000 BC (Lee & Robineau

2004). Instead, the detected bottleneck signal possibly relates to a decreased ocean productivity

during the Pleistocene which may have limited the food resources available to humpback whales

at the time and hence may have caused population decline(s) (Thomas et al. 1995).

However, the interpretation of our results need be viewed also in the light of the limitations that

the applied analytical framework poses. Our ABC approach does not take any migration between

Cape Verde and a breeding ground other than the West Indies into account. New information

suggests that indeed this may be inaccurate and that whales from the Northern and Southern

hemispheres might come into contact in the Cape Verde area. In September 2014 two humpback

whale biopsy samples were collected in Cape Verde (unpublished data, not included in this

study) whose mtDNA sequences were identical to whales from the Southern hemisphere

(Rosenbaum et al. 2009). If present in our dataset, whales from the Southern hemisphere likely

add genetic diversity to our local population (Palsbøll et al. 2013). Any derived abundance

estimate will hence reflect the total of these populations rather than of the Cape Verde area

alone. Besides, while we selected an appropriate prior range of mutation rates for mtDNA and

nuclear data, we caution to put too much emphasis in the precise bottleneck timing estimates

given above, as they depend on these selected mutation rates and may hence vary considerably.

75

Conclusions

We investigated the degree of gene flow between the only two known North Atlantic humpback

whale breeding populations and estimated the size of the allegedly very small population in the

Cape Verde Archipelago. While the amount of genetic exchange between the known breeding

grounds in the West Indies (off the Dominican Republic) and Cape Verde is very limited,

confirmed migrations between Cape Verde and the East Indies (Guadeloupe) point to existing

gene flow between this Eastern part of the Caribbean and Cape Verde. If this indeed is the case,

a yet unknown separation might exist between whales in different parts throughout the

Caribbean. A heterogeneous use of Caribbean waters for breeding would have important

implications for conservation management. Consequently, a next step in humpback whale

research should focus on testing the degree of molecular distinction between Guadeloupe and the

main breeding ground around the Dominican Republic in the West Indies.

Having likely undergone several historic and more recent population declines, the Cape Verde

population has been small throughout history, and today may be close to or below a minimum

viable size. If indeed the Cape Verde humpback whales have as little genetic exchange with any

other, yet unknown breeding ground as they have with the West Indies, this population might be

very sensitive to stochastic and anthropogenic effects and should be handled with great care in a

conservation context. This being said, some data suggest that Cape Verde might in fact be part of

a larger Eastern Atlantic breeding range. Genetic distances of mtDNA data collected in the North

Atlantic feeding areas in 1992/93 suggest that only 40 to 60% of Icelandic humpback whales

winter in the West Indies, and that as many as 90% of humpback whales feeding in the Barents

Sea may winter elsewhere than the West Indies (unpublished data). To date this may only be

speculation as sampling efforts in the Eastern Tropical Atlantic have been very limited. Future

research would therefore greatly benefit from increasing sampling efforts in the Eastern Tropical

Atlantic to discover any potential additional breeding grounds and to shed light on the nescience

of how isolated Cape Verdes humpback whales indeed are.

76

Table 1: Estimates of genetic differentiation between samples of humpback whales of different gender, and between samples of different years, for the Cape Verde and the West Indies breeding grounds.

Breeding Ground Molecular Marker FST

P-value

Gender Cape Verde mtDNA 0.000 0.863

West Indies mtDNA 0.002 0.058 Sampling Year Cape Verde mtDNA

0.000 0.430

Microsatellites 0.015 0.510 West Indies mtDNA 0.001 0.184

Microsatellites 0.000 1.000

Table 2: Genetic diversity estimates and neutrality estimates for mtDNA per humpback whale North Atlantic breeding ground and for both breeding grounds combined. Number of individuals (N); number of haplotypes (HT); HT diversity with 95% confidence intervals (CI); nucleotide diversity (π) averaged over loci with 95% CI; Tajima’s D; and significance (P-value) of Tajima’s D.

Parameters Cape Verde West Indies Combined

N 36 1,739 1,775

HT 7 27 27

HT diversity (95% CI) 0.6810 (0.664-0.698) 0.8264 (0.826-0.827) 0.8267 (0.826-0.827)

π (95% CI) 0.0131 (0.011-0.016) 0.0229 (0.022-0.023) 0.0229 (0.022-0.023)

Tajima's D -0.638 3.001 2.982

Tajima's D P-value < 0.303 < 0.995 < 1

77

Table 3: Genetic diversity estimates, Hardy-Weinberg proportions, and neutrality test estimates for 10 microsatellite loci among humpback whales in North Atlantic breeding grounds (Combi = Cape Verde and West Indies combined). Number of samples (N); Number of alleles per locus (NA); Allelic richness (AR) (based on a minimum sample size of 35); Observed heterozygosity (Hobs); Expected heterozygosity (Hexp); P-values with 95% CIs as a measure of deviation from Hardy-Weinberg equilibrium (HWE); and FIS, a measure of deviation from HWE estimated following Weir and Cockerham (1984). * indicate significant P -values at the 0.05 level.

EV096 GATA028 GATA053 GATA098 GATA417 GT015 GT211 GT271 GT575 TAA031

Cape N 36 36 36 36 35 36 36 36 36 36

Verde NA 9 6 7 10 14 9 7 7 8 10

AR 8.972 5.999 6.972 7.972 14 8.943 6.972 6.916 7.944 9.971 Hobs 0.861 0.556 0.778 0.944 0.886 0.639 0.778 0.722 0.833 0.778 Hexp 0.832 0.564 0.779 0.860 0.849 0.743 0.789 0.574 0.758 0.775

HWE (95% CI)

0.883 (0.8827-0.8833)

0.901 (0.9007-0.9013)

0.715 (0.7147-0.7153)

0.243 (0.2423-0.2437)

0.054 (0.0533-0.0547)

0.026 (0.0257-0.0263)

0.103 (0.1027-0.1033)

0.489 (0.4883-0.4897)

0.769 (0.7683-0.7697)

0.053 (0.0527-0.0533)

FIS 0.154 0.010 -0.029 0.028 0.095 -0.245 0.016 -0.085 -0.021 0.030 West N 1,745 1,743 1,745 1,744 1,743 1,743 1,745 1,745 1,745 1,744

Indies NA 10 10 10 10 16 15 7 15 12 16

AR 8.591 6.881 8.146 7.016 11.262 10.711 6.79 7.225 9.065 9.612 Hobs 0.811 0.464 0.831 0.954 0.837 0.801 0.818 0.567 0.708 0.816 Hexp 0.821 0.467 0.832 0.851 0.874 0.796 0.812 0.581 0.703 0.817

HWE (95% CI)

0.397 (0.3968-0.3972)

0* (0.000) 0.975 (0.9750-0.9750)

0.665 (0.6649-0.6651)

0.004* (0.0040-0.0040)

0.630 (0.6298-0.6302)

0.773 (0.7729-0.7731)

0.390 (0.3897-0.3903)

0.950 (0.9500-0.9500)

0.944 (0.9439-0.9441)

FIS 0.007 0.042 0.025 0.012 -0.005 -0.008 -0.006 0.002 -0.007 0.001 Combi N 1,781 1,779 1,781 1,780 1,778 1,779 1,781 1,781 1,781 1,780

NA 10 11 10 10 19 15 7 15 12 16

AR 8.626 7.051 8.128 7.041 11.496 10.697 6.81 7.217 9.053 9.659 Hobs 0.812 0.465 0.830 0.954 0.838 0.798 0.817 0.570 0.710 0.815 Hexp 0.822 0.470 0.831 0.851 0.874 0.796 0.814 0.581 0.705 0.817

78

Table 4: Estimates of contemporary effective population sizes (Ne) with 95% CIs of humpback whales in Cape Verde (based on 17 loci) and in the West Indies (based on 10 loci) for different critical values (pcrit) using the LD method implemented in NEESTIMATOR ver.2 (Do et al. 2014).

Cape Verde West Indies Pcrit Ne Lower

95% CI

Upper 95% CI

Ne Lower 95% CI

Upper 95% CI

0.05 41.6 32.6 55.4 913.7 756.8 1,120.9 0.04 49 37.9 66.7 957.3 796.9 1,168.2 0.03 49 37.9 66.7 956.6 811.8 1,141.4 0.02 55.9 43.3 76.2 1,039 896.1 1,217.6 0.01 - - - 1,028.6 901.5 1,183.5

79

Table 5: Prior distributions for demographic parameters and for locus-specific mutation models, and simulation conditions for demographic parameter estimation implemented in DIYABC ver. 2.0.4 (Cornuet et al. 2014). UN = uniform distribution, GA = gamma distribution.

Priors for demographic parameters Ne UN ̴[10; 6,000] N1 UN ̴[100; 100,000] N2 UN ̴[10; 150,000] N3 UN ̴[100; 500,000] N4 UN ̴[100; 500,000] N5 UN ̴[10; 250,000] N6 UN ̴[10; 3,000] N7 UN ̴[100; 500,000] t1 UN ̴[1; 20,000] t2 UN ̴[10; 25,000] t3 UN ̴[10; 200,000]

Priors for mutation model microsatellites Mean - µmic UN ̴[10-5 - 10-3] Gam - µmic GA ̴[10-5 - 10-3, 2] Mean - P UN ̴[0.1 - 0.3] Gam - P GA ̴[0.01 - 0.9, 2]

mtDNA control region µsec UN ̴[10-7 - 10-5] K1 UN ̴[0.05 - 20] % invar. sites 63 shape 0.23

Conditions N1>Ne t2>t1 N2>Ne t3>t2 N3>N2 N5<N4 N4>Ne N6<Ne N7>N6

80

Table 6: Model checking for each summary statistic for each demographic model (S1 – S7, see Fig. 1) for one dataset including both mtDNA and microsatellite data, as well as datasets that include only mtDNA and only microsatellite data, respectively. The probability Prob (Ssimul < Sobs) was computed from 500 virtual datasets simulated from the posterior distributions of parameters obtained under a given demographic model. Values indicate the proportion of simulated datasets which have a value below the observed one (Ssimul < Sobs). NAL = number of alleles, MGW = Garza-Williamson’s M, NHA = number of haplotypes, MPD = mean of pairwise differences, DTA = Tajima’s D, MNS = mean number of rarest nucleotide at segregating sites. **, *** = tail-area probability < 0.01, < 0.001, respectively.

Microsatellite data

Summary Statisics Observed value S1 S2 S3 S4 S5 S6 S7

NAL 7.45 0.7072 0.6139 0.5498 0.5749 0.7013 0.6797 0.672

MGW 0.5017 0.0002(***) 0.0746 0.3621 0.2327 0.0076(**) 0.0001(***) 0.0035(**)

MtDNA data


NHA 7 0.4169 0.4155 0.4103 0.4078 0.4224 0.3044 0.3143

MPD 3.7825 0.6344 0.5967 0.5627 0.5739 0.6353 0.5605 0.5679

DTA -0.4279 0.2538 0.2426 0.2361 0.2386 0.2527 0.2584 0.2614

MNS 5.1765 0.4794 0.4627 0.454 0.4604 0.4781 0.4682 0.4705

Microsatellite and mtDNA data


NAL 7.45 0.7071 0.6133 0.551 0.5745 0.7015 0.6802 0.6721

MGW 0.5017 0.0001(***) 0.0747 0.3618 0.2331 0.0074(**) 0.0001(***) 0.0036(**)

NHA 7 0.42 0.4184 0.413 0.4118 0.4242 0.3086 0.3079

MPD 3.7825 0.6547 0.6162 0.5798 0.5929 0.6547 0.5857 0.5813

DTA -0.4279 0.2316 0.2232 0.2165 0.2188 0.2311 0.232 0.2316

MNS 5.1765 0.4782 0.4623 0.451 0.4586 0.4773 0.4675 0.4657

81

Table 7: Relative posterior probability for each competing demographic model with 95% CIs based on summary statistics for the model choice analysis using a logistic approach. Results are shown for one dataset including both mtDNA and microsatellite data, as well as for datasets that include only mtDNA and only microsatellite data, respectively. Individual values calculated for each summary statistic per scenario are provided in table 6.

Microsatellite data MtDNA data Microsatellite and mtDNA data Model Post. prob. (95% CI) Model Post. prob. (95% CI) Model Post. prob. (95% CI) S1 0 % (0 - 0) S1 13.6 % (12.9 - 14.3) S1 0.05 % (0 - 0.9) S2 21.8 % (21.5 - 22.1) S2 16.3 % (15.5 - 17.1) S2 26.3 % (25.2 - 27.4) S3 40.0 % (39.6 - 40.1) S3 12.6 % (11.9 - 13.4) S3 37.9 % (36.8 - 39.1) S4 35.2 % (34.8 - 35.6) S4 12.0 % (11.3 - 12.7) S4 32.4 % (31.3 - 33.5) S5 2.1 % (1.9 - 2.4) S5 15.3 % (14.5 - 16.1) S5 2.6 % (1.7 - 3.5) S6 0 % (0 - 0) S6 15.7 % (14.9 - 16.5) S6 0.03 % (0 - 0.9) S7 0.9 % (0.6 - 1.1) S7 14.5 % (13.8 - 15.3) S7 0.7 % (0 - 1.6)

Table 8: Instances (in %) of the selected demographic model exhibiting the highest posterior probability compared with all competing models for 500 simulated datasets that are generated under the best-supported model (S3) using a logistic regression. Results are shown for one dataset including both mtDNA and microsatellite data, as well as for datasets that include only mtDNA and only microsatellite data, respectively.

Model Posterior probability (%) - log reg S1 S2 S3 S4 S5 S6 S7

Microsatellites 11.2 19 52 9.2 7 1.6 0

MtDNA 13 5.2 13.2 1.2 27 31.2 9.2

Microsatellites and mtDNA

7.8 19.8 51.6 10 7.4 2.6 0.8

82

Table 9: Demographic parameters (mode and 95% CI) estimated under the best-supported demographic model (S3) for one dataset including both mtDNA and microsatellite data, as well as for datasets that include only mtDNA and only microsatellite data, respectively.

Microsatellite data Parameter Mode 95% CI Current effective population size (Ne) 2,760 (1,800 - 5,820) Population size before most recent bottleneck (N2) 16,000 (7,780 - 145,000) Population size before historic bottleneck (N3) 218,000 (72,300 - 489,000) Time since recent bottleneck (t1) 6,120 (1,280 - 15,700) Time since historic bottleneck (t2) 19,600 (6,040 - 24,600)

MtDNA data Parameter Mode 95% CI Current effective population size (Ne) 2,250 (1,090 - 5,810) Population size before most recent bottleneck (N2) 15,200 (5,430 - 145,000) Population size before historic bottleneck (N3) 235,000 (52,700 - 487,000) Time since recent bottleneck (t1) 3,530 (963 - 18,500) Time since historic bottleneck (t2) 22,000 (4,830 - 24,600)

Microsatellite and mtDNA data Parameter Mode 95% CI Current effective population size (Ne) 2,550 (1,780 - 5,700) Population size before most recent bottleneck (N2) 11,300 (6,630 - 145,000) Population size before historic bottleneck (N3) 175,000 (65,200 - 488,000) Time since recent bottleneck (t1) 4,560 (1,620 - 14,700) Time since historic bottleneck (t2) 20,200 (6,560 - 24,600)

83

Table 10: Cumulative distribution function values of each test quantity (= Prob (Ssimul < Sobs) to evaluate the fit of the observed data with respect to simulated data from 1000 simulated datasets under the best-supported model (S3). Results are shown for one dataset including both mtDNA and microsatellite data, as well as for datasets that include only mtDNA and only microsatellite data, respectively. HET = mean expected heterozygosity, VAR = mean allelic size variance, NSS = Number of segregating sites, VPD = variance of pairwise differences, PSS = private segregating sites, VNS = variance of numbers of the rarest nucleotide at segregating sites.

Dataset Summary Statisics

Observed value

Proportion (Ssimul < Sobs)

Microsatellite HET 0.716 0.472 data VAR 22.5436 0.282 MtDNA data NSS 18 0.748

VPD 15.4932 0.763 PSS 18 0.748

VNS 36.9689 0.886 Microsatellite HET 0.716 0.507 and mtDNA data VAR 22.5436 0.249

NSS 18 0.783 VPD 15.4932 0.789 PSS 18 0.783

VNS 36.9689 0.905

84

Figure 1: Alternative demographic models of Cape Verde humpback whale demography tested by implementing the ABC approach.

university of groningen divergence and adaptive capacity ... · clapham et al. 1999; baker &...

Documents