intro forensic stats - promega
TRANSCRIPT
![Page 1: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/1.jpg)
Popstats Unplugged
14th International Symposium on Human Identification
John V. Planz, Ph.D. UNT Health Science Center at Fort Worth
![Page 2: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/2.jpg)
Forensic StatisticsFrom the ground up…
![Page 3: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/3.jpg)
Why so much attention to
statistics?Exclusions don’t require numbers
Matches do require statistics
Problem of verbal expression of
numbers
![Page 4: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/4.jpg)
Transfer evidence
Laboratory result
1. Non-match - exclusion
2. Inconclusive- no decision
3. Match - estimate frequency
![Page 5: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/5.jpg)
Statistical Analysis
Focus on the question being asked…
About “Q” sample
“K” matches “Q”
Who else could match “Q"
partial profile, mixtures
![Page 6: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/6.jpg)
Match – estimate frequency of:
Match to forensic evidence
NOT suspect DNA profile
Who is in suspect population?
![Page 7: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/7.jpg)
So, what are we really after?
Quantitative statement that expresses the rarity of the DNA
profile
![Page 8: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/8.jpg)
Estimate genotype frequency
1. Frequency at each locus
Hardy-Weinberg Equilibrium
2. Frequency across all loci
Linkage Equilibrium
![Page 9: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/9.jpg)
Terminology
Genetic marker variant = allele
DNA profile = genotype
Database = table that provides frequency of alleles in a population
Population = some assemblage of individuals based on some criteria for inclusion
![Page 10: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/10.jpg)
Where Do We Get These Numbers?
1 in 1,000,000
1 in 110,000,000
![Page 11: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/11.jpg)
POPULATION DATAand
Statistics
DNA databases are needed for placing statistical weight on DNA profiles
![Page 12: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/12.jpg)
vWA data (N=129)
14 15 16 17 18 19 20
14 915 3 016 19 1 117 23 1 14 918 6 0 3 10 4 19 6 1 7 3 2 220 0 0 0 3 2 0 0
freq
75
6
46
72
31
23
5258
![Page 13: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/13.jpg)
Because data are not available for every genotype possible,
We use allele frequencies instead of genotype frequencies to estimate rarity.
![Page 14: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/14.jpg)
Estimate allele frequencies by “gene counting”
A1 n1
A2 n2
A3 n3
A4 n4
: :Ak nk
n1 + n2 + n3 + n4 + ... nk = 2N
p1 = n1 /2N
p2 = n2 /2N
p3 = n3 /2N
![Page 15: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/15.jpg)
Population database
Look up how often each allele occurs at the locus in a population (or populations)
AKA looking up the “allele” frequency
![Page 16: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/16.jpg)
![Page 17: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/17.jpg)
FBI population Data
![Page 18: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/18.jpg)
![Page 19: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/19.jpg)
You can get other population databases to be used by Popstats if needed
![Page 20: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/20.jpg)
![Page 21: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/21.jpg)
![Page 22: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/22.jpg)
![Page 23: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/23.jpg)
OK
So now we have our population databases
How do they work for us in Popstats??
![Page 24: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/24.jpg)
Anchor principle
Analysis of genetic makeup in individuals is based on the Genotype at the locus being queried
To remove “individual variation” so that we can focus on population-wide variation we must meld all the genotypes into a pool…separated as alleles
![Page 25: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/25.jpg)
Alleles in populations –The Hardy-Weinberg Theory
Basis: Allele frequencies are inherited in a Mendelian fashion and frequencies of occurrence follow apredictable pattern of probability
![Page 26: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/26.jpg)
The Hardy-Weinberg principle states: that single-locus genotype frequencies after one generation of random mating can be represented by a binomial (with two alleles) or multinomial (with multiple alleles) function of the alleles frequencies.
This mathematical model serves as our Null Hypothesis when comparing populations, demes etc.
![Page 27: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/27.jpg)
Hardy - Weinberg Equilibrium
A1A1 A1A2 A2A2
A1 A2
A1
A2 A1A2
A1A2
A2A2
A1A1
p12 p1p2
p1p2 p22
p12 2p1p2 p2
2
freq(A1) = p1
freq(A2) = p2
(p1 + p2 )2 = p12 + 2p1p2 + p2
2
![Page 28: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/28.jpg)
A Hardy-Weinberg Population
LARGE POPULATIONNO NATURAL SELECTIONNO MUTATIONNO IMMIGRATION / EMIGRATION
RANDOM MATING
![Page 29: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/29.jpg)
Estimate genotype frequency:
1. Frequency at each locus
2. Frequency across all loci
Product Rule
![Page 30: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/30.jpg)
Product Rule
The frequency of a multi-locus STR profile is the product of the genotype frequencies at the individual loci
ƒ locus1 x ƒ locus2 x ƒ locusn = ƒcombined
![Page 31: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/31.jpg)
Item D3S1358 D16S539 TH01 TPOX CSF1P0 D7S820
Q1 16,16 10,12 8,9.3 9,10 12,12 8,11
Item D3S1358 vWA FGA D8S1179 D21S11 D18S51 D5S818 D13S317 D7S820
Q1 16,16 15,17 21,22 13,13 29,30 16,20 8,12 12,12 8,11
CoFIler
ProfIler Plus
![Page 32: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/32.jpg)
D3S1358 = 16, 16 (homozygote)
Frequency of 16 allele = ??
![Page 33: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/33.jpg)
![Page 34: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/34.jpg)
D3S1358 = 16, 16 (homozygote)
Frequency of 16 allele = 0.3071
When same allele:
Frequency = genotype frequency (p2)(for now!)
Genotype freq = 0.3071 x 0.3071 = 0.0943
![Page 35: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/35.jpg)
Item D3S1358 D16S539 TH01 TPOX CSF1P0 D7S820
Q1 16,16 10,12 8,9.3 9,10 12,12 8,11
Item D3S1358 vWA FGA D8S1179 D21S11 D18S51 D5S818 D13S317 D7S820
Q1 16,16 15,17 21,22 13,13 29,30 16,20 8,12 12,12 8,11
CoFIler
ProfIler Plus
![Page 36: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/36.jpg)
VWA = 15, 17 (heterozygote)
Frequency of 15 allele = ??
Frequency of 17 allele = ??
![Page 37: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/37.jpg)
![Page 38: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/38.jpg)
VWA = 15, 17 (heterozygote)
Frequency of 15 allele = 0.2361
Frequency of 17 allele = 0.1833When heterozygous:
Frequency = 2 X allele 1 freq X allele 2 freq(2pq)
Genotype freq = 2 x 0.2361 x 0.18331 = 0.0866
![Page 39: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/39.jpg)
Overall profile frequency =
Frequency D3S1358 X Frequency vWA
0.0943 x 0.0866 = 0.00817
This is basically what Popstats does for us in it's simplest task
![Page 40: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/40.jpg)
Steps – Single Sample Target Profile
enter alleles of target profilelook up allele frequencies at all
loci for all populationsdetermine if homozygous or
heterozygous at each locuscalculate genotype frequency at
each locuscalculate profile frequency with
product rule
But this doesn't address all of the issues!
![Page 41: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/41.jpg)
What if…
We encounter alleles not represented in the population database…
…or alleles that are extremely rare in the database???
![Page 42: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/42.jpg)
Minimum allele frequencyThe first NRC report proposed a minimum
allele frequency based on NO empirical data and without any statistical basis!
10 % or 0.1What…you are surprised??
Ceiling Principle
![Page 43: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/43.jpg)
Minimum allele frequency
Weir, B.S. 1992. minfreq = 1 - α1/2N
Budowle, B., K. Monson, R. Chakraborty, 1996. minfreq = 1 - [ 1 - ( 1 - α)1/C ]1/2N
NRC II, 1996. minfreq = 5/2N
![Page 44: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/44.jpg)
Minimum allele frequency
This method requires a minimum of This method requires a minimum of 55copies of an allele before the allele copies of an allele before the allele frequency can be used for calculation of frequency can be used for calculation of genotype frequencygenotype frequency
![Page 45: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/45.jpg)
55
Total number of alleles at locus
For the 13 allele at For the 13 allele at vWAvWA: : Actual Freq = 2 / 392 = 0.0051Actual Freq = 2 / 392 = 0.0051Minimal Freq = 5 / 392 = 0.0128Minimal Freq = 5 / 392 = 0.0128
Conservatism & also addresses Conservatism & also addresses some substructure effectssome substructure effects
![Page 46: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/46.jpg)
This estimate is strictly driven by database size:
N min allele freq
100 2.50 % (0.025)
150 1.67 % (0.0167)
200 1.25 % (0.0125)
250 1.00 % (0.01)
300 0.83 % (0.0083)
Where N is the number of individuals in database
![Page 47: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/47.jpg)
![Page 48: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/48.jpg)
![Page 49: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/49.jpg)
![Page 50: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/50.jpg)
So the only other real thing left to consider regarding the NRC concerns is population subdivision.
Population Structure
Racial, ethnic subgroups
Excess of homozygotes
What is “theta” θ
Why modify just homozygous calculation?
NRC Formula 4.1 vs 4.4 vs 4.10
![Page 51: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/51.jpg)
Population Subdivision
We've always surmised…
Racial / ethnic group composed of distinct Racial / ethnic group composed of distinct subsub--groups within the sample populationgroups within the sample population
Only a concern if subOnly a concern if sub--groups differ groups differ substantially at allele frequencies at the substantially at allele frequencies at the lociloci
![Page 52: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/52.jpg)
Human Genetic Variation
between populations within racial groups …
between racial groups ……………..……………………
within populations within racial groups ……
- Barbujani, Magagni, Minch, Cavalli-Sforza. 1997. An apportionment of human DNA diversity. PNAS 94:4516-4519.
![Page 53: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/53.jpg)
Problems created by population subdivision
Genotype frequencies calculated Genotype frequencies calculated from population average allele from population average allele
frequencies frequencies couldcould lead to:lead to:
Wrong estimates!Wrong estimates!
![Page 54: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/54.jpg)
Employ a Theta (θ) Correction
θθ is used as a measure of the effects of is used as a measure of the effects of population subdivision (inbreeding)population subdivision (inbreeding)
How many Great, Great, Great, Great, Great, How many Great, Great, Great, Great, Great, Great, Great… Grandparents do you have?Great, Great… Grandparents do you have?
![Page 55: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/55.jpg)
National Research Council Report II
National Academyof Sciences
Data support the recommendation that FST of 0.01 is conservative
Issued in May 1996
![Page 56: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/56.jpg)
National Research Council Report II
That Hardy-Weinberg Expectations do not have to be met
The significance of this FSTis
![Page 57: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/57.jpg)
![Page 58: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/58.jpg)
Modifying the product rule
Use correction factor for homozygotes
P)θ-P(1P2 +
Intermediate to the FST that you would find in populations with 1st and 2nd
cousin matings
spopulationNativefor 0.030.01=
use 2pipj for heterozygotes (ie: no correction)
θ
Really, this is more than ten fold more conservative
![Page 59: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/59.jpg)
Modifying the product rule
Formula 4.1 - HW
Formula 4.4 - Simple subdivision
Formula 4.10 - assumption of population
Conditional vs Unconditional Probability
![Page 60: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/60.jpg)
2pHWE:
p)p(1p 2 −+ θNRC II, 4.4a:
])2)(1(1
i)p(1[3]i)p(1[2
++
−+−+θ θ θ θ
θ θNRC II, 4.10a:
This last formula addresses a conditional probability of the suspect genotype, given that of the perpetrator, P(AiAi | AiAi), considering the person contributing the evidence and the suspect are from the same subgroup.
![Page 61: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/61.jpg)
Although we CAN correct the heterozygote genotype estimate…it is not generally necessary.
2pqHWE:
)θ2pq(1−NRC II, 4.4a:
NRC II, 4.10b:)2+)(1+(1
])p-(1+[]i)p-(1+2[jθ θ θ θ
θ θ
P(AiAj | AiAj)
![Page 62: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/62.jpg)
Theta Values Commonly Employed
0.01 for 0.01 for CauCau, AA, SEH, and , AA, SEH, and SWHSWH
0.03 for Native American 0.03 for Native American groupsgroups
Conservative ValuesConservative Values
![Page 63: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/63.jpg)
So how do we deal with these concepts in PopStats??
![Page 64: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/64.jpg)
This is the default using 0.01 θ
![Page 65: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/65.jpg)
![Page 66: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/66.jpg)
If we want to use 0.03 θ
![Page 67: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/67.jpg)
![Page 68: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/68.jpg)
θ = 0.01
θ = 0.03
![Page 69: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/69.jpg)
WHERE CAN WE FIND INFORMATION ON WHAT
POPSTATS CAN DO?
![Page 70: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/70.jpg)
![Page 71: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/71.jpg)
![Page 72: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/72.jpg)
![Page 73: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/73.jpg)
Ok…so lets look at our other main option
)2)(1(1i)p(1[3]i)p(1[2
++
−+−+θ θ θ θ
θ θNRC II, 4.10a:
NRC II, 4.10b:)2+)(1+(1
])p-(1+[]i)p-(1+2[jθ θ θ θ
θ θ
Note: both homozygotes and heterozygotes are treated in this application
![Page 74: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/74.jpg)
When and why should we consider this??Takes into account the assumptionthat the person contributing the evidence and the suspect are from the same subgroup
What it gives us is a conditional probability of the suspect genotype
given that of the perpetrator.
Example… use if the suspect and all possible perpetrators are from the same small isolated town i.e. religious sects, native communities
![Page 75: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/75.jpg)
Here it is using the default of 0.01 θ
![Page 76: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/76.jpg)
![Page 77: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/77.jpg)
If we want to use 0.03 θ
![Page 78: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/78.jpg)
![Page 79: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/79.jpg)
θ = 0.01
θ = 0.03
![Page 80: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/80.jpg)
So,
PopStats has given us the numbers we desired…
What do we do with them???
![Page 81: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/81.jpg)
Well,
We report them of course!
But we should consider what we are reporting and the information we are
conveying in our "statistics"
![Page 82: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/82.jpg)
Source attribution
Hot topic for statistical debate
With the current panel of genetic markers available to forensic testing, it is not uncommon for the reciprocal of the random match probability determined for a genetic profile to exceed the worlds population several fold.
![Page 83: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/83.jpg)
So, how do you want to express this fact in your reports and testimony?
What do these numbers mean to you?the prosecutor?the defense?the judge?the jury?
This is what really matters!!!
![Page 84: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/84.jpg)
Big Number Names:1,000,000 million
1,000,000,000 billion1,000,000,000,000 trillion
1 x 1015 quadrillion1 x 1018 quintillion1 x 1021 sextillion1 x 1024 septillion1 x 1027 octillion1 x 1030 nonillion1 x 1033 decillion
![Page 85: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/85.jpg)
Even Bigger Number Names:
undecillionduodecilliontredecillionquattordecillionquindecillionsexdecillionseptendecillionoctodecillionnovemdecillionvigintillion
1 x 1036
1 x 1039
1 x 1042
1 x 1045
1 x 1048
1 x 1051
1 x 1054
1 x 1057
1 x 1060
1 x 1063
![Page 86: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/86.jpg)
NRC II May 1996NRC II May 1996
“...that profile might be said to“...that profile might be said tobe unique if it is so rare that it be unique if it is so rare that it becomes unreasonable to supposebecomes unreasonable to supposethat a second person in the that a second person in the population might have the samepopulation might have the sameprofile.”profile.”
![Page 87: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/87.jpg)
To address uniqueness we are back to the same old question… population sample size
Here the population size differs from what we discussed when calculating
allele frequencies…
The relevant population is at issue here
![Page 88: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/88.jpg)
Define the Question(or at least make sure you know what question
you are answering)
![Page 89: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/89.jpg)
Define the QuestionEstimates of the Rarity of
a DNA Profile:
Based on unrelated individuals
1 in 130 million
1 in 128Based on brothers
![Page 90: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/90.jpg)
Uniqueness / Source AttributionWebster’s Definitions
only one
Unusual
Some [circumstance] that is the only one of it kind
![Page 91: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/91.jpg)
Uniqueness / Source AttributionWebster’s Definitions
Attribution evaluated within context of case
Rarely is the world’s population the appropriate context
Thus, a circumstance that is the only one of its kind is appropriate context
![Page 92: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/92.jpg)
Uniqueness
A profile that exists in one person and no other (excluding identical twins)
Context?
• Population of the world…maybe• Population of the US….there is a thought!• Population with access to a crime scene…
![Page 93: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/93.jpg)
UniquenessA profile that exists in one person and no other (excluding identical twins)
Actually we are interested in source attribution, not whether the profile is unique in the world
Is it reasonable to consider the profile to be so rare that one can opine about the source of the evidence?
![Page 94: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/94.jpg)
Let the RMP of a given evidentiary profile X be px
(Calculate using NRC II Report Recommendations)
Then (1-px)N
is the probability of not observing the profile in a population of N unrelated individuals
This probability should be greater than or equal to a 1- α confidence level
(1-px)N ≥ 1- α
px ≤ 1 - (1- α)1/N
![Page 95: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/95.jpg)
Source Attribution• Specify (1- α) confidence level of 95% or 99%
(uses an α of 0.05 or 0.01, respectively)
• Determine RMP threshold to assert with a specific degree of confidence that the particular evidence profile is unique with a population of N unrelated individuals
What population????
![Page 96: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/96.jpg)
Source AttributionValues
Calculate p for major population groups
θ = 0.01or 0.03
Take the most common value for p Increase p by factor of 10Determine if p < 1- (1- a) 1/N
What N ??
![Page 97: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/97.jpg)
The standard basis that is used here in the US is an estimate of US population of approximately 260 million people
So, taking this and if we accept an α of 0.01 (99% confidence level) with
px ≤ 1-(1-α)1/N
A random match probability less than 3.9 x 10-11
would convey at least 99% confidence that the evidentiary profile is unique in the population
![Page 98: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/98.jpg)
RMP thresholds for source attribution at various population sizes and confidence levels
SAMPLE CONFIDENCE LEVELSSIZE N 0.90 0.95 0.99 0.9992 5.1x10-2 2.5x10-2 5.0x10-3 5.0x10-43 3.5x10-2 1.7x10-2 3.3x10-3 3.3x10-44 2.6x10-2 1.3x10-2 2.5x10-3 2.5x10-45 2.1x10-2 1.0x10-2 2.0x10-3 2.0x10-46 1.7x10-2 8.5x10-3 1.7x10-3 1.7x10-47 1.5x10-2 7.3x10-3 1.4x10-3 1.4x10-48 1.3x10-2 6.4x10-3 1.3x10-3 1.3x10-49 1.2x10-2 5.7x10-3 1.1x10-3 1.1x10-410 1.1x10-2 5.1x10-3 1.0x10-3 1.0x10-425 4.2x10-3 2.1x10-3 4.0x10-4 4.0x10-550 2.1x10-3 1.0x10-3 2.0x10-4 2.0x10-5100 1.1x10-3 5.1x10-4 1.0x10-4 1.0x10-51x103 1.1x10-4 5.1x10-5 1.0x10-5 1.0x10-61x105 1.1x10-6 5.1x10-7 1.0x10-7 1.0x10-81x106 1.1x10-7 5.1x10-8 1.0x10-8 1.0x10-91x107 1.1x10-8 5.1x10-9 1.0x10-9 1.0x10-105x107 2.1x10-9 1.0x10-9 2.0x10-10 2.0x10-112.6x108 4.1x10-10 2.0x10-10 3.9x10-11 3.9x10-121x109 1.1x10-10 5.1x10-11 1.0x10-11 1.0x10-125x109 2.1x10-11 1.0x10-11 2.0x10-12 2.0x10-13
![Page 99: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/99.jpg)
So with our PopStats results obtained for a θ = 0.01
Profile frequency is less than 99% threshold
3.9 x 10-11
![Page 100: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/100.jpg)
So with our PopStats results obtained for a θ = 0.03
Profile frequency is less than 99% threshold
3.9 x 10-11
![Page 101: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/101.jpg)
“To a reasonable degree of scientific certainty, ________is the source of the
DNA in specimen Q2.”
“I have a high degree of confidence, ________is the source of the DNA in
specimen Q2.”
"Based on an estimate of 260 million people resident in the population of the United States, there is 99% confidence
that ________is the source of the DNA in specimen Q2.”
![Page 102: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/102.jpg)
We are not stating that ________ is the only person to possess that profile. We are stating that we would not expect to find it in a
population of N individuals.
![Page 103: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/103.jpg)
Source Attribution
• Method is simple
• Conservative because N is so large (260,000,000)
• If N = 260,000,000, then RMP threshold is 3.9 x 10-11
• Most of the time the RMP is far less, so confidence is greater than 0.99
![Page 104: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/104.jpg)
Source Attribution
• N can be configured to context of the case
• Two individuals to entire town, state, or whatever
• Laboratory policy to set N
![Page 105: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/105.jpg)
Random match probability is NOT
Chance that someone else is guilty
Chance that someone else left the bloodstain
Chance of defendant not being guilty
![Page 106: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/106.jpg)
![Page 107: Intro Forensic Stats - Promega](https://reader030.vdocuments.mx/reader030/viewer/2022012414/616eb8111be18f00d03bbe49/html5/thumbnails/107.jpg)