probability definitions of statistics

Upload: shihab-khan

Post on 02-Apr-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/27/2019 Probability definitions of statistics

    1/19

    Probability

  • 7/27/2019 Probability definitions of statistics

    2/19

    Lets start with some definitions

    Suppose you throw a pair of dice and add the numbers facing up (experiment)

    When you repeat this experiment long enough you will find out that the set of allpossible outcomes is S = {2,3,4,,12}. S is called the sample space.

    Sometimes you are interested in a subset of S. For example, you may want toknow, in the example above, the set of all possible even outcomes. We usually

    refer to such subsets as events (E).In this case E = {2,4,6,8,10,12}.

    Question: A coin is tossed three times and number of Heads are noted. What is thesample space for this experiment? What is the set of all outcomes where heads

    come up at least twice?

    S = {0,1,2,3}

    E = {HHH, HHT, HTH, THH} and E = {TTT, THT, TTH, HTT}

    E U E = S and E E =

  • 7/27/2019 Probability definitions of statistics

    3/19

    Another example

    A pair of dice (one red, one green) is thrown and the number facing up on each die is

    noted, let E be the event that the sum of the numbers is 4, and let F be the event that the

    sum is an odd number.

    a) The event F' is:

    {(1, 1), (1, 3), (1, 5), (2, 2),(2, 4), (2, 6), (3, 1), (3, 3),

    (3, 5), (4, 2), (4, 4), (4, 6),

    (5, 1), (5, 3), (5, 5), (6, 2),

    (6, 4), (6, 6)}

    b) E F' is the event:

    {(1, 3), (2, 2), (3, 1)}

    c) E' U F is the event:

    The sum of the numbers is any number other than 4

  • 7/27/2019 Probability definitions of statistics

    4/19

    Estimated probability

    a) The frequencyof the event E [denoted as fr(E)] is the number of times the event E

    occurs.

    Example: Toss a coin 20 times. If heads comes up 13 times, then the frequency of the

    event that heads comes up is fr(E) = 13.

    b) Therelative frequencyorestimated probabilityof the event E is the fraction of

    times E occurs: P(E) = fr(E) / N.

    Referring to the example above, the estimated probability of the event that heads comes

    up is 13/20=65%.

    c) The number of times that the experiment is performed is called the sample size (N).

    The experiment above was performed 20 times, so the sample size is N = 20.

  • 7/27/2019 Probability definitions of statistics

    5/19

    Estimated probability distribution

    d) The collection of the estimated probabilities ofallthe outcomes is the estimated

    probability distribution.

    Example: If 10 rolls of a die resulted in the outcomes 2, 1, 4, 4, 5, 6, 1, 2, 2, 1, the

    associated estimated probability distribution is the one shown in the following table.

    Outcome 1 2 3 4 5 6

    Probability 0.3 0.3 0 0.2 0.1 0.1

  • 7/27/2019 Probability definitions of statistics

    6/19

    Properties of probability distributions

    Let S = {s1, s2, ... , sn} be a sample space and let P(s i) be the estimated probability of the

    event {si}. Then:

    0 P(si) 1

    The estimated probability of each outcome is a number between 0 and 1.

    P(s1) + P(s2) + .. + P(sn)=1

    The estimated probabilities of all the outcomes add up to 1.

    If E = {e1, e2, ..., er}, then P(E) = P(e1) + P(e2) + ... + P(er).

    The estimated probability of an event E is the sum of the estimated probabilities of the

    individual outcomes in E.

  • 7/27/2019 Probability definitions of statistics

    7/19

    Theoretical probability

    Definition:The theoretical probability, orprobability, P(E), of an event E is the fractionof times we expectE to occur if we repeat the same experiment over and over.

    Relationship with estimated probability:The estimated probability approaches thetheoretical probability as the number of trials gets larger and larger. Thus, estimatedprobability is an approximation, or estimate, of theoretical probability.

    Determining theoretical probability:Theoretical probability is determined analytically, thatis, by using our knowledge about the nature of the experiment rather than through actualexperimentation. The best we can obtain through actual experimentation is an estimate ofthe theoretical probability (hence the term "estimated probability").

    Example: Toss a fair coin and observe the uppermost side. Since we expect that heads isas likely to come up as tails, we conclude that the theoretical probability distribution is:P(H) = P(T) = 1/2.

    Similarly, if you roll a fair dice: P(1)=P(2)= . . . = P(6) = 1/6.

  • 7/27/2019 Probability definitions of statistics

    8/19

    Probability distributions

    Formal def ini t ion:A finite sample space is just any old finite set S = {s1, s2, ..., sn}. Aprobability distribution is an assignment of a number P(s i) to each outcome si in asample space S ={s1, s2, ... , sn} such that

    0 P(si) 1The estimated probability of each outcome is a number between 0 and 1.

    P(s1) + P(s2) + .. + P(sn)=1

    The estimated probabilities of all the outcomes add up to 1.

    A few more facts to note:

    To get the probability of an event(recall that an event is a set of outcomes) all we dois add the probabilities of the individual outcomes.

    If P(E) = 0, we call E an impossible event.

  • 7/27/2019 Probability definitions of statistics

    9/19

    An example

    Fill in the missing probabilities, given that P(6) = 3P(1).

    What is the probability that an even number will come up?

    Outcome 1 2 3 4 5 6

    Probability ? 0.125 0.125 0.125 0.125 ?

  • 7/27/2019 Probability definitions of statistics

    10/19

    Addition principle

    a) If E and F are mutually exclusive events, then P(E U F) = P(E) + P(F).

    Example: Cast two dice and add the numbers facing up. E: the sum is 5; F: the sum is

    even. Since these are mutually exclusive events, the probability that the sum is either 5

    or even is: P(E U F) = P(E) + P(F) = 1/9 + 1/2 = 11/18.

    b) If E and F are NOT mutually exclusive events, then we must use the more general

    formula: P(E U F) = P(E) + P(F) - P(E F).

    Example: Cast a single dice and note the number facing up. E: the outcome is 1, 2 or 3;

    so P(E) = 1/2; F: the outcome is even; P(E) = 1/2

    E F = {2}; P(E F) = 1/6.

    P(E U F)=P(E)+P(F)-P(E F) = 1/9 + 1/2 1/6 = 11/18.

  • 7/27/2019 Probability definitions of statistics

    11/19

    Other principles

    P(E') = 1 - P(E)

    P(S) = 1

    P(O) = 1 P(S) = 0

  • 7/27/2019 Probability definitions of statistics

    12/19

    Conditional probability

    Quick i l lust rat ion: Suppose you cast two dice; one red and one green.

    Then the probability of getting "bulls eyes" (two ones) is 1/36.However, if, after casting the dice, you ascertain that the green die

    shows a one (but know nothing about the red die), then there is a 1/6

    change that both of them will be one. In other words, the probability of

    "bulls eyes" changes if you have partial information, and we refer tothis (altered) probability as "conditional probability".

  • 7/27/2019 Probability definitions of statistics

    13/19

    Example

    Used

    cream Placebo TOTAL

    Skin improved 800 600 1,400

    No improvement 400 200 600

    TOTAL 1,200 800 2,000

    The table above shows fictitious trial results of a new acne cream.

    The sample size (total number of people in the study) is: 2,000

    The estimated probability that someone's skin improved (regardless of which skin

    cream was used) is: 1,400/2,000 = 70%. (unconditional probability)

    The experimental probability that someone's skin improved, given that they used the

    new acne cream is: 800/1,200 = 66.7% (conditional probability)

  • 7/27/2019 Probability definitions of statistics

    14/19

    Formally

    The probability of the event E, given the event F, and written P(E|F) (probability of E,given F)

    If E and F are not mutually exclusive events, then the probability of E given F is

    If all outcomes are equally likely, then we can also use the alternative formula

    )(

    )()|(

    FP

    FEPFEP

    )(

    )()|(

    Ffr

    FEfrFEP

  • 7/27/2019 Probability definitions of statistics

    15/19

    Another example

    You have invested in Home-Clone Inc. stocks, as you suspect that the company's "Clone-

    a-Sibling" kit will shortly be approved by the FDA. There is an 80% chance that FDA

    approval will be given, and a 95% chance that the value of the stock you hold will double

    if FDA approval is given. What is the probability that the value of the stock you hold will

    double?

    F: FDA approval, E: Price doubles

    P(E|F) = 95%, P(F) = 80%, P(E) = ?

    Using the formula above: 0.95 = 0.8 * P(E)

    P(E) = 76%.

  • 7/27/2019 Probability definitions of statistics

    16/19

    Independent Events

    If E and F are independent if any one of the following equations holds:

    P(E|F) = P(E)

    P(F|E) = P(F)

    P(EF) = P(E) * P(F)

    Example: You throw two fair dice, one green and one red, and observe the numbersuppermost.

    A: the event that their sum is 7; P(E) = n(E)/36 = 6/36 = 1/6

    B: the event that the red die shows an even number; P(F) = 1/2

    P(AB) = P((1, 6), (3, 4), (5, 2)) = 3/36 = 1/12.

    Since P(AB) = P(A) * P(B), A and B are independent.

  • 7/27/2019 Probability definitions of statistics

    17/19

    Bayes Theorem

    Bayes' theorem gives us a way of calculating P(E|F) from a knowledge of P(F|E).

    )'(*)'|()(*)|(

    )(*)|()|(

    FPFEPFPFEP

    FPFEPEFP

    Example: A manufacturer claims that its drug test will detect steroid use (that is, showpositive for an athlete who uses steroids) 95% of the time. Your friend on the rugby

    team has just tested positive. What is the probability that he uses steroids is?

    E = the event that an anabolic steroid detection test gives a positive result

    F = the event that the athlete uses steroids

    P(E|F) = 95%, P(F|E) = ?

    To answer this, we need to know:

    a) P(F) (the probability that an athlete on the team is using steroids)

    b) P(E|F') (the probability of a false positive result).

  • 7/27/2019 Probability definitions of statistics

    18/19

    Example continues

    Suppose, 15% of all steroid-free individuals also test positive [P(E|F') =15%]. Also,10% of the rugby team members use steroids [P(F)=10%]. Your friend on the rugby

    team has just tested positive. The probability that he uses steroids is:

    413.0

    90.0*15.010.0*95.0

    10.0*95.0)|(

    EFP

  • 7/27/2019 Probability definitions of statistics

    19/19