introduction statistics introduction professor ke-sheng cheng department of bioenvironmental systems...
TRANSCRIPT
![Page 1: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/1.jpg)
STATISTICS IntroductionIntroduction
Professor Ke-Sheng ChengDepartment of Bioenvironmental Systems Engineering
National Taiwan University
![Page 2: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/2.jpg)
• Lecture notes will be posted on class website– www.rslabntu.net– Supplementary material: IRSUR by Kerns
• Grades– Homeworks (60%) [No homework copying.]– Midterm (20%), Final (20%)
• The R language will be used for data analysis.• A tutorial session is arranged on Tuesday (6:00
– 7:00 pm).• Office hour: Thursday 2:30 – 3:30 pm
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
2
![Page 3: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/3.jpg)
What is “statistics”?
• Statistics is a science of “reasoning” from data.
• A body of principles and methods for extracting useful information from data, for assessing the reliability of that information, for measuring and managing risk, and for making decisions in the face of uncertainty.
04/10/23 3Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 4: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/4.jpg)
• The major difference between statistics and mathematics is that statistics always needs “observed” data, while mathematics does not.
• An important feature of statistical methods is the “uncertainty” involved in analysis.
04/10/23 4Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 5: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/5.jpg)
• Statistics is the discipline concerned with the study of variability, with the study of uncertainty and with the study of decision-making in the face of uncertainty. As these are issues that are crucial throughout the sciences and engineering, statistics is an inherently interdisciplinary science.
04/10/23 5Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 6: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/6.jpg)
• Extracting useful information from data• Assessing the reliability of that information– How much are we sure about our claim based on
the data?
04/10/23 6Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 7: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/7.jpg)
• One of the objectives of this course is to facilitate students with a critical way of thinking.– Accuracy of weather forecasting– Accuracy of flood forecasting– Not to be fooled by the surface meaning of
statistical terminologies.
04/10/23 7Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 8: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/8.jpg)
Sources of uncertainties
• Data uncertainty• Parameter uncertainty• Model structure uncertainty– An exemplar illustration
04/10/23 8Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 9: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/9.jpg)
-100
0
100
200
300
400
500
600
700
800
0 10 20 30 40 50 60 70 80
You are given a set of (x,y) data. Apparently, Y is dependent on X.
04/10/23 9Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 10: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/10.jpg)
Observed data with uncertainties (Linear model)
y = 9.9507x - 48.343
R2 = 0.9534
-100
0
100
200
300
400
500
600
700
800
0 10 20 30 40 50 60 70 80
04/10/23 10Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 11: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/11.jpg)
Observed data with uncertainties(Power model)
y = 3.5218x1.2335
R2 = 0.8852
-100
0
100
200
300
400
500
600
700
800
0 10 20 30 40 50 60 70 80
The linear model fits the data better than the power model.
04/10/23 11Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 12: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/12.jpg)
Theoretical model:
y = 3.5218x1.2335
R2 = 0.8852
-100
0
100
200
300
400
500
600
700
800
0 10 20 30 40 50 60 70 80
)60,0(~,8.2 3.1 NiidXY
Sum of squared errors (SSE) of estimates of the linear and power models (with respect to the theoretical model) are 12011.7 and 8950.08, respectively.
Theoretical model
The power model performs better than the linear model.
04/10/23 12Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 13: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/13.jpg)
Key topics in statistics
• Probability• Estimation• Test of hypotheses• Regression• Forecasting• Quality control• Simulation
04/10/23 13Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 14: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/14.jpg)
Deterministic vs Stochastic Models
• An abstract model is a description of the essential properties of a phenomenon that is formulated in mathematical terms. – An abstract model is used as a theoretical
approximation of reality to help us understand the world around us.
All models are wrong!
04/10/23 14Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 15: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/15.jpg)
• Essentially, all models are wrong, but some are useful.
• Remember that all models are wrong; the practical question is how wrong do they have to be to not be useful. (George E. P. Box)– Normal distribution for men’s height, grades in a
statistics class, etc.
04/10/23 15Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 16: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/16.jpg)
Types of abstract models
• Deterministic model– A deterministic model describes a phenomenon
whose outcome is fixed.
• Stochastic model– A random/stochastic model describes the
unpredictable variation of the outcomes of a random experiment.
04/10/23 16Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 17: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/17.jpg)
Examples
• Deterministic model– Suppose we wish to measure the area covered by
a lake that, for all practical purposes, appears to have a circular shoreline. Since we know the area A=r2, where r is the radius, we would attempt to measure the radius and substitute it in the formula.
04/10/23 17Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 18: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/18.jpg)
• Stochastic model– Consider the experiment of tossing a balanced coin and
observing the upper face. It is not possible to predict with absolute accuracy what the upper face will be even if we repeat the experiment so many times. However, it is possible to predict what will happen in the long run. We can say that the probability of heads on a single toss is ½.
– P(more than 60 heads in 100 trials)
04/10/23 18Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 19: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/19.jpg)
Random Experiment and Sample Space
• An experiment that can be repeated under the same (or uniform) conditions, but whose outcome cannot be predicted in advance, even when the same experiment has been performed many times, is called a random experiment.
• Can the lotto draw be considered as a random experiment?
04/10/23 19Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 20: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/20.jpg)
• Examples of random experiments– The tossing of a coin. – The roll of a die. – The selection of a numbered ball (1-50) in an urn.
(selection with replacement) – The time interval between the occurrences of two
higher than scale 6 earthquakes. – The amount of rainfalls produced by typhoons in
one year (yearly typhoon rainfalls).
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
20
![Page 21: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/21.jpg)
• The following items are always associated with a random experiment: – Sample space. The set of all possible outcomes,
denoted by . – Outcomes. Elements of the sample space,
denoted by . These are also referred to as sample points or realizations.
– Events. Subsets of for which the probability is defined. Events are denoted by capital Latin letters (e.g., A,B,C).
04/10/23 21Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 22: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/22.jpg)
Definition of Probability
• Classical probability• Frequency probability• Probability model
04/10/23 22Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 23: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/23.jpg)
Classical (or a priori) probability
• If a random experiment can result in n mutually exclusive and equally likely outcomes and if nA of these outcomes have
an attribute A, then the probability of A is the fraction nA/n .
04/10/23 23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 24: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/24.jpg)
• Example 1.
Compute the probability of getting two heads if a fair coin is tossed twice. (1/4)
• Example 2.
The probability that a card drawn from an ordinary well-shuffled deck will be an ace or a spade. (16/52)
04/10/23 24Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 25: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/25.jpg)
Remarks
• The probabilities determined by the classical definition are called “a priori” probabilities since they can be derived purely by deductive reasoning.
04/10/23 25Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 26: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/26.jpg)
• The “equally likely” assumption requires the experiment to be carried out in such a way that the assumption is realistic; such as, using a balanced coin, using a die that is not loaded, using a well-shuffled deck of cards, using random sampling, and so forth. This assumption also requires that the sample space is appropriately defined.
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
26
![Page 27: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/27.jpg)
• Troublesome limitations in the classical definition of probability: – If the number of possible outcomes is infinite; – If possible outcomes are not equally likely.
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
27
![Page 28: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/28.jpg)
Relative frequency(or a posteriori) probability
• We observe outcomes of a random experiment which is repeated many times. We postulate a number p which is the probability of an event, and approximate p by the relative frequency f with which the repeated observations satisfy the event.
04/10/23 28Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 29: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/29.jpg)
• Suppose a random experiment is repeated n times under uniform conditions, and if event A occurred nA times, then the relative frequency for which A occurs is fn(A) = nA/n. If the limit of fn(A) as n approaches infinity exists then one can assign the probability of A by:
P(A)= .)(lim Af n
n
04/10/23 29Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 30: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/30.jpg)
• This method requires the existence of the limit of the relative frequencies. This property is known as statistical regularity. This property will be satisfied if the trials are independent and are performed under uniform conditions.
04/10/23 30Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 31: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/31.jpg)
• Example 3
A fair coin was tossed 100 times with 54 occurrences of head. The probability of head occurrence for each toss is estimated to be 0.54.
04/10/23 31Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 32: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/32.jpg)
• Example 4– Randomly draw three balls in the box at one time. • What is the sample space of the random experiment?• What is the probability of having two or more blue balls
in a draw?• What if …
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
32
1
2
1
32
![Page 33: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/33.jpg)
• The chain of probability definition
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
33
Random experimen
t
Sample space
Event space
Probability space
![Page 34: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/34.jpg)
Probability Model
04/10/23 34Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 35: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/35.jpg)
Event and event spaceEvent and event spaceAn event is a subset of the sample space. The class of all events associated with a given random experiment is defined to be the event space.
04/10/23 35Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 36: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/36.jpg)
Remarks
04/10/23 36Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 37: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/37.jpg)
• Probability is a mapping of sets to numbers. • Probability is not a mapping of the sample
space to numbers. – The expression is not defined.
However, for a singleton event , is defined.
for )(P}{ })({P
04/10/23 37Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 38: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/38.jpg)
Probability space
• A probability space is the triplet (, , P[]), where is a sample space, is an event space, and P[] is a probability function with domain .
• A probability space constitutes a complete probabilistic description of a random experiment. – The sample space defines all of the possible
outcomes, the event space defines all possible things that could be observed as a result of an experiment, and the probability P defines the degree of belief or evidential support associated with the experiment.
04/10/23 38Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 39: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/39.jpg)
Conditional probability
04/10/23 39Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 40: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/40.jpg)
Bayes’ theorem
04/10/23 40Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 41: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/41.jpg)
Multiplication rule
04/10/23 41Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 42: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/42.jpg)
Independent events
04/10/23 42Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 43: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/43.jpg)
• The property of independence of two events A and B and the property that A and B are mutually exclusive are distinct, though related, properties.
• If A and B are mutually exclusive events then AB=. Therefore, P(AB) = 0. Whereas, if A and B are independent events then P(AB) = P(A)P(B). Events A and B will be mutually exclusive and independent events only if P(AB)=P(A)P(B)=0, that is, at least one of A or B has zero probability.
04/10/23 43Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 44: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/44.jpg)
• But if A and B are mutually exclusive events and both have nonzero probabilities then it is impossible for them to be independent events.
• Likewise, if A and B are independent events and both have nonzero probabilities then it is impossible for them to be mutually exclusive.
04/10/23 44Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 45: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/45.jpg)
Summarizing data
• Qualitative data– Frequency table
Freq Relative freqRed 14 0.156
Green 16 0.178Blue 21 0.233
Yellow 9 0.100White 25 0.278Pink 5 0.056
04/10/23 45Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 46: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/46.jpg)
– Bar chart
0
5
10
15
20
25
30
Red Green Blue Yellow White Pink
04/10/23 46Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 47: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/47.jpg)
• Quantitative data
– Histogram
35 57 43 78 77 7775 88 86 78 79 4133 88 75 72 75 7773 50 50 24 60 4060 87 59 73 83 9085 88 33 65 82 3178 95
04/10/23 47Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 48: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/48.jpg)
• Boxplot
04/10/23 48Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 49: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/49.jpg)
04/10/23 49Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 50: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/50.jpg)
04/10/23 50Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 51: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/51.jpg)
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
51
![Page 52: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/52.jpg)
• Dealing with outliers– Should the outliers be discarded or should they be
retained?– An example of outlier presence• Typhoon Morakot
04/10/23 52Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 53: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/53.jpg)
Typhoon Morakot
• Cumulative rainfall (Aug 7, 0:00 – 24:00)
04/10/23 53Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 54: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/54.jpg)
• Cumulative rainfall (Aug 8, 0:00 – 24:00)
04/10/23 54Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 55: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/55.jpg)
• Cumulative rainfall (Aug 9, 0:00 – 24:00)
04/10/23 55Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 56: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/56.jpg)
Cumulative rainfall in mm
• 2009/08/07 00:00 ~ 2009/08/09 17:00
04/10/23 56Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 57: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/57.jpg)
Measures of Central Tendency
• Mean– Sum of measurements divided by the number of
measurements.• Median– Middle value when the data are sorted.
• Mode– Value or category that occurs most frequently.
04/10/23 57Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 58: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/58.jpg)
Measures of Variation
• Standard Deviation - summarizes how far away from the mean the data value typically are.
• Range
n
iin xx
ns
1
2
)1(
1
minmax xxR
04/10/23 58Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
![Page 59: Introduction STATISTICS Introduction Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University](https://reader033.vdocuments.mx/reader033/viewer/2022061306/55147f33550346b0158b5723/html5/thumbnails/59.jpg)
Reading assignment
• IPSUR (Will be covered in the tutor session)– Chapt. 2– Chapt. 3• 3.1.1, 3.1.3, 3.1.4• 3.3• 3.4.3, 3.4.4, 3.4.5, 3.4.6, 3.4.7
04/10/23Lab for Remote Sensing Hydrology and Spatial Modeling Department of Bioenvironmental Systems Engineering, National Taiwan University
59