fall 2011 csc 446/546 part 7: random-variate generation

CSC 446/546

Part 7: Random-Variate Generation

CSC 446/546

Agenda

1. Purpose

2. Prerequisite

3. Inverse-transform technique

4. Acceptance-rejection technique

5. Special Properties

CSC 446/546

1. Purpose

To generate samples from a specified distribution as input to a simulation model.

CSC 446/546

2.PrerequisiteAll the techniques assume that a source of uniform [0,1] random numbers R1, R2, … is readily available, where each Ri has pdf

and cdf

otherwise ,0

10 ,1 xxfR

CSC 446/546

3. Inverse-transform Technique (1)

The concept:• For cdf function: r = F(x)

• Generate r from uniform (0,1)

• Find x:

x = F-1(r)r1

r = F(x)

Note 0F(x)1

CSC 446/546

3. Inverse-transform Technique (2:) Steps to followCompute the cdf of the desired random variable X

Set F(X)=R on the range of X

Solve the equation F(X)=R for X in terms of R

Generate (as needed) uniform Random Numbers R1, R2, R3, …. and compute the desired random variates by Xi=F-1(Ri)

CSC 446/546

3. Inverse-transform Technique (3) Exponential Distribution (1)Exponential Distribution:

• Exponential cdf:

• To generate X1, X2, X3 …

One simplification is use to Replace (1-Ri) with Ri as both Ri and (1-Ri) are uniformly distributed on [0,1]

Xi = -(1/) ln(Ri)

r = F(x) = 1 – e-x for x 0

Xi = F-1(Ri) = -(1/ ln(1-Ri)

Density f(x)=e-x

CSC 446/546

3. Inverse-transform Technique (3) Exponential Distribution (2) (=1)

i Ri Xi

1 0.1306 0.1400

2 0.0422 0.0431

3 0.6597 1.078

4 0.7965 1.592

5 0.7696 1.468

CSC 446/546

3. Inverse-transform Technique (3) Exponential Distribution (3)

Example: Generate 200 variates Xi with distribution exp(= 1)

• Generate 200 Ri with U(0,1) and utilize eq’n 8.3, the histogram of Xi becomes:

CSC 446/546

3. Inverse-transform Technique (4) Does X have the desired distribution??

Check: Does the random variable X1 generated through transformation have the desired distribution?

)())(()(

[0,1]on ddistribute Uniformlyis

xFxFRPxXP

Remember for Uniform Distribution U(0,1)P(Y≤y)=y for 0 ≤y ≤1

CSC 446/546

3. Inverse-transform Technique (5) Uniform distribution (1)Consider a random variable X uniformly distributed on the interval [a,b].

The pdf of X is:

The cdf is given by:

Set F(X)=(X-a)/(b-a)=R

Solving for X yields X=a+(b-a)R

Therefore the random variate is X=a+(b-a)R

otherwise ,0

,1 bxaabxf

bxaabax

CSC 446/546

3. Inverse-transform Technique (6) Triangular DistributionConsider a random variable X that has the pdf as:

The cdf is given by:

For 0X 1, R=X2/2 and for 1X2, R=1-((2-X)2/2)

Thus X is generated by:

otherwise ,0

21 ,221

10 ,2/

12/1 ,)1(22

2/10 ,2

CSC 446/546

3. Inverse-transform Technique (7) Empirical Continuous Dist. (1)

When theoretical distribution is not applicable that provides a good model, then it is necessary to use empirical distribution of data

To collect empirical data:

• One possibility is to resample the observed data itself– This is known as using the empirical distribution– It makes sense if the input process takes a finite number

of values

• If the data is drawn from a continuous valued input process, then we can interpolate between the observed data points to fill in the gaps

This section looks into defining and generating data from a continuous empirical distribution

CSC 446/546

Given a small sample set (size n):• Arrange the data from smallest to largest• Define x(0) = 0• Assign the probability 1/n to each interval• The resulting empirical cdf has a ith line segment slope as

• The inverse cdf is calculated by

where (i-1)/n<R i/n; R is the random number generated

(n))()( x x x 21

(i))(i- x xx 1

iRaxRFX ii

)1()(ˆ

xxa iiii

i /1/)1(/1)1()()1()(

CSC 446/546

3. Inverse-transform Technique (7) Empirical Continuous Dist. (3)Five observations of fire crew response times (in minutes) to incoming alarms are collected to be used in a simulation investigating possible alternative staffing and crew-scheduling policies. The data are: 2.76, 1.83, 0.80, 1.45, 1.24

Arranging in ascending order: 0.80, 1.24, 1.45, 1.83, 2.76

iInterval

(minutes) Probability 1/nCumulative

Probability, i/n Slope, a i

1 0 ≤ x ≤ 0.80 0.2 0.20 4.002 0.80 ≤ x ≤ 1.24 0.2 0.40 2.2

3 1.24 ≤ x ≤ 1.45 0.2 0.60 1.1

4 1.45 ≤ x ≤ 1.83 0.2 0.80 1.95 1.83 ≤ x ≤ 2.76 0.2 1.00 4.65

If a random number R1=0.71 is generated, then R1 lies in the fourth interval (between 3/5 and 4/5. Therefore, X1=x (4-1) +a4(R1-(4-1)/n) = 1.66

CSC 446/546

3. Inverse-transform Technique (7) Empirical Continuous Dist. (4)In general, given ci is the cumulative probability of first i intervals, is the ith interval, then the slope of the ith line segment is:

The inverse cdf is calculated as:1

iii cc

(i))(i- x xx 1

cRaxRFX

CSC 446/546

Example: Suppose the data collected for 100 broken-widget repair times are:

iInterval (Hours) Frequency

Relative Frequency

Cumulative Frequency, c i

Slope, a i

1 0.25 ≤ x ≤ 0.5 31 0.31 0.31 0.81

2 0.5 ≤ x ≤ 1.0 10 0.10 0.41 5.0

3 1.0 ≤ x ≤ 1.5 25 0.25 0.66 2.0

4 1.5 ≤ x ≤ 2.0 34 0.34 1.00 1.47

Consider R 1 = 0.83:

c 3 = 0.66 < R 1 < c 4 = 1.00

X 1 = x (4-1) + a 4(R 1 – c (4-1)) = 1.5 + 1.47(0.83-0.66) = 1.75

CSC 446/546

3. Inverse-transform Technique (8) Continuous Distributions with no closed-form inverse

A number of useful continuous distributions do not have a closed form expression for their cdf or inverse

Examples are: Normal, Gamma, Beta

Approximations are possible to inverse cdf

A simple approximation to inverse cdf of normal is given by

The problems with approximate inverse cdf is that some of them can be computationally intensive

1975.0

135.0135.01 RR

CSC 446/546

3. Inverse-transform Technique (9) Discrete Distribution (1)

All discrete distributions can be generated via inverse-transform technique, either numerically through a table-lookup procedure, or algebraically using a formula

Examples of application:• Empirical

• Discrete uniform

• Geometric

• Gamma

CSC 446/546

3. Inverse-transform Technique (9) Discrete Distribution (2) EmpiricalExample: Suppose the number of shipments, x, on the loading dock of a company is either 0, 1, or 2

• Data - Probability distribution:

• Method - Given R, the generation

scheme becomes:

0.18.0

8.05.0

xConsider R 1 = 0.73:

F(x i-1) < R <= F(x i)F(x 0) < 0.73 <= F(x 1)

Hence, x 1 = 1

x p(x) F(x)0 0.50 0.501 0.30 0.802 0.20 1.00

In general, generate Rif F(xi-1) < R <= F(xi)set X=xi

CSC 446/546

3. Inverse-transform Technique (9) Discrete Distribution (3) Geometric Distribution

Consider the Geometric distribution with pmf

It’s cdf is given by

Using the inverse transform technique, Geometric RV assume the value x whenever,

,....2,1,0 ,1)( xppxp x

ppppxF

1ln function, up-round theusing

1ln1ln1ln1

)(111)1(1

xFpRpxFxx

CSC 446/546

Since p is a fixed parameter, let =-1/ln(1-p).

Then X=- ln(1-R)-1. This is nothing but exponentially distributed RV with mean .

Therefore, one way to generate a geometric variate with parameter p is to generate exponential variate with parameter -1=-ln(1-p), subtract 1 and Round-up!!

Occasionally, there is a need to generate Geometric variate X that can assume values {q, q+1, q+2, …}

This can be generated by

Most common case is q=1

CSC 446/546

Generate three values from a Geometric distribution on the range {X1} with mean 2.

This has pmf as

Thus X can be generated with q=1, p=1/2 and 1/ln(1-p)=-1.443

R1=0.932, X1=1+ -1.443ln(1-0.932)-1=4

R2=0.105, X1=1+ -1.443ln(1-0.105)-1=1

R3=0.687, X1=1+ -1.443ln(1-0.687)-1=2

2/1or 2/1mean with ,....;2,1,0 ,1)( ppxppxp x

CSC 446/546

4. Acceptance-Rejection Technique(1) Useful particularly when inverse cdf does not exist in closed

form, a.k.a. thinning

Illustration: To generate random variates, X ~ U(1/4, 1)

R does not have the desired distribution, but R conditioned (R’) on the event {R ¼} does.

Efficiency: Depends heavily on the ability to minimize the number of rejections.

Procedures:

Step 1. Generate R ~ U[0,1]

Step 2a. If R >= ¼, accept X=R.

Step 2b. If R < ¼, reject R, return to Step 1

Generate R

Condition

Output R’

CSC 446/546

4. Acceptance-Rejection technique (2) Poisson Distribution (1)Recall a Poisson RV, N with mean >0 has pmf

Recall that inter-arrival times A1, A2, … of successive customers are exponentially distributed with rate (i.e., is the mean arrivals per unit time)

,...1,0 ,!

enNPnp

12121 1 iff nnn AAAAAAAnN

Relation N=n states that there were exactly n arrivals during one unit of timeSecond relation states that the nth arrival occurred before time 1 while the (n+1) arrival occurred after time 1. Clearly these two are equivalent

CSC 446/546

4. Acceptance-Rejection technique (2) Poisson Distribution (2)Recall Ai can be generated by Ai=(-1/)lnRi

Therefore,

get tonumber any for relation theUse

lnlnlnln

get product to a of logarithm theis

logarithms of sumfact that theuse and -by Multiply

AAAAAAA

CSC 446/546

4. Acceptance-Rejection technique (2) Poisson Distribution (3)

Therefore, the procedure for generating a Poisson random variate N is:• Step 1: Set n=0; P=1

• Step 2: Generate a random number Rn+1 and replace P by P.Rn+1

• Step 3: If P < e-, then accept N=n. Otherwise, reject the current n, increase n by one and return to step 2.

How many random numbers are required to generate one Poisson variate N?• Answer: If N=n, then n+1 random numbers are

required. So the average # is: E(N+1)=+1.

CSC 446/546

4. Acceptance-Rejection technique (2) Poisson Distribution (5) ExampleGenerate three Poisson variates with mean =0.2

First compute e-=0.8187

Step 1: Set n=0, P=1

Step 2: R1=0.4357, P=1.R1=0.4357

Step 3: Since P=0.4357< e-=0.8187, accept N=0

Step 1-3. (R1=0.4146 leads to N=0)

Step 1. Set n=0, P=1

Step 2. R1=0.8352, P=1.R1=0.8353

Step 3 Since P e- reject n=0 and return to Step 2 with n=1

Step 2. R2=0.9952, P=R1.R2=0.8313

Step 3 Since P e- reject n=1 and return to Step 2 with n=2

Step 2 R3=0.8004, P=R1.R2.R3=0.6654

Step 3. Since P < e-, accept N=2

CSC 446/546

4. Acceptance-Rejection technique (2) Poisson Distribution (5) Example

The calculations for the generation of these Poisson random variates is summarized as:

n Rn+1 P Accept/Reject Result

0 0.4357 0.4357 Accept N=0

0 0.4146 0.4146 Accept N=0

0 0.8353 0.8353 Reject

1 0.9952 0.8313 Reject

2 0.8004 0.6654 Accept N=2

CSC 446/546

4. Acceptance-Rejection technique (3) NSPP (1)

Non-stationary Poisson Process (NSPP): a Possion arrival process with an arrival rate that varies with time

Idea behind thinning:

• Generate a stationary Poisson arrival process at the fastest rate, * = max (t)

• But “accept” only a portion of arrivals, thinning out just enough to get the desired time-varying rate

Generate E ~ Exp(*) t = t + E

Condition R <= (t)/*

Output E ’~ t

CSC 446/546

4. Acceptance-Rejection technique (3) NSPP (2)

Generic algorithm that generates Ti as the time of ith arrival

• Step 1: Let * = max (t) be the maximum arrival rate function and set t=0 and i=1

• Step 2: Generate E from the exponential distribution with rate * and let t=t+E (this is the arrival time of the stationary Poisson process)

• Step 3: Generate random number R from the U(0,1) distribution. If R (t)/ * , then Ti = t and i=i+1

CSC 446/546

4. Acceptance-Rejection technique (3) NSPP (3)Example: Generate a random variate for a NSPP

Procedures:Step 1. * = max (t) = 1/5, t = 0 and i = 1.

Step 2. For random number R = 0.2130,

E = -5ln(0.213) = 13.13

t = 13.13

Step 3. Generate R = 0.8830

(13.13)/*=(1/15)/(1/5)=1/3

Since R>1/3, do not generate the arrival

Step 2. For random number R = 0.5530,

E = -5ln(0.553) = 2.96

t = 13.13 + 2.96 = 16.09

Step 3. Generate R = 0.0240

(16.09)/*=(1/15)/(1/5)=1/3

Since R<1/3, T1 = t = 16.09,

and i = i + 1 = 2

t (min)

Mean Time Between Arrivals

Arrival Rate (t) (#/min)

0 15 1/15

60 12 1/12

120 7 1/7

180 5 1/5

240 8 1/8

300 10 1/10

360 15 1/15

420 20 1/20

480 20 1/20

Data: Arrival Rates

CSC 446/546

5. Special Properties (1)

Variate generate techniques that are based on features of particular family of probability distributions

For example:• Direct Transformation for normal and lognormal

distributions

• Convolution

• Beta distribution (from gamma distribution)

CSC 446/546

5. Special Properties (2) Direct Transformation (1)Approach for Normal(0,1):• Consider two standard normal random variables, Z1 and Z2,

plotted as a point in the plane:

• B2 = Z21 + Z2

2 ~ chi-square distribution with 2 degrees of freedom that is equivalent to Exp( = 2). Hence,

• The radius B and angle are mutually independent.

2/1)ln2( RB

In polar coordinates:Z1 = B cos Z2 = B sin

)2sin()ln2(

)2cos()ln2(

CSC 446/546

5. Special Properties (2) Direct Transformation (2)Approach for Normal(,), N(,:

• Generate Zi ~ N(0,1)

Approach for lognormal(,):• Generate X ~ N(,)

Yi = eXi

Xi = + Zi

fall 2011 csc 446/546 part 7: random-variate generation

pdf of x

variates x

range of x

histogram of x

e x slide

r r1r1 x1x1 r

terms of r

desired random variable

Documents

fall 2011 csc 446/546 part 4: statistical models in...

web analytics tom wojciechowski dms 446/546 - interface...

an overview 1 pamela harrod, dms 546/446 presentation, march...

uni variate distributions

fall 2011 csc 446/546 part 6: random number generation

multi variate analysis slides

dms 546/446 interface design

celesta light engines · 2021. 1. 26. · features details...

collinearity in multi-variate analyses

dms 546/446 interface design al larsen – spring 2008 paper...

chapter 8 random-variate generation - الصفحات...

random variate generation (44 slides)

visualization of multi-variate scientiﬁc data ·...

pembangkit random variate -...

chapter 8 random-variate generation

multi-variate stochastic volatility modelling

04-random-variate generation.ppt

fall 20101 csc 446/546 part 10: estimation of absolute...

multi variate analysis

multi variate