presentation 04
TRANSCRIPT
-
8/13/2019 Presentation 04
1/62
CIVE1219 Transport Engineering 3
Presentation File 4
Sara Moridpour
-
8/13/2019 Presentation 04
2/62
2
Objectives
Understand the meaning of trip production and tripattraction,
Understand the influencing factors on trip production andtrip attraction,
Appreciate the approaches used in trip generation models,
Be aware of the advantages and disadvantages of usingeach approach in trip generation models,
Understand how to evaluate the trip generation models.
After completing this subject, you should be able to:
-
8/13/2019 Presentation 04
3/62
3
Trip Generation Models
Trip generation models relate the intensity of trip makingto, and from, land use parcels (zones) to measures of the
type and intensity of land use.
Trip generation models develop a relationship betweenthe number of trips attracted to, or produced by, a zone
and the characteristics of the zone.
Trip generation models can then be used to predict thefuture number of trips.
-
8/13/2019 Presentation 04
4/62
4
Trip Productions and Attractions
A 'trip' is defined as travel between two places of activity.
Trips are classified as either: home based trips and non
home based trips.
Trip productions are trips generated by residential zonesto serve a need,
Trip attractions are trips generated by activities such
as employment, retail services etc. and are related tosatisfaction of a need.
-
8/13/2019 Presentation 04
5/62
5
It is important to distinguish between whether the tripgeneration analysis considers only trips made by private
vehicle (person trips by car) or total trips regardless of
mode (total person trips).
It is not uncommon for trips made by foot or by bicycleto be excluded from the datasets used to estimate trip
generation models.
Excluding bike and walk trips reduces the complexity ofsubsequent models, i.e. trips distribution and mode choice.
Trip Productions and Attractions Continued
-
8/13/2019 Presentation 04
6/62
6
Home Based Trips
Home Based (HB) trips, meaning that one end of the tripis at home, for instance home based work trips, home
based shopping trips, and trips from work to home.
Finer classifications may be used (e.g. blue collar work,
white collar (office) work, tertiary student) since underling
relationships governing trip generation are expected to be
different for different trip types and therefore more accurate
predictions are obtained.
-
8/13/2019 Presentation 04
7/627
Home Based Trips Continued
White Collar
Managers & Administrators
Professionals
Associate Professionals Clerical, Sales & Service
Blue Collar
Tradespersons & Related
Production & Transport
Laborers
Home Based trips
Secondary
PrimaryTertiary
Shopping
Recreation
OtherWhite
Collar
Blue
Collar
-
8/13/2019 Presentation 04
8/628
Non Home Based Trips
Non Home Based (NHB) trips, where neither end of thetrip is at home, for instance, a trip from work to shop.
Non Home Based
trips
Work -
Other
Shopping -
Shopping
Shopping -
Other
OtherWork -
Work
Work -
Shopping
-
8/13/2019 Presentation 04
9/629
Influencing Factors in Trip Productions
A variety of factors are likely to influence trip production
including:
Population,
Income levels,
Car ownership,
Quality of transport facilities and frequency of service,
Residential density,
Age structure or stage in the family life cycle.
-
8/13/2019 Presentation 04
10/6210
Question 1
What are the following factors have the main influence on
trip production of a family (can be more than one option)?
1. Number of students in family, Property type, Number of
people aged 65+,
2. Number of people aged 65+, Family size, Property type,
3. Number of students in family, Public transport access,
Property postcode,
4. Price of property, Property type, Property postcode.
-
8/13/2019 Presentation 04
11/6211
Influencing Factors in Trip Attraction
A variety of factors are likely to influence trip attraction
including:
Employment,
Levels broken down by type of employment,
Educational institution enrolments,
Floor space,
The accessibility provided by the transport system,
Parks and recreation areas.
-
8/13/2019 Presentation 04
12/6212
Question 2
Which of the following sets are most important in trip
attraction of a traffic zone (can be more than one option)?
1. Number of schools, Number of people aged 18+,
2. Public transport access, Number of shopping centres,
3. Area of shopping centres, Number of employees,
4. Area of medical centres, Number of cinemas.
-
8/13/2019 Presentation 04
13/6213
Trip Generation Approaches
There are two common approaches to trip generation
analysis:
Category analysis, also known as cross classification
analysis,
Regression modelling.
These two procedures represent the state of practice inrelation to trip generation modelling.
-
8/13/2019 Presentation 04
14/6214
Category Analysis or Cross Classification Analysis
This approach usually focuses on the household or asmaller unit than the zone.
Households are categorised on the basis of generalcharacteristics (e.g. car ownership, household size and
age structure). Then, the trip rate is calculated for each
category.
Zonal trips are then predicted as the sum of the trips foreach household in the zone.
Category analysis can be applied to the analysis of tripattractions with the categories defined by land use or
employment.
-
8/13/2019 Presentation 04
15/6215
Cross Classification Analysis Continued
The steps in calibrating a category analysis model are as
follows:
Group the households by the desired characteristics(number of persons, car ownership etc.),
Calculate a trip rate (trips/household) for each category.
The accuracy of the model depends on the number ofhouseholds in each category. At least 20 households are
required in each category to have confidence in the results.
-
8/13/2019 Presentation 04
16/6216
Cross Classification Analysis Continued
To predict the future trips for a zone:
Determine the number of households in each category,
Multiply the appropriate trip rate by the number ofhouseholds in the category,
Sum the results across all categories to obtain an
estimate of the total number of trips for the zone.
-
8/13/2019 Presentation 04
17/6217
Cross Classification Analysis Continued
To determine the appropriate variables in the model, anumber of variables are initially considered. These are:
Gender (male, female),
Car availability (always, sometimes, never),
Age (< 18, 18 - 40, 41 - 65, > 65),
Employment (employed, unemployed),
Income (low, medium, high),
Type of employment (blue collar, white collar, other),
Family type (single, childless couple, with children).
-
8/13/2019 Presentation 04
18/6218
Cross Classification Analysis Continued
The following table shows the results of trip generationmodelling from a category analysis of data from Madison in
the USA (FHWA, 1975):
Cars Owned
Family Size 0 1 2+
1 1.0 2.7 3.1
2 1.5 5.1 7.0
3 3.1 7.2 9.4
4 3.2 8.0 11.7
5+ 5.2 9.2 13.4
-
8/13/2019 Presentation 04
19/6219
Cross Classification Analysis Continued
Category analysis is simple to apply and is capable of
capturing the primary influences on trip generation.
It can be estimated using dummy variable regressiontechniques.
This provides the added advantage of access tostatistical tests to assess the significance of the
relationships and the goodness of fit of the model.
Cross classification analysis has a number of advantages:
-
8/13/2019 Presentation 04
20/62
20
Cross Classification Analysis Continued
While simple to apply, classification analysis has a number
of disadvantages:
The variation in trip-making within an individual category
can be higher than the variation between categories,
It is difficult to find the combination of categories unlesswhere each category is homogeneous enough,
It is hard to include more than just the basic populationand demographic information given the data processing
requirements.
-
8/13/2019 Presentation 04
21/62
21
Question 3
What is not correct about cross classification method?
1. This approach is very complicated for traffic zone analysis,
2.This method can easily be applied at aggregate and
disaggregate levels (for households and traffic zones),
3. The method is less costly than other methods but not very
accurate,
4. The variables used in this method are generally simple ones.
-
8/13/2019 Presentation 04
22/62
22
Regression Modelling
Regression modelling is a common technique for analysingtrip generation. This approach relies on estimating the
parameters of an underlying model relating the number of
trips from/to a zone to the characteristics of the zone. For
instance:
ZTA = 1560 + 14.3 X1+ 10.5 X2+ 3.7 X3
Where,
ZTA = zonal trips attracted,
X1= retail floor area (m),
X2= service and office floor area (m),
X3= manufacturing floor area (m).
-
8/13/2019 Presentation 04
23/62
23
Regression Modelling Continued
Th= 0.53 + 1.53 NPh- 1.42 CH4h+ 0.67 Wh+ 1.72 CAh
Where,
Th= number of person trips for household h,
NPh= number of persons in the household,CH4h= number of children under 4 in the household,
Wh= number of workers in the household
CAh= number of cars available in the household.
The following is an example of a household level model for
home-based trips:
-
8/13/2019 Presentation 04
24/62
24
Traditionally the emphasis has been on linear regression,since it has been more readily available in statistical
analysis packages.
There are strong reasons for exploring non-linearfunctional forms particularly when empirical evidence
suggests that variables such as car ownership have a
non-linear impact on trip generation.
Dramatic improvements in the capabilities of statisticalpackages make it feasible to explore non-linear
formulations and to test these against linear models.
Regression Modelling Characteristics
-
8/13/2019 Presentation 04
25/62
25
Regression modelling provides the advantage of beingable to test the significance of the relationship between
independent variables and trip generation rates using
statistical tests as well as providing a goodness-of-fitmeasure for the model.
The calibration of a regression model requires a set ofdata containing data for the dependent and independent
variables.
Regression Modelling Advantages/Disadvantages
-
8/13/2019 Presentation 04
26/62
26
Question 4
What is not correct about regression method?
1.This approach is applied at aggregate and disaggregatelevels (for households and traffic zones),
2. Different types of variables can be defined in this method,
3. The accuracy of the results depend on the accuracy of the
collected information,
4. There is no limitation for using variables in this method.
-
8/13/2019 Presentation 04
27/62
27
Evaluation of Regression Modelling
Ye= a + b X
Ye= an estimate of Y,
a, b = parameters of the model.
In a simple case, we have information on the number of trips
generated by a number of zones (Y) and the corresponding
zonal population (X). A linear model has the following form:
The Y and X data are fed into a statistics package and the
values for 'a' and 'b are estimated to give the best fit to the data
(Y, X). The approach minimises the prediction error of model
and it aims to minimise the squared errors.
-
8/13/2019 Presentation 04
28/62
28
0 500 1,000 1,500 2,000 2,500 3,000Population
0
1,0002,000
3,000
4,000
5,000
6,000
7,000
Trip
Production
Series 1
Series 2
Evaluation of Regression Modelling Continued
e1
e2
e3
e4
e5
Error = min (e1 + e2 + e3 + e4 + e5)
Error = min [(Y1 - Ye1)+(Y2 - Ye2)+(Y3 - Ye3)+(Y4 - Ye4)+(Y5 - Ye5)]
-
8/13/2019 Presentation 04
29/62
29
Evaluation of Regression Modelling Continued
Three important tests to evaluate the model are as follows:
First, it is important to reflect on whether we would logically
expect a relationship between the X variable(s) and Y.
Second, the signs of the parameters should be examined. Arethey consistent with expectations?
For instance, we would expect that the number of trips wouldincrease as the zonal population increases and so a positive
value would be expected for the parameter 'b'.
1. Logic Test (Ye= a + b X)
-
8/13/2019 Presentation 04
30/62
30
One of the classic cases is wheresomeone developed a model that
predicted the number of murders occurred
in a city as a function of how many hats
were sold in that city.
Evaluation of Regression Modelling Continued
What is a good regression model?
While that model may have done well in terms of goodness-of-fit
or statistical tests, the underlying logic would have to be seriously
questioned.
-
8/13/2019 Presentation 04
31/62
31
The goodness of fit statistics (R2) provides a measure ofhow well the model predicts the values of Y.
A perfect model, that is one which predicts the value of Yperfectly for each value of X, would have an R2of 1.0.
The closer the value of R2is to 1.0 the better the fit. Thevalue is also useful for comparing the goodness of fit of
two models.
2. Goodness of Fit (Ye= a + b X)
Evaluation of Regression Modelling Continued
-
8/13/2019 Presentation 04
32/62
32
The inherent scatter in the data produces someuncertainty in the value of the parameters of the model.
The most popular statistical test relates to the slopeparameter in the model (parameter 'b'). If the true slope
parameter were equal to zero then the explanatory
variable 'X' would not help in predicting 'Y'. This is usually
done with a 't-test.
3. Statistical Tests (Ye= a + b X)
Evaluation of Regression Modelling Continued
-
8/13/2019 Presentation 04
33/62
33
It is usually of interest to test the null hypothesis that theslope parameter is equal to zero. This is usually done with
a 't-test' and the value of the t-statistic is usually calculated
by the statistics package.
T-test Analysis
Uses t-tests of individual variable slopes,
Shows if there is a linear relationship between thevariable(s) X and Y,
Hypotheses:
H0: b = 0 (no linear relationship)
H1: b 0 (linear relationship does exist between X and Y)
-
8/13/2019 Presentation 04
34/62
34
T-test Analysis
Compare the calculated t-statistic value with the critical
value from statistical tables.
The critical value depends on the size of the data set(n=number of observations), the number of parameters
(in the case of the simple model considered above this is
two: 'a' and 'b') and the confidence level selected.
It is common to work with a 95% confidence interval, sothe critical t value has n - 2 degrees of freedom (for the
simple model considered above) and you would look up a
0.025 level for the statistic (tcrit).
For large sample sizes it is common to use a rule ofthumb that if the t-test value is greater than 2.0 the
variable's coefficient is statistically significant from zero.
-
8/13/2019 Presentation 04
35/62
35
T-test Analysis Continued
For small samples, the critical t-value must be read from thestatistics table and then:
If t > tcrit, reject the null hypothesis and conclude the
slope parameter is significantly different from zero (that isthere is a statistically significant relationship between 'X'
and 'Y'),
If t < tcrit, then it is not possible to reject the null
hypothesis and conclude that the coefficient on the 'X'
variable is not statistically different from zero.
-
8/13/2019 Presentation 04
36/62
36
T-test Analysis Continued
If the signs of the parameters are consistent with
expectations, the parameters are statistically significant and
the goodness-of-fit is reasonable. Then we can have someconfidence that we have a 'good' model and should be able
to use it to predict for future conditions.
a/2 = 0.025 a/2 = 0.025
Reject H0Reject H0 Do not reject H0
-
8/13/2019 Presentation 04
37/62
37
Regression Modelling in Excel
Excel can be used to estimate a regression model.
Plot the raw data and the regression line on a graph.
Data Analysis is an add-in option which you may need to install if you have notalready done so.
When using Excel 2003, from Tools menu, select Add-Ins, and in the Manage
box, select Analysis ToolPack and click OK. Then, select Data Analysis fromthe Tools menu and then select Regression.
When using Excel 2007, click the Microsoft Office Button and then click ExcelOptions. Click Add-Ins, and in the Manage box, select Excel Add-ins and click Go.
Then, in the Add-Ins available box, select the Analysis ToolPak check box, and
then click OK. Select Data Analysis and then select Regression.
When using Excel 2010, Click the File tab, click Options, and then click the Add-Ins category. In the Manage box, click Excel Add-ins, and then click Go. The Add-
Ins dialog box appears. In the Add-Ins available box, select Analysis Toolpack
and then click OK. Click the Data tab, then select Data Analysis and then
select Regression.
Using this approach will generate t-test results for the parameters of the model.
-
8/13/2019 Presentation 04
38/62
38
Multiple Linear Regression Output
Total Trips = 210.835 + 0.247 (Population) + 0.037 (Number of Cars)
-
8/13/2019 Presentation 04
39/62
39
Multiple Linear Regression Output
Total Trips = 210.835 + 0.247 (Population) + 0.037 (Number of Cars)
b1= + 0.247
Total Trips will increase by0.247 for each one person
increase in Population.
b2= + 0.037
Total Trips will increase, onaverage, by 0.037 for each
increase in Number of Cars.
Where
Total Trips is the number of trips produced in each zone.
-
8/13/2019 Presentation 04
40/62
40
Coefficient of Multiple Determination (R2)
Reports the proportion of total variation in Y explained by
all X variables taken together.
squaresofsumtotal
squaresofsumregression
SST
SSRR2Y ==
-
8/13/2019 Presentation 04
41/62
41
squaresofsumtotal
squaresofsumregression
SST
SSRR2Y ==
SSR = Regression sum of squares =the sum of the differences
of each estimation from the overall mean over all observations.
2n
1i
i )Y(ESSR
=
=
Coefficient of Multiple Determination (R2) Continued
SST = total sum of squares = the sum of the differences ofeach observation from the overall mean over all observations.
2n
1i
i )Y(YSST
=
=
E is the predicted value
Y is the observed value
-
8/13/2019 Presentation 04
42/62
42
0.996333173737083.
862173086971.
SST
SSR
R
2
Y ===
99.6% of the variation in Total Trips is
explained by the variation in Population
and Number of Cars
Coefficient of Multiple Determination (R2) Continued
Total Trips = 210.835 + 0.247 (Population) + 0.037 (Number of Cars)
-
8/13/2019 Presentation 04
43/62
43
Adjusted R2
R2 never decreases when a new X variable is added tothe model.
This can be a disadvantage when comparing models.
We lose a degree of freedom when a new X variable isadded.
Did the new X variable add enough explanatory power to
offset the loss of one degree of freedom?
-
8/13/2019 Presentation 04
44/62
44
Adjusted R2 Continued
Shows the proportion of variation in Y explained by all Xvariables adjusted for the number of Xvariables used.
Penalize excessive use of unimportant independent
variables.
Smaller than R2.
Useful in comparing among models.
=
1kn
1n)R(11R 22adj
(n = sample size, k = number of independent variables)
-
8/13/2019 Presentation 04
45/62
45
99.4% of the variation in Total Trips is
explained by the variation in Population
and Number of Cars, taking into
account the sample size and number of
independent variables.
9940R2adj
.=
Adjusted R2 Continued
Total Trips = 210.835 + 0.247 (Population) + 0.037 (Number of Cars)
-
8/13/2019 Presentation 04
46/62
46
Is the Model Significant?
F-test is used for Overall Significance of the Model.
F-test shows if there is a linear relationship between all ofthe X variables considered together and Y.
Use F-test statistic.
Hypotheses:
H0: b1= b2= = bk= 0 (no linear relationship) i = 1 to k
H1: at least one bi 0 (at least one independent variable affects Y)
Y = b0+ b1 x1+ b2 x2+ + bk xk
-
8/13/2019 Presentation 04
47/62
47
F-test for Overall Significance
399.363216703.824
3186543485.9
1-k-n
SSEk
SSR
MSE
MSRF ====
With 2 and 3 degrees of
freedom
P-value for the
F-Test
Total Trips = 210.835 + 0.247 (Population) + 0.037 (Number of Cars)
-
8/13/2019 Presentation 04
48/62
48
F-test for Overall Significance Continued
H0: b1= b2= 0
H1: b1and b2not both zero
= 0.05
df1 = 2 df2= 3
Test Statistic:
Since F test statistic is in
the rejection area (p-value
< 0.05), reject H0
There is evidence that at
least one independent
variable affects Y
399.363MSE
MSRF ==
Critical Value: F= 9.552
Decision:
Conclusion:
F0.05
= 9.552
0
= 0.05
Reject H0Do not reject H0
F
-
8/13/2019 Presentation 04
49/62
49
Are Individual Variables Significant?
Use t-tests of individual variable slopes.
Shows if there is a linear relationship between the variable X iand Y.
Hypotheses:
H0: bi= 0 (no linear relationship) i = 1 to k
H1: bi 0 (linear relationship does exist between Xiand Y)
ib
i
S
bt
0=Test Statistic:
(df = n - k - 1)
Where, biis the Coefficient of variable Xiand Sbiis the Standard Error of
variable Xi.
Y = b0+ b1 x1+ b2 x2+ + bk xk
-
8/13/2019 Presentation 04
50/62
50
Are Individual Variables Significant? Continued
t-value for Population is t = 5.104, with
p-value 0.015
t-value for Number of Cars is t = 0.561,
with p-value 0.614
Total Trips = 210.835 + 0.247 (Population) + 0.037 (Number of Cars)
-
8/13/2019 Presentation 04
51/62
51
H0: bi= 0 i = 1 to k
H1: bi0
= 0.05 df = 6 - 2 - 1 = 3
Reject H0 for Population and do
not reject H0for Number of Cars.
There is evidence that Populationaffect and Number of Cars does
not affect Y (Total Trips).
Critical Value: t/2 = 3.182
Decision:
Conclusion:
Are Individual Variables Significant? Continued
Reject H0Reject H0Do not reject
H0
3.182-3.182
a/2=0.025 a/2=0.025
Note: Number of Cars is generally is a very important variable. However, in
the available dataset, it is insignificant.
Total Trips = 210.835 + 0.247 (Population) + 0.037 (Number of Cars)
-
8/13/2019 Presentation 04
52/62
52
Next Learning Activities
Examples of Trip Generation Models (problem set 3)will be examined in the next tutorial class (please printproblem set 3 as well as lecture week 4 and bring to the
next tutorial class).
Please bring one Laptop per group for the next tutorialclass. You will need it for regression analysis.
Trip Distribution Models will be comprehensively
explained in the next lecture class.
Different approaches in developing Trip DistributionModels are considered and the strengths and
weaknesses of each model type will be explained.
-
8/13/2019 Presentation 04
53/62
53
Questions?
-
8/13/2019 Presentation 04
54/62
-
8/13/2019 Presentation 04
55/62
55
Assessment Task 2 Continued
Table 1: Population and employment predictions for the year 2020.
-
8/13/2019 Presentation 04
56/62
56
Assessment Task 2 Continued
Table 2: Trip generation data.
-
8/13/2019 Presentation 04
57/62
57
Assessment Task 2 Continued
-
8/13/2019 Presentation 04
58/62
58
Assessment Task 2 Continued
Table 3: Current characteristics of each mode.
-
8/13/2019 Presentation 04
59/62
59
Assessment Task 2 Continued
1. Peak period directional trips over the current bridge (prior to
construction of the second bridge).
2. Peak period directional trips on both bridges once the new
bridge is built.
3. Write out the utility expression for each mode. Are the signs
and relative magnitudes of the coefficients logical?
4. Use the model to predict the bus ridership and revenue for the
morning service.
5. The bus operator believes it may be possible to lobby the State
Government to install express bus lanes between zones 1 and
7. Under this scenario the in-vehicle time for the bus would be
reduced by 12%. Predict the effects of this on bus ridership and
revenue.
-
8/13/2019 Presentation 04
60/62
60
Assessment Task 2 Continued
t14= 22 + 0.015V14 t24= 22 + 0.015V24
t13= 10 + 0.005V13 t34= 8 + 0.005V34
t23= 15 + 0.005V23
V14= 13000 V24= 11000
Part B:
-
8/13/2019 Presentation 04
61/62
61
Assessment Task 2 Continued
1. Using all-or-nothing assignment, find the link volumes and
travel times.
2. Using incremental assignment with five increments of 20%,
find the link volumes and travel times.
3. Compare and comment on the results from parts 1 and 2
above.
-
8/13/2019 Presentation 04
62/62
Questions?