econometric projek lithem

Upload: wai-leong-edward-low

Post on 02-Jun-2018

229 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/10/2019 Econometric Projek Lithem

    1/31

    1

    SCHOOL OF MATHEMATICAL SCIENCESFIRST SEMESTER, ACADEMIC YEAR 2013/2014

    SEP 221: Applied Statistics and Econometric

    TITLE: THE ROLE OF GOVERNMENT IN

    HUMAN DEVELOPMENT

    Prepared by

    LIM LI THEM 113786

    Checked by

    Dr. Che Normee Che Sab

  • 8/10/2019 Econometric Projek Lithem

    2/31

    2

    Table of Contents

    Chapter Title Page

    Abstract 1

    1 IntroductionProblem StatementObjectiveScope of StudySignificance of StudyDistribution of Chapter

    2 - 4

    2 Literature Review 5 - 6

    3 Methodology3.1 Descriptive Statistics

    3.2 Unit Root Test3.3 Correlation3.4 Multiple Regression Analysis3.5 Autocorrelation3.6 Heteroscedasticity3.7 Newey-West Test3.8 Specification Error Test3.9 Multicollinearity

    7 - 14

    4 Result Analysis and Discussion

    4.1 Descriptive Statistics4.2 Unit Root Test4.3 Unit Root Test for Residual4.4 Correlation4.5 Autocorrelation

    - Durbin-Watson test- Breusch-Godfrey Serial Correlation LM Test

    4.6 Heteroscedasticity- Graphical Method- White s Heteroscedasticity Test

    4.7 Newey-West Test4.8 Specification Error Test- Ramsey RESET Test- Recursive Estimates (CUSUM-Squared Test)- Chow s Breakpoint Test- Wald Test

    3.9 Multicollinearity- Variance Inflation Factor (VIF) Test

    15 - 25

    5 Conclusion 26 -27

    Reference 28

    Appendix 29

  • 8/10/2019 Econometric Projek Lithem

    3/31

    1

    ABSTRACT

    This paper examines the role of government in human development by using multiple

    regression model. The effect of government expenditure in education, health,

    infrastructure, security and transport sectors, from the year 1981 until year 2010, onthe value of Human Development Index is investigated and their relationship is

    studied. The result of regression model, after taking into account of autocorrelation,

    heteroscedasticity and multicollinearity, suggests that only the government

    expenditure in transport sector has a significant positive relationship with HDI while

    the government expenditure in internal security sector has significant negative

    relationship with HDI. The government spending in sectors like education, health and

    infrastructure do not have a significant effect on HDI.

    Keywords: Government expenditure, HDI, multiple regression model

  • 8/10/2019 Econometric Projek Lithem

    4/31

    2

    CHAPTER 1: INTRODUCTION

    In an economic planning, human development always is the main concern for both

    developed and developing countries. According to the 1990 Human Development

    Report, hu man development is defined as a process of enlarging peoples choices.

    This means that people have the freedom to make their own choices and have the

    opportunities to realize them. These choices include political freedom, guaranteed

    human rights, accessibility to education, a long and a healthy life, a good standard of

    living and so on. Human Development Index (HDI) is introduced by Mahbub ul Haq,

    a Pakistan economist and Amartya Sen, an Indian economist in 1990. Its used by

    United Nations Development Programme in Human Development Report as one of

    the tools in measuring the human development level of the countries in the world.HDI is a composite index of life expectancy, education and income indices (refer to

    figure 1) . Its served as a frame of reference for both social and economic

    development and is expressed as a value between 0 and 1.

    Based on this index, countries with HDI greater than 0.8 is considered as high human

    development, countries with HDI between 0.5 to 0.8 is medium human development

    while countries with HDI less than 0.5 is classified as low human development.

    According to the Human Development Report 2012, Malaysias HDI value is 0.769

    which is in the high human development category, ranked at the position of 64 out of

    187 countries and territories in the world. Between 1980 and 2012, Malaysias HDI

    value had increased from 0.563 to 0.769, an increase of 37% or average annual

    increase of about 1.0%.

    Government expenditure or government spending, on the other hand, is government

    acquisition of goods and services for current use to satisfy individual or collective

    needs (government final consumption expenditure) or for future benefits (government

    investment). These two types of government expenditure, on final consumption and

    on gross capital formation, together constitute one of the major components of gross

    domestic product. In Malaysia, the federal government expenditure is divided into

    operating expenditure and development expenditure. Operating expenditure are

    expenses that incurred to maintain the operations and other regular activities of the

    Government, for example, the wages of public service officers and operating grants to

    statutory boards and aided educational institutions. While the development

    http://en.wikipedia.org/wiki/Mahbub_ul_Haqhttp://en.wikipedia.org/wiki/Mahbub_ul_Haq
  • 8/10/2019 Econometric Projek Lithem

    5/31

    3

    expenditure are expenses that represent a longer-term investment and results in the

    formation of asset of the Government. Examples of such spending are the investment

    of housing and building of roads. There are four main categories under the

    government expenditure, which are security, social service, economic service and

    general administration. Since year 1970, the federal government expenditure is on the

    upward trend, especially the government development expenditure. This shows that

    Malaysia is trying to move from developing country status towards a developed and

    industrialised nation.

    Problem Statement

    Its not doubt that when a national budget is planned, the Government will consider

    both economic growth and social benefit. The government spending is planned

    carefully so that it can achieve both goals of economic growth and human

    development. However, is the government expenditure that spent really improving

    human development? Or it just brings to the growth of country economic? Which

    sectors spending can affect the human development? Well find out the answers

    through the study.

    Objective and Significant of Study

    The main objective of this paper is to study the role of government expenditure on the

    human development. By determining the role of government in improving the human

    development, the social benefit can be increased and the spending of government can

    be more efficient. And knowing that which sectors spending can affect the human

    development can help the government in planning their budget with the main goal to

    improve the social benefit. The study also helps to determine whether the government

    spending is efficiency or not. If the overall government expenditure is efficient,

    definitely it can increase the value of HDI.

    Scope of Study

    The study focuses on the development expenditure in education sector, health,

    internal security, public infrastructure and transport sectors. The government

    expenditure in internal security means that the spending of government in fighting the

    local crime rate and increase social security, for example, the increase in number of

    police. The government expenditure in public infrastructure and transport sectors

  • 8/10/2019 Econometric Projek Lithem

    6/31

    4

    means that the government investment in providing and maintenance the public

    utilities and public transports. The human development is measured by using the HDI

    value from year 1970 until year 2012.

    Distribution of Chapter

    Chapter 1: Introduction of the Human Development Index. Objective, significant

    of study and scope of study is stated in this chapter.

    Chapter 2: In this chapter, a selection of previous studies which examine the

    influence of government expenditure in different sectors on human

    development.

    Chapter 3: Several methods are used to analyze the data and will be discussed in

    this chapter.

    Chapter 4: A few tests are carried out using Eviews and results are shown in this

    chapter. The effect of the independent variables (lnEDU, lnHEALTH,

    lnINFRA, lnSAFETY and lnTPT) towards the ln HDI is studied. A

    regression model is generated at the end to investigate the relationship

    between the independent variables and the dependent variables.

    Chapter 5: Conclusion is drawn based on the results of analysis.

  • 8/10/2019 Econometric Projek Lithem

    7/31

    5

    CHAPTER 2: LITERATURE REVIEW

    A few studies had been done regarding the relationship between government

    expenditure and the human development. Social indicators are improving in many

    developing countries as public spending, especially the expenditures in health and

    education sectors, are increasing.

    In A.D Prasetyo and U. Zuhdi paper, they investigated the efficiency level of

    government expenditure per capital in health and education sectors towards the

    human development in 81 countries by using Data Envelopment Analysis (DEA)

    approach from 2006 to 2010. Government expenditure per capita on education and

    health sector were used as the inputs while HDI as the output. Results showed that

    how efficiency of government expenditure could affect the human development

    progress. Countries that were efficient in managing their expenditure tend to have

    high HDI value, for example, Singapore. Another paper by Andrew and Vinaya (2007)

    also showed that public spending was more effective and had impact on the related

    sectors with good governance from the government. Study found that the health

    spending lowers child mortality rates more in countries with good governance.

    Similarly, public spending on primary education became more effective in increasing

    the primary education attainment in countries with good governance. The difference

    in the efficacy of public spending, therefore, can be explained by the quality of

    governance.

    Same research was carried out in Iran to examine the effect of government health

    expenditure on HDI by using ordinary least square method (OLS) over the period

    1990-2009 in Iran (M.J. Razmi, 2012). The investigation revealed that there was a

    positive and significant relationship between government health expenditure and HDI.

    However, Granger Causality Test in that research indicated that there was no bilateral

    relationship between the government health expenditure and HDI.

    A related study had been done also in Malaysia by R.A. Mohd Dzubaidi, Rahmah

    Ismail and Tamat Sarmidi in 2013. The study showed that there was a positive

    relationship between government expenditure and human development and at the

    same time impacts the growth of economics. Variables like HDI, total government

    expenditure, government expenditure in health and education sectors, Gross Domestic

  • 8/10/2019 Econometric Projek Lithem

    8/31

    6

    Product per capital, urban population, accessibility to water source and sanitation

    facilities.

    Other similar studies were Afonso and St. Aubyn (2004) paper that measured the

    efficiency in education and health sectors in OECD countries. Several inputs such asthe expenditure per student, average class size, in-patient beds, medical technology

    indicators and etc. are used. While for outputs, performance of students on reading,

    mathematics and science literacy scales, life expectancy, infant and maternal mortality

    rate are used in the paper. Herrera and Pang (2005) did the same study but their scope

    of study was in the developing countries. They used government expenditure per

    capital on education and health sectors ranged from 1996 to 2002 as the inputs.

    Whereas, school enrolment, literacy of youth, average years of school, learning scoresare the outputs for education sectors and life expectancy at birth are the outputs for

    health sectors.

    .

  • 8/10/2019 Econometric Projek Lithem

    9/31

    7

    CHAPTER 3: METHODOLOGY

    In this project, Eviews is used to perform our analysis. A regression model is

    developed to test the effect few selected variables on the value of HDI. The estimated

    regression model is constructed with time frame from year 1981 until year 2010 as

    below:

    l n H DI ln EDU l n HEALTH+ l n I NF RA ln SAFETY+ ln TPT

    HDI=Human Development Index

    EDU= Expenditure that spent in education sector (RM millions)

    HEALTH= Expenditure that spent in health sector (RM millions)

    INFRA= Expenditure that spent in building infrastructure (RM millions)

    SAFETY= Expenditure that spent in internal security (RM millions)

    TPT= Expenditure that spent in transport sector (RM millions)

    A few tests are carried out on the estimated regression model above in order to

    develop the most suitable regression model by taking into account the possibility of

    autocorrelation, heteroscedasticity and multicollinearity. The tests and analyses arecarried at the significant level of 5% or = 0.05.

    3.1 Descriptive Statistics

    The mean, median, maximum, minimum and standard deviation of each data series is

    summarised in a table for the easy comparison. The skewness and kurtosis of each

    data series show the distribution of the data. The normality of each variable is testedtoo through probability of the Jarque-Bera test. The hypotheses of Jarque-Bera test are

    made as below:

  • 8/10/2019 Econometric Projek Lithem

    10/31

    8

    3.2 Unit Root Test

    A unit root test tests is a test that use to determine whether the time series variable is

    non-stationary using an autoregressive model. There are few unit roots test, for

    example, Dickey-Fuller test, Phillips-Perron KPSS test and so on. A well-known

    test that is valid in large samples is the augmented Dickey Fuller (ADF) test. A data

    series that has unit root is not stationary while data series that do not have unit root is

    stationary. The hypotheses below are made for unit root test:

    (The variable is not stationary or has unit root)

    (The variable is stationary)

    The null hypothesis is rejected if p-value < 0.05 or 1, k nt t or 1, k nt t , which

    means the variable is stationary at = 0.05. The null hypothesis is not rejected if p -

    value > 0.05 or 1, k nt t or 1, k nt t , which means the variable is not stationary

    at = 0.05. In this study, the unit root test is conducted at both level and 1 st difference.

    3.3 Correlation

    Correlation is the statistical relationship involving dependence of two or more random

    variables or data sets. When two sets of data are strongly linked together we say they

    have a high correlation. When the values increase together, there is positive

    correlation; when the values decrease together, there is negative correlation. A closer

    value of the Pearsons correlation coefficient to -1 or 1 indicated there was a higher

    negative or positive relationship between the variables respectively (refer to the

    diagram 1 below).

    Diagram 1 Positive and Negative Correlation

    http://en.wikipedia.org/wiki/Time_serieshttp://en.wikipedia.org/wiki/Autoregressivehttp://en.wikipedia.org/wiki/Augmented_Dickey%E2%80%93Fuller_testhttp://en.wikipedia.org/wiki/Augmented_Dickey%E2%80%93Fuller_testhttp://en.wikipedia.org/wiki/Augmented_Dickey%E2%80%93Fuller_testhttp://en.wikipedia.org/wiki/Augmented_Dickey%E2%80%93Fuller_testhttp://en.wikipedia.org/wiki/Autoregressivehttp://en.wikipedia.org/wiki/Time_series
  • 8/10/2019 Econometric Projek Lithem

    11/31

    9

    3.4 Multiple Regression Analysis

    Regression analysis is a statistical process for estimating the relationships among the

    variables. It helps us understand how the dependent variable changes when any one of

    the independent variables is varied, assuming that the other independent variables areheld fixed. A multiple regression function is a function that contains two or more

    independent variables, and one dependent variable, . The general form of

    multiple regressions is

    where is responding variable of ith experimental unit

    is the ith observation on the jth independent variable

    is the regression intercept

    is a random error term

    There are a few assumptions for the regression analysis:

    1. There is a linear relationship between independent and dependent variables.

    2. are n independent variables and ), i=1,2,.,n 3. The mean error, is denoted as zero. E( =0.

    4. ,.., are independent.

    5. are independent and is normally distributed.

    6. The variance of the error is constant across observations (homoscedasticity) .

    3.5 Autocorrelation

    Autocorrelation is a mathematical representation of the degree of similarity between a

    given time series and a lagged version of itself over successive time intervals. It is the

    same as calculating the correlation between two different time series, .

    The same time series is used twice - once in its original form and once lagged one or

    more time periods. It also known as serial correlation. It is used to detect the non-

    randomness in a given data or to identify an appropriate time series model if the data

    are not random.

    http://en.wikipedia.org/wiki/Homoscedasticityhttp://en.wikipedia.org/wiki/Homoscedasticity
  • 8/10/2019 Econometric Projek Lithem

    12/31

    10

    3.5.1 Durbin-Watson Test

    Durbin-Watson (DW) Test is a test statistics that used to detect the presence of the

    first order autocorrelation. The DW statistics has range from 0 to 4 and is calculated

    using the formula below:

    where T is the number of observation and d is approximately equal to 2(1- ). The

    hypothesis testing for this statistical test can be written as below:

    H0: = 0 (no autocorrelation)

    H1: >0 (for positive autocorrelation) or

  • 8/10/2019 Econometric Projek Lithem

    13/31

    11

    test for the presence of serial dependence that has not been included in a proposed

    model structure and which, if present, would mean that incorrect conclusions would

    be drawn from other tests.

    3.6 Heteroscedasticity

    In a regression analysis, heteroscedasticity refer to a situation in which the variance of

    the dependent variable varies across the data . The presence of heteroscedasticity is a

    major concern in the application of regression analysis as it can invalidate statistical

    tests of significance that assume that the modelling errors are uncorrelated and

    normally distributed . Whites Heteroscedasticity is used in this study to detect the

    presence of heteroscedasticity in the regression model with the following hypotheses:

    : Hoteroscedasticity does not exist (Homoscedasticity exists)

    : Heteroscedasticity exists

    If the p-value obtained from F-statistic is larger than 5% level, is not rejected and

    homoscedasticity exists. The alternative method to detect the presence of

    heteroscedasticity is by graphical method.

    3.7 Newey-West (HAC) Test

    Newey-West (HAC) Test is used when autocorrelation and heteroscedasticity is

    detected in a regression model. It is used to try to overcome autocorrelation, or correlation,

    and heteroscedasticity in the error terms of the models. This is often used to correct the

    effects of correlation in the error terms in regressions applied to time series data.

    3.8 Specification Error Tests

    Specification is the process of converting a theory into a regression model. This

    process consists of selecting an appropriate functional form for the model and

    choosing which variables to include. The specification errors include:

    Incorrect functional form

    Omitted-variable bias Irrelevant variable that is included in the model

    http://en.wikipedia.org/wiki/Serial_dependencehttp://en.wikipedia.org/wiki/Serial_dependence
  • 8/10/2019 Econometric Projek Lithem

    14/31

    12

    The system has dependent variables

    Measurement error

    3.8.1 Ramsey REST Test

    In a multiple regression analysis, a regression equation can be nonlinear in any or all

    the explanatory variables. Ramsey Regression Equation Specification Error Test

    (RESET) test, therefore, is used to detect the nonlinearity in a model . If the newly

    added non-linear explanatory variables have any power in explaining the response

    variable, the model is said to be mis-specified or in other word, suffers from

    specification error. The following hypotheses are made for the Ramsey RESET test:

    H 0: 0654321 (the model does not have specification error)

    H 1: Not all 0 j ; j=1, 2, 3, 4, 5, 6 (the model has specification error).

    3.8.2 Recursive Estimates (CUSUM-Squared Test)

    The cumulative sum squared test (CUSUM-squared test) is used to test the

    consistency of the coefficient in a particular model. The trend of the model will be

    displayed in a graphical representation and the two bounds will be constructed parallelto each other that represent the 5% significance level. If the trend touches or exceeds

    the bound at any point of time, it implies that structural break occurs at that particular

    time and correction need to be done to reduce the influence of this structural break

    towards the predictability of the model.

    3.8.3 The Chows Breakpoint Test

    A series of data can often contain a structural break, due to a change in policy or

    sudden shock to the economy, i.e. Asian Economic Crisis (1997-1998). To carry out

    the test, we partition the data into two or more sub-samples. Each sub-sample must

    contain more observations than the number of coefficients in the equation so that the

    equation can be estimated using each sub-sample. F statistics is used to determine

    whether a single regression is more efficient than two separate regressions involving

    splitting the data into two sub-samples.

  • 8/10/2019 Econometric Projek Lithem

    15/31

    13

    22,121

    21 ~22/

    1/k nk

    R F k n ESS ESS

    k ESS ESS ESS F

    3.8.4 Wald Test Coefficient Restriction

    The Wald test computes the test statistic by estimating the unrestricted regression

    without imposing the coefficient restrictions specified by the null hypothesis. The

    Wald statistic measures how close the unrestricted estimates come to satisfying the

    restrictions under the null hypothesis. If the restrictions are in fact true, then the

    unrestricted estimates should come close to satisfying the restrictions. The hypotheses

    for the joint coefficients test can be formulated as follows:

    H 0: 1 = 2 = = k = 0

    H 1: At least one of j is non-zero, j = 1, 2, , k.

    The test statistic used to test the above hypotheses is:

    11

    1

    2

    22

    k n Rmk R R

    k n ESS mk ESS ESS

    df ESS df df ESS ESS

    F

    U

    RU

    U

    U R

    U U

    U RU R

    3.9 Multicollinearity

    Multicollinearity occurs when two or more independent variables in a multiple

    regression model are highly correlated. There is an approximate linear relationship

    between the explanatory variables, which could lead to unreliable regression estimates,

    although the OLS estimates are still BLUE. In general it leads to the standard errors of

    the parameters being too large, therefore the t-statistics tend to be insignificant. The

    problem of multicollinearity can lead to inaccurate estimation from the regression

    model and thus reduce the trustworthy of the model. Variance Inflation Factor (VIF)

    test can be used to detect and examine the seriousness of the multicollinearity

    problem between the independent variables by using the formula below:

  • 8/10/2019 Econometric Projek Lithem

    16/31

    14

    211

    j RVIF

    If the VIF value of a variable is higher than the selected value of 10, we can say that

    multicollinearity is exist.

  • 8/10/2019 Econometric Projek Lithem

    17/31

    15

    CHAPTER 4: RESULT ANALYSIS AND DISCUSSION

    4.1 Descriptive Statistics

    Table 4.1.1 Statistics Table

    HDI EDUCATION HEALTH INFRA SAFETY TRANSPORTMean 0.673033 3997.933 925.9667 1359.167 591.2000 4007.967Median 0.678500 2067.500 516.5000 932.5000 433.5000 2977.500Maximum 0.763000 12436.00 3780.000 5286.000 1659.000 9450.000Minimum 0.563000 791.0000 53.00000 468.0000 133.0000 1046.000Std. Dev. 0.062555 3805.637 938.3989 1002.103 444.6788 2838.623Skewness -0.130540 1.070335 1.342629 2.264631 1.132522 0.674561Kurtosis 1.706293 2.710527 4.245541 8.929413 3.078749 1.976300

    Jarque-Bera 2.177301 5.832830 10.95248 69.59020 6.420787 3.585113Probability 0.336671 0.054127 0.004185 0.000000 0.040341 0.166534

    Sum 20.19100 119938.0 27779.00 40775.00 17736.00 120239.0Sum Sq. Dev. 0.113479 4.20E+08 25537183 29122126 5734437. 2.34E+08

    Observations 30 30 30 30 30 30

    Based on the table 4.1.1, the mean of HDI from 1981 to year 2010 is 0.673033 with

    standard deviation 0.062555. This shows that Malaysia has medium human

    development for the 30 years. Value of skewness of HDI is -0.130540 and Kurtosis is

    1.706293. From the Jarque-Bera Test, the p-value obtained is larger than = 0.05 at

    5% significant level. The null hypothesis of normal distribution is not rejected. Thus,

    HDI is normally distributed. Same goes to the variable EDUCATION and

    TRANSPORT which has p-values larger than 0.05. Both of the variables, therefore,

    are normally distributed. EDUCATION and TRANSPORT have means RM3997.933

    millions and RM 4007.967 millions respectively with standard deviation RM

    3805.637 millions and RM 444.6788 millions each.

    For the variable HEALTH, it has mean of RM 925.9667 millions and standard

    deviation of RM 938.3989 millions. It has positive skewness of 1.342629 and kurtosis

    of 4.245541. Whereas for variable INFRA, it has mean of RM 1359.167 millions and

    standard deviation of 1002.103 with value of skewness of 2.264631 and kurtosis of

    8.929413. And for variable SAFETY, it has mean of RM 591.200 millions and

    standard deviation of RM 2838.623 millions. These variables are not normally

    distributed as they have p- value less than = 0.05 at 5% significant level.

  • 8/10/2019 Econometric Projek Lithem

    18/31

    16

    4.2 Unit Root Tests Before constructing a regression model and proceeding to estimate the co-integration

    model, its necessary to determine if the variables used for the regression model are

    stationary. Thus, the variables are tested for unit root to determine the integrationorder of the variables. A stationary test known as Augmented Dickey Fuller (ADF)

    with =0.05 is carried out with the hypothesis as below:

    (The variable is not stationary or has unit root)

    (The variable is stationary)

    Table 4.2.1 Result of Unit Root Test

    Variables

    Level 1 st Difference

    None No Trend With Trend None No Trend With Trend

    ln HDI -1.952910*** -2.967767 -3.574244 -1.953381*** -2.976263*** -3.587527***

    ln EDU -1.952910 -2.971853 -3.580623* -1.953381*** -2.971853*** -3.580623**

    ln HEALTH -1.955020 -2.986255 -3.595026*** -1.953381*** -2.971853** -3.580623**

    ln INFRA -1.952910 -2.967767 -3.574244 -1.953381*** -2.971853*** -3.580623***

    ln SAFETY -1.952910 -2.967677 -3.574244** -1.953381*** -2.986225*** -3.603202**

    ln TPT -1.952910 -2.967677 -3.574244 -1.953381*** -2.971853*** -3.580623***

    Table 4.2.2 Summary on Unit Root Test

    Variables Level 1 st Difference I orderln HDI non-stationary stationary*** 1ln EDU non-stationary stationary*** 1

    ln HEALTH non-stationary stationary** 1ln INFRA non-stationary stationary*** 1

    ln SAFETY non-stationary stationary*** 1ln TPT non-stationary stationary*** 1

    From tables 4.2.1 and 4.2.2, its found that the null hypothesis for the data series

    cannot be rejected at the level of the series. Mostly of the data in the regression form

    without intercept and trend, with intercept and regression form with intercept and

    trend do not reject null hypothesis. The computed ADF t-statistics value (absolute

    value) is smaller than the critical values at = 0.01, 0.05 and 0.10 significant level.

    There is enough evidence to show that the variables ln HDI, ln EDUCATION, ln

  • 8/10/2019 Econometric Projek Lithem

    19/31

    17

    HEALTH, ln INFRA, ln SAFETY and ln TPT have unit roots. The same test on the

    first differences of the data series is carried out and result shows that the null

    hypothesis can be rejected at = 0.01, 0.05 and 0.10 significant level . There is no

    enough evidence to show that the variables ln HDI, ln EDUCATION, ln HEALTH, ln

    INFRA, ln SAFETY and ln TPT have unit roots. Thus, this implies that the series are

    integrated of order one [ I (1)] and are stationary in their first differences.

    4.3 Unit Root Test for Residual

    Table 4.3.1 Result of Unit Root Test of Residual

    Null Hypothesis: D(RESID) has a unit rootExogenous: ConstantLag Length: 0 (Automatic - based on SIC, maxlag=2)

    t-Statistic Prob.*

    Augmented Dickey-Fuller test statistic -5.023344 0.0029Test critical values: 1% level -4.200056

    5% level -3.17535210% level -2.728985

    *MacKinnon (1996) one-sided p-values.

    Now that it has been established that the variables are integrated of the same order.We need to test whether the regression model is a spurious regression. A spurious

    regression refers to a regression that shows significant result due to the presence of a

    unit root in the variables. The series is co-integrated if the residual in the series is

    stationary. The residual is tested for the presence of a unit root. The ADF statistics

    (absolute value) is found to be greater than the critical values at = 0.01, 0 .05 and

    0.10 significant level. The null hypothesis, thus, can be rejected and the residual is

    proved that it do not have unit root or is stationary. The regressions model is not

    sprious and co-integration exists.

    4.4 Correlation

    Table 4.4.1 General Correlation between HDI and Other Variables

    LNHDI LNEDU LNHEALTH LNINFRA LNSAFETY LNTPT

    LNHDI 1.000000 0.915413 0.909630 0.719905 0.752611 0.926196

    LNEDU 0.915413 1.000000 0.915935 0.754489 0.797160 0.908854LNHEALTH 0.909630 0.915935 1.000000 0.684609 0.820601 0.926238

  • 8/10/2019 Econometric Projek Lithem

    20/31

    18

    LNINFRA 0.719905 0.754489 0.684609 1.000000 0.699408 0.723428

    LNSAFETY 0.752611 0.797160 0.820601 0.699408 1.000000 0.885159

    LNTPT 0.926196 0.908854 0.926238 0.723428 0.885159 1.000000

    From the table 4.3.1, its found that ln HDI has strong positive correlation relationshipwith ln EDUCATION, ln HEALTH, ln INFRA, ln SAFETY and ln TPT variables,

    with values 0.915413, 0.909630, 0.719905, 0.752611 and 0.926196 respectively. That

    means the value of ln HDI increases with the values of ln EDUCATION, ln HEALTH,

    ln INFRA, ln SAFETY and ln TPT.

    4.5 Autocorrelation

    4.5.1 Durbin Watson Test

    Table 4.5.1 Multiple Regression Model

    Dependent Variable: LNHDIMethod: Least SquaresDate: 12/16/13 Time: 18:21Sample: 1981 2010Included observations: 30

    Variable Coefficient Std. Error t-Statistic Prob.

    C -1.272230 0.102793 -12.37666 0.0000

    LNEDU 0.026928 0.017288 1.557625 0.1324LNHEALTH 0.017042 0.014265 1.194677 0.2439LNINFRA 0.013528 0.015386 0.879282 0.3880

    LNSAFETY -0.041885 0.016910 -2.476933 0.0207LNTPT 0.089002 0.026153 3.403086 0.0023

    R-squared 0.914958 Mean dependent var -0.400200 Adjusted R-squared 0.897241 S.D. dependent var 0.094028S.E. of regression 0.030142 Akaike info criterion -3.988966Sum squared resid 0.021804 Schwarz criterion -3.708727Log likelihood 65.83450 Hannan-Quinn criter. -3.899315F-statistic 51.64261 Durbin-Watson stat 0.852741Prob(F-statistic) 0.000000

    The regression model is shown in the table above. The expected positive relationship

    between ln HDI and other variables are consistent with the theory except the

    independent variable ln SAFETY which has a negative sign. This implies that In HDI

    has a negative relationship with ln SAFETY which is inconsistent with the theory.

    The p-values of both ln EDU, ln HEALTH and ln INFRA are larger than = 0.05

    whereas the p-values of both ln TPT and ln SAFETY are less than = 0 .05. This

    means that ln EDU, ln HEALTH and ln INFRA are less significant while ln TPT and

  • 8/10/2019 Econometric Projek Lithem

    21/31

    19

    ln SAFETY are significant under 5% significant level. The high value in the table

    shows that the dependent variables and independent variables are closely correlated.

    The 91.50% of the changes in ln HDI can be predicted by the current model and

    explained by the other independent variables.

    With the hypothesis below:

    (There is no autocorrelation)

    (There is positive or negative autocorrelation)

    By referring to the statistics table on Durbin-Watson, there is no autocorrelation if

    Durbin-Watson statistic is 2, has positive autocorrelation if less than 2 or negative

    autocorrelation if more than 2. From the data series, the Durbin-Watson statistics

    obtained is 0.852741. The lower and upper critical values, and are 1.071 and

    1.833. Null hypothesis is rejected at 5% significant level and the series data exhibits

    positive autocorrelation.

    4.5.2 Breusch-Godfrey Serial Correlation LM Test

    Table 4.5.2.1 Result of Serial Correlation LM Test

    Breusch-Godfrey Serial Correlation LM Test:

    F-statistic 5.436148 Prob. F(2,22) 0.0121Obs*R-squared 9.922303 Prob. Chi-Square(2) 0.0070

    Breusch-Godfrey Serial Correlation LM Test is used to test the existence of

    autocorrelation. Result shows that the p-value = 0.0121 which is less than = 0.05.

    Therefore, null hypothesis is rejected, there is autocorrelation exists in the model at

    = 0.05 significant level. The probability value strongly indicates the presence of serial

    correlation in the residuals.

    4.6 Heteroscedasticity Test

    The presence of heteroscedasticity in the regression model is tested by observing the

    graph of residual square over the time frame of year 1981 until year 2010 and using

    Whites Heteroscedasticity Test.

  • 8/10/2019 Econometric Projek Lithem

    22/31

    20

    Graph 4.6.1 Plot of Residual Square of Estimated Regression

    The graph above is the graphical representation of the residual square of the estimated

    regression model from year 1981 until year 2010. From the graph, we can see there is

    fluctuation over the time period and the residual square is not parallel to the x-axis.

    This shows that there is no heteroscedasticity exist within the regression model. In

    order to get a more concrete proof regarding the existence of heteroscedasticity withinthe regression model, Whites heteroscedasticity test is used with the fo llowing

    hypothesis:

    Heteroscedasticity does not exist

    Heteroscedasticity exist

    Table 4.6.1 Result of White Test

    Heteroskedasticity Test: White

    F-statistic 1.576167 Prob. F(20,9) 0.2451Obs*R-squared 23.33717 Prob. Chi-Square(20) 0.2725Scaled explained SS 17.98610 Prob. Chi-Square(20) 0.5883

    The result of the white test is shown in the table above. The p-value= 0.2451 which is

    larger than = 0.05. The null hypothesis is not rejected at 5% significant level. There

    is enough evidence to show that is no heteroscedasticity exist in our regression model

    at = 0.05.

  • 8/10/2019 Econometric Projek Lithem

    23/31

    21

    4.7 Newey-West (HAC) Test

    Table 4.7.1 Result of Newey-West Test

    Dependent Variable: LNHDIMethod: Least SquaresDate: 12/16/13 Time: 18:30Sample: 1981 2010Included observations: 30HAC standard errors & covariance (Bartlett kernel, Newey-West fixed

    bandwidth = 4.0000)

    Variable Coefficient Std. Error t-Statistic Prob.

    C -1.201980 0.096558 -12.44831 0.0000LNEDU 0.028289 0.013579 2.083214 0.0481

    LNHEALTH 0.016091 0.019274 0.834819 0.4121INFRA 7.00E-06 6.91E-06 1.013081 0.3211

    LNSAFETY -0.042261 0.019407 -2.177578 0.0395LNTPT 0.090609 0.017333 5.227565 0.0000

    R-squared 0.914997 Mean dependent var -0.400200 Adjusted R-squared 0.897288 S.D. dependent var 0.094028S.E. of regression 0.030135 Akaike info criterion -3.989429Sum squared resid 0.021794 Schwarz criterion -3.709190Log likelihood 65.84144 Hannan-Quinn criter. -3.899778F-statistic 51.66875 Durbin-Watson stat 0.889316Prob(F-statistic) 0.000000 Wald F-statistic 37.23538Prob(Wald F-statistic) 0.000000

    Newey-West (HAC) Test is used to improve the regression model and to reduce the

    effect of autocorrelation in the model. After conducting the Newey-West Test on the

    variables, we can see that there are changes in the significant level of the variables. The

    significant level of the variable ln EDU has decreased and become significant at = 0.05.

    The R-squared value, Durbin-Watson statistic and Standard Error remain almost the

    same as the one before the regression model is corrected.

    4.8 Specification Error Test

    After checking for the problem of autocorrelation and heteroscedasticity, the

    regression model is still not the best yet. More specification errors tests need to be

    carried out in order to make the regression model becomes more accurate and precise.

    4.8.1 Ramsey RESET Test

    Table 4.8.1.1 Result of Ramsey RESET TestRamsey RESET Test

  • 8/10/2019 Econometric Projek Lithem

    24/31

    22

    Equation: UNTITLEDSpecification: LNHDI C LNEDU LNHEALTH LNINFRA LNSAFETY LNTPTOmitted Variables: Powers of fitted values from 2 to 3

    Value df ProbabilityF-statistic 1.362658 (2, 22) 0.2768

    Likelihood ratio 3.503557 2 0.1735

    From the result of Ramsey RESET Test, its found that the value of F -statistic is

    3.500796 with probability of 0.2768 which is greater than = 0.05. Therefore the null

    of hypothesis that the newly added variables, = 0 is not rejected at 5%

    significant level. The newly added variables do not the explanatory power toward the

    dependent variable ln HDI. This implies that the estimated regression model is not

    suffered from specification error.

    4.8.2 Recursive Estimates (CUSUM-Squared Test)

    Graph 4.8.2.1 Result of CUSUM-Squared Test

    A CUSUM test is used to test the consistency of the coefficients in the regression

    model. From the graph, we can see that there is consistency of the coefficients in the

    model for the time frame of year 1981 until 2010. The estimated regression model is

    stable.

  • 8/10/2019 Econometric Projek Lithem

    25/31

    23

    4.8.3 Chow s Breakpoint Test

    In order to verify the existence of structural change in the regression model for the

    certain period, Chow Breakpoint Test is used. Important date like year 1998 (Asian

    Economic Crisis) is identified and the following hypothesis is made:

    Table 4.8.3.1 Result of Chow Breakpoint Test

    Chow Breakpoint Test: 1998Null Hypothesis: No breaks at specified breakpointsVarying regressors: All equation variablesEquation Sample: 1981 2010

    F-statistic 2.489226 Prob. F(6,18) 0.0624Log likelihood ratio 18.12525 Prob. Chi-Square(6) 0.0059Wald Statistic 14.93536 Prob. Chi-Square(6) 0.0208

    Result found that the there is a structural change in the regression model in year 1998

    as the F-statistic has a small probability of 0.0624 which is less than = 0.05. The

    null hypothesis is not rejected at 5% significant level. A dummy variable is not

    required to add into the regression model.

    4.8.4 Wald Test

    The estimated equation is shown as below:Estimation Equation:=========================LNHDI = C(1) + C(2)*LNEDU + C(3)*LNHEALTH + C(4)*LNINFRA + C(5)*LNSAFETY + C(6)*LNTPT

    Table 4.8.4.1 Result of Wald Test

    Wald Test:Equation: Untitled

    Test Statistic Value df Probability

    F-statistic 3.409866 (3, 24) 0.0337Chi-square 10.22960 3 0.0167

    Null Hypothesis: C(2)=0, C(3)=0, C(4)=0Null Hypothesis Summary:

    Normalized Restriction (= 0) Value Std. Err.

    C(2) 0.026928 0.017288C(3) 0.017042 0.014265

    C(4) 0.013528 0.015386

  • 8/10/2019 Econometric Projek Lithem

    26/31

    24

    From the table 4.5.1, w notice that the p-value for the independent variables, ln EDU

    and ln HEALTH and ln INFRA are 0.1324, 0.2439 and 0.3880 which are larger than

    = 0.05. Both of the variables are insignificant at 5% significant level. Thus, these

    three variables are subjected to Wald Test to determine if these variables can be

    omitted from the current regression model. The hypothesis below is made:

    The result implies that the null hypothesis cannot be rejected at 5% significant level as

    the p-value = 0.0337 which is less than = 0.05. Thus, it can be concluded that the

    three independent variables, ln EDU, ln HEALTH and ln INFRA cannot be omitted

    from the current regression model.

    4.9 Multicollinearity

    The regression model at last, will test for multicollinearity. Multicollinearity refers to

    a situation in which two or more explanatory variables in a multiple regression model

    are highly linearly related. It means that the independent variables are highly

    correlated to an extent that in which the variables are dependent on each other. This

    can lead to a serious problem of inaccurate estimation from the regression model and

    thus reduce the trustworthy of the model even though it has high value.

    As shown in the table 4.4.1, the general correlation between the variable ln TPT and

    ln HEALTH has the highest value of 0.926238. There is very high collinearity level

    between these variables, hence further analyses was made to test the seriousness of

    the multicollinearity problem for each and every independent variable by using theVIF (Variance Inflation Factor) Test. The range of VIF is started from 1. The

    multicollinearity problem is very serious if VIF exceed 10.

    http://en.wikipedia.org/wiki/Multiple_regressionhttp://en.wikipedia.org/wiki/Multiple_regression
  • 8/10/2019 Econometric Projek Lithem

    27/31

    25

    Table 4.9.2 Result of VIF Test II

    Variance Inflation Factors

    Date: 12/16/13 Time: 19:50

    Sample: 1981 2010

    Included observations: 30

    Coefficient Uncentered Centered

    Variable Variance VIF VIF

    C 0.008617 199.9143 NA

    LNEDU 0.000382 555.5256 7.694700

    LNHEALTH 0.000231 216.5798 7.337885

    LNINFRA 0.000336 388.4226 2.515110

    LNSAFETY 0.000290 255.6127 3.451071

    From the table 4.9.1 above, it shows that the VIF of ln TPT is higher than 10.

    Therefore, the variable ln TPT is omitted and the VIF test is carried out again in order

    to get a better regression model. In the VIF Test II, all the VIF values of the variables

    obtained are less than 10 which show that there is no serious multicorrelinearity

    problem.

    After went through a series of tests, the suitable regression model is developed as

    below:

    ln HDI= -1.272230 + 0.026928 lnEDU + 0.017042 lnHEALTH + 0.013528

    lnINFRA - 0.041885 lnSAFETY + 0.089002 lnTPT

    Table 4.9.1 Result of VIF Test I

    Variance Inflation Factors

    Date: 12/16/13 Time: 19:39

    Sample: 1981 2010

    Included observations: 30

    Coefficient Uncentered Centered

    Variable Variance VIF VIF

    C 0.010566 348.9104 NA

    LNEDU 0.000299 618.0921 8.561322

    LNHEALTH 0.000203 272.0020 9.215631

    LNINFRA 0.000237 388.9255 2.518367

    LNSAFETY 0.000286 359.2304 4.850030

    LNTPT 0.000684 1471.113 12.18056

  • 8/10/2019 Econometric Projek Lithem

    28/31

    26

    CHAPTER 5: CONCLUSION

    After conducting a various tests and diagnostics, we manage to develop the most

    suitable regression model for our study. The regression model obtained is:

    ln HDI= -1.272230 + 0.026928 lnEDU + 0.017042 lnHEALTH + 0.013528

    lnINFRA - 0.041885 lnSAFETY + 0.089002 lnTPT

    From the study, we can know that how is the government expenditure affect the value

    of Human Development Index. Result shows that the government spending in internal

    security and transport sectors has significant influences on HDI while the spending in

    education, health and infrastructure sectors does not has significant impact on HDI.

    The study also shows that the government expenditure in education, health,

    infrastructure and transport sector has positive relationship with HDI as what we

    expected in the initial prediction. This is because HDI takes into account of life

    expectancy, expected and means year of schooling and gross national income per

    capital. The more government spending in educational and health sectors, for example,

    the building of hospital and school, the increase in the number of teachers and health

    care workers and the increase in the usage of technology can improve the life

    expectancy and literacy rate among the Malaysian. The government investments in

    infrastructure and transport sector also increase the value of HDI as it improves the

    living standard of the residents. On the other hand, its not consistent with the theory

    that the government spending in internal security sector affect the HDI negatively. It

    might be due to the inefficiency of the polices in fighting against the upward trend of

    crime rate and the residents have the feeling of unsecure.

    Throughout the study, the estimated regression model doesnt have heteroscedasticity

    and serious multicollinearity problem but it contains positive autocorrelation.

    Autocorreltaion that happens in the model might due to the natural of the data series

    and is har to avoid. Besides that, there is no structural change across the sub-sample in

    the regression model and the model, thus, doesnt suffer from specification error.

    Following is the summary of results of a few main tests that we had carried out.

  • 8/10/2019 Econometric Projek Lithem

    29/31

    27

    Tests ResultAutocorrelation:

    Positive autocorrelation exists.Durbin-Watson TestBreusch- Godfrey Serial Correlation LM Test

    Heteroscedasticity:There is no heteroscedasticity.Graphical Test

    Whites Heteroscedasticity test

    Model Specification Error:

    Structural change across the sub-samples does not exist.

    Ramsey RESET TestRecursive Estimates (CUSUM-Squared Test)Chows Breakpoint Test Wald Test

    Multicollinearity:Serious multicollinearity does notexist between independent variable.Variance Inflation Factor (VIF) Test

    In conclusion, government expenditure in internal security and transport sectors can

    affect the value HDI significantly as shown in the tests above whereby ln SAFETY

    and ln TPT can explain and predict the changes in the value of HDI up to 90%.

    Government should focus more in these two sectors and plan well for the coming

    budget so that the human development can be improved. However, governmentshould not ignore the other sectors as they affect HDI positively although the impact

    induced is not significant.

    Even though a satisfactory result is obtained throughout the study, some improvement

    can be done for further research and to obtain a better regression model. More

    independent variables can be added to the model to improve its analysis. More detail

    tests can be carried out toward the model to reconfirm the results obtained, for

    example ARCH test can be used to reconfirm the result of Whites Heteroscedasticity

    Test.

  • 8/10/2019 Econometric Projek Lithem

    30/31

    28

    REFERENCE

    Afonso, A., & St Aubyn, M. (2005). Non-parametric approaches to education andhealth efficiency in OECD countries. Journal of Applied Economics , 8(2), 227-246.

    Gupta, S., Clements, B., & Tiongson, E. (1998). Public spending on humandevelopment. Finance and Development , 35, 10-13.

    Ministry of Finance Malaysia, Federal Government Development Expenditure 1970-2012, viewed on 11 December 2013,

    Pang, G., & Herrera, S. (2005). Efficiency of public spending in developing countries:

    an efficiency frontier approach. World Bank Policy Research Working Paper , (3645).

    Prasetyo, A. D., & Zuhdi, U. (2013). The Government Expenditure Efficiencytowards the Human Development. Procedia Economics and Finance . pp 615-622.

    Rajkumar, A. S., & Swaroop, V. (2008). Public spending and outcomes: Doesgovernance matter?. Journal of development economics , 86 (1), 96-111.

    R.A.M. Dzubaidi, Rahmah Ismail, Tamat Sarmidi. (2013). Peranan Perbelanjaan

    Kerajaan Terhadap Pembangunan dan Pertumbuhan Ekonomi. Universiti KebangsaanMalaysia. ISSN: 2231-962X, PROSIDING PERKEM VIII, JILID 2, page 872 879.

    Razmi, M. J. (2012). Investigating the Effect of Government Health Expenditure onHDI in Iran. Journal of Knowledge Management, Economics and InformationTechnology , 2.

    United Nation Development Programme, Human Development Report, viewed on 11December 2013, < http://hdr.undp.org/en/statistics/hdi/ >

    http://www.treasury.gov.my/pdf/ekonomi/dataekonomi/2013/timeseries/FGDE1970_2012.pdfhttp://www.treasury.gov.my/pdf/ekonomi/dataekonomi/2013/timeseries/FGDE1970_2012.pdfhttp://www.treasury.gov.my/pdf/ekonomi/dataekonomi/2013/timeseries/FGDE1970_2012.pdfhttp://hdr.undp.org/en/statistics/hdi/http://hdr.undp.org/en/statistics/hdi/http://hdr.undp.org/en/statistics/hdi/http://hdr.undp.org/en/statistics/hdi/http://www.treasury.gov.my/pdf/ekonomi/dataekonomi/2013/timeseries/FGDE1970_2012.pdfhttp://www.treasury.gov.my/pdf/ekonomi/dataekonomi/2013/timeseries/FGDE1970_2012.pdf
  • 8/10/2019 Econometric Projek Lithem

    31/31

    APPENDIX

    (Source: Human Development Report)

    Figure 1 Component of HDI