the effect of risk aversion on self- employment status

42
The Effect of Risk Aversion on Self- Employment Status A review of this relationship using risk aversion measures derived from a partly incentive compatible lottery experiment. by YORAM VANMAEKELBERGH 10003416 Supervisors: ZHENXING HUANG JAN TUINSTRA Bachelor thesis econometrics June, 2014 Abstract In this study is examined whether a high degree of risk aversion has a negative effect on the propensity to become self-employed. Three different measures of risk aversion are derived from a partly incentive compatible lottery experiment. Each measure is included in different reduced form logit regressions proposed by previous research. None of the measures seems to have a significant effect, contradictory to most previous research. Insignificance appears invariant to the definition of self- employment, the included control variables, splitting up the sample by different payoff conditions and leaving out individuals whose job status is unknown. Moreover, the final model is clearly misspecified. This may be caused by reverse causality issues, heteroskedasticity because of an omitted main control variable, the design of the experiment or construct invalidity of the risk aversion measures. Keywords: Risk Aversion, Self-Employment, Entrepreneurship, Lottery Experiment, Incentives. In this study use is made of data of the Longitudinal Internet Studies for the Social sciences (LISS) panel administered by CentERdata (Tilburg University, The Netherlands). Acknowledgements I would like to thank Zhenxing Huang and Jan Tuinstra for their feedback on earlier drafts of this thesis. I would like to thank Rob van Hemert for teaching me academic writing skills and his extensive feedback on the theoretical part of this thesis. I would like to thank Thomas Dohmen from the University of Maastricht for sending me an important discussion paper for this thesis. Finally I would like to thank Joppe Arnold for peer reviewing my work.

Upload: others

Post on 18-Dec-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

The Effect of Risk Aversion on Self-

Employment Status A review of this relationship using risk aversion measures derived from a partly incentive

compatible lottery experiment.

by YORAM VANMAEKELBERGH 10003416

Supervisors:

ZHENXING HUANG JAN TUINSTRA

Bachelor thesis econometrics

June, 2014

Abstract In this study is examined whether a high degree of risk aversion has a negative effect on the

propensity to become self-employed. Three different measures of risk aversion are derived from a

partly incentive compatible lottery experiment. Each measure is included in different reduced form logit

regressions proposed by previous research. None of the measures seems to have a significant effect,

contradictory to most previous research. Insignificance appears invariant to the definition of self-

employment, the included control variables, splitting up the sample by different payoff conditions and

leaving out individuals whose job status is unknown. Moreover, the final model is clearly misspecified.

This may be caused by reverse causality issues, heteroskedasticity because of an omitted main

control variable, the design of the experiment or construct invalidity of the risk aversion measures.

Keywords: Risk Aversion, Self-Employment, Entrepreneurship, Lottery Experiment, Incentives. In this study use is made of data of the Longitudinal Internet Studies for the Social sciences (LISS) panel administered by CentERdata (Tilburg University, The Netherlands). Acknowledgements I would like to thank Zhenxing Huang and Jan Tuinstra for their feedback on earlier drafts of this thesis. I would like to thank Rob van Hemert for teaching me academic writing skills and his extensive feedback on the theoretical part of this thesis. I would like to thank Thomas Dohmen from the University of Maastricht for sending me an important discussion paper for this thesis. Finally I would like to thank Joppe Arnold for peer reviewing my work.

1

Contents 1. Introduction ....................................................................................................................... 2

2. Theory ............................................................................................................................... 5

2.1 Relating risk aversion to utility functions ....................................................................... 5

2.2 Relating the derived degree of risk aversion to risk taking behavior ............................. 6

2.3 Possible relationships between risk aversion and self-employment ............................. 8

2.3.1 A theoretical model ................................................................................................ 8

2.3.2 Risk aversion as a determinant of self-employment probability .............................. 8

2.3.3 Available variables and their effect on self-employment probability ......................10

2.3.3.1 Gender ...........................................................................................................10

2.3.3.2 Age ................................................................................................................11

2.3.3.3 Marital status and number of children ............................................................11

2.3.3.4 Level of education ..........................................................................................12

2.3.3.5 Wealth and capital constraints .......................................................................12

2.3.3.6 Family background, not in the LISS panel ......................................................13

2.3.3.7 Unemployment status ....................................................................................13

2.4 Research question and hypotheses ............................................................................14

3. Data and methods ............................................................................................................14

3.1 Data: the LISS panel ...................................................................................................14

3.2 Deriving different measure of risk aversion based on the LISS lottery experiment ......15

3.3 Procedure to derive a model for the relationship between risk and self-employment ...17

3.4 Operationalization of the variables ..............................................................................17

3.5 creating subsamples for robustness checks ................................................................18

4. Results .............................................................................................................................20

4.1 Comparing the derived risk aversion measures by payoff condition ............................20

4.2 Differences between self-employed and not self-employed individuals .......................21

4.3 The final model ...........................................................................................................23

4.4 Results found using the final model ............................................................................25

4.5 Detecting influential observations ................................................................................29

4.6 Heteroskedasticity and its possible solutions ..............................................................31

5. Conclusion .......................................................................................................................33

6. Discussion ........................................................................................................................33

References ...........................................................................................................................38

Appendix I ............................................................................................................................... i

2

1. Introduction Recently, a lot of research has been done on why individuals choose for self-employment

rather than a normal job. This is interesting for society because starting an own business

potentially creates new jobs and can therefore decrease currently high unemployment rates.

Moreover, the topic is interesting for the academic world, because of the complex nature of

self-employment. This interest is reflected in the fast increasing number of studies on this

topic.

Research on self-employment has resulted in a wide range of explanations for this

phenomenon. In the past, self-employment was explained by access to capital, links between

generations and fiscal reasons (Brown et al., 2011). Nowadays research on this topic

focuses on the differences in personal characteristics between self-employed individuals and

individuals who are not. Previous studies found evidence that one of the main differences is

that self-employed workers are more able and more willing to take risks (see for example,

Brown et al., 2011; Ekelund et al., 2005; Dohmen et al., 2011). Modern economics regards

an individual’s attitude towards risk as a main determinant for all kinds of behavioral choices,

including the choice to be(come) self-employed.

For example, Hartog et al. (2002) and Masclet et al. (2009) provide evidence that self-

employed workers have a significantly lower degree of risk aversion than their not-self-

employed counterparts. This view is supported by Ekelund et al. (2005) and Brown et al.

(2011), who find that a low degree of risk aversion increases the probability of self-

employment, ceteris paribus. However, Miner and Raju (2004) find evidence for the reverse

relationship, i.e. self-employed workers are more risk averse than wage employed workers.

Contrary to this, Tucker (1988) and Parker (2008) find that risk attitude has no significant

effect on the decision for self-employment. Finally, Caliendo et al. (2009) find that risk

aversion is only a significant explanatory variable in explaining the self-employment decision

when an individual has already been employed before becoming self-employed.

Although more supportive than counter supportive literature can be found,

significance of the risk aversion variable is not invariant to the used models and the way the

degree of risk aversion is measured. The models differ with respect to the direction of the

relationship between the two variables. In some models the degree of risk aversion is used

as an explanatory variable for the probability of self-employment (see for example, Dohmen

et al., 2011), while in other models self-employment status is used as an explanatory variable

for the degree of risk aversion (see for example, Masclet et al., 2009). Previous research

also differs in the way in which risk aversion is measured. Four possible derivations of the

degree of risk aversion are used most frequently; based on a hypothetical gamble, based on

experiments with real payoffs, based on a psychological measure and based on a self-

assessment risk question.

3

All the above measures of risk aversion have one thing in common: they are all

proxies for an individual’s degree of risk taking behavior. Whether the derived proxy for risk

aversion is a good predictor for the decision of self-employment is examined in Dohmen et

al. (2005). One of their results is that responses to lottery questions with hypothetical payoffs

are not able to predict self-employment status. This finding contradicts many of the results of

the aforementioned researches, but comparison is complicated due to the different used

models and different measures of risk aversion.

A possible explanation for the contradictory result of Dohmen et al. (2005) is that

responses to their used hypothetical risk question are not incentive compatible. This makes it

unclear to what extent the derived degrees of risk aversion translate into actual risk-taking

behavior, like the choice to become self-employed (Dohmen et al., 2011). Therefore, in this

study a risk aversion measure derived from a partly incentive compatible lottery experiment

is included in a model explaining the probability of self-employment. The question is whether

this degree of risk aversion is a significant explanatory variable in such a model. A negative

answer to this question is found in the present study, as will be shown in the results section

in paragraph 4.4.

However, on beforehand an affirmative answer to this question is expected because

of three reasons. Firstly, Masclet et al. (2009) find that a degree of risk aversion based on an

incentivized lottery experiment is significantly explained by self-employment status.

Therefore, it seems natural to expect significance in a model in which the dependent and

explanatory variables are exchanged, as used in this paper. Secondly, Dohmen et al. (2011)

find that the results of an incentive compatible lottery experiment are strongly correlated with

an individual’s general risk measure, which proved able to predict self-employment status.

This together implies that it should as well be possible to predict self-employment status from

the experimental measures used in this study directly. Thirdly, Camerer and Hogarth (1999)

state that adding incentives to a risky choice question reduces the variance of the results.

This reduced variance may increase the likelihood of significant results.

In this study three different degrees of risk aversion are derived from a partly

incentive compatible lottery experiment of the LISS panel1. These degrees of risk aversion

are used among other variables of the LISS panel to estimate an appropriate reduced form

model to answer the question whether the derived risk aversion coefficient is a significant

explanatory variable for the probability of self-employment. To the author’s knowledge, the

combination of risk aversion derived by a partly incentive compatible lottery experiment and

risk aversion as an explanatory variable for the probability of self-employment is unique and

never researched before.

1 40% of the participants were in a condition that they were able to make money.

4

The proceeding of this thesis is organized as follows. In chapter two the theory about

risk aversion and its influence on self-employment status found in previous research are

addressed. Chapter three discusses the data, methods, used model and used samples. In

chapter four the results of this study can be found. Chapter five summarizes the conclusions.

Finally, in chapter six an extensive explanation for the found results and forthcoming advises

for future research are discussed.

5

2. Theory

In this chapter the theory of risk aversion is related to utility functions firstly. Secondly, risk

aversion derived by the used lottery experiment is linked to risk taking behavior. Finally,

existing models to explain self-employment status and determinants of self-employment

status are reviewed.

2.1 Relating risk aversion to utility functions

This paragraph gives a brief explanation of all relevant aspects of utility theory and risk

aversion necessary for this research. It’s important to note that all statements in this

paragraph are only valid within the expected utility framework. All concepts in this paragraph

are retrieved from Wakker (2010).

A well-known measure of risk aversion is the Pratt-Arrow measure, also called the

absolute measure of risk aversion:

(1)

In this measure U is a utility function which is twice continuously differentiable. Another

frequently used measure is the relative measure of risk aversion:

(2)

In this measure x is an individual’s wealth.

An individual A is said to be more risk averse than another individual B if and only if

A’s absolute (relative) measure of risk aversion, exceeds B’s everywhere. This also applies

to all the risk aversion measures used in this study: a higher value means a higher degree of

risk aversion.

Empirical observations often have properties which can be derived from two different

families of utility functions, introduced in what follows. The first family of utility functions is the

power family, which has the property of constant relative risk aversion. That family is defined

on as follows:

(3)

6

In the family of equations of (3), θ can be considered as an anti-index of risk aversion: the

smaller its value, the more concave the utility function, the more risk aversion is generated.

Based on this family a relative degree of risk aversion, necessary for the present study, is

computed (see 1) to correspond with the common empirical finding from previous research of

increasing relative risk aversion with increasing wealth.

The second family of utility functions is the exponential family which has the property

of constant absolute risk aversion. That family is defined on as follows:

(4)

In the family of equations of (4), θ can be considered as an index of risk aversion: θ > 0

results in a concave utility function implying risk aversion, θ = 0 results in a linear utility

function implying risk neutrality, θ < 0 results in a convex utility function implying risk seeking.

Based on this family an absolute degree of risk aversion, necessary for the present study, is

computed (see 2) to correspond with the common empirical finding of decreasing absolute

risk aversion with increasing wealth.

2.2 Relating the derived degree of risk aversion to risk taking behavior

Previous research about the relationship between risk aversion and self-employment status

uses different measures of risk aversion to examine their potential relationship. For example,

Hartog et al. (2002) deduce their measures of risk aversion from an individual’s reservation

price for a lottery ticket and from the answer provided on an income shift question. In

addition, Brown et al. (2011) use a six point risk aversion scale which is based on classifying

individuals by their risk attitude according to answers they provided on a hypothetical gamble

with their income. Furthermore, Ekelund et al. (2005) use a psychological measure of risk

aversion, based on harm avoidance. Finally, Masclet et al. (2009) and Dohmen et al. (2011)

use an incentive compatible lottery to reveal an individual’s degree of risk aversion.

All these measures for risk aversion are quite complicated to obtain: they require

understanding of probability theory by the participants, strong imaginary power or they are

expensive to carry out. However, risk aversion measures which are easy to obtain also exist.

For example Dohmen et al. (2011) use a question which directly asks survey participants

about their willingness to take risks: “How do you see yourself: are you generally a person

who is fully prepared to take risks or do you try to avoid taking risks?”. Participants provided

answers to this general risk question on a 10-ladder Likert scale (i.e. 0 means ‘Not at all

willing to take risks’ and 10 means ‘Very willing to take risks’).

7

An interesting question for this paper arises with respect to measuring the degree of

risk aversion: what is the relationship between the answer provided on the above general risk

measurement question and the measure derived by an incentive compatible lottery

experiment? Dohmen et al. (2005) find that the degree of risk aversion derived from a

hypothetical lottery question is highly correlated with the degree of risk aversion derived from

this general risk question. Moreover, they find that this last mentioned risk measure is able to

predict self-employment status, while the first measure is not. This is noteworthy because the

general risk question aims to measure an individual’s degree of risk perception, which can be

related to an individual’s risk taking behavior in real life according to Weber et al. (2002), like

the choice to become self-employed (Dohmen et al., 2005). The same variable, to wit the

individual’s degree of risk aversion, should therefore lie underneath both the answer of the

general risk question, the decision to become self-employed and the results of the

hypothetical lottery experiment.

In Dohmen et al. (2011) is found that, when the above lottery question is incentivized,

its results are not only highly correlated with the general risk measure, but also able to

predict the general risk measure and vice versa. Combining this last result with the

aforementioned result that the general risk measure is able to predict self-employment

status, it seems natural to assume that a risk aversion measure derived from an incentive

compatible lottery experiment is able to predict self-employment status directly. This study

sought to examine this last question. The complete reasoning of this paragraph is

summarized in figure 1.

Figure 1

Left: Results of Dohmen et al. (2005). Right: hypothesis of this paper (in green) combined

with the results of Dohmen et al. (2011).

8

2.3 Possible relationships between risk aversion and self-employment

Previous research provides many explanations why an individual chooses to become self-

employed. A low degree of risk aversion appeared to be one of the main determinants in this

choice. In general two approaches to model this relationship can be distinguished, a

theoretical model and a reduced form equation. These models and their included variables

are elaborated respectively in the next paragraphs.

Of course no real model for the choice of self-employment exists. Previous research

dealt with this fact by considering reduced form equations for this relationship, which do not

take all influencing variables into account. In this study the same strategy is employed: the

purpose is not to find a structural model for the probability of self-employment2, but to find an

appropriate reduced form equation containing risk aversion to review the relationship

between risk aversion and self-employment.

2.3.1 A theoretical model

Praag and Cramer (2001) developed a theoretical model in which an individual chooses for

self-employment based on his risk attitude and entrepreneurial ability. An individual is said to

know the labor demand and profit for both occupational choices and compares the expected

utility of the profit of self-employment with the utility of wage employment based on these

values. He occupies himself in the sector with the highest utility.

This model’s strong and unrealistic assumptions, e.g. single wage rate, only full-time

employment and a closed economy, make it not suitable for empirical analysis with data from

a real economy as in the present study. But the model does contribute to this research by

giving insight in which factors influence the probability of self-employment. By their model,

including the variables level of education, gender and risk attitude to model the probability of

self-employment is both theoretically sound and empirically underbuilt. The inclusion of other

influencing variables relies on common sense, sociological arguments and empirical

evidence, rather than on a theoretical model, as will be discussed in paragraph 2.3.3.

2.3.2 Risk aversion as a determinant of self-employment probability

The paper of Cramer et al. (2002) was the first that examined the relationship between risk

aversion and self-employment in an empirical way. Previous research has suggested that

individuals with a low degree of risk aversion are more likely to choose for self-employment.

To test this hypothesis empirically they built a relatively simple model with self-employment

status as the dependent variable and a risk aversion measure among the independent

variables in a probit model. Their results show that risk aversion is a significant explanatory

2 The words self-employment status, probability for self-employment and decision to become self-

employed are all used together without distinction in meaning.

9

variable with a negative sign, but because of different timing of the data3, nothing concerning

causality is claimed.

In addition, Ekelund et al. (2005) state that, because earnings of a self-employed

individual have greater variance than the earnings of a wage employed individual, it seems

natural to assume that self-employed workers are less risk averse. In both the logit model

with and without possible endogenous variables, their risk aversion measure based on harm

avoidance appeared to be a significant explanatory variable with the expected negative sign.

Furthermore, Brown et al. (2011) include even more possible explanatory variables to

model the choice for self-employment. Finally they estimate a probit model with many control

variables, family background covariates and a risk aversion measure based on a gamble with

an individual’s income. They find a significant negative relationship between risk aversion

and the probability of being self-employed.

Finally, Dohmen et al. (2005) use a probit model with self-employment status as the

dependent variable and a risk aversion measure based on a hypothetical investment

question among the independent variables. They find that this risk aversion measure does

not have a significant influence on self-employment status. A possible explanation for

insignificance of this risk aversion measure is the aforementioned lack of real payoffs

resulting in high variance of the derived risk measures.

In the aforementioned studies endogeneity of the risk aversion measure may be a

problem. This problem is addressed in Brown et al. (2011), who wonder whether their risk

aversion measure is not simply capturing the individual’s unobserved characteristics which

are not included in their model. In accordance with the assumptions of previous literature

they find that this is not the case.

This is not the only endogeneity issue; the direction of causality between the degree

of risk aversion and self-employment status is not self-evident. Perhaps an individual

chooses to become self-employed and this decision influences his degree of risk aversion,

instead of the other way round, as assumed in most previous studies. This reverse causality

issue is investigated in different ways in Brown et al. (2011) and Ekelund et al. (2005)4. In the

end both conclude that the risk aversion coefficient measured in the past is influencing

current employment status. This is consistent with risk aversion as a causal determinant for

future self-employment status, as is also assumed in the present study. Therefore, any

endogeneity issues will not be taken into account in the final model of this study.

3 In the study of Cramer et al. (2002) the degree of risk aversion is derived long after the occupational

choice has been made 4 Ekelund et al. (2005) investigate whether a longer tenure in self-employment would result in a lower

degree of risk aversion. Because data of tenure in self-employment was not available, they approximated this relationship with tenure in current employment as a proxy for tenure in self-employment. Brown et al. (2011) were able to measure an individual’s risk attitude prior and after becoming self-employed.

10

However, research based on the reverse relationship, i.e. in which way self-

employment status influences risk attitude cannot be ignored at this point (see for example,

Hartog et al. (2002) and Masclet et al. (2005)). The main difference between the two

approaches can be explained by different research interests in the papers employing the

different models. When self-employment status is used as a determinant for risk aversion, it

is just a control variable in order to get a clear view of the effect of another variable on risk

aversion5. This is not the case in the present study, but the model is used in the discussion of

chapter 6.

2.3.3 Available variables and their effect on self-employment probability

After the above literature investigation, the following variables (among a degree of risk

aversion) are selected from the LISS panel to be included in a reduced form equation to

estimate the probability of self-employment: gender, age, marital status, number of children,

level of education and whether an individual owns a dwelling. Why these variables are

included is discussed in what follows and how these variables are included is addressed in

paragraph 3.4.

2.3.3.1 Gender

In previous research, gender appeared to be a main determinant of the probability to become

self-employed: men are significantly more often self-employed than women. This is explained

in many ways; only the main reasons are highlighted here. Rosa et al. (1995) find that

businesses of self-employed females underperform compared to men-owned businesses by

looking at standard performance measures. This may decrease the likelihood to survive for a

self-employed female and therefore decreases the probability that a female is self-employed.

Burke et al. (2001) suggest that the social implications of gender (still) play an important role:

women are regarded as responsible for the upbringing of the children and other domestic

commitments. Women make occupational choices to complement with these duties, which is

better possible with (part-time) wage-employment than with self-employment. Another

argument is given by Georgellis and Wall (2005): men are more willing to give up wage-

employment just because of the difference in earnings with self-employment, while women

attach more value to non-wage aspects of a job. Simoes et al. (2013) complement this

argument by stating that women are engaged in different sectors with fewer possibilities to

become self-employed and with a higher job satisfaction, resulting in a lower probability of

self-employment. Finally, Koellinger et al. (2014, in press) state that even after controlling for

all the aforementioned effects women still show a lower propensity to become self-employed

because of different subjective perceptions, for example about their entrepreneurial ability,

5 In these papers the reverse causality issue is not mentioned.

11

than men. All in all a gender dummy variable should be included in the final model of the

present study.

2.3.3.2 Age

Age is a possible determinant for the probability of self-employment as an approximation for

labor market experience: self-employment probabilities increase with labor market

experience. Le (1999) gives two possible explanations for this effect. Firstly, as labor market

experience increases, accumulated capital also increases. This makes it possible to

overcome the liquidity constraints associated with setting up one’s own business. Secondly,

labor market experience increases entrepreneurial ability and hence, according to the model

by Praag and Cramer (2001), an individual is more likely to become self-employed. Previous

research did indeed provide evidence that labor market experience, approximated by age,

has a significant positive effect on the probability of self-employment.

Moreover, according to Simoes et al. (2013) a real age effect can also be attached to

the age variable. A positive sign for this variable is explained by a strong desire for more

flexible employment, a way to avoid mandatory retirement and limiting health status that

makes it impossible to have a full-time job. In some research a negative sign above a certain

threshold value is also found. This can be explained by a higher level of risk aversion, not

being able to work many hours associated with self-employment and less time to recover the

invested capital in one’s own business. The different found signs could imply a non-linear

relationship and therefore encourage including both an age term and a quadratic age term in

the final model of the present study.

2.3.3.3 Marital status and number of children

In previous research a married individual appeared to have a higher probability to be self-

employed. This can be explained in several ways according to Le (1999). Firstly, marriage is

associated with stability and emotional support and therefore a good background for the risk

bearing entrepreneur. Secondly, entrepreneurs face the possibility that their employees shirk.

Employing one’s spouse in the business prevents this last phenomenon because both the

spouse and the entrepreneur have the same incentive of maximizing family profit. Thirdly, a

married individual faces less financial constraints to start a business because of possible

sharing of their accumulated capital.

However, also a negative effect for married individuals on self-employment probability

is found. According to Simoes et al. (2013) this can be explained by the specialization

hypothesis of the neoclassical theory of family interpreted in a self-employment context.

According to this hypothesis, a married couple maximizes joint utility if one of them

specializes in domestic work and the other in self-employment. This implies a negative sign

for the individual who is specializing in domestic work and may encourage including an

12

interaction term between gender and marriage, next to a married dummy variable, in the

used model of the present study.

Previous research found another family characteristic that affects the probability of

self-employment, namely the number of children. A positive sign for this variable is found by

Connelly (1992): women with young children are more likely to choose for self-employment

to lower the cost for child care and to be able to spend time with their children while still

engaging in the labor market. Besides, teenage children are able to help in the parent’s self-

employed business (Georgellis & Wall, 2005). However, a negative sign is also found, for

example by Hundley (2001): when the number of children increases, the amount of time

related to bringing them up may increase, and less time remains to run a business. These

studies encourage that the number of children is included in the final model of the present

study.

2.3.3.4 Level of education

In previous research the level of education comes out as a determinant for the probability of

self-employment. Several channels exist through which this variable affects this probability

(Le, 1999). One possibility, implying a positive sign, is that if the level of education increases,

an individual’s entrepreneurial ability might also increase. According to the model of Praag

and Cramer (2002) this then raises the probability of self-employment. Another possibility,

implying a negative sign, is that higher educated individuals earn higher salaries in the labor

market, which decreases their probability to choose for self-employment. Hence, the net

effect of the level of education variable for the propensity to become self-employed is not

clear at first sight. This is reflected by previous research that reports ambivalent results for

the sign of the level of education variable. According to Simoes et al. (2013) a u-shaped

relationship is found most often: self-employment probability is high in both ends of the level

of education distribution. This implies that in the present study dummy variables should be

included for different levels of education.

2.3.3.5 Wealth and capital constraints

An individual’s availability to capital influences the decision to become self-employed: an

individual who wishes to start up a business has to overcome the financial constraints

associated with establishing one’s own business6. The importance of this has been shown in

previous research by significance of variables approximating an individual’s wealth (Le,

1999). Perhaps this effect is also partly captured by the age variable because a higher age

implies more accumulated capital to overcome the financial constraints.

6 For example high initial investment, uncertainty in the first period etc.

13

In the present study, inclusion of the financial constraints associated with self-

employment is done by including a dwelling owner dummy variable: owning a house can be

regarded as a proxy for accumulation of necessary wealth to become self-employed.

Moreover, an own dwelling can be used as collateral which increase possibilities for external

funding (Simoes et al., 2013). The same strategy is also carried out in Ekelund et al. (2005)

who find that home ownership is a significant variable with a positive sign in explaining the

probability of self-employment. However, they note that it is hard to distinguish between

home ownership as a cause or as a consequence of self-employment.

Wealth can also be included directly in the model, in the form of the accumulated

value of assets. However, Hartog et al. (2002) showed that this variable is prone to severe

measurement errors. This issue is discussed more extensively in the discussion of chapter 6.

Another problem with including wealth in the model is that wealth influences risk

aversion, as was also mentioned in paragraph 2.1. In this study this problem is partly solved

by comparing the results found using an absolute measure of risk aversion with the results

found using a relative measure of risk aversion, because both measures display different

reactions to changing wealth. This will be discussed more extensively in paragraph 3.2.

2.3.3.6 Family background, not in the LISS panel

Previous research often includes variables about an individual’s family background (see for

example Brown et al., (2011), Dohmen et al., (2011) and Ekelund et al., (2005)). However

these variables appear insignificant most of the times. This suggests that either family

background variables do not approximate one’s character well or that these variables have

also other influences on self-employment status in the opposite direction. However, self-

employment status of an individual’s father is an exception to this. An explanation for

significance of this variable is that either the individual takes over the family business of his

father or that a self-employed father reduces the psychological barrier to become self-

employed. Nonetheless, according to Skriabikova et al. (2014) it is not a problem that the

self-employment status of an individual’s father is not available in the present study. They

find that there is a strong causal relationship between risk attitude and the decision to

become self-employed and that this relationship is not driven by parental background

variables, like a self-employed father.

2.3.3.7 Unemployment status

Caliendo et al. (2009) find that the effect of risk aversion differs between individuals

becoming self-employed out of salaried work and individuals becoming self - employed out of

unemployment. They find that risk aversion is a significant variable only for the first group.

This can possibly be explained by the fact that already employed workers have more to lose

by giving up their job for self-employment and they can therefore be considered as more

14

willing to bear risk. In the present study this is partly taken into account by comparing the

results found using subsamples with and without unemployed individuals, as will be

discussed in paragraph 3.5.

2.4 Research question and hypotheses

The present study sought whether it is possible to predict an individual’s self-employment

status based on the aforementioned characteristics and a measure of risk aversion derived

from a partly incentive compatible lottery experiment. An affirmative answer can be expected

from previous research, as can be read in paragraph 2.3.2 and from the close relation

between risk taking behavior and an experimentally derived risk aversion measure, as is

discussed in paragraph 2.2. This induces the following two hypotheses:

A risk aversion measure from an incentive compatible lottery experiment is a significant

explanatory variable in predicting self-employment status (I) with a negative sign (II) in a

reduced form equation for this relationship.

3. Data and methods

Chapter 2 gives a good theoretical base to estimate a reduced form equation with the data

available in the LISS survey. In this section firstly the used LISS data sample is described.

Subsequently, the design of the conducted experiment and how a risk aversion measure can

be derived from that experiment is discussed. Thereafter follows the maintained approach to

find the final model for the probability of self-employment and the operationalization of the

included variables in that model. Lastly, the creation of subsamples for robustness checks is

considered.

3.1 Data: the LISS panel7

In this paper use is made of data of the LISS (Longitudinal Internet Studies for the

Social sciences) panel administered by CentERdata (Tilburg University, The

Netherlands). The LISS panel is a representative sample of Dutch individuals who participate

in monthly internet surveys. The panel is based on a true probability sample of households

drawn from the population register. Households that could not otherwise participate are

provided with a computer and internet connection. This procedure increases external validity

of research results based on the LISS panel by trying to prevent a biased main sample. In

this paper the data of the surveys of the year 2010 about background characteristics, work,

education, housing and assets are used. Moreover, the results of an experiment of the LISS

7 Based on obligatory description from CentERdata retrieved from:

http://www.lissdata.nl/assets/uploaded/References_LISS.pdf

15

panel conducted in December 2009 are used to derive three degrees of risk aversion for

each individual, which is discussed in paragraph 3.28.

3.2 Deriving different measure of risk aversion based on the LISS lottery

experiment

The experiment of the LISS panel aims to measure a participant’s attitude towards risk by

letting him choose five times between a varying sure payoff and a lottery that pays €65 or €5

with equal probability, so with an expected pay off of €359. The sure payoff varied from €20

to €40 in steps of €5. It was not possible to select indifference between the lottery and the

sure payoff.

This experiment is varied in a few ways to make sure that the derived degrees of risk

aversion are not measured with any systematic errors. Firstly, approximately 30% (10%) of

the participants made their choices for potentially normal (low) real payoffs and 30% {30%}

for hypothetical pay offs {scaled up by a factor of 150}. At the beginning of the experiment

the participants in the real payoff condition were encouraged to choose the option they really

preferred by a message that the decision in every option could determine the compensation

for the experiment. Only at the end of the experiment the participants in the real payoff

condition were told whether they won a prize and how much the prize was, this to ensure

incentive compatibility during the entire experiment. Noussair et al. (2013) use the data of the

same experiment and state that there are no differences between the results of the different

payoff conditions, which is in line with the results of previous incentive compatible lottery

experiments (for example Dohmen et al, 2011). This is discussed more extensively in

paragraph 4.1.

Moreover the side of the screen on which the lottery and sure payoff appeared is

varied. The lottery was displayed on the right side for one half of the subjects and on the left

side for the other half of the subjects. Furthermore one half of the subjects started with the

choice between the lottery and a sure payoff of €40 (€20) which decreased (increased) in

five steps to the sure payoff of €20 (€40).

From the choices of each subject three measures for the degree of risk aversion are

derived. The first measure of risk aversion is the frequency a participant chose the sure

payoff. This number can vary from zero to five, because the participant has to decide five

times between the lottery and the sure payoff. The higher the number, the more risk averse

the participant is. This measure is a simple measure, i.e. it is not derived using the utility

8 Data are retrieved from panel 1, 6, 9, 11, 38 and 81.

9 Harrison and Rutström (2008) find that this kind of experiment is reliable to determine a participant’s

coefficient of risk aversion. However, some discrepancy between the derived degree of risk aversion and an individual’s “real” degree of risk aversion will of course exist.

16

theory in paragraph 2.1. However, Kapteyn and Teppa (2011) show that a simple measure

may represent an individuals’ degree of risk aversion better than a sophisticated measure

based on economic theory, because the last measure makes unlikely assumptions about the

financial capabilities of an individual.

The other two measures are based on the following principle. Each participant

switches between the lottery and the sure payoff at a specific amount. For example, a

random selected participant preferred the lottery above a sure payoff of €25, but preferred a

sure payoff of €30 above the lottery. Due to the fact that the experiment did not collect data

about the sure payoff which made a participant indifferent between the lottery and the sure

payoff, the participant’s certainty equivalent is set to €27.5. This is the average of the last

value at which the participant preferred the lottery and the first value at which the participant

preferred the sure payoff. The bias which is created by this procedure can be considered as

normally distributed and is therefore not influential to the results.

Perhaps some extreme risk averse (risk loving) participants preferred the sure payoff

rather than the lottery in all instances. For these participants the certainty equivalent is set to

€42.5 (€17.5), the average of the highest (lowest) sure payoff and the logical next sure payoff

which was not included in the experiment. It’s unknown whether the switching point of these

participants would still be the same if the next option had been included.

Subsequently, the following equation is solved for each participant10:

(5)

The chosen utility function which is substituted for u in equation 5 is from both the power

utility family (see 3) and the exponential utility family (see 4). Equation 5 is solved for the

parameter θ. Then, using the found θ of the exponential (power) utility family, the coefficient

of absolute (relative) risk aversion is computed. These two different measures are derived to

take into account the common empirical finding of increasing relative risk aversion and

decreasing absolute risk aversion in wealth (Pratt, 1964). Moreover, all three risk aversion

measures can be used to check the robustness of the findings. This is useful because

Kapteyn and Teppa (2011) state that the way risk is measured may influence the found

results.

10

For participants in the high hypothetical payoff condition these numbers are scaled up with a factor k=150.

17

3.3 Procedure to derive a model for the relationship between risk and self-

employment

A top-down approach is used to see which of the explanatory variables of paragraph 2.3 are

included in the final model: the first model includes all variables which are meaningful

according to that paragraph and at the same time available in the LISS panel. This model is

refined to the final model with diagnostic tests indicating whether these refinements are

improvements.

The choice to become self-employed is modeled in the following way:

[ ] ) with )

(6)

In equation 6 is the systematic part in which is a vector summarizing the

different control variables and is one of the above three measures of risk aversion11. is

the individual specific effect which takes the possibility into account that individuals with the

same value of the systematic part may choose differently about becoming self-employed. For

the density function F either the standard normal density function (probit model) or the logit

density function (logit model) is chosen. Both models are non-linear and therefore the

method of maximum likelihood is used to estimate the coefficients.

3.4 Operationalization of the variables

To estimate equation 6, the dependent, control, and risk aversion variables are

operationalized as is shown in table 1.

11

So each equation is estimated three times, each time with a different measure of risk aversion for r i.

18

Table 1

Variables used to estimate the reduced form equation

Dependent variable Operationalization

SE Self-employment dummy: is one if an individual

answered affirmative on the question “Are you self-

employed or do

you work in a family business, or are you a

freelancer?

Independent variables: control

variables

MALE Dummy variable which is one for males.

AGE The age of an individual.

AGESQ The quadratic age of an individual.

MARRIED Is one for a married individual.

NUMCHILD12 The number of children of an individual.

VMBO, HAVOVWO, MBO, HBO,

WO 13

Dummy variables for the highest attained grade; is

one when the highest level of education is of that

type. Reference group: primary school.

HOMEOWNER Is one when an individual owns a home.

Independent variable: risk aversion

measures

NUMSAFE, ARE, RRP Degree of risk aversion, respectively: NUMber of

SAFE choices, Absolute Risk aversion measure

based on Exponential utility, Relative Risk aversion

measure based on Power utility.

3.5 Creating subsamples for robustness checks

5788 LISS panel data members were selected to participate in the lottery experiment

described in paragraph 3.2. 3457 members completed the part of the experiment (59.2%)

used in this paper. The 464 participants (13.4% of the complete responses) who switched

more than once between the sure payoff and the lottery are deleted from the sample.

Furthermore, the 103 participants (2.9%) who preferred a certain payoff above the lottery at

12

As will be discussed in paragraph 4.3 this variable is not included in the final model because it does not measure the number of children of the individual i but the number of children of the household head. 13

Dutch educational grades, respectively: preparatory intermediate vocational school (VMBO), junior/senior high school (HAVOVWO), intermediate professional education (MBO), higher professional education (HBO), academic education (WO).

19

one point in the experiment, but switched to the lottery when the sure payoff increased are

eliminated as well. Both behaviors violate the assumption of rational preferences. This

procedure results in 2890 useful observations. After merging the experimental results with

the participant’s personal characteristics necessary for the analysis, 2651 observations

(76.7% of the complete responses) are left. There are no signs that the unmerged and

eliminated observations differ in some important way from the kept observations. The sample

contains sufficient observations to say that the study is set up successfully for testing the

hypotheses.

From the primary sample of 2651 observations, five subsamples are created. First,

the sample is split by creating two subsamples consisting of 1043 (1608) participants who

were in the real (hypothetical) payoff condition. Another three subsamples are created out of

these two subsamples and the primary sample by eliminating the individuals whose job

status is unknown14. In the primary (real payoff condition) {hypothetical payoff condition}

sample 819 (322) {497} of those individuals are present. For 693 (272) {421} of those

individuals can be derived that they can be classified as unemployed. This means that they

have not had a job for at least four years or that they are studying and older than 26, but that

they used to work in the past.

As discussed above, many observations are lost by creating the subsamples. In order

to take advantage of the high number of observations in the primary sample, this sample is

considered as the basis of a more thorough analysis for the relationship between self-

employment status and risk aversion15.

The reason for the creation of the subsamples is threefold. Firstly, Caliendo et al.

(2009) found that the effect of risk aversion on the decision to become self-employed is

different for unemployed individuals as compared to wage-employed individuals16. Because

the group whose job status is unknown consists mainly of unemployed individuals, a different

result might be found by analyzing a subsample without them. Secondly, several reasons

exist for the fact that an individual does not report its job status. This makes the interpretation

of the effect of the risk aversion measure on self-employment status unclear in the full

sample, because in that case the group of individuals who is not self-employed is a mixture

of wage-employed individuals, unemployed persons, pensioners, workers in the black

14

This means that both unemployed individuals and individuals that did not report their job status are left out. 15

However, by using the full sample the incentive compatible aspect of the risk aversion measures is partly decayed. This implies that the argument based on incentives in favor of the hypotheses becomes less strong. 16

In the present study the real influence of this cannot be measures because no data are available of individuals transitioning to self-employment from unemployment. However, the effect may still be present by a smaller degree of risk aversion of the unemployed individuals.

20

economy etcetera17. Thirdly, it may be possible that the degrees of risk aversion of the

participants are influenced by the payoff condition of the experiment, as was also found by

Holt and Laury (2002), and hence the observations from the different conditions are

separated. The differences between the samples are summarized in table 2, which can be

found in the appendix.

4. Results

Now that the methods are discussed, it is time to see what results are found using these

methods. Firstly, the derived degrees of risk aversion are compared between the

experimental payoff conditions. Secondly, sample characteristics of self-employed individuals

are compared with those of not self-employed individuals. Thirdly, the final model is derived

and some diagnostic tests are carried out to review the performance of that model. Fourthly,

it is checked whether the found results hold in the created subsamples and robustness with

respect to the operationalization of the self-employment variable is discussed. Fifthly, the

influence of potential outliers on the results is considered. Finally, the problem of and

potential solutions for heteroskedasticity are addressed.

4.1 Comparing the derived risk aversion measures by payoff condition

Regarding the derived degrees of risk aversion is expected that these may differ between the

experimental payoff conditions. This expectation is confirmed18 by t-tests as displayed in

table 3. Moreover, results of the non-parametric Wilcoxon rank-sum test, carried out

because normality of the data is doubtful, agree with this (r = number of safe choices: W = -

4.660, p < .000, r = relative risk aversion: W = 4.636, p < .000, r = absolute risk aversion W =

-13.426, p < .000).

In the introduction was mentioned that Camerer and Hogarth (1999) found that

adding incentives to a risky choice question reduces the variance of the results. This

statement implies a lower variance for the degrees of risk aversion derived from the real

payoff condition in comparison with those from the hypothetical payoff condition. This

statement appears untrue for all derived risk aversion measures, as can be seen in the last

column of table 6 (r = number of safe choices: F(1043, 1608) = 1.079, p < .914, left-tailed, r =

relative risk aversion: F(1043, 1608) = 1.310, p < 1, left-tailed, r = absolute risk aversion

F(1043, 1608) = 1.088, p < .935, left-tailed). Results of the non-parametric Kruskal Wallis

variance test, again carried out because normality of the data is doubtful, show that the

variance does differ between the payoff conditions (r = number of safe choices: K = 19.112, p

17

So eliminating them makes the group of not self-employed more homogenous, which is also done in for example Brown et al. (2011). 18

Assuming the 5% significance level for this entire paper.

21

< .000, r = relative risk aversion: K = 173.767, p < .000, r = absolute risk aversion K =

18.912, p < .000). However, by the bigger standard errors in the real payoff condition

displayed in table 3 can be derived that this difference is not in the direction expected from

the research of Camerer and Hogarth (1999).

The above results have the following implications for this research. Testing whether

the results found in the primary sample still hold in a subsample split up by payoff condition

matters, because both parametric and non-parametric test statistics indicate that there may

be differences between the samples. However, results with a higher level of significance

because of incentives in the lottery experiment are not expected anymore since the incentive

compatibility appears not to have reduced the variance of the derived risk aversion

measures, as confirmed by both parametric and non-parametric test statistics.

Table 319

Summary statistics of the risk aversion measures for the different payoff conditions

4.2 Differences between self-employed and not self-employed individuals

In table 4 within sample differences between self-employed and not self-employed

individuals are summarized. In the last column of this table the p value for equal means

based on the Wilcoxon rank sum test is displayed. This test is used instead of the regular t-

test because the number of self-employed individuals is small and therefore it is risky to

assume normality. Moreover, in the third column is displayed whether the difference between

the means of self-employed and not self-employed individuals are in line with the theoretical

part of this paper. In this case the words “Yes and No” mean that previous research found

ambivalent results with respect to the sign of this variable.

19

* means significant at a 10% level, ** means significant at a 5% level, *** means significant at a 1% level.

Only hypothetical Only Real

N 1608 1043

Risk aversion measure Mean Std. Err. t (∞) p Mean Std. Err. F (1043, 1608) p (H1: F<1 )

Number of safe choices 3,632 1,730 4,139*** 0,000 3,340 1,797 1,079 0,914

Absolute (exponential utility) 0,014 0,001 -13,371*** 0,000 0,027 0,001 1,310 1,000

Relative (power utility) -0,615 0,014 4,075*** 0,000 0,525 0,018 1,088 0,935

22

Table 4

Comparison between self-employed and not self-employed individuals20

From table 4 can be derived that the means of all background characteristics between

the group of self-employed individuals and the group of not self-employed individuals are in

line with theory. However, none of the differences is significant, except for the difference in

level of education (p < .005, two-tailed).

It will be recalled that self-employed individuals are expected to be less risk averse

than their not self-employed counterparts. As can be seen in figure 2, the distribution of the

relative risk aversion measure tends to be slightly more left skewed than for not self-

employed individuals, indicating that this expectation may be right. The means of all risk

aversion measures in table 4 are also in accordance with that expectation: self-employed

individuals do make less safe choices in the lottery experiment (3.33) than those who are not

(3.53) and have a slightly lower degree of absolute and relative risk aversion (-0.002 and -

0.062 difference respectively). But as mentioned before the differences are not statistically

significant. However, fitting the data in a binary choice model may give different results for

the signs and significance of these variables, because this technique takes into account all

side effects caused by the background characteristics.

20

Level of education is measured as an ordinal variable as maintained by Statistics Netherlands, ranging from 1 (primary school) to 6 (university degree).

Not self-employed Self-employed Expected from theory? p value Wilcoxon statistic

N 2522 129

% Male 48,4% 56,6% Yes 0,0687*

Age 48,955 49,829 Yes and No 0,654

% Married 59,4% 62,0% Yes and No 0,555

% Partnered 76,5% 74,4% Yes and No 0,590

% With children 39,1% 40,3% Yes and No 0,769

Number of children 0,798 0,829 Yes and No 0,784

Education level 3,508 3,891 Yes and No 0,005***

% Own dwelling 74,8% 79,1% Yes 0,281

Number of safe choices 3,527 3,333 Yes 0,221

Absolute risk aversion 0,019 0,017 Yes 0,515

Relative risk aversion 0,582 0,520 Yes 0,219

23

Figure 2

Distribution of the relative risk aversion measure21 for self-employed and not self-employed

individuals

4.3 The final model

Before the final model can be used, some decisions have to be made. Firstly, a decision

about which distribution function F to use has to be made. Unreported results show that in a

model with only the exogenous covariates22 age, squared age and a male dummy the logit

and probit model fit the data on average equally well, regardless the used measure of risk

aversion. Because the choices are very unbalanced, i.e. there are far more not self-

employed individuals in the sample than self-employed individuals, the logit function is

chosen as the cumulative density function in the final model23, since this function has larger

values in its both tails.

Secondly, a decision regarding whether to use normal or robust standard errors has

to be made. Unreported results show that in none of the used models a variable changes

from significant to insignificant or the other way round by changing the used type of standard

errors. Moreover, using normal standard errors increases joint significance and goodness of

fit of the models in some cases, but decreases those in other cases. Because of a lack of a

compelling reason to use robust standard errors, normal standard errors are used. This topic

is elaborated more extensively in paragraph 4.6.

Thirdly, a top down approach is used to derive which control variables to include in

the vector X of equation 6. This approach requires that in the first estimate X contains all

control variables from table 1. The estimation is carried out separately for each measure of

21

The result that the distribution is more left skewed for self-employed individuals is valid for all risk aversion measures. 22

Only purely exogenous variables are used in order to prevent any interfering influences on the model shape function from possibly endogenous variables. 23

Unreported results show that all results of the present study are the same if the probit model had been used.

0.1

.2.3

.4.5

Frac

tion

-.5 0 .5 1pown

self-employed not self-employed

24

risk aversion24. The estimation results of this model, from now on called the full model, are

shown in column 1 to 3 of table 5. From this table can be deduced that only the age and

squared age variable are significant in explaining self-employment status. The other control

variables in X and the risk aversion measure are highly insignificant and some have the

wrong sign. The dummy variables for level of education all have a different effect which was

expected from the theory. However, a likelihood ratio test on joint significance shows that

these dummies are jointly insignificant regardless the risk aversion measure used (r =

number of safe choices, χ2 (5) = 7.17, p < .21, r = absolute risk aversion, χ2 (5) = 7.28, p <

.20, r = relative risk aversion, χ2 (5) = 7.18, p < .21). As can be seen in column 4 to 6 in table

5, leaving out these dummy variables results in a significant male dummy variable at a 10%-

level with insignificant other control variables, regardless the used measure of risk aversion.

The top down approach now requires that the own dwelling dummy is eliminated. As can be

seen in column 7 to 9 this doesn’t result in a significant married variable, so lastly this

variable is left out as well, resulting in the purely exogenous model.

Table 5

Results of the different models proposed by the top down approach

To see whether the top-down approach has resulted in a well-specified model, a logit

regression of self-employment status on the by the exogenous model predicted value (ŷ) and

on the squared predicted value (ŷ2) is carried out. The results of this test are striking: ŷ is

insignificant in the exogenous model, regardless the used degree of risk aversion (r =

number of safe choices, Z = 0.01, p < .99, two-tailed, r = absolute risk aversion, Z = -0.31, p

< .67, two-tailed, r = relative risk aversion, Z = 0.03, p < .97, two-tailed). This indicates that

there is a severe misspecification problem present in the exogenous model. Adding back all

the control variables from the first estimation slightly improves the results of this test, but ŷ

remains highly insignificant. Another possible solution is to add some interaction variables to

account for different effects of the risk aversion measure between groups, as was also done

24

Please note that the number of children variable is not in X because after inspection this variable appeared insolvably distorted.

1 2 3 4 5 6 7 8 9

Risk aversion measure NUMSAFE ARE RRP NUMSAFE ARE RRP NUMSAFE ARE RRP

Coefficient -0,0301733 -0,9298816 -0,1034671 -0,03662 -1,32409 -0,12255 -0,036849 -1,41454 -0,1228

p - value risk aversion measure 0,552 0,806 0,519 0,468 0,726 0,441 0,464 0,708 0,439

Covariate included? Significance(1%/5%/10%-level)

Exogenous variables (AGE, AGESQ, MALE) Yes; Except MALE Yes; Except MALE Yes; Except MALE Yes; 10% Yes; 10% Yes;10% Yes; 10% Yes; 10% Yes; 10%

5 level of education dummies Yes; No Yes; No Yes; No No No No No No No

Married dummy Yes; No Yes; No Yes; No Yes; No Yes; No Yes; No Yes; No Yes; No Yes; No

Own dwelling dummy Yes; No Yes; No Yes; No Yes; No Yes; No Yes; No No No No

MC Fadden's R squared 0,0296 0,0293 0,0297 0,0227 0,0223 0,0228 0,0221 0,0218 0,0228

25

in previous research (see for example, Hundley, 2001). Based on previous research

interaction terms between the risk aversion measure and the different educational

categories, degree of urbanization categories, civil status and gender are added.

Nonetheless, this still results in a misspecified model according to insignificance of the

predicted value of ŷ (results of these tests are available upon request). This might indicate

that the real explanatory variable for predicting self-employment status is missing in all

previous tried out models. Considering the theoretical part this may be a variable which

measures entrepreneurial drive, the ability to overcome the financial constraints associated

with becoming self-employed or some family background characteristic, which are all not

available for this sample. Another explanation could be that the risk aversion measures

derived from the lottery experiment are not a good proxy for real risk taking behavior. This is

discussed more extensively in chapter 6.

But not all is lost for the full model. Perhaps its goodness of data fit is better than that

of the exogenous model. In table 6 both models are compared by several data fit diagnostics.

As can be derived from this table, some statistics support the exogenous model and other

statistics support the full model. But all in all a conclusion can be drawn from table 6: the

difference in the Bayesian information criterion of approximately 47 between the two models

gives strong support for using the exogenous model (Kass & Raftery, 1995), regardless the

used risk aversion coefficient.

Table 6

Comparison of goodness of data fit between the full model and the exogenous model

4.4 Results found using the final model

The results found by using the exogenous model proposed by the top down approach are

displayed in table 7. As can be seen, the signs of the three different risk aversion measures

are in line with theory and confirm the second part of the hypothesis. This could already be

NUMSAFE ARE RRP

Full model Exogenous model Difference Full model Exogenous model Difference Full model Exogenous model Difference

Log-Likelihood Full Model -500,499 -504,652 4,153 -500,644 -504,855 4,211 -500,468 -504,621 4,153

Likelihood ratio (d.f.) 30,524 (11) 22,217(4) 8,307(7) 32,070(11) 23,707(4) 8,362(7) 30,585(11) 22,280(4) 8,305(7)

Prob > LR 0,001 0,000 0,001 0,001 0,000 0,001 0,001 0,000 0,001

McFadden's R2 0,030 0,022 0,008 0,029 0,021 0,008 0,030 0,022 0,008

McFadden's Adj R2 0,006 0,012 -0,006 0,006 0,011 -0,005 0,006 0,012 -0,006

Maximum Likelihood R2 0,011 0,008 0,003 0,011 0,008 0,003 0,011 0,008 0,003

Cragg & Uhler's R2 0,036 0,026 0,010 0,035 0,025 0,010 0,036 0,026 0,010

McKelvey and Zavoina's R20,087 0,071 0,016 0,087 0,070 0,017 0,087 0,071 0,016

Efron's R2 0,011 0,008 0,004 0,011 0,007 0,004 0,011 0,008 0,003

Variance of ŷ 3,604 3,540 0,063 3,602 3,538 0,065 3,604 3,541 0,063

AIC 0,387 0,384 0,002 0,387 0,385 0,002 0,387 0,384 0,003

BIC -19801,427 -19848,299 46,872 -1980,137 -19847,849 46,757 -19801,489 -19848,362 46,874

26

expected from the results in table 4, which display the same pattern of lower risk aversion for

self-employed individuals. However, all risk aversion coefficients are insignificant when

tested with the regular t-test, which rejects the first part of the hypothesis. This may be

caused by the large standard errors due to the misspecified model. Furthermore, MC

Fadden’s R2 is very low while the likelihood ratio test on joint significance indicates that the

used model is statistically significant (r = number of safe choices, χ2 (4) = 22.22, p < .000, r =

absolute risk aversion, χ2 (4) = 22.28, p < .000, r = relative risk aversion, χ2 (4) = 22.41, p <

.000). This combination of statistical significance with a low R2 is typical for binary choice

models about individual behavior: the model does not work well in describing individual

decisions but is able to show an overall pattern of the behavior. Besides, the (unreported)

classification table of this model reflects very small predictive power: the complete sample is

classified as not self-employed, i.e. none of the self-employed individuals is classified as self-

employed, regardless the used measure of risk aversion.

Table 7

Results final model

It is interesting to see whether the aforementioned results are robust when the final

model is estimated using the different subsamples described in paragraph 3.5. As shown by

table 8, eliminating the observations with missing job status does not alter the signs of the

risk aversion measures. However, not displayed in table 8, this makes the exogenous

variables insignificant.

Table 8

Robustness checks for the results of the final model using a subsample without individuals

with missing job status

Risk aversion measure NUMSAFE ARE RRP

Coefficient risk aversion measure -0,039 -1,613 -0,129

Sign in line with theory Yes Yes Yes

p value risk aversion measure 0,439 0,668 0,415

Joint significance (p value LR-test) 0,000 0,000 0,000

Goodness of fit (p value Pearson chi squared) 0,968 0,979 0,846

Mc Fadden's R squared 0,022 0,021 0,022

Subsample Full no job status excl. Full no job status excl. Full no job status excl.

Risk aversion measure NUMSAFE ARE RRP

Coefficient risk aversion measure -0,035 -0,636 -0,119

p value risk aversion measure 0,501 0,707 0,464

Mc Fadden's R squared 0,033 0,032 0,033

Result in line with results of the full sample yes yes yes

27

The same applies when only participants of the real payoff condition are considered25,

as can be seen in table 9. Deleting the observations with missing job status from this sample

does not make a difference with respect to the sign or significance of the risk aversion

measure26, as the values in the table 9 show.

Table 9

Robustness checks for the results of the final model using the real payoff condition

subsample

If merely the participants within the hypothetical payoff condition are considered the

age and squared age variables remain significant27, but the male dummy variable becomes

insignificant. In addition, all signs of the risk aversion measures change from negative to

positive, but still all risk aversion measures remain insignificant, as can be seen in table 10.

Deleting the observations with missing job status from this sample does not alter these

results, except for an insignificant squared age variable.

Table 10

Robustness checks for the results of the final model using the hypothetical payoff condition

subsample

Because the number of safe choices variable has a straightforward interpretation and

showed the best performance in previous research, this variable is investigated more

thoroughly. From this risk aversion measure three dummy variables are created. The first

(second) {third} one has the value one when a participant makes less than three (two) {one}

25

However, the exogenous variables age, squared age and the male dummy variable remain significant at the 10% significance level using this subsample. 26

But the exogenous variables are now also not significant anymore using the 10% significance level. 27

At a 5% significance level in contrast to the 1% significance level in the full sample.

Subsample Real Real Real Real no job status excl. Real no job status excl. Real no job status excl.

Risk aversion measure NUMSAFE ARE RRP NUMSAFE ARE RRP

Coefficient risk aversion measure -0,070 -4,752 -0,236 -0,057 -3,711 -0,205

p value risk aversion measure 0,341 0,369 0,306 0,453 0,495 0,391

Mc Fadden's R squared 0,049 0,049 0,050 0,046 0,046 0,047

Result in line with results of the full sample yes yes yes yes yes yes

Subsample Hypothetical Hypothetical Hypothetical Hypothetical no job status excl. Hypothetical no job status excl. Hypothetical no job status excl.

Risk aversion measure NUMSAFE ARE RRP NUMSAFE ARE RRP

Coefficient risk aversion measure 0,010 0,612 0,036 0,006 1,318 0,021

p value risk aversion measure 0,883 0,911 0,087 0,938 0,811 0,926

Mc Fadden's R squared 0,009 0,009 0,009 0,029 0,029 0,289

Result in line with results of the full sample no no no no no no

28

safe choice(s), because a participant can to a certain extent be considered as risk loving in

that case. Unfortunately, unreported results show that all these three dummy variables are

also insignificant in explaining self-employment status regardless which control variables

from table 1 are included.

The last thing which is tried to improve the post estimation performance of the

exogenous model is using a different definition of self-employment. In all previous models an

individual was characterized as self-employed when working as a self-employed, a

freelancer or when owning a family business. Now two new self-employment variables are

created. In the first one the freelancers from the original group are eliminated in order to

ensure homogeneity of the investigated group, namely only self-employed individuals with

staff. In the second one, independent professionals28 are added to the original group of self-

employed individuals to see whether including them alters the results. A third one could

exclude family business owners from the group of self-employed. These individuals might

have inherited the family business and therefore their self-employment status is not the result

of a low degree of risk aversion. Because only three family business owners are present in

the sample, this definition is not put to the test.

Unreported sample statistics show that there are only extremely small differences

between the original group and the two new created groups of self-employed individuals.

Table 9 shows the results of the logit regression using the exogenous model with the

different operationalizations of self-employment. The signs of the risk aversion measures

when independent professionals are included are similar to those of the original

operationalization. However, when the freelancers are left out, all risk aversion measures are

predicted with the wrong sign as shown in table 11. This last result is exactly the opposite of

the result when the original definition of self-employment is used. This implies that the results

are extremely sensitive to the definition of self-employment. Please note that still all risk

aversion measures appear insignificant.

Table 11

Robustness checks for the results of the final model for different operationalization of self-

employment

28

A clear description of this group lacks in current literature.

Operationalization of self-employment Minus freelancers Plus independent professionals

Risk aversion measure NUMSAFE ARE RRP NUMSAFE ARE RRP

Coefficient risk aversion measure 0,038 1,525 0,128 -0,032 -2,196 -0,103

p value risk aversion measure 0,064 0,740 0,532 0,484 0,518 0,475

Mc Fadden's R squared 0,025 0,025 0,025 0,025 0,025 0,025

Result in line with results of the original definition no no no yes yes yes

29

4.5 Detecting influential observations

The unsatisfactory results of paragraph 4.4 may be caused by outliers and influential

observations, because logit models are very sensitive to both. Whether these are present in

the full sample is checked using the exogenous model with the absolute risk aversion

measure in this section29.

Looking to observations which are more far away from other observations may gain

insight in whether outliers are present30. Pearson residuals are displayed in figure 3,

deviance residuals are displayed in figure 4 and the observations’ leverages are displayed in

figure 5, all against their id number. From these graphs can be concluded that there are

indeed some observations with high residual values, but that their influence on the estimates

is quite small as can be read from figure 5 (leverage < 0.03).

Figure 3 Figure 4

Pearson residuals against id number31 Deviance residuals against id number31

Some observations may be outliers in the sense that they have big impact on the

goodness of fit of the model. To detect them, an approximation of the difference in the

Pearson χ2-statistic for each observation is computed. In figure 6 each dot is an observation,

and the belonging y-value represents the (approximation of the) reduction of the value of the

Pearson χ2-statistic if that observation is eliminated. As a rule of thumb is used that an

observation is regarded as an outlier if its influence on the χ2-statistic is twice as big as the

mean influence. Unreported results show that leaving out these observations slightly

improves the goodness of fit of the exogenous model.

29

Unreported results show that the results of this paragraph are invariant to the used degree of risk aversion. 30

The more common method of looking to observations which are far away from their predicted probability is not used because of the bad post estimation performance of the exogenous model. 31

A one (zero) next to an observation indicates that an individual is (not) self-employed.

0000000

0

00

00

00000000000000

000000000

0

1

00000000

00000

0000

0

0000000000000

0

000

0

000000

0

000000

0

0

0

000000

000

0

00

0

0

00

0

0

000000

1

00000

0

00

0

0000

0

1

00

0

000000000000

0

00

0

000

0

0000000

0

00

0

1

0

0

1

0

000000000

1

0000

000

0

00000

00

0

00

1

0

0

1

000

1

0

000

0

0

0

0

0

00

00000

0

000

0

00

0

1

0

0

0

0

000

0

000

0

00000

000

1

0

1

00

0

0

0

0

0

0

0

0

0

0

0

0

000000000000000

0

0000000

0000000

0

00000

0

0

00

00000000

0

00

000

1

000000000

0000

11

0000

0

00

0

0

00

000000000000000

1

00

0

000

0

0000

0

00000

0

0000

0

0

1

000

0

0

0

1

00

0

0

00

01

0

1

0

000

1

0

00000000000

0

000

0

0000000

0000

0

000

0

00

1

0

00

0

0

00

1

00

000000

0

00

0

0

0

0

000

1

00

0

1

000

0

00

00

000000000

0

00

0

000

0

0

00

0

000000000

1

0

0000

0

000

0

1

000000

1

00

0

0

0

000

000

0

000000000000

00

1

0

1

000

1

0000

0000

0

00

0

0000

1

00

1

00

0

00

0

1

0

0

0

0

0

000

0

00

1

00

000

0

000

0

0

00

0

0

00

0

0

0

00

00000

0000

0

0000

0

00000

0

0

000

0

0

0

00000000000

0

0

1

00

0

0

00

0

0

0

0

0

00

0

0

000000000

0

00

0

00

0

00

10

0

0

0

1

000

0

00

0

0000000

0

0

0

0

00

0000

0

0000

0

1

00

1

000000

0

00

0

000

000

0

0

0

1

000

1

0

0

000

0

0000

0

00

0

0

00

0

0

000

0

0

0

00

1

00

00

000000

0

00000

00

0000

0

00

00000000

1

1

000

0

000

0

000000

0

0

00

0

0

1

0

0

0

000

0

0

1

000

0

0

000

0

000

0

0

1

00

0

0

000

00

001

0

1

0000000

0

0

1

00

0

00

0

0

00

0

0

00

00

000000

0

0

0

00

0

0

0

0

0

0

0000000

0

1

00

0

0

00

0

1

0

000000

0

0

00

0

1

0

1

00000

0

0

000

00

0

0000000

0

0

0

00

1

00

0000

0

000

1

000

1

00000

0

0

01

00

0

000

0

1

0

0

0

00

0

0000000000000000000

0

000

0

000

0

00

0

00000

0

000

0

0

0

00

0

0

000000000000000

0

00000000000

0

0000000000

0

0

0

00000

0

0

0

0

0

0

0

00000

0

0

00

1

0

1

000

0

0000

0

00000000

00

0

000

0

00

0

000

0

0

0

0

0

0

0

00000

0

000

0

00

0

1

0

0

0000000000000

0

0

000000

1

000000000

0

000000

0

0

0

00

0

0

0

000

0

0000

0

1

0

000

0

00

0

000

00

0

000

0

0

0

1

000

00

0

1

000

00000000

00

0

00000

1

0

0

0

1

0

0

0

0

0

00000

0

0

0

00000

0

00000

0

0

00

0

00

0

0

0

00

1

00000000

0

0

000

0

00000000000

0000

0

1

0

0

0000

000

0

00

0

0000

00000

0

0000

0

0

0

0

00

0

0

0

0

0

0

0

0000

0000

00

000000000000

1

0

0

0

0

0

1

0

0

00000

0

00

0

1

00

0

0

000

0

000000

0

00

10

000000

1

000

1

0000000

0

0

0

0

00000

0

0000000

0

0000000

0

00000

0

000

0

0

0

000

0

0

0

0

1

000

0

000

000

0

00

00

0

00

1

00000000

0

0000

0

000

0

00000

0

0000

0

000

0000

0

000

00000000

1

0

0

00

0

0

0

0

000

0

1

000

0

0000

0

0

0

00

0000000

0

000

0

00

0

0

0

00000

000

000

1

0

0

00

0

0000000

0

00

0

0

1

00000

0

1

0

0000

0

0

0

1

00

1

0

00

0

00

1

0

0

0

1

00000

0

0

1

0

0

0

0

0

0

0000000

0

000000

0

00

0

0000

000000000000

1

0

0

00

00000000000

0

0000000

0

0000

0000

1

000

1

0000

0

0

0

0

000000000000

0

1

0

00

0

000

1

00

00

0

00

0

0

1

1

0000

00

0

000

0

00000000

0

0

00

0

00000000

0

0

0000000

0

0

00

00

0

0

00

0

00

0

0

0

0

00

0

1

0000000

1

0

0

000

0

000

0

0000

0

0

0

000000

0

0

0000

0

00000

0

00

0

0000

000000

0

00

0

0

00000

00

000

0

00000

0

0

00000

0

000

0

0

00000

0

00

000

1

000

0

00

0

0

0

000

0

1

00000000000000

0

0

00

0

0

0

0

00

0

0

00

1

0

0

0

0

0

00

0

000

0

0

0

0

00000

1

0

1

0

0

0

000000000000

0

00000000000

0

00

01

0

0

0

0000

0

00

0

000

0

0

0

000

1

0

00000000000

0

0000

0

000000

0

00

0

000

0

00000000000000000

1

0

00

0

0

00

1

00

00000

0

000

0

0000

0

00000

00000000000

0

000

0

00000

0

0

0

000

0000

00

00000

0

0

00

0

0

0

000

0

0

000000

1

00

000

1

0

1

0

1

0

0

0

00000000

1

0

0000

1

000

0

0000

0

000000

0

000000000

00

0

00

0

0

0

0

0

000

0

0

0

0

00

0

00

0

0

0

0

0

1

1

0

0

00

00

0

1

0

0

0

00000000

0

0

00000

0

0

0000

0

00

0

0000

0

0000

0

0

0

00

0

0

0

00000000000000

0

00000

1

000

1

00000

0

00

000

0

000000000

0

00000

0

0000

0

0

0000

0

000000

1

0

0

000000

0

0

0

000

0000

0

0

0

000

0

0

0

0

0

100

0

00000000

0

00

0

0

0

0

1

0

0

000

00000

0

0

0

0

00

0000

0

0

0

0

0

0

0

0

0000

0

0

01

000000

0

000

0

000

1

1

0

0

0

0

1

0

0

0

000

0000

1

00000000

00

1

000000

0

00

0

00

00

0

0

00000

000

0

00

1

0

000

000

0

0

1

0

0

0

000000

0

00

0000

0

0000000000

0

-20

24

6

sta

nda

rdiz

ed

Pe

ars

on r

esid

ual

800000 820000 840000 860000 880000 900000Number of the household member encrypted

0

0000

0

0

0

0

0

0

0

00

0

0

0

0000

0000

0

0

0

0000

0

00

0

1

0000

00

00

000

00

0

000

0

000000000

0

000

0

0

0

0

0

00

000

0

0

0

00

0

0

0

0

0

0

0

0

0

000

0

0

0

0

00

0

0

0

0

0

0

0

00

0

0

0

1

0

00

0

0

0

0

0

0

00

0

0

0

1

0

0

0

000

00

00

00

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

0

1

0

0

1

0

0

0

0

00

0000

1

00

0

0

0

00

0

000

00

0

0

0

0

0

1

0

0

1

0

0

0

1

0

0

00

0

0

0

0

0

0

0

0

0

000

0

00

0

0

00

0

1

0

0

0

0

000

0

0

00

0

0

000

0

0

0

0

1

0

1

00

0

0

0

0

0

0

0

0

0

0

0

0

00

00

0

00

0

00

0000

0

0

0000

000

0

000

000

0

0

0

0

00

0

0

0

0

0

000

000

0

0

00

00

0

1

0

0

0

00000

0

0000

11

0

000

0

0

0

0

0

0

0

00

0000

00

0

0

0

0

0

0

0

1

00

0

00

0

0

0

00

0

0

0000

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

1

0

0

0

0

0

0

01

0

1

0

0

00

1

0

0

0

0000

00

0

0

0

0

0

0

0

0

000

00

00

0

000

0

000

0

00

1

0

0

0

0

0

00

1

00

000

000

0

00

0

0

0

0

0

00

1

00

0

1

0

0

0

0

00

00

000

00

000

0

0

0

0

0

000

0

0

00

0

0

0000

0

000

1

0

0

0

0

0

0

00

0

0

1

00

0

0

00

1

00

0

0

0

0

0

0

00

0

0

000000

0

00000

0

0

1

0

1

0

00

1

0

00

0

0

000

0

0

0

0

0000

1

00

1

0

0

0

0

0

0

1

0

0

0

0

0

00

0

0

00

1

00

00

0

0

000

0

0

00

0

0

0

0

0

0

0

00

0

0

0

0

0

0

000

0

0000

0

0

000

0

0

0

000

0

0

0

00000

0000

00

0

0

1

0

0

0

0

00

0

0

0

0

0

00

0

0

00

00

000

00

0

00

0

00

0

00

10

0

0

0

1

0

0

0

0

0

0

0

0000000

0

0

0

0

0

0

0

0

0

0

0

0000

0

1

0

0

1

0

0

0

0

0

0

0

0

0

0

000

00

0

0

0

0

1

000

1

0

0

000

0

0000

0

0

0

0

0

0

0

0

0

000

0

0

0

0

0

1

00

0

0

0

0

00

00

0

0

0

000

0

0

000

0

0

00

00000

0

0

0

1

1

00

0

0

00

0

0

00

00

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

0

1

00

0

0

0

0

00

0

000

0

0

1

00

0

0

00

0

00

0

0

1

0

1

00

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

00

0

0

00

0

0

0

00

0

00

0

0

0

00

0

0

0

0

0

0

0

000

000

0

1

00

0

0

0

0

0

1

0

000000

0

0

0

0

0

1

0

1

0

0

000

0

0

00

0

0

0

0

0

000

000

0

0

0

00

1

00

00

00

0

0

0

0

1

0

00

1

000

0

0

0

0

01

0

0

0

0

0

0

0

1

0

0

0

0

0

0

00

0

000

0000

00

00

0

000

0

0

000

0

000

0

00

0

00000

0

00

0

0

0

0

0

0

0

0

00

000000

00000

00

0

0

0000

00

00

0

0

0

0

000

00000

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

00

0

00

0

0

0

0

1

0

1

000

0

0000

0

0

00

0

0

0

0

0

0

0

0

00

0

0

00

0

00

0

0

0

0

0

0

0

0

0

0

00

0

0

0

00

0

0

0

0

1

0

0

000

000

0

00

0

00

0

0

0

000000

1

00

00

000

00

0

0

0

00

0

0

0

0

0

00

0

0

0

000

0

00

00

0

1

0

0

00

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

1

000

0

0

0

1

000

0

0

0000

00

00

0

0

000

0

1

0

0

0

1

0

0

0

0

0

0

0

000

0

0

0

0

0

0

0

0

0

00

000

0

0

0

0

0

00

0

0

0

0

0

1

0

0

0

00

0

00

0

0

000

0

00

0

000

0

0000

0

0

00

0

1

0

0

00

00

000

0

00

0

0

000

0

0000

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

000

0

00

00

0

0

00

00

00000

0

00

1

0

0

0

0

0

1

0

0

0000

0

0

0

0

0

1

00

0

0

0

0

0

0

000

0

00

0

0

0

1

0

0

00

0

0

0

1

0

0

0

1

00

0

000

0

0

0

0

0

0

0

000

0

00

000

00

0

00

000

0

0

0

00

0

00

0

0

00

0

0

0

000

0

0

0

0

1

0

0

0

0

0

0

0

00

0

0

00

0

0

0

0

0

1

00000

0

0

0

0

0

0

0

0

0

00

0

0

0000

0

0

0

00

0

0

00

0

000

0

0

000

0

0

00

0

000

1

0

0

0

0

0

0

0

0

0

00

0

1

0

00

0

000

0

0

0

0

00

0

0

00000

0

000

0

0

0

0

0

0

0

00

00

00

0

00

0

1

0

0

00

0

0

000

000

0

00

0

0

1

0

00

00

0

1

0

00

0

0

0

0

0

1

00

1

0

0

0

0

0

0

1

0

0

0

1

0000

0

0

0

1

0

0

0

0

0

0

0

0

00

00

0

0

0

0

000

0

0

00

0

0

00

0

000

00

0

0

00000

1

0

0

0

0

00

0

0

00

00000

0

0

0

000

00

0

000

0

0

0

0

0

1

000

1

00

00

0

0

0

0

0

0

00

0

00

0

00

0

0

0

1

0

0

0

0

000

1

00

0

0

0

00

0

0

1

1

00

0

0

00

0

0

0

0

0

00

0

0

000

0

0

0

0

0

0

0000

000

0

0

0

0

0

0

0

000

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

00

0

1

00

0

0000

1

0

0

0

00

0

000

0

00

0

0

0

0

0

00000

0

0

0

0

00

0

0

0

00

00

0

00

0

0

00

0

0

00

000

0

0

0

0

0

0000

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

00

0

000

0

0

0

0

000

0

0

0

0

00

1

0

0

0

0

00

0

0

0

0

00

0

1

000

0

000

00

00

0

00

0

0

0

0

0

0

0

0

0

0

0

0

00

1

0

0

0

0

0

00

0

0

00

0

0

0

0

00

0

0

0

1

0

1

0

0

0

000

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

01

0

0

0

000

0

0

00

0

0

0

0

0

0

0

00

0

1

0

0

0

00

00

0

00

00

0

0000

0

0

0

0

000

0

0

0

0

0

00

0

0

0

0

0

00

00000

000

0

00

1

0

0

0

0

0

0

0

1

00

0

0

00

0

0

0

00

0

0

0

00

0

0

0

00

0

0

0

00

00

00000

0

00

0

0

00

000

0

0

0

0

00

00

0

0

00

0

0

000

0

0

0

0

0

0

0

000

0

0

00

0

00

0

1

00

00

0

1

0

1

0

1

0

0

0

000

000

00

1

0

000

0

1

000

0

0000

0

0

000

0

0

0

0

000

00000

0

0

0

00

0

0

0

0

0

00

0

0

0

0

0

00

0

00

0

0

0

0

0

1

1

0

0

0

0

0

0

01

0

0

0

0

0

0

00

000

0

0

0

0

0

0

0

0

0

0

00

0

0

00

0

00

00

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

00

00

000

000

0

00

000

1

000

1

0

0

0

0

0

0

0

0

0

0

0

0

0000

00

00

0

0

0

0

0

00

0

000

0

0

0

0

000

0

00

0

00

0

1

0

0

00

0

000

0

0

0

000

0000

0

0

0

000

0

0

0

0

0

1

0

0

0

000

0

0

0

0

0

0

00

0

0

0

0

1

0

0

000

0

0

000

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0000

0

0

01

000

000

0

0

0

0

0

00

0

1

1

0

0

0

0

1

0

0

0

00

0

00

00

1

0000

0

00

0

00

1

00

00

0

0

0

0

0

0

00

00

0

0

0

0

0

00

00

0

0

0

0

1

0

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

00

0

0

0

00

0

0

0

00

000

000

00

0

-2-1

01

23

devia

nce r

esid

ual

800000 820000 840000 860000 880000 900000Number of the household member encrypted

30

But before eliminating these observations, it may as well be interesting to see what is

special about them. They all turn out to be self-employed individuals with a low or high age32.

This is indeed not what is expected from the theory, but these observations cannot be

considered as outliers in the sense that they must be data entry errors and hence these

observations are not eliminated.

Figure 5 Figure 6

Pregibon leverage against id number31 Δχ2-statistic against id number31

Another possibility is that there are some observations with big impact on the

estimated coefficients. This is investigated by looking to an observation’s Pregibon’s dbeta,

which measures the influence of a covariate pattern on a coefficient estimate. A rule of

thumb for this statistic is that an observation can be considered as influential if it has a

Pregibon’s beta larger than 0.2. In figure 6 can be seen that according to this rule eight

observations have influential impact on the estimates. Further inspection of these

observations reveals that those observations are all 17 year-old males with 3 safe choices in

the lottery experiment and a certainty equivalent of 27.5. There is nothing suspicious about

this covariate pattern so there is no reason to eliminate these individuals.

32

Observation number 848934 is a male aged 19 who is self-employed, but can be considered as risk averse. Observation number 863528 is a female aged 69 who is self-employed and risk loving. Observation number 891308 is a 24 year old male who is self-employed and risk averse.

0

00

00

0

0

0

0

0

0

0

00

0

0

000

00

0

0

0

0

0

0

0

000

000

0

0

10

00000

0

0

00

0

0

00

0

00

0

00

0

0

0

0

0

00

0

0000

0

0

0

0

000

0

00

0

0

00

00

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

00

0

0

0

00

0

0

0

0

1000

0

0

0

00

0

00

0

0

0

1

000

000

0

0

0

0

0

00

0

0

0

0

00

00

0

0

0

0

00

0

0

0

0

0

0

0

1

0

01

0

0

0

0

0

0

0

0

00

1

0

0

0

0

0

0

0

0

0

000

0

00

0

0

0

1

0

0

1

0

001

0

0

0

0

0

0

0

0

0

0

0

000

0

0

00

00

0

0

0

0

1

0

0

0

0

00

0

0

0

00

0

00

0

0

0

0

0

01

0

1

00

0

0

0

00

0

0

00

0

0

0

0

00

0

00

0

0

0

0

00

0

0

0

00

00

0

0

0

0

00

0

0

00

0

0

0

0

0

0

0

0

0

00

0

000

00

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

0

0

0

0

00

1

1

0

000

0

0

0

0

0

0

0

0

0

0

0

0

0

00

00

0

0

0

0

0

1

0

0

0

0

0

0

000

0

0

0

0

000

0

0

0

000

0

01

0

00

0

0

010

0

0

0

0

00

1

0

1

0

0

0

01

0

0

00

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

00

0010

0

00

0

0

0

1

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

1

00

0

10

0

00

0

0

0

0

0

0

0

0

0

0

00

0

0

00

0

0

0

0

0

0

00

0

00

0

00

0

0

0

01

0

0

0

00

0

00

0

0

1

0

0

0

00

0

1

0

0

0

0

000

0

00

0

0

0

0

00

00

0

00

0

0

0

00

1

0

1

00

0

1

0

00

00

00

0

00

0

0

0

0

0

0

1

0

0

10000

0

0

1

00

0

0

0

0

0

0

0

0

0

1

00

0

0

0

0

000

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

000

0

0

0

0

0

00

0

0

00

00

0

0

000

0

0

0

0

0

00

0

0

00

0

0

0

0

0

1

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

0

000

0

0

0

0

0

0

0

0

0

0

1

0

00

0

1

0

0

0

0

0

0

0

0

000

0

00

0

0

0

0

0

0

00

00

0000

0

0

10

0

10

0

0

0

0

0

0

0

0

0

000

0

00

0

0

0

1

0

0

0

1

0

0

00

0

0

0

00

0

0

0

0

00

0

00

0

00

0

0

0

0

0

0

1

00

0

0

0

0

00

0

0

0

0

0

0

00

0

0

000

0

0

00

0

0

0

0

00

0

0

1

1

00

0

0

00

0

0

0

0

00

0

0

0

00

0

0

0

1

0

00

00

0

0

0

1

0

0

0

00

0

0

0

0

00

0

00

1

00

0

00

00

0

0

0

0

1

0

1

0000

0

0

00

0

1

00

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

000

00

00

0

0

0

00

0

0

0

00

00

0

0

1

0

0

0

0

0

0

0

1

0

0

0

0

0

00

0

0

0

0

0

1

0

1

0

0

0

0

0

0

0

00

0

0

0

0

0

00

0

000

0

0

0

00

1

00

0

0

0

0

0

0

00

1

0

0

0

1

000

0

0

0

00

1

0

0

0

0

0

0

0

1

0

0

0

0

000

0

0

0

00

00

00

0

0

00

0

0

0

0

000

0

0

0

0

00

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

0

0

00

0

0

0

0

00

000

0

0

0

00

0

0

0

0

0000

00000

0

0

0

00000

0

0

0

0

0

0

00

0

00

0

00

0

0

1

0

1000

0

00

00

0

000

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0000

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

1

00

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

00

0

0

0

1

00

00

00

0

0

0

0

0

0

00

0

0

0

0

0

00

0

0

00

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

01

0

00

0

0

01

000

0

0

0

0

00

00

0

00

0

0

0

0

01

0

0

0

1

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

00

0

0

00

0

1

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

0

0

00

0

0

0

0

0

0

01

0

0

0

00

0

00

0

0

00

0

0

0

0

00

0

0

0

0

0

00

0

0

0

0

000

0

0

0

0

0

0

0

0

0

00

0

0

0

00

00

0

0

0

0

000

0

0

0

0

0

1

0

0

0

0

0

1

0

0

0000

0

0

0

0

0

10

0

0

0

0

0

0

0

000

000

00

0

1

0

0

0

00

0

0

1

0

0

0

1

0

0000

0

00

0

0

0

0

0

0

00

0

0

0

0

00

0

00

0

0

0

00

0

00

0

0

0

00

0

0

0

0

0

0

0

00

0

00

0

0

1

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

10

0

00

0

0

0

0

0

0

000

0

00

0

0

0

000

0

0

0

0

00

0

00

0

0

00

0

0

0

0

0

0

0

0

0

0

00

01

0

0

0

00

0

0

0

0

00

0

1

0

00

0

0

000

0

0

0

0

0

0

0

00

0

00

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

10

0

00

0

0

0

0

0

0

00

00

00

0

1

0

0

0

00

0

1

0

00

0

0

0

0

0

1

00

10

00

0

001

0

0

01

0

0

00

0

0

0

1

0

0

0

0

0

0

0

0

00

0

00

0

00

000

00

0

0

0

0

00

0

0

00

00

0

0

0

000

01

00

0

0

0

00

0

0

00

0

0

0

0

0000

00

0

0

0

0

0

00

0

0

0

0

10

0

0

1

0

0

00

0

00

0

0

0

0

00

0

0

0

0

0

00

01

0

0

0

0

00

0

1

00

0

0

0

0

0

0

0

1

1

0

0

0

0

0

0

00

000

00

0

0

00

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

00

0

00

00

0

00

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

1

00

0

00

0

0

1

0

0

00

0

0

00

0

0

00

0

0

0

0

0

000000

0

00

0

00

0

0

00

0

0

0

00

0

0

00

0

0

0

0

0

0

0

0

0

00

0

00

0

00

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

00

0

00

0

000

0

00

0

0

0

1

0

00

0

0

00

0

0

0

00

0

1

0

00

0

0

0

0

0

0

0

00

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

1

00

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

1

0

1

00

0

0

00

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

1

0

0

0

0

0

0

0

0

00

0

0

0

00

0

0

0

0

01

0

00

0

0

0

00

0

0

0

0

0

0

00

0

0

0

0

0

000

0

0

0

0

0

00

0

0

0

0

0

00

0

0

0

0

00000

00

1

0

0

0

0

0

0

0

1

00

0

0

0

0

00

0

00

0

0

0

0

00

0

0

0

0

0

0

0

00

00

00

000

0

0

0

0

0

0

0

00

00

0

0

0

00

0

0

0

00

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

000

0

0

01

0

0

0

0

0

1

0

1

0

1

0

0

0

0

00

0

0

0

00

1

0

0

0

0

0

1

0

00

0

0

0

00

0

0

0

0

0

0

0

00

0

00

0

0

00

0

0

000

0

0

0

0

0

000

0

0

0

0

0

0

0

00

0

0

0

0

0

0

1

1

0

0

0

0

0

001

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

00

0

00

0

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

00

100

01

0

0

0

0

0

0

00

0

0

0

0

0

00

0

0

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

00

0

0

0

0

00

0

0

0

0

10

0

00

0

0

00

0

0

0

0

0

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

1

00

0

0

00

0

0

000

0

00

000

010

0

00

000

0

0

0

0

0

0

0

0

0

0

0

00

0

0

0

0

00000

0

00

0

00

1

0

00

0

0

0

00

0

0

0

0

0

0

1

1

0

0

0

0

10

0

0

0

0

0

0

0

0

0

1

0

000

0000

00

1

0

00

0

0

0

0

00

0

00

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

10

00

0

0

0

0

0

0

1

0

0

0

0

00

0

00

0

0

0

00

0

0

0

0

0

0

000

00

00

0

0

.01

.02

.03

.04

leve

rag

e

800000 820000 840000 860000 880000 900000Number of the household member encrypted

000000000000000000000000000000000000

1

00000000000000000

0

0000000000000

0

0000000000000000000000000000000000000

0

000000

1

00000000

0

000001000000000000000

0

000000000000000000

1

00

1

000000000010000000

0

0000000

0

00

1

00

1

000

1

0

000

0

000000000000000

0

000

1

0000000000000000000010

1

000

0

0

0

000

0

0000000000000000000

0

00000000000000000000000000000000000000

1

0000000000000

11

00000000

0

000000000000000001

000000

0

0000000000000000010000

0

0

1

000

0

00

01

0

1

00001

000000000000

0

000

0000000000000000000

1

0000000

1

0000000000000000001000

1

000

0

0000000000000000

0

0000

0

000000000000

1

0

00000000

0

1

00000010000

0

00000000000000000000010

1

000100000000

0

0000000

1

001000000

1

0

0

00

0

000

0

00

1

00000000000000000

0

0

0

00000000000000000000000

0

00000

0

0000000000000

1

0000

00

0

00000000000000000000000000

10

000

1

000

0

00

0

00000000000000000

0

00000

1

001000000

0

000000000

0

00

1

000

1

00000000000000

0

00000000

0

000

1

0000000000000000000000

0

000000000011000000000000000

0

0000

1

00000000

1

000

0

0

000

0

000

0

0

1

000000000

001

010000000

0

0

1

00

0

000

0

00

0

0000000000000

0

00000

0

00000000001000000

0

1

000000000000

1

0

1

000000000000000000000

0000

1

000000

0

000

1

000

1

00000000100

0

0000

1

0000000000000000000000000

0

0000000

0

00000000

0

0000000000000000000000000

0

00000000000

0

000000000000000000000000

0

00000

0

00010

1

000

0

0000

0

00000000000000

0

0000000000

0

00000000000

000

0

1

0

0

000000000000000000000

1

00000000000000000000000

0

000000000100000000000000000

0

001000000

1

000

00000000

00

000000

1

0001000

0

0

0000000000000

0

00000000000000000

1

00000000000000000000000000000010000000000000000000000

0

0000

0

00

0

00000

0

0000000000000000000000000

1

000

0

0

1

0000000

0

000100000000000000000

10

000000

1

00010000000000000000000000000

0000000

0

0000000000000000000

1

000

0

00000000000000

1

00000000000000000000000

0

000000000000000000000000

1

0

0

00

0

00000001

000

0

000000

0

0000000000000

0

00

0

0

0

00000000000

1

000000000000

0

00

0

0

1

000000100000000

1

00

1

000000

10

00

1

0000000

1

00

0

0000000000

0000000000

0

00000000000000001000000000000000

0

0000000000000000

1

000

1

000000

0

00000000000000

1

0

000000

1

0000

0

0000

1

1

0000000000

0

00000000

0

0

00

0

0000000000000000000

00

000000

0

00000000010000000

1

00000000000000

0

00000000

0

00000000000

0

0000000000000000

0

000000

00

0000000000

0

00000

0

000000000000000010000000000000

1

00000000000000

0

000000000000010

0

0

0

000

0

000

0

0000000010

1

00

0

000000000000000000000000000

01

00000000000000

0

00000

1

000000000000

0000000000000000000000000000000000000

1

0000000

1

0000000

0

00000000

0

0000000000000000

0

000

0

00000

0

00000000000000000

0

00000

0000

0

000000

1

00000

1

0

1

01

0

00000000001000001000000000000000

0

00000000000

0

000000

0

000000

0

00

0

000000

0

1

10

0

0000010

0

0000000000000000000000

0

0000000000000000000000000000000000000000

1

000

1

00000

0

000000000000000

0

00000000000

0

0000

0

000000

1

00000000

0

00000000000000000

0

001000000000000000000

1

0

0

00000000000

00000000

0

0

0

0

0

00000000

01

000000

0

0000000

1

1

00

0

0

1

0

0

00000000

1

0000000000

1

000000000000

00

0000000000

0

00

1

0000000001000000000000000000000000000001

02

03

04

05

0

H-L

dX

^2

800000 820000 840000 860000 880000 900000Number of the household member encrypted

31

Figure 7

Pregibon's dbeta against id number24

All in all can be concluded that there are no real outliers present with respect to the

combination of the exogenous covariates and the derived degrees of risk aversion. There are

some influential observations in terms of influence on the estimated coefficients or influence

on goodness of fit statistics but further inspection of these observations shows that there is

no reason to exclude them from the analysis.

4.6 Heteroskedasticity and its possible solutions

The bad post estimation performance of the exogenous model can also have another cause:

heteroskedasticity. By inspecting the estimation results for heteroskedastic residuals, a

severe problem is encountered: in a misspecified logit model, which the used exogenous

model clearly is, it is impossible to distinguish between heteroskedastic residuals due to this

misspecification and non-constant residuals due to different variances between groups in the

sample. Because no variables to solve the misspecification problem are available, as is

discussed in paragraph 4.3, it is tried to solve the heteroskedasticity problem with the

available variables.

0000000

0

0000000000000000000000000000

1

0000000000000000000000000000000000000000000000000000000000000

0

00000000000000100000000

0

0000010000000000000000000000000000000000100

1

000000000010000000

0

0000000

0

00

1

00100010000

0

000000000000000

0

00010000000000000000000010

1

000

0

0

0

000

0

0000000000000000000000000000000000000000000000000000

0

0000010000000000000

1

1

0000

0

000

0

00000000000000000

1

0000000

0000000000000000010000

0

01000

0

00010

1

00001000000000000

0

000000000000000000000010000000100000000000000000010001000

0

0000000000000000

0

0000

0

000000000000

10

000000000100000010000000000000000000000000010100010000000000000000

1

0010000001

0

0

00

0

000

0

00

1

000000000

0

0000000

0

0

0

00000000000000000000000

0

0000000000000000000

1

000000

0

00000000000000000000000000

1

0

000

1

000

0

00

0

000000000000000000000001

00

1000000

0

000000000

0

00100010

0

00000000

0

000000000000

00

001000

0

000000000000000000

0

0000000000110000000000000000000010000000010000

0

000

0

000

0

0

1000000000

0

0

1

010000000

0

0

1

00

0

000

0

0000000000000000

0

00000000000000001000000

0

1

000000000000

1

0

1

000000000000

0

000000000000

1

0000000000

1

000100000000100

0

0000

1

000000000000000000000000000000000

0

00000000

0

0000000000000000000000000000000000000

0

0000000000

0

0000000000000000000

0

00010100000000

0

00000000000000

0

0000000000

0

000000000000000

1

0

0

000000000000000000000

1

00000000000000000000000

0

000000000100000000000000000

0

001000000

1

00000000000

0

000000010

0

01000

0

00000000000000

0

00000

0

000

0

0000000

1

0000000000000000000000000000001000000000000

0

00000000000000

0

00

0

00000

0

0000000000000000000000000

1

000

0

0

1

0000000

0

000100000000000000000

1

0

0000001

0001000000000000000000000000

0

00000000000000000000000

0

000

1

00000000000000000

0

1

00000000000000000000000

0

0000000000000000000000001000

0

0

0

00000010000000000

0

0000000000000000

0

0000000000000

1000000000000000

0

01000000100000000

1

00

1

00000010

0010000000100

0

0000000000

0

00000000000000000000000000100

0

000000000000000000000000000001000

1

00000000000000000000010

000000

1

0000

0

0000

1

10000000000

0

00000000

0

000

0

00000000000000000000

0

000000

0

00000000010000000100000000000000

0

00000000

0

00000000000

0

000000000000000000000000

0

0000000000000000

0

000

0

000000

0

000001000

0

000000000100000000000000000000000000001000

0

000

0

000

0

0000000010100

0

000000000000000000000000

0

000

10

000000000000000

0

000

1

000000000000

0

00000000000000000000000000000000000010000000

1

0000000

0

00000000

0

00000000000000000000000000000000000000000000

0

0000

0

0000

0

00000010000010

1

01

0

00000000001000001000000000000000000000000000

0

00000000000000

00

0

00

0

0000

1

10

0

0000010

0

000000000000000000

0000

0

0000000000000000000000000000000000000000

1

000

100000

0

00000

0

000000000

0

00000000000

0

0000000000010

0

000000

0

000000000000000

0

0

0

001000000000000000000

1

0

0

00000000000

0000000000

0

00000000

0

00

1

000000

0

0000000

1

1

00

0

0

1

0

0

00000000

1

00000000001000000000000000000000000000

1

000000000100000000000000000000000000000

.2.4

.6

Pre

gib

on's

db

eta

800000 820000 840000 860000 880000 900000Number of the household member encrypted

32

Figure 8 Figure 9

Pearson residuals against level of education33 Pearson residuals against the degree of urbanization34

Previous research indicates that the level of education has a different effect on the

decision to become self-employed for individuals belonging to different educational

categories (see e.g. Le, 1991). Self-employed individuals are most present at both ends of

the level of education distribution (Simoes et al., 2013). Therefore it can be expected that the

variance for individuals belonging to the highest and lowest educational categories is

different from the variance of individuals belonging to the other categories. Moreover, it can

be argued that other variables which are not included in the model but influential on self-

employment status, e.g. available wealth and entrepreneurial ability, could differ between

individuals with different educational degrees as well. Figure 7 provides some very weak

support that the variance differs between the levels of education. However, unreported

results show no noteworthy differences between regression results when the variance is

allowed to differ between educational categories35.

Furthermore previous studies include regional dummies in their reduced form

equation for self-employment (see for example, Brown et al., 2011), because labor markets

tend to vary between regions. Unfortunately, the region of the participants of the LISS panel

is not available. However, a classification in the categories extremely urban to not urban is

available. The probability to become self-employed might be higher in an extremely urban

region because there is a larger market for services offered by self-employed individuals.

However, the probability of self-employment might also be lower in extremely urban regions

because more jobs are available and hence there is less need to become self-employed.

Both explanations state that variances differ between individuals belonging to regions with

33

Level of education in categories maintained by Statistics Netherlands, ranging from 1 (primary school) to 6 (university degree). 34

1 means extremely urban, 5 means not urban at all 35

This result is invariant to whether the educational degree dummies are included in the model or not.

-20

24

6

sta

nda

rdiz

ed

Pe

ars

on r

esid

ual

1 2 3 4 5 6Level of education in CBS (Statistics Netherlands) categories

-20

24

6

sta

nda

rdiz

ed

Pe

ars

on r

esid

ual

1 2 3 4 5Urban character of place of residence

33

different degrees of urbanization. However, figure 8 shows only very weak support for this

reasoning and unreported results show no noteworthy differences between regression

results when the variance is allowed to differ between regions with a different degree of

urbanization36.

5. Conclusion

Summarizing, the derived degrees of risk aversion turned out not to be significant

explanatory variables for self-employment status, as was also found in Tucker (1988), Parker

(2008) and Dohmen et al. (2011) and therefore the first hypothesis of this paper is rejected. A

negative sign for all risk aversion measures is found, so the second hypothesis is not

rejected. Moreover, the consistency of the negative signs for the different used measures of

risk aversion implies that there might be at least some relationship between risk aversion and

self-employment. However, it appeared impossible to find a non-misspecified model to

explain the relationship between self-employment status and risk aversion, probably because

the main determinant is missing in the available data. The exogenous model fits the data the

best, but still has bad predictive quality. This is not caused by outliers in the data or

extremely influential observations. From a theoretical point of view it seems plausible that the

error terms in the exogenous model are heteroskedastic, but because of miss specification it

is impossible to detect its source. The results are invariantly insignificant to the used

measure of risk aversion, only considering participants in the real or hypothetical payoff

condition and whether individuals with unknown job status are left out of the sample. In fact,

the results are even invariant to all different possible combinations of these factors.

Moreover, maintaining a different definition of self-employment does not generate results

which are significant and even shows that the signs of the risk aversion measures are

extremely sensitive to this defintion.

6. Discussion

From the results in chapter four follows that a theoretically underbuilt model works worse

than a model with only exogenous variables, using the data of the LISS panel. In this section

is discussed what probably causes this and how these flaws can be prevented in future

research within this field.

One part of the answer lies within the used model: the logit binary choice model is

very sensitive to misspecification because the estimates become biased, inefficient and

inconsistent in that case. This may be one of the reasons that the risk aversion measures are

36

This result is invariant to whether the different degree of urbanization dummies are included in the model or not.

34

insignificant in the exogenous model. This is tested by using the less sensitive linear

regression technique in a model based on the reverse relationship between self-employment

and risk aversion, i.e. in which way self-employment status influences risk attitude37, as is

discussed in paragraph 2.3.2. If the coefficient of the self-employment dummy variable is

significant with the right sign in this reverse model, this could be a not-waterproof38 signal

that insignificance of the risk aversion measures is caused by the sensitivity to

misspecification of the logit model. However, table 12 shows that the self-employment

dummy variables are still insignificant in the reverse relationship estimated using linear

regression. This result weakly indicates that the sensitivity of the logit model cannot be fully

blamed for the found results.

Table 12

Results of applying linear regression on a model based on the reverse relationship between

risk aversion and self-employment

Another possible explanation for the insignificant results could be that the derived

degrees of risk aversion are not a good proxy for risk taking behavior in real life, as Dohmen

et al. (2011) mention. This constructional invalidity can be examined by the combining the

results of the used lottery experiment with the results of another experiment of the LISS

panel, in which participants had to answer a self-assessment risk question39, as is discussed

in paragraph 2.2. 84 LISS panel members participated in both experiments and from them

both risk aversion measures are available. As can be seen in table 13, the correlation

between the answer on the self-assessment risk question and the from the experiment

derived degrees of risk aversion is in the right direction, albeit modest in magnitude. This

may be a sign that the used lottery experiment is not the best way to approximate real life

risk taking behavior. Another sign for this is the found difference in risk aversion between

participants in the real payoff condition and the hypothetical payoff condition: employing both

payoff conditions in the same experiment might have resulted in inconsistent degrees of risk

aversion between the two payoff conditions. Hence, a tentative conclusion of the present

37

Among the regular exogenous control variables age, squared age and a male dummy. 38

Assumptions necessary for reliable OLS estimates are highly violated, for example because the dependent variable is not continuous etc. 39

This question is: “how do you see yourself: are you generally a person who is fully prepared to take risks or do you try to avoid taking risks”. Answers are given on a 10-point Likert scale: 0 means ‘Not at all willing to take risks’ and 10 means ‘Very willing to take risks’.

Dependent variable NUMSAFE ARE RRP

Coefficient of the self-employment dummy variable -0,125 -0,001 -0,042

p value self-employment dummy variable 0,429 0,671 0,405

Sign corresponds to sign of the risk aversion measure in the exogenous model Yes Yes Yes

35

study is that a human’s risk taking behavior is too complicated to capture in a partly incentive

lottery experiment with just five choices, as the one used in the present study. In future

research the design of the lottery experiment should be revised: it is interesting to see

whether adding larger and smaller sure payoffs and an indifference option to the experiment

alters the found degrees of risk aversion.

Table 13

Correlation between the general risk measure and the derived degrees of risk aversion

A last explanation for the insignificant results is the lack of a main determinant of self-

employment status, as suggested by a logit regression of self-employment status on ŷ and

ŷ2, as is discussed in paragraph 4.4. The theoretical model of Praag and Cramer (2001)

states that this missing determinant of the decision to become self-employed might be

entrepreneurial ability, but none of the available variables is a good proxy for this.

Moreover, another potential missing determinant of self-employment status might be

the ability to overcome the financial constraints associated with setting up an own business.

Perhaps an individual’s accumulated wealth is a better proxy for this, than whether the

individual has an own dwelling, as is included in the model from paragraph 4.3. An

accumulated wealth variable was not used in the analysis because firstly previous research

found that this variable is prone to severe measurement errors, secondly it is hard to

distinguish between high wealth as a cause of self-employment and high wealth as a

consequence of self-employment and thirdly wealth variables were only available for 1541

participants of the lottery experiment. Deleting the observations without available wealth

status would have decreased the sample size with roughly 40% and could have created a

biased main sample because there might be a link between risk aversion and willingness or

ability to complete the questions in a financial assets survey40. Table 14 shows what

happens if logged accumulated wealth41 is added to the analysis. Again all risk aversion

measures are predicted with the insignificant right sign, regardless whether mortgages and

financial assets are included in the logged wealth variable. It is noteworthy that the logged

wealth variable appears insignificant, but increases the significance of the simple and relative

40

Self-employed individuals may have more financial literacy for example, other explanations are also possible. 41

Operationalized as the total value of total assets (money, investments, real estate, expensive valuables, loans, life insurance) minus outstanding debts.

Correlation with the answer of the general risk measure

Number of safe choices -0,334

Absolute risk aversion -0,198

Relative risk aversion -0,330

36

risk aversion measure as compared to the exogenous model, as can be seen by comparing

table 14 with table 7. This may indicate that the ability to overcome the financial constraints is

indeed an important omitted variable in the exogenous model, but that the operationalization

of this variable is not right yet or that measurement errors disturb the model improvement of

including this variable.

Table 14

Results of the exogenous model when a logged wealth variable is added

Other shortcomings of this research are the big amount of irrational participants in the

experiment potentially creating bias in the sample. Of course, an individual may display

irrational behavior in both real life and experimental settings. However, possibly some

participants made mistakes in the experiment because they did not understand the

questions: 46.5% of the participants reported that the questions were difficult or very difficult

to answer and 28.3% addressed that the questions were unclear or very unclear. Irrationality

caused by unclearness of the questions should be minimized in a future experiment42. This

can probably be reached by reformulating the questions, explanations by a real person,

different graphics and control questions whether the participant understood the experiment.

Furthermore, the lack of family background characteristics, as used in other research

about this topic, makes it hard to compare the found results of the present study with those of

other studies. Moreover, the sample appears not to be representative with respect to the

percentage of self-employed individuals it contains: 7.0% of the sample’s labor force43 is self-

employed, compared to 14% of the Dutch labor force in 2009 (Centraal Bureau voor de

Statistiek [CBS], 2014). In addition, because the signs of the risk aversion measure appear

sensitive to the definition of self-employment, it may be interesting to examine the differences

among self-employed individuals in different categories of self-employment more thoroughly,

e.g. “Do freelancers have different characteristics than owners of family businesses?”.

42

For example by reformulating the questions, explanations by a real person, different graphics and control questions whether the participant understood the experiment. 43

It seems more logical to exclude the individuals whose job status is missing from the sample’s labor force, but this might not be appropriate for all those individuals: some of them might actually be looking for a job or did just not fill in the answer on this question and are therefore part of the labor force according to the standard definition of labor force. Therefore 7% is an underestimation of the real experimental fraction.

Logged wealth variable Total wealth Minus mortgages Minus mortgages and assets

Measure of risk aversion NUMSAFE ARE RRP NUMSAFE ARE RRP NUMSAFE ARE RRP

Coefficient risk aversion measure -0,726 -1,747 -0,236 -0,073 -1,762 -0,238 -0,073 -1,752 -0,024

P-value risk aversion measure 0,262 0,726 0,245 0,259 0,724 0,243 0,258 0,726 0,242

Mc Fadden's R squared 0,024 0,0222 0,0241 0,0237 0,0218 0,0238 0,0237 0,0219 0,0239

Result in line with results without logged wealth included Yes Yes Yes Yes Yes Yes Yes Yes Yes

37

Finally, replicating this research with another dataset is interesting because this makes it

possible to determine whether insignificance of most control variables in the full model is

based on random factors or not.

Due to these flaws and the insignificant results there is no basis for extending the

findings of this study in general. Future research which takes into account the

aforementioned recommendations is necessary to investigate whether the effect of risk

aversion on self-employment status remains insignificant.

38

References Burke, A.E., Fitzroy, F. R., & Nolan, M. A. (2002). Self-employment wealth and job creation: the roles of gender, non-pecuniary motivation and entrepreneurial ability. Small Business Economics, 19, 255–270. Caliendo, M., Frank, F., & Kritikos, A. (2009). Risk attitudes of nascent

entrepreneurs: new evidence from an experimentally-validated survey. Small Business Economics, 32, 153–167.

Camerer, C. F., & Hogarth, R. M. (1999). The effects of financial incentives in experiments: A review and capital-labor production framework. Journal of Risk and Uncertainty, 19, 7-42.

Centraal Bureau voor de Statistiek. (2014). Data zelfstandigen met en zonder personeel. Retrievedfrom:http://statline.cbs.nl/StatWeb/publication/?VW=T&DM=SLNL&PA=80 50NED&D1=0 ,5&D2=0,28-32&D3=11,16,21,26,31,36,41,46,51,56&HD=121218 1635&HDR=T&STB=G1,G2 Connelly, R. (1992). Self-employment and providing child care. Demography, 29,17-

29. Dohmen, T., Falk, A., Huffman, D., Sunde, U., Schupp, j. & Wagner, W. (2005). Individual risk attitudes: new evidence from a large, representative, experimentally-validated survey (IZA working paper No. 1730). Retrieved from http://hdl.handle.net/10419/33253 Dohmen, T., Falk, A., Huffman, D., Sunde, U., Schupp, j. & Wagner, W. (2011). Individual risk attitudes: measurement, determinants and behavioral consequences. Journal of the European Economic Association, 9, 522-550. Ekelund, J., Johansson, E., Jarvelin, M., & Lichtermann, D. (2005). Self-employment and risk aversion – evidence from psychological test data. Labour Economics, 12, 649–659. Georgellis, Y. & Wall, H. J. (2005). Gender differences in self‐employment. International Review of Applied Economics, 19(3), 321-342. Harrison, G. W., & Rutström, E. E. (2008). Experimental evidence on the existence of hypothetical bias in value elicitation methods. Handbook of experimental economics results, 1(5), 752-767. Hartog, J., Ferrer-i-Carbonell, A., & Jonker, N. (2002). Linking measured risk aversion to individual characteristics. Kyklos, 55, 3–26. Heij, C., De Boer, P., Franses, , Kloek, T., & van Dijk, H. K. (2004). Econometric methods with applications in business and economics. Oxford, England: Oxford University Press. Holt, C. A., & Laury, S. K. (2002). Risk aversion and incentive effects. American economic review, 92(5), 1644-1655. Hundley, G. (2001). Why women earn less than men in self-employment. Journal of Labor Research, 22 (4), 817-829. Kapteyn, A., & Teppa, F. (2011). Subjective measures of risk aversion, fixed costs, and portfolio choice. Journal of Economic Psychology, 32(4), 564-580. Kass, R. E., & Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90(430), 773-795. Koellinger, P., Minniti, M., & Schade, C. (2013). Gender differences

in entrepreneurial propensity. Oxford Bulletin of Economics and Statistics, 75(2), 213- 234. Le, A. T. (1999). Empirical Studies of Self-employment. Journal of Economic Surveys, 13, 381-416. Masclet, D., Colombier, N., Denant-Boemont, L., & Loheac, Y. (2009). Group and individual risk preferences: A lottery-choice experiment with self-employed and salaried workers. Journal of Economic Behavior and Organization, 70(3), 470–484. Miner, J. B., & Raju, N. S. (2004). Risk propensity differences between managers and entrepreneurs and between low- and high-growth entrepreneurs: A reply in a more conservative vein. Journal of Applied Psychology, 89(1), 3–13.

39

Noussair, C.N., Trautmann, S. T., van de Kuilen, G., Vellekoop, N. (2013). Risk aversion and religion. Journal of Risk and Uncertainty, 47, 165-183.

Parker, S. C. (1997). The effects of risk on self-employment. Small Business Economics, 9,515–522.

Parker, S. C. (2008). Entrepreneurship among married couples in the United States: A simultaneous probit approach. Labour Economics, 15, 459–481. Rosa, P., Carter, s., & Hamilton, D. (1996). Gender as a determinant of small business performance: insights from a British study. Small Business Economics, 8(4), 463 - 478. Skriabikova, O. J., Dohmen, T., Kriechel, B. (in press). New evidence

on the relationship between risk attitudes and self-employment, Labour Economics. Retrieved from: http://www.sciencedirect.com/science/article/pii/S092753711400044X Simoes, N., Moreira, S. B., & Crespo, N. (2013). Individual determinants of self-employment entry: what do we really know? (MPRA paper, no. 48403). Retrieved from University of Munich, Munich personal RePEc archive website: http://mpra.ub.uni muenchen.de/48403/ Tucker, I. B. (1988). Entrepreneurs and public-sector employees: The role of achievement motivation and risk in occupational choice. The Journal of Economic Education, 19(3), 259–268. Van Praag, C. M., & Cramer, J. S. (2001). The roots of entrepreneurship and labour demand: Individual ability and low risk aversion. Economica, 68, 45–62. Pratt, J. W. (1964). Risk aversion in the small and in the large. Econometrica, 32,122-136. Wakker, P. P. (2010). Prospect theory: for risk and ambiguity. Cambridge, England: Cambridge University Press. Weber, E. U., Blais A. R., & Betz. N.E. (2002). A domain-specific risk-attitude scale: measuring risk perceptions and risk behaviors. Journal of Behavioral Decision Making, 15(4), 263-90.

Appendix I

Table 2

Differences in characteristics between subsamples

Pay-off condition Both

Included individuals All Only employed Only known job status

N 2651

1958

1832 Mean Std. Err. Mean Std. Err. Mean Std. Err.

% Male 48,8% 50,0% 50,4% 50,0% 51,3% 50,0%

Age 49,002 17,068 43,554 14,681 43,746 13,823

% Married 59,5% 49,1% 55,6% 49,7% 56,8% 49,6%

% Partnered 76,4% 42,5% 77,2% 42,0% 77,7% 41,7%

% With children 39,1% 48,8% 47,9% 50,0% 47,8% 50,0%

Number of children 0,798 1,138 0,977 1,181 0,962 1,162

Education level 3,526 1,516 3,723 1,477 3,810 1,446

% Own dwelling 75,1% 43,3% 77,8% 41,5% 78,4% 41,2%

% Self-employed 4,9% 21,5% 6,6% 24,8% 7,0% 25,6% Number of safe choices 3,517 1,762 3,500 1,738 3,475 1,737

Absolute risk aversion 0,019 0,000 0,018 0,001 0,018 0,001

Relative risk aversion 0,579 0,011 0,576 0,012 0,568 0,013

Pay-off condition Real

Included individuals All Only employed Only known job status

N 1043

771

721 Mean Std. Err. Mean Std. Err. Mean Std. Err.

% Male 49,6% 50,0% 52,3% 50,0% 53,5% 49,9%

Age 49,384 16,820 44,128 14,640 44,148 13,786

% Married 59,9% 49,0% 56,4% 49,6% 57,6% 49,5%

% Partnered 75,6% 42,9% 77,0% 42,1% 77,8% 41,6%

% With children 40,7% 49,2% 49,7% 50,0% 49,8% 50,0%

Number of children 0,830 1,158 1,013 1,191 1,008 1,187

Education level 3,472 1,542 3,669 1,510 3,772 1,474

% Own dwelling 73,3% 44,3% 76,7% 42,3% 77,4% 41,9%

% Self-employed 5,6% 22,9% 7,5% 26,4% 8,0% 27,2% Number of safe choices 3,340 1,797 3,272 1,786 3,247 1,773

Absolute risk aversion 0,027 0,001 0,025 0,001 0,025 0,001

Relative risk aversion 0,525 0,018 0,505 0,020 0,497 0,208

i

ii

Pay-off condition Hypothetical

Included individuals All Only employed Only known Jobstatus

N 1608

1187

1111 Mean Std. Err. Mean Std. Err. Mean Std. Err.

% Male 48,3% 50,0% 49,1% 50,0% 49,9% 50,0%

Age 48,753 17,228 43,181 14,702 43,485 13,847

% Married 59,3% 49,1% 55,0% 49,8% 56,3% 49,6%

% Partnered 76,9% 42,2% 77,3% 41,9% 77,6% 41,7%

% With children 38,0% 48,6% 46,8% 49,9% 46,4% 49,9%

Number of children 0,777 1,125 0,954 1,174 0,932 1,146

Education level 3,562 1,498 3,758 1,455 3,834 1,427

% Own dwelling 76,2% 42,6% 78,6% 41,0% 79,0% 40,7%

% Self-employed 4,4% 20,6% 6,0% 23,7% 6,4% 24,5% Number of safe choices 3,632 1,730 3,648 1,691 3,623 1,698

Absolute risk aversion 0,014 0,001 0,013 0,001 0,013 0,001

Relative risk aversion 0,615 0,014 0,622 0,015 0,614 0,016