abstract · 2019-09-03 · differences in learning and knowledge generation and diffusion. even...
TRANSCRIPT
Paper to be presented at the DRUID Academy Conference 2018at University of Southern Denmark, Odense, Denmark
January 17-19, 2018
Network participation and innovation: evidence from MSMEs in Vietnamesemanufacturing sector
Elisa CalzaUNU-MERITUNU-MERIT
AbstractBy identifying knowledge flows as a main driver ofinnovation, firm-level empirical literature explains firmheterogeneity in innovation performance as the result ofdifferences in learning and knowledge generation anddiffusion. Even within clusters, where geographicproximity is believed to play a primary role (Asheimand Gertel, 2005), firms are not homogeneous and theystill behave differently, which may ultimately bereflected in how and how much they innovate(Boschma and Wal, 2007; Giuliani, 2007). In thisrespect, other dimensions of proximity (besidesgeographic) may act as channels for knowledgediffusion and innovation. Among these, social orrelational proximity (Boschma, 2005) represented by(social, business, family) networks embeddedness(Gebreeyesus and Mohnen, 2013) is attractingincreasing interest. Most of the existing empirical analyses relatingnetwork and innovation performance are based on theexperience of advanced economies, thus they do notconsider the specific conditions of entrepreneurialactivity in a developing country. At the same time, theliterature on networks in developing economies focusesmainly on their effects on other dimensions of firmperformance (e.g. value added) (Fafchamps and Minten2002; Barr, 2000; Howard, 2016), or on transactioncosts and information (Fafchamps, 2000). Hence, theempirical evidence on the impact of networks on
firm-level innovation performance in developingeconomies is still scarce (Gebreeyesus and Mohnen,2013). This work aims at addressing this research gap bybringing together these two strands of the literature: itinvestigates the determinants of technologicalinnovation outputs (product and process) amongVietnamese SMEs in the manufacturing sector byexploring the role of participation in networks,addressing in particular whether the size of a firm'snetwork impacts upon its innovation performance. Wefurther examine the importance of the ‘quality’ ofcontacts and the role of geographic proximity in drivinginnovative performance. Underlying the proposed analysis is the theoreticalargument that the inclusion and the degree ofembeddedness in networks can play a role in explaininginnovation heterogeneity. Different types of networkare likely to convey different information, and adifferent degree of participation in networks impliesthat firms are linked to different sets of players, whichare in turn carrying different sets of knowledge,representing potentially different opportunities as wellas constraints. Using 5 waves (2007-2015) of a panel survey onVietnamese MSMEs in the manufacturing sector (1,123firms per period in the balanced panel), a probabilitymodel for technological innovation (product and/or
process) is implemented. The main independentvariable accounting for network effects is the number ofcontacts. We also include other network-related factors(e.g. diversity of contacts, being part of a businessassociation, geographical distance from suppliers andcustomers), as well as controls (firm and entrepreneurcharacteristics, sector, province and time periods). Toaccount for possible endogeneity of network, wesimultaneously estimate the innovation probabilityequation (implementing a correlated random effectsprobit based on Mundlak (1978)) and a reduced formequation for network (implemented ascontrol-function). Initial results show that firms with a larger network aremore likely to innovate. This result is robust to differentestimation methods and specifications. We also findthat the number of contacts in different sectors, amongpoliticians and among women is positively correlatedwith innovation. In terms of ‘quality’ of network, beingpart of a business association is highly positivelycorrelated with innovation, while we do not find suchevidence for having a more diverse network. Ourresults do not provide any evidence of a curvilineareffect of the size of network on innovation in our data.
ReferencesAsheim, B. and M.S. Gertler (2005). ‘The Geographyof Innovation: Regional Innovation Systems’. In J.Fagerberg, D. Mowery, and R. Nelson (Eds.), TheOxford Handbook of Innovation, 291-317, Oxford:Oxford University Press. Barr, a. (2000). ‘Social Capital and TechnicalInformation Flows in the Ghanaian ManufacturingSector’. Oxford Economic Papers, 52(3): 539-59.Boschma, R.A. (2005). ‘Proximity and Innovation: ACritical Assessment’. Regional Studies, 39(1):61-74.Boschma, R. A. and L.J. Wal (2007). ‘KnowledgeNetworks and Innovative Performance in an IndustrialDistrict: the Case of Footwear District in the SouthItaly’. Industry and Innovation, 14(2): 177-99.Fafchamps, M. (2000). ‘Ethnicity and Credit in AfricanManufacturing’. Journal of Development Economics,61: 205-35.Fafchmps, M. and B. Minten (2002). ‘Returns to SocialNetwork Capital Among Traders’. Oxford EconomicPapers, 54(2): 173-206.Gebreeyesus, M. and P. Mohnen (2013). ‘InnovationPerformance and Embeddedness in Networks: Evidencefrom the Ethiopian Footwear Cluster’. WorldDevelopment, 41: 302-316Giuliani, E. (2007). ‘The Selective Nature ofKnowledge Networks in Clusters: Evidence from theWine Industry’. Journal of Economic Geography,7:139-68.
Howard, E. (2016). ‘Social Networks, GeographicProximity, and Firm Performance in Viet Nam’.WIDER Working Paper 2017/69. Helsinki:UNU-WIDER.Tybout, J.R. (2000). ‘Manufacturing Firms inDeveloping Countries: How Well Do They Do, andWhy?’. Journal of Economic Literature, 38(1): 11-44.
Network participation and innovation: evidence fromVietnamese MSEMs in manufacturing sector
Elisa Calza∗
December 7, 2017
DRAFT - DO NOT CIRCULATE, DO NOT CITE
Abstract
JEL Classifications:Keywords: Entrepreneurship, Innovation
Emerging economies, Viet NamSocial networks
∗UNU-MERIT/MGSoG, Maastricht, The Netherlands, Contact: [email protected]@merit.unu.edu
1 Introduction
The heterogeneity of firm-level innovation performance is a widely investigated topic in thediscipline of economics of innovation. This interest can be understood when considering thepositive effects of technological innovation on firm performance and productivity, effects thathave been deeply studied and well recognized in the empirical literature (Crepon et al.Crepon et al., 19981998;Mairesse and MohnenMairesse and Mohnen, 20102010; HallHall, 20112011). Thus, understanding what makes firms more likelyto innovate - and thus, potentially, to grow or foster their productivity - is a relevant issue,especially for developing and emerging countries.The analysis of the determinants of innovation for firms operating in a developing or emergingeconomies has considered a wider set of proxies for channels of knowledge diffusion. In thisrespect, geographic proximity has been studied as playing a primary role on innovation activitieswithin firm clusters. However, even when geographically close, firms are not homogeneous andthey still behave differently, which may ultimately be reflected in how and how much theyinnovate. Other dimensions of proximity may act as channels for knowledge diffusion andinnovation. Among these, social networks and social interactions have been gaining increasingattention as a channel for information and knowledge diffusion. However, most of existingempirical studies on developing and emerging economies explore the relationship between socialnetworks and firm productivity or profit performance, while the empirical evidence on socialnetworks and firm-level innovation performance is still scarce, and limited to the experience ofclusters in more advanced economies.This paper aims at contributing to address this gap in the literature by investigating the sourcesof the heterogeneity of innovation performance from the perspective of knowledge diffusion andtransfer among entrepreneurial actors operating in a developing and emerging economy. Wefollow the argument that the inclusion in networks allows firms to get in contact with a newand different set players, who are carrying different typed of information respect to what theyown, and this may represent an important input for innovation activity, especially among microand small firms operating in a developing and emerging economy. Using a rich panel databaseof a MSMEs survey conducted in Vietnamese manufacturing sector between 2007 and 2015,we perform an empirical analysis of the determinants of technological innovation (product andprocess) among Vietnamese SMEs in the manufacturing sector, focusing on the role playedby the participation into networks. The term network is here used as synonymous for ‘socialnetwork’ as it has been defined by Nichter and GoldmarkNichter and Goldmark (20092009): ‘micro-level relationshipsbetween agents in an economy...to refer to relationships between individuals’. These couldinclude, but not be restricted to, business relations, as well as family, neighborhood, or villagerelationships. We find evidence of a positive and significant effect of network size on theprobability of engaging in innovation activities. By testing different specifications, our resultsalso show that also participating in a more diverse network, as well as being member of ‘higherquality’ networks, has a further positive effect on innovation.The determinants of innovation performance and the role played by network have not beenempirically investigated for the case of Vietnamese MSMEs. Thus, our work also contributes tobetter understand the innovation process for micro, small, and medium enterprises in the specificcontext of the Vietnamese manufacturing sector. We believe that this study is particularlyimportant for the country, potentially providing some insights for the design of entrepreneurialpolicies aiming at fostering the performance of MSMEs. The Vietnamese government isnowadays keen on implementing policy actions in this direction, aiming at the strengthening ofdomestic technological capabilities to avoid the risk of getting stuck in a middle income trap.The paper is structured as follows. Section 22 provides a brief review of the relevant literature,based in which we formulate some hypotheses to be tested in the empirical analysis. Section 33describes the used database, presenting also the description and some summary statics of the
1
used variables. Section 44 discuss the model and explains the implemented estimation strategy.Section 55 comments the obtained results, followed by a brief discussion and some concludingremarks in Section 66.
2 Literature and hypotheses
2.1 Literature background
Absorptive capacity and firm capabilities have long been recognized as the grounds of learningand innovation performance in firms (Nelson and WinterNelson and Winter, 19821982; Cohen and LevinthalCohen and Levinthal, 19901990;LallLall, 19921992). They represent also an important source of firm heterogeneity: firms areheterogeneous in what they know, how they learn, how they use their knowledge, whichultimately may reflect in how and how much they innovate.With the increased availability of micro-data, a well established empirical literature hasinvestigated the determinants of innovation activities and outputs, trying to match thetheoretical importance of capabilities and absorptive capacity with the search of adequateindicators. In this respect, the studies related to the CDM model (Crepon et al.Crepon et al., 19981998)contributed largely to a better understanding of the determinants of technological innovation,assigning a pivotal role to RD investments in accounting for advanced technological capabilities(together with other ‘conventional’ controls to account for more general firm-level capabilities,such as firm size and age) (Mairesse and MohnenMairesse and Mohnen, 20102010). However this approach was developedon the basis of the experience of firms in advanced economies, and it has been criticized dueto its emphasis on RD activities, which makes it not so adequate for the analysis of innovationin small enterprises and in firms in developing countries. In a developing economy productiveactors tend not to work on frontier technology and rarely rely on conducting RD to push forwardinnovation (Hall et al.Hall et al., 20092009), being their learning more related to imitation, adaptation andabsorption of technologies developed somewhere else (Chudnovsky et al.Chudnovsky et al., 20062006).In order to better account for the heterogeneity in innovation performance in a developingcontext, the empirical analysis of firm-level innovation has also emphasized other factorsassociated with firm-level absorptive capacity, capabilities, and learning, such as the humancapital of the firm proxied by manager education, experience or/and employee training(Goedhuys et al.Goedhuys et al., 20142014; GebreeyesusGebreeyesus, 20112011; Gebreeyesus and MohnenGebreeyesus and Mohnen, 20132013), or the role ofknowledge diffusion and knowledge spillovers. Innovation is based on the interactions andknowledge flows between market and economic entities (Asheim and GertlerAsheim and Gertler, 20052005), andknowledge spillovers are likely to play a particularity important role in a developing countrycontext, where firms are operating far from the best practice frontier (Howard et al.Howard et al., 20152015;HowardHoward, 20172017), and where transaction costs and information asymmetries are more diffused(TyboutTybout, 20002000).The research into knowledge spillover and knowledge transfer across economic entities stemsfrom the tradition of studies on industrial agglomeration and clusters. These studies emphazisethe importance of geographic proximity on firm competitive and innovaiton performancewith respect to isolated firms, mainly thanks to a ‘cluster effect’ (Asheim and GertlerAsheim and Gertler, 20052005)associated with collective efficiency and the transmission of tacit knowledge. However, therelative importance of geographical agglomeration has started to get questioned. This isnot exclusively due to the appearance of digital technologies and internet, which may havemade physical distance less relevant and whose impact on how knowledge and informationare transmitted is still debated, but mainly due to the fact that geographical proximity perse seems not to be able to explain the intra-cluster heterogeneity in learning, knowledge
2
generation and diffusion, and innovation. Even within clusters, firms still behave and performdifferently, thus are heterogeneous despite their geographical proximity (Giuliani and BellGiuliani and Bell, 20052005;Boschma and Ter WalBoschma and Ter Wal, 20072007; ?).Other dimensions of proximity need to be considered in order to properly understand firmlearning and innovation performance, first of all social or relational proximity (BoschmaBoschma, 20052005;Boschma and Ter WalBoschma and Ter Wal, 20072007). The inclusion and the degree of embeddedness in networks canplay a role in explaining innovation heterogeneity within the same clusters: different typesof network are likely to carry different information content that may affect firm performancedifferently, and uneven participation in networks means that firms are connected in differentiatedways that link them to different sets of players and present them with sharply differentopportunities and constraints.This does not imply that geographical proximity is not relevant; instead, that it should not beassessed in isolation but in relation other dimensions of proximity. Gebreeyesus and MohnenGebreeyesus and Mohnen(20132013) argue that ‘the effect of clusters on innovation may be (...) partially influenced by networkposition.’ In other words, the impact of geographical proximity (such as in clusters) may alsodepend on the degree and quality of embededness in networks, suggesting a sort of interactionbetween different types of proximities. In the same way, the effect of the embeddedness innetworks may also differ from firm to firm depending on their absorptive capacity: firms withan higher absorptive capacity may find it easier to establish links with external sources ofknowledge (Giuliani and BellGiuliani and Bell, 20052005) and are better able to exploit the knowledge from outside(Cohen and LevinthalCohen and Levinthal, 19901990).The empirical literature on entrepreneurship in developing countries has recognized theimportance of social network mainly in relation with firm performance and growth, or toovercome obstacles related to transaction costs, contract enforcement, and lack of regulation,which are typical in environments with pervasive market failures as in the case of developing andemerging economies. ‘Having an extensive social network is a valuable asset that can help anentrepreneur obtain access to information (e.g., leads about profitable business opportunities)as well as resources (e.g.,credit).’(Nichter and GoldmarkNichter and Goldmark, 20092009). FafchampsFafchamps (20002000) shows thepreference of Ghanaian firms for doing business with individuals they already know, and thatnetworks are crucial in determining access to credit. In another study Fafchamps and MintenFafchamps and Minten(20022002) find that larger networks are associated with higher value added for agricultural traders inMadagascar. This is similarly to what BarrBarr (20002000) finds for the case of small-scale manufacturersin Ghana, among those the ones with larger and more diverse sets of networks are moreproductive. Turning more specifically to Viet Nam, as far as we know HowardHoward (20172017) is theonly existing study focusing on the correlation between social networks, geographical proximityand firm performance (value added) using a panel of MSMEs in manufacturing over years 2011-2013-2015, finding that larger and ‘more quality’ social networks seem to be more positivelycorrelated with firm productivity level than geographical proximity.Most of the existing empirical studies on network and innovation focus instead on the experienceof firms in advance economies, whose clusters are characterized by RD and by the presenceof more advanced, more capable leading firms acting as ‘technological/knowledge gatekeeper’(Boschma and Ter WalBoschma and Ter Wal, 20072007). On the contrary in a developing context clusters are more likelyto be formed by small and micro locally owned firms in traditional sectors and that are active onthe local market, which tend to generate little new knowledge and technology and rely mainly onimitating and adapting existing one developed elsewhere (MarkusenMarkusen, 19961996). Thus, the existingempirical literature on network and innovation is of limited relevance for the analysis of thissame topic in a developing context.We are not aware of many empirical studies on network embededness and innovationperformance in developing countries, with the exception of the work of Gebreeyesus and MohnenGebreeyesus and Mohnen(20132013) that provides positive evidence of the effect of embeddedness in networks on firminnovative performance in the case of the Mercato footwear cluster in Ethiopia, finding that local
3
business relations are the major channels through which marketing and technical knowledge flowinto the analyzed footwear cluster, where trade is mainly conducted within the analyzed footweardistrict, and where linkages with firms outside the district - even through subcontracting - tendto be minimal, with most of the firms obtaining information from informal relations with theirlocal business partners (Gebreeyesus and MohnenGebreeyesus and Mohnen, 20132013). An empirical analysis of the effect ofsocial networks on firm innovation is still missing for Vietnamese manufacturing sector. Giventhe current interest of the Vietnamese government, this could provide new insights for the designand implementation of innovation and entrepreneurship policies for micro and small enterprises.
2.2 Hypotheses
Drawing from the presented literature, we formulate some hypothesis about the role ofparticipating into a social network - intended as the set of individuals the firm is regularlyin contact with and that is useful to perform the business operations - on the likelihood tointroduce a new process or an improved product11. We refer to Gebreeyesus and MohnenGebreeyesus and Mohnen (20132013)and explicitly formulate our main hypothesis about the role of networks, which we argue tobe ‘the main channel of knowledge flows’ for micro and small firms in developing countries, asfollows:
Hypothesis 1: The larger the network a firms in participating in, the larger is theprobability that the firm introduces and innovation.
To improve our understanding of the role of social networks for innovation, we furthercomplement the scope of the analysis of our first hypothesis with some additional hypothesesabout the ‘quality’ of the network. Different types of contacts may carry different sets ofknowledge and to convey different information. Hence, by contributing to not only to expandbut also to diversify the knowledge base of a firm, the participation in a more diversifiednetwork may positively affect the ability to innovate. This is expressed as follows:
Hypothesis 2: Firms with a more diversified network are more likely to get in contactwith different forms of knowledge, and thus become more likely to implement an innovation,controlling for the size of their network.
Another way to proxy for ‘network quality’ is to consider who is actually part of thenetwork, and whether some of the individuals a firm is connected to can actually be more usefuland helpful for firm business operations. In this respect, being member of a business associationhas been used in the literature to proxy for participation in ‘higher quality’ networks (Howard2016), following the argument that the size of the network may matter less in terms of firmperformance if the ‘quality of the contacts’ is larger. It follows that:
Hypothesis 3: Being part of a business association implies being connected with moreuseful contacts for firms’ business operation, thus it increases the probability that a firmengages in innovative activities than those are not in a business association.
1 See Table 11 for a more precise definition and description of the used variables.
4
3 Data and summary statistics
3.1 Data
The data used in the present study come from the Small and Medium Scale ManufacturingEnterprise (SME) survey conducted since 2005 every second year among Vietnamese enterprisesoperating in the manufacturing sector22. The survey was conducted in 10 provinces: Ho ChiMinh City (HCMC), Hanoi, Hai Phong, Long An, Ha Tay, Quang Nam, Phu Tho, Nghe An,Khanh Hoa, and Lam Dong. The random sample was stratified by ownership type to include:household establishments, private enterprises, collectives or cooperatives, and limited liabilityand joint stock companies.The current sample is a direct continuation of the sample obtained in 2005: apart from theenterprises interviewed in 2005 that still operate, the sample contains enterprises added toreplace those that in the meantime have stopped operating. It originally included only firmsactive in manufacturing sectors and with less than 300 employees; however, due to changes ofsector over time, it may be possible that some firms still appear in the list of manufacturingfirms even though they in fact moved to services, or that some firms have grown larger than themaximum threshold of 300 employees. Firms that operate in agriculture or mining, and withthe participation of foreign or state capital are excluded from the dataset.The questionnaire includes a rich set of information on enterprise characteristics and practices,such as number and structure of workforce, technology and innovation, owner characteristics,together with unique information on firm’s social network defined as the number of contacts:in the same sector, in a different sector, among bank officials, politicians and civil servants,women, as well as the number of costumers and suppliers.For the presented empirical analysis we use a database obtained by merging the five waves of thesurvey collected between 2007 and 201533, which (after cleaning)44 corresponds to an unbalancedsample of 13,163 firms that have appeared at least in one of the survey waves between 2007and 2015. Among these, we consider the 1,123 firms that participated in all consecutive surveywaves since 2007, and that thus constitute the balanced sample.
3.2 Descriptives
Table 11 reports the definition of the main variables used in the empirical analysis. Table 22presents the basic summary statistics of the main variables used (mean and standard deviation)over different sample compositions (per year, total unbalanced and balanced).The Innovation variable includes process innovation and product improvement. It presents a
2 The survey has been conducted in collaboration between the Central Institute for EconomicManagement (CIEM) of the Ministry of Planning and Investment of Vietnam (MPI), the Instituteof Labor Science and Social Affairs (ILSSA) of the Ministry of Labor, Invalids and Social Affairsof Vietnam (MOLISA), the Development Economics Research Group (DERG) of the University ofCopenhagen and the United Nations University World Institute for Development Economics Research(UNU-WIDER). For more information about the surveys and data, see CIEM and ILSSACIEM and ILSSA (20162016).
3 We were planning to use all six waves available of the MSMEs survey between 2005 and 2015 (with990 firms in the balanced sample, for a total of 5940 observations across six periods). We decidedto drop the first wave because some variables (especially related to innovation and network) present‘far-from-(within)average‘ figures. See Appendix for more details on the construction of some variables.
4 See Appendix A for more detailed on the cleaning process performed on the data to obtain the variablesNetwork and Employment.
5
declining pattern after 2011, dropping from 48 per cent in 2007 to 15 per cent in 201555. Theaverage values for network size are more persistent over time: the average number of contactsacross time falls between a 33 and 39 individual contacts, with an average of 36 in the balancedpanel 66. More than 1/3 of firms have a diversified network - thus containing at least onecontact from the same sector, from a different sector, from politicians or civil servants, or frombank officials -, while around 11-10 per cent of firms are member of a business association.Looking at general firm characteristics, the average firm size (based on employment) declinesover time from an average of almost 15 (2007) to 12 (2015) regular employees. Given the smallaverage firm size, and considering that the large majority of firms in the considered samplecan be classified as micro and small firms77, it is thus not surprising that between 80 and 77per cent of firms fall in the category of household enterprises. The average firm age raises overtime from 14 to 20 years. The observed increase in the average firm age over time is mainlydue to the fact that we are considering firms in the balanced panel; however, the last columnsin Table 22 show that the average firm age does not change much between the balanced andthe unbalanced sample. About manager or owner (the respondent) characteristics, the shareof male respondents decreases from almost 70 to 60 per cent between 2007 and 2015, while theaverage age is between 44.7 and 46.5, being larger in the last waves (again, this may be due tothe panel structure of the sample)88.Looking at the variables used as covariates in the network equation (first stage), it is interestingto note the rapid increase in Internet access during the last decade, which reached 36 per centof considered firms in 2015. The access to infrastructure, such as road, rail or port, is largeand has been also increasing over the considered time period. An increasing number of firmsdeclare to conduct individual price negotiation with their costumers, while the share of firmsthat use personal contacts as main criteria to select suppliers is not constant over time andoscillates between 26 and 43 per cent.
5 We do not have enough elements to explain the change in innovation outcomes. We can, however,hypothesize two factors: the international 2008 crisis, and a data collection issue (e.g. a change in howthe questions related to innovation are asked.
6 Looking at the composition of the contacts categories that form this variable, it can be added thatin all considered waves the number of ‘contacts in different sectors‘ represent the largest share of thetotal number of contacts, with an average value between 19 and 23 individuals. This average valueis more than double than the average of number of contacts in the category ‘contacts in the samesector‘, which remains between 7 and 9. The average number of contacts among bank officials andpoliticians is in general lower (between 1 and 1.45 per cent for bank officials, and between 1.5 and 1.9for politicians), while the contacts in the residual category ‘others’ increase other time (from 3.3 to6.2) but it still lower than ‘contacts in same sector’.
7 Following the World Bank definition of micro, small, and medium enterprises based on the number ofemployees
8 It has to be noticed that, however, there is no guarantee that the respondent is always the same person.By looking at the data, some firms seem not to have the same manager across the whole time period,and thus individual-related factors cannot be fully considered as time-invariant variables.
6
Tab
le1:
Var
iab
les
defi
nit
ion
s
Var
iab
leD
escr
ipti
on
Inn
ovat
ion
=1
iffi
rmh
asp
erfo
rmed
ap
rod
uct
imp
rove
men
tor
ap
roce
ssin
nov
ati
on
sin
cela
stsu
rvey
.N
etw
ork
Tot
alnu
mb
erof
regu
lar
conta
cts
(at
leas
tev
ery
3m
onth
s)am
ong:
bu
sin
ess
peo
ple
inth
esa
me
sect
or,
bu
sin
ess
peo
ple
ina
diff
eren
tse
ctor
,ban
koffi
cial
s(f
orm
alan
din
form
alcr
edit
ors)
,p
oli
tici
an
san
dci
vil
serv
ants
,oth
ers)
(in
inver
sehyp
erb
olic
sin
etr
ansf
orm
atio
n).
Em
plo
ym
ent
(log
)E
mp
loym
ent,
mea
sure
dby
nu
mb
erof
full
-tim
ew
orke
rsp
lus
the
nu
mb
erof
part
-tim
ew
ork
ers
wei
gh
ed0.6
(in
log.)
.F
irm
age(
log)
Age
offi
rm(i
nlo
g).
Man
ager
/ow
ner
age
(log
)A
geof
the
man
ager
/ow
ner
(in
log)
.M
anag
er/o
wn
erge
nd
er=
1if
the
man
ager
/ow
ner
ism
ale.
Con
tact
sd
iver
sity
=1
iffi
rmh
asat
leas
ton
eco
nta
ctin
ever
yco
nta
ctca
tego
ry(b
usi
nes
sp
eople
inth
esa
me
sect
or,
bu
sin
ess
peo
ple
ina
diff
eren
tse
ctor
,ban
koffi
cial
s,p
olit
icia
ns)
.B
usi
nes
sas
soci
atio
n=
1if
firm
isa
mem
ber
ofa
bu
sin
ess
asso
ciat
ion
.O
wn
ersh
ipty
pe:
Hou
seh
old
=1
iffi
rmis
anh
ouse
hol
dor
pri
vate
sole
pro
pri
etor
ship
ente
rpri
se.
Par
tner
ship
/col
lect
ive/
coop
erat
ive
=1
iffi
rmis
ap
artn
ersh
ip,
coop
erat
ive
orco
llec
tive
ente
rpri
se.
LS
C=
1if
firm
isa
lim
ited
liab
ilit
yco
mp
any
ora
join
tst
ock
com
pan
y.S
ecto
rB
inar
yva
riab
les
corr
esp
ond
ing
toth
e8
sect
ors
(ob
tain
edfr
om4-
dig
itIS
ICR
ev.4
):F
ood
,b
ever
ages
an
dto
bacc
o;
Ap
par
el,
texti
lean
dle
ath
er;
Wood
and
furn
itu
re;
Pap
eran
dp
ulp
;C
hem
ical
an
dru
bb
er;
Non
-met
all
icm
iner
al
pro
duct
s;E
lect
ron
ics,
mot
orve
hic
les
and
other
tran
spor
teq
uip
men
t,b
asic
an
dfa
bri
cate
dm
etals
;O
ther
s(r
ecycl
ing,
etc.
).P
rovin
ceB
inar
yva
riab
les
corr
esp
ond
ing
to10
pro
vin
ces:
Ha
Noi
,H
CM
C,
Hai
Ph
on
g,
Ha
Tay
,L
on
gA
n,
Phu
Th
o,
Qu
an
gN
am
,N
ghe
An
,K
han
hH
oa,
Lam
Don
g.S
up
pid
=1
iffi
rms
iden
tifi
edm
ain
lyth
rou
ghp
erso
nal
conta
cts
orb
usi
nes
sass
oci
ati
ons.
Pri
cen
eg=
1if
firm
has
ind
ivid
ual
pri
cen
egot
iati
onw
ith
cost
um
ers.
Inte
rnet
=1
iffi
rmh
asac
cess
toIn
tern
et.
Infr
astr
uct
ure
=1
iffi
rms
has
acce
ssto
infr
astr
uct
ure
such
asro
ad,
rail
,or
por
t.
Sou
rce:
auth
ors’
elab
orat
ion
bas
edon
CIE
Man
dIL
SS
AC
IEM
and
ILS
SA
(2016
2016).
7
Tab
le2:
Su
mm
ary
stat
isti
cs
2007
2009
2011
2013
2015
Bala
nce
dU
nbala
nce
dA
v.
SD
Av.
SD
Av.
SD
Av.
SD
Av.
SD
Av.
SD
Av.
SD
Inn
ovat
ion
0.48
0.50
0.45
0.50
0.41
0.49
0.22
0.42
0.1
50.3
50.3
40.4
70.3
30.4
7N
etw
ork
34.7
660
.93
36.8
048
.81
33.1
851
.63
36.2
040
.07
38.5
949.3
935.8
850.6
336.1
750.2
5N
etw
ork
(hyp
)3.
830.
824.
020.
723.
840.
764.
010.
694.0
10.7
93.9
40.7
63.9
50.7
7C
onta
cts
div
ersi
ty0.
280.
450.
390.
490.
330.
470.
370.
480.3
60.4
80.3
50.4
80.3
40.4
7B
usi
nes
sA
ssoci
atio
n0.
110.
310.
110.
310.
080.
270.
100.
290.0
90.2
90.1
00.2
90.0
90.2
8E
mp
loym
ent
14.7
527
.30
14.4
328
.37
14.0
328
.80
12.3
824
.45
12.1
625.6
613.5
526.9
814.4
228.7
0E
mp
loym
ent
(log
)1.
911.
121.
871.
121.
811.
121.
761.
081.6
91.1
01.8
11.1
11.8
61.1
3F
irm
age
14.2
610
.75
16.3
411
.87
15.0
09.
0817
.96
9.31
19.9
49.2
916.7
010.3
214.6
910.3
9F
irm
age
(log
)2.
420.
692.
570.
672.
550.
572.
770.
492.9
00.4
32.6
40.6
02.4
60.6
9M
anag
er/o
wn
erag
e45
.15
10.2
246
.52
10.2
647
.30
10.3
748
.53
10.6
949.6
310.9
247.4
210.6
145.8
610.6
9M
anag
er/o
wn
erag
e(l
og)
3.78
0.23
3.81
0.23
3.83
0.23
3.86
0.23
3.8
80.2
33.8
30.2
33.8
00.2
4M
anag
er/o
wn
erge
nder
0.69
0.46
0.67
0.47
0.64
0.48
0.62
0.49
0.6
10.4
90.6
40.4
80.6
30.4
8H
ouse
hol
dan
dpri
vate
sole
pro
pri
etor
ship
0.82
0.39
0.80
0.40
0.79
0.41
0.78
0.42
0.7
70.4
20.7
90.4
10.7
30.4
5P
artn
ersh
ip/C
olle
ctiv
e/C
oop
erat
ive
0.03
0.18
0.03
0.16
0.03
0.16
0.03
0.16
0.0
30.1
70.0
30.1
70.0
30.1
7L
SC
0.15
0.36
0.17
0.38
0.18
0.39
0.19
0.40
0.2
10.4
00.1
80.3
80.2
50.4
3S
up
pid
0.31
0.46
0.26
0.44
0.39
0.49
0.43
0.50
0.3
60.4
80.3
50.4
80.3
50.4
8P
rice
neg
0.13
0.34
0.18
0.38
0.13
0.34
0.17
0.37
0.2
10.4
10.1
60.3
70.1
60.3
6In
tern
et0.
170.
380.
230.
420.
310.
460.
270.
450.3
60.4
80.2
70.4
40.3
10.4
6In
fras
tru
ctu
re0.
820.
390.
830.
380.
820.
380.
870.
330.8
80.3
30.8
40.3
60.8
50.3
6
Ob
serv
atio
ns
1123
5614
13163
Sou
rce:
auth
ors’
elab
orat
ion
bas
edon
CIE
Man
dIL
SS
AC
IEM
and
ILS
SA
(2016
2016).
8
Finally, to preliminary assess the existence of differences in terms of network size for firms thatengage in innovation activity, we compare the distributions of the used network variable acrossinnovative and non-innovative firms. As showed by Figure 11, the distribution of the networkvariables for innovative firms (continuous line) is shifted to the right of the distribution ofnetwork for non-innovative firms (dotted line), thus confirming that innovative firms do tendto have larger networks. This allow us to point out the presence of a possible ‘structuraldifference’ between innovative and non-innovative firms from in terms of their participationinto networks.
Figure 1: Distribution of network size: innovative and non-innovative firms
0.2
.4.6
.8kd
ensi
ty h
_net
_tot
2
0 2 4 6 8x
kdensity h_net_tot2 kdensity h_net_tot2
4 Model and empirical strategy
We formulate the binary choice model for having implemented a technological innovation sincethe last survey as:
INN∗jt = βXjt + εjt (1)
where INN∗jt
is the latent variable underlying the dichotomous response of having implementeda technological innovation, which depends on the vector of explanatory variables Xjt thatcontains measures for network Njt , a set of time-varying (exogenous) variables (Wjt) accountingfor firm and respondent characteristics, and time-invariant controls (Zjt), such as sector andprovince dummies. The relevant empirical literature proposes different measures for network; forexample, the linkages within business network, such as the contacts with suppliers and buyers,have been often used. However, a broader definition of network beyond trade based relationshipsis consistent with the argument that a diversity of contacts may channel a variety of informationand experiences, favoring innovation activity. Hence, following Gebreeyesus and MohnenGebreeyesus and Mohnen (20132013)and Howard2016, we measure ‘the intensity of network’ as the number of individuals the firmshas a regular contact with, defined as the sum of contacts among business people in the same
9
sector, in a different sector, bank officials, politicians and civil servants, and others99 - thus,irrespectively or beyond to the connections through the exchange of goods.We only observe the innovation binary outcome as:
INNprobitjt
=
{1 if INN∗
jt= βXjt + εjt > 0
0 otherwise(2)
where the error term εjt is assumed to be normally distributed (εjt ∼ N(0, 1)). However, itmay contain time-invariant unobservable characteristics (cjt). Endogeneity issues surge whenthese unobservable individual specific effects are correlated with the observed covariates (Xjt)(e.g. if unobservable managerial ability is correlated with other determinants of innovation,such as the number of business contacts). The availability of panel data allows us to addressthis source of endogeneity. With small T the implementation of fixed effect (FE) probit byadding individual dummies would, however, produce inconsistent estimations of the parametersof interest (β) due to the incidental parameters problem (WooldridgeWooldridge, 20102010). This could beavoided by applying a conditional maximum likelihood estimation (CMLE), but this methodcan be implemented for nonlinear models only in special cases, and relies on strong conditionalindependence assumptions (WooldridgeWooldridge, 20102010). A good compromise for the estimation ofparameters in nonlinear panel data models is represented by the correlated random effect (CRE)model (MundlakMundlak, 19781978). This method allows to specify the unobserved effect (cjt) as function ofthe vector of the averages across time periods of the time-variant variables contained in Xjt . Inpractice, it allows for correlation between the unobserved effect and the explanatory variables byadding the within group means of the time-variant covariates in the binary equation 22, using thesignificance of their coefficients as a test for the existence of this correlation (with the implicitassumption that the time-independent variables are uncorrelated with the unobserved effect cjt)(WooldridgeWooldridge, 20102010).By implementing the CRE model, the expression for the latent variable in Equation 11) can bereplaced by:
INN∗jt = β1Njt + β2Wjt + β3Zjt + ξ1Wjt + ξ2Njt + εjt (3)
The CRE model could be estimated using MLE if the assumption of serial independence holds;in fact, MLE returns inconsistent results if this assumption fails. Serial independence maybe a strong assumption in case of repeated observations, where serial correlation is common.To relax this strong assumption, we can estimate the CRE model with a pooled probitmethod, but this would lead to more inefficient estimations. A possible solution to recoverefficiency and relax independence is to use ‘generalized estimating equations’ (GEE), whichis more efficient and as robust as pooled probit when dependence within clusters is assumed(if independence is assumed, the GEE is equivalent to the pooled probit and less efficientthan MLE), by allowing to formulate alternative assumptions about the correlation structure(WooldridgeWooldridge, 20102010). We follow GebreeyesusGebreeyesus (20152015) and estimate the CRE as GEE probit withan ‘exchangeable’ correlation structure (Papke and WooldridgePapke and Wooldridge, 20082008).Another concern comes from the fact that the network variable could be correlated with theerror term. In this case endogeneity raises from simultaneity or from reverse causality betweena regressor and the dependent variable (e.g. here, if firms tend to gain a larger network dueto the fact that they are also more innovative). To avoid biased and inconsistent parameterestimations, we could look for a relevant instrument and implement an IV estimation to correctthis endogeneity problem. As pointed out by Gebreeyesus and MohnenGebreeyesus and Mohnen (20132013), finding a goodinstrument is often challenging, and may also lead to less precise estimations. An alternativeapproach is to implement a control function (CF) method (GebreeyesusGebreeyesus, 20152015). The CF consists
9 The questionnaire asks ‘How many people do you currently (present) have regular contact with?(Contact at least once every 3 months, which you find useful for your business operations)’.
10
of a two-step procedure: first, a reduced form equation is estimated (using OLS) for theendogenous explanatory variable, which in our case is the measure for network; second, theresiduals of the reduced form equation are used as additional regressor in the main structuralequation (here, the probability equation for innovation), together with the original endogenousvariable. This sequential estimation assumes causality to run from network to innovation, andno eventual feedback loops from innovation to network are considered. In practice, we followGebreeyesusGebreeyesus (20152015) and Papke and WooldridgePapke and Wooldridge (20082008) in addressing both endogeneity sourcesof individual unobserved hetereogeneity and of reverse causality by combining the CRE and theCF approaches in the presence of a continuous endogenous explanatory variable1010. The reducedform equation for network and the structural equation for innovation can then be formulatedas follows:
Njt = γ1Mjt + γ2Wjt + γ3Zjt + η1Mjt + η2Wjt + υ1jt (4)
INN∗jt = β1Njt + β2Wjt + β3Zjt + ξ1Wjt + υ̂1jt + υ2jt (5)
where in reduced form Equation 44 Mjt denotes a set of variables that are exogenous to themodel (thus, uncorrelated with the error term in the structural equation, and that can helpidentification), which in our case includes: a dummy that takes the value of 1 if the firmidentifies suppliers mainly via personal contact (Howard 2016), a dummy for having individualprice negotiation with costumer (Gebreeyesus and MohnenGebreeyesus and Mohnen, 20132013), and two dummies related tothe availability of access to information (Internet and Infrastructure)1111. In Equation 55 υ̂1jt isthe residual from the reduced form equation, while υ2jt is the new idiosyncratic error term1212.In both equations, variables with bar are group means of time-varying variables to allow themto be correlated with the firm unobserved heterogeneous characteristics.
5 Results
Table 33 shows the estimation results (probit coefficients) for the innovation equation forthe balanced sample1313. The results of the pooled probit and of the CRE regression model(estimated with MLE and GEE) are reported as benchmark in columns (1), (2) and (3).These methodologies give qualitatively similar results, providing a positive and significantcoefficient for the network variable; the sign and significance of the other covariates is alsorather consistent across the models. However, it has to be noticed that the magnitude ofthe network coefficient decreases in the CRE or Mundlak-augmented regression models. Thisshows that, when individual heterogeneity is properly accounted for, the effect of network issmaller. This is not surprising: the significance of the average of the network variable revealsthe presence of significant correlation between the main independent variable and individualunobservable effects.Columns (4) and (5) present the results of the estimations of the complete CRE-CF model(with MLE and GEE with boostrapped standard errors). We again find evidence of a positive
10 As noted by GebreeyesusGebreeyesus (20152015), because of the two-step procedure, we apply bootstrapped standarderrors (standard 50 replications) when estimating the second stage (or structural equation forinnovation).
11 As test for the exogeneity of these variables, we added these variables to the specification of theprobability equation for innovation and they do not turn out to be significant.
12 Note that now we remove from the structural equation the group mean of the endogenous variable(Njt).
13 The results of the first stage equation are reported in Appendix B.
11
and significant effect of network on innovation, being the coefficient obtained with the GEEmethods slightly smaller but of higher significance level. The coefficient of the predictedresidual from the first stage is negative and statistically significant, confirming the endogeneityof the network variable to the model. Thus, the result of both columns (4) and (5) allow us toconfirm our first hypothesis: the larger the network, the more likely is that the firm will engagein innovation activities. The effect of network obtained when endogeneity is accounted for isnow much larger than in the original models.About the other covariates, we effect of firm size (as number of regular employees) on thelikelihood to have introduced an innovation is positive, while we do not find evidence of asignificant effect of firm age once endogeneity is properly accounted for (the coefficient of agewas significant in the first models). Interestingly, household enterprises are more likely tointroduce an improved product or a new process than cooperatives and joint stock companiesonce we control for the size of network. Individual characteristics of the entrepreneur (age andgender) seem not to matter for innovation, on average and everything else being equal.Columns (6) and (7) in Table 33 show the results for the CRE-CF model (GEE with boostrap)when other network-related variables are added to the original specification for the innovationequation (the first stage equation remains the same). To test Hypothesis 2, we add a dummyvariable for network diversity (column (6)). We find that hving a more diversified networkpositively affects the likelihood to innovate; at the same time, the coefficient of the mainnetwork variable is still positive and significant, even if of a smaller magnitude with respect towhat obtained with the CRE-CF where diversity of network is not accounted for (column (5)).Column (7) shows the estimations when a dummy variable for being member of a businessassociation is also included, consistently with Hypothesis 3. Being a member of a businessassociation has a large positive coefficient on innovation. Despite this large effect, the effect ofnetwork size is still significant and positive, even if now it presents a smaller magnitude and itis significant at 10 per cent.
12
Tab
le3:
Inn
ovat
ion
and
net
wor
k:
mai
nes
tim
atio
nre
sult
s
Dep
enden
tvariable:innovation
(1)
(2)
(3)
(4)
(5)
(6)
(7)
Pooled
CRE
probit
CRE
probit
CRE-C
FCRE-C
FCRE-C
FCRE-C
Fprobit
(MLE)
(GEE)
(MLE)
(GEE)
(GEE)
(GEE)
Network
0.114***
0.072**
0.069**
0.678**
0.652***
0.578*
0.512*
(0.026)
(0.032)
(0.030)
(0.302)
(0.250)
(0.333)
(0.280)
Employmen
t(log)
0.323***
0.233***
0.225***
0.142**
0.137**
0.140*
0.145***
(0.025)
(0.051)
(0.050)
(0.066)
(0.056)
(0.077)
(0.050)
Firm
age(log)
-0.082**
-0.080
-0.076
-0.057
-0.054
-0.056
-0.062
(0.039)
(0.058)
(0.052)
(0.059)
(0.052)
(0.046)
(0.059)
Manager/owner
age(log)
-0.425***
-0.268*
-0.257*
-0.214
-0.205
-0.197
-0.206
(0.091)
(0.153)
(0.141)
(0.156)
(0.156)
(0.160)
(0.154)
Manager/owner
gen
der
0.110**
0.109*
0.104*
0.102
0.098
0.104
0.101
(0.045)
(0.066)
(0.063)
(0.066)
(0.065)
(0.066)
(0.069)
Partnersh
ip/Collective/
Cooperative
-0.325**
-0.357***
-0.340**
-0.373***
-0.353**
-0.364***
-0.523***
(0.150)
(0.133)
(0.151)
(0.134)
(0.141)
(0.124)
(0.154)
LSC
-0.098
-0.149**
-0.140*
-0.227**
-0.216***
-0.215**
-0.214**
(0.069)
(0.074)
(0.073)
(0.089)
(0.074)
(0.104)
(0.084)
Employmen
t(log)(avg.)
0.110*
0.102*
0.127**
0.119**
0.077
0.030
(0.059)
(0.057)
(0.059)
(0.058)
(0.064)
(0.044)
Firm
age(log)(avg.)
0.008
0.009
-0.004
-0.002
0.009
0.002
(0.080)
(0.077)
(0.080)
(0.074)
(0.074)
(0.078)
Manager/owner
age(log)(avg.)
-0.253
-0.239
-0.279
-0.264
-0.234
-0.273
(0.198)
(0.188)
(0.198)
(0.224)
(0.191)
(0.204)
Manager/owner
gen
der
(avg.)
0.011
0.008
0.007
0.005
0.008
0.021
(0.091)
(0.088)
(0.091)
(0.097)
(0.079)
(0.087)
Network
(avg.)
0.152**
0.148**
(0.065)
(0.062)
Contactsdiversity
0.149***
0.141***
(0.045)
(0.052)
Contactsdiversity
(avg.)
0.283**
0.291***
(0.110)
(0.095)
BusinessAssociation
0.254***
(0.089)
BusinessAssociation(avg.)
0.278*
(0.152)
Residuals
(network)
-0.575*
-0.553**
-0.506
-0.443
(0.303)
(0.251)
(0.335)
(0.278)
Constant
0.362
0.192
0.165
-1.648
-1.602
-1.515
-0.958
(0.363)
(0.530)
(0.503)
(1.352)
(1.148)
(1.498)
(1.305)
Sector
Yes
Yes
Yes
Yes
Yes
Yes
Yes
Province
Yes
Yes
Yes
Yes
Yes
Yes
Yes
Tim
eYes
Yes
Yes
Yes
Yes
Yes
Yes
lnsig2u,co
nstant
-2.326***
-2.324***
(0.254)
(0.255)
Obs.
5551
5551
5551
5518
5518
5518
5518
Firms
1123
1123
1123
1123
1123
1123
1123
dfm
27
32
32
32
29
33
34
Sou
rce:
auth
ors’
elab
orat
ion
bas
edon
CIE
Man
dIL
SS
AC
IEM
and
ILS
SA
(2016
2016).
13
6 Discussion and concluding remarks
Using 5 waves (2007-2015) of a panel survey on Vietnamese MSMEs in the manufacturingsector, we investigate the effect of network participation on firms’ innovation activity. Proxingnetwork with the number of regular contacts, we find evidence that the size of networks has apositive and significant effect on the probability of a firm to engage in innovation activities.This result is consistent across different methods accounting for different source of endogeneity,such as the implementation of CRE (MundlakMundlak, 19781978) together with the control function (CF)method.We also find evidence that the ‘quality’ of the network also matters for innovation. In factour findings about the effect of network are robust to the addition of other network-relatedvariables, such as the dummies for being member of a business association and having amore diverse network of contacts. Both these variables are found to be important in affectinginnovation, but their addition to the model does not wash away the effect of the size of network,which continues to be positive and significant even in more sophisticated models.Besides expanding our understanding of the determinants of technological innovation outputs(product and process) among Vietnamese SMEs by providing empirical evidence in supportof the argument that the participation in networks plays a role in explaining innovationheterogeneity, this work also contributes to address the existing gap in the empirical literatureon firm in developing countries, which has typically overlooked the analysis relationship betweennetwork and innovation in favor of the study of network and firm performance. Furthermore,this works provides some initial insight to the re-thinking of policies aimed at fosteringinnovation among micro and small firms operating in developing or emerging economies. Amore specific analysis on the relevance of network connections versus geographical proximitywould be necessary, as well as a deeper investigation on how and which features of networksare more relevant for firm-level innovation activities.
14
References
Asheim, B. T. and Gertler, M. S. (2005). The geography of innovation: regional innovationsystems, pages 291–317. Oxford University Press., Oxford.
Barr, A. (2000). Social capital and technical information flows in the ghanaian manufacturingsector. Oxford Economic Papers, 52(3):539–559.
Boschma, R. (2005). Proximity and innovation: a critical assessment. Regional studies, 39(1):61–74.
Boschma, R. A. and Ter Wal, A. L. (2007). Knowledge networks and innovative performancein an industrial district: the case of a footwear district in the south of italy. Industry andInnovation, 14(2):177–199.
Chudnovsky, D., Lpez, A., and Pupato, G. (2006). Innovation and productivity in developingcountries: A study of argentine manufacturing firms behavior (19922001). Research Policy,35(2):266–288.
CIEM, D. and ILSSA (2016). Characteristics of the vietnamese business environment. evidencefrom a sme survey in 2013. Technical report, CIEM.
Cohen, W. M. and Levinthal, D. A. (1990). Absorptive capacity: A new perspective on learningand innovation. Administrative Science Quarterly, 35(1):128–152.
Crepon, B., Duguet, E., and Mairessec, J. (1998). Research, innovation and productivity:An econometric analysis at the firm level. Economics of Innovation and New Technology,7(2):115–158.
Fafchamps, M. (2000). Ethnicity and credit in african manufacturing. Journal of Developmenteconomics, 61(1):205–235.
Fafchamps, M. and Minten, B. (2002). Returns to social network capital among traders. Oxfordeconomic papers, 54(2):173–206.
Gebreeyesus, M. (2011). Innovation and microenterprises growth in Ethiopia, pages 122–145.Oxford University Press.
Gebreeyesus, M. (2015). Firm adoption of international standards: evidence from the ethiopianfloriculture sector. Agricultural Economics, 46(S1):139–155.
Gebreeyesus, M. and Mohnen, P. (2013). Innovation performance and embeddedness innetworks: Evidence from the ethiopian footwear cluster. World Development, 41:302–316.
Giuliani, E. and Bell, M. (2005). The micro-determinants of meso-level learning and innovation:evidence from a chilean wine cluster. Research policy, 34(1):47–68.
Goedhuys, M., Janz, N., and Mohnen, P. (2014). Knowledge-based productivity in low-techindustries: evidence from firms in developing countries. Industrial and Corporate Change,23(1):1–23.
Hall, B. H. (2011). Innovation and productivity. Technical report, National bureau of economicresearch.
Hall, B. H., Lotti, F., and Mairesse, J. (2009). Innovation and productivity in smes: empiricalevidence for italy. Small Business Economics, 33(1):13–33.
15
Howard, E. (2017). Social networks, geographic proximity, and firm performance in viet nam.Technical report, World Institute for Development Economic Research (UNU-WIDER).
Howard, E., Newman, C., and Tarp, F. (2015). Measuring industry coagglomeration andidentifying the driving forces. Journal of Economic Geography, 16(5):1055–1078.
Lall, S. (1992). Technological capabilities and industrialization. World Development, 20(2):165–186.
Mairesse, J. and Mohnen, P. (2010). Chapter 26 - Using Innovation Surveys for EconometricAnalysis, volume Volume 2, pages 1129–1155. North-Holland.
Markusen, A. (1996). Sticky places in slippery space: a typology of industrial districts. Economicgeography, 72(3):293–313.
Mundlak, Y. (1978). On the pooling of time series and cross section data. Econometrica: journalof the Econometric Society, pages 69–85.
Nelson, R. R. and Winter, s. (1982). An evolutionary theory of economic change. Cambridge,MA: Harvard University Press.
Nichter, S. and Goldmark, L. (2009). Small firm growth in developing countries. WorldDevelopment, 37(9):1453–1464.
Papke, L. E. and Wooldridge, J. M. (2008). Panel data methods for fractional response variableswith an application to test pass rates. Journal of Econometrics, 145(1):121–133.
Tybout, J. R. (2000). Manufacturing firms in developing countries: How well do they do, andwhy? Journal of Economic Literature, 38(1):11–44.
Wooldridge, J. M. (2010). Correlated random effects models with unbalanced panels.
16
7 Appendix
7.1 Appendix A
Notes about the construction of variables
Network has been obtained as the total number of contacts in different contact categories(business people in the same sector, business people in a different sector, bank officials (formaland informal creditors), politicians and civil servants, others). The variable has been cleanedby eliminating extremely large values (e.g. we kept values around 1000 as maximum reasonablevalue in largest contact category, such as business people in the same or in a different sector).There is only one observation with network size corresponding to zero, which is left in thesample. The loss of observations due to cleaning is minimal, since this implied the replacementwith missing values of just 1 or 2 observations for every wave (even less, considering that mostof the ‘atypical‘ values were not in the balanced panel). The values of the network variable aretransformed with a inverse hyperbolic sine transformation.
Employment is calculated as the number of full-time regular employees, adding thepart-time regular employees with a weight of 0.6. We are not considering any form of casualemployment. In most periods this regular employment represents the 90 per cent of totalemployment (with the exception of 2009, where it was only 80 per cent due to a sharp decreaseof regular employment in that year, while a more stable patter for the casual one). To obtaina more homogeneous sample of micro and small entrepreneurs, we eliminate (replace withmissing) the observations with less than one regular employment (around 500 observationsin the whole unbalanced panel).The value of the regular employment for the single-employeevenues will turn to zero when we take the logarithm of regular employment1414. (The samehappens with the log of total employment).For this same purpose, we also eliminate large firms (with more than 300 regular employees)when these are: a) present only in the unbalanced panel (thus, they do not appear in everyperiod); b) present in the balanced panel and their large size is clearly the result of a MA (e.g.the firm went up from 20 to 400 employees in the consecutive period). The firms that are in theconsidered balanced panel and those growth into a large dimension is due to an entrepreneurialgrowth process were left in the dataset, these corresponding to only two firms. Moreover,we replace atypical values and extremely large (e.g. for revenues and number of contacts,especially, which could be entry errors) with missing, and we replace revenues those value iszero with one, in order to be able to convert into log values without loosing the correspondingindividual.
14 An alternative is to replace the value of 1 with 1.1, not to obtain a zero with the log.
17
7.2 Appendix B
Network equation: first stage
Table 44 shows the results of the reduced form for the network equation, whose estimatedresiduals are then used as an additional regressor in the the structural equation for innovation.
Table 4: Network equation: first stage
Dependent variable: network (1) (2) (3)Pooled CRE probit CRE probitprobit (MLE) (GEE)
Supplid 0.059*** 0.043** 0.043**(0.020) (0.022) (0.022)
Priceneg 0.002 0.059** 0.059**(0.027) (0.029) (0.029)
Internet 0.122*** 0.103*** 0.103***(0.035) (0.038) (0.038)
Infrastructure 0.128*** 0.122*** 0.122***(0.030) (0.039) (0.039)
Employment (log) 0.137*** 0.068** 0.068**(0.016) (0.028) (0.028)
Firm age (log -0.032 0.030 0.030(0.022) (0.032) (0.032)
Manager/owner age (log) -0.078 0.023 0.024(0.049) (0.071) (0.071)
Manager/owner gender 0.015 0.036 0.036(0.022) (0.030) (0.030)
Partnership/Collective/Cooperative -0.027 -0.032 -0.032(0.069) (0.070) (0.070)
LSC 0.130*** 0.096** 0.097**(0.045) (0.047) (0.047)
Suppid (avg.) 0.082 0.083(0.056) (0.056)
Priceneg (avg.) -0.284*** -0.284***(0.077) (0.077)
Internet (avg.) 0.024 0.024(0.069) (0.069)
Infrastructure (avg.) -0.009 -0.010(0.063) (0.063)
Employment (avg.) 0.085** 0.085***(0.033) (0.033)
Firm age (avg.) -0.090** -0.090**(0.042) (0.042)
Manager/owner age (avg.) -0.115 -0.115(0.094) (0.094)
Manager/owner gender (avg.) -0.041 -0.041(0.045) (0.044)
Constant 4.104*** 4.258*** 4.258***(0.189) (0.239) (0.238)
Sector Yes Yes YesProvince Yes Yes YesTime Yes Yes Yes
Obs. 5518 5518 5518Firms 1123 1123df m 30 38 38r2 a 0.13
Source: authors’elaboration based on CIEM and ILSSACIEM and ILSSA (20162016).
18