demographic segmentation of long distance behavior : data ......facultyworkingpapers...

56

Upload: others

Post on 07-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 2: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

BOOKS!A',-1^'^^

Page 3: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

Digitized by the Internet Archive

in 2011 with funding from

University of Illinois Urbana-Champaign

http://www.archive.org/details/demographicsegme79shet

Page 4: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 5: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

Faculty Working Papers

DEMOGRAPHIC SEGMENTATION OF LONG

DISTANCE BEHAVIOR: DATA ANALYSIS& INDUCTIVE MODEL BUILDING

Jagdish N. Shethand

A, Marvin Roscoe, Jr.

#79

College of Commerce and Business Administration

University of Illinois at Urbana-Champaign

Page 6: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 7: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

FACULTY WORKING PAPERS

College of Commerce and Business Administration

University of Illinois at Urbana -Champaign

December 13, 1972

DEMOGRAPHIC SEGMENTATION OF LONGDISTANCE BEHAVIOR: DATA ANALYSIS

& INDUCTIVE MODEL BUILDING

Jagdish N. Shethand

A, Marvin Roscoe, Jr.

#79

Page 8: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 9: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

DEMOGRAPHIC SEfflffiNTATION OF LONG DISTANCE BEHAVIOR:

DATA ANALYSIS & INDUCTIVE MODEL BUILDINGl

A, Msxvin Roscoe, Jr.

2

American Telftphonf* & Telegi-aph Company

and

Jagdish N. ShethUniversity of Illinois

The objective of the study vas to explore the appropriateness of severalstatistical techniques in developing predictive models of consujiaers' longdistance telephone expenditux-e based on the analysis of socioeconomic anddemographic characteristics. Specifically, the paper examines the relativeefficacy of stepwise multiple regression, monotonic A.ID (Automatic InteractionDetector) and free AID in analyzing large scale data.

Description of MRTS Data Bank.

Most consumer behavior research to date has been ad hoc, fragmentary,and exploratory. Only irecently have large corporations begun to generate con-tinoxis and systematic infonnation about the market place as part of theirmarketing information systems. The Bell System Companies provide comm\ini cationservices in the hB continentaJ. states and the District of Columbia to lOUmillion telephones, 83/^ of the total telephones in the United States. Theneed to understand this er-onaous mai-ket is self-evident, not only from atraditional marketing view but also from the social and economic consider-ations inherent in the laarageTnent of a regulated utility. To help meet this

need, a latrge scale Market Research Information System (MRIS) has beenestablished, consisting oi a national longitudinal panel of some 60,000customers representing both the business and residence markets.

MRIS panel members vere selected by multistage stratified sampling pro-cedures from the customer files of each .f the one hiindred accounting officesof the Bell System where customer billing i.J performed. A panel of 600 customers,evenly divided between business and residence customers , was selected fromeach of these accounting offices. Cttrreutlir, the MRIS data bejik contains morethan 126 million card image record^s, and is growing at the rate of 3 1/2million records each month. The I'lRIS panel excludes certain types of accounts,such as the U.S. Government, which are handled sepeirately from a communicationsview point, and certain specialized types of communications services such as

private line and data services

,

For each panel member, the MRIS data bank stores the following information:

1. A basic equipment record consisting of service and eqxiipment data,

such as the number and type of telephone lines , number of extensionphones and other vertical (optional; services including Princess*and Trimline**^ phones, Touch-Tone* service and additional Directorylistings . These data are updated whenever psuiel menibers change

their service or equipment

.

•/Registered trademark of the Bell System.

Page 10: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 11: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

2, A billing amount record listing charges for local service,additional, message units (where applicable), a sioniKiary oflong distance billing, taxes and other charf^es or credits

as ^hoTTO on tne customer's bill. This record is expandedevery month.

3. A long dl^-tance record listiiTig liljing details for each meseagefound on the custcicer ' £ bil^J-ng. Ktaieneiit, sv-cb as the date andtime of the call, type of cell (direct dial or operator handled),length of coTivei'sabioii and amount of charge. This record is alsoexpended each month.

h. A demographic record cont^^intn^ a socioeconomic and demographic pro-fil"^ of the residence custoner's bov.sfchold unit. These data have beenobtained from a raaii questionnaire, and include age, sex, educationand occupation of the head o? hcuEieholdj familj^ size and composition,its mobility characteristics tuid family income. The residence profileis updated every three yeax's , ejid consideration is presently beinggiven to collectin'^ additional information on the residence customer'sfundamental value system as veil .xs his generalized a titudes towardthe telephone as a means of communication.

To be able to ccmprehend this enormous amount of information at themicrolevel of the individual customer, basic research is underway to developanalytic strategies in the interest of building predictive micro and macromodels of Duyer behavior for the telephone industry.

TJ-iis paper represents one of our projects designed to develop an xmder-standing of the loiig distance telephone behavior of the residence customer. Tlie

relationship of socioeconomic and demographic factors to long distance behavioris especiallj'- import,aiat to iusure that the rate structures filed by theTelephone Ccmpaiiies and approved by ths regulatory agencies are equitable tothe varioxis socioeconomic castomer segments in the cotintryS.

In order to ex:aiine the char2,ctaristics of the data and the associated pro-blems in their analysis, the stuciy wt;s limitsd to the 793 panel customersfrom two RorthQEstern states . 'fhe two groups of customers were chosen on thebasis of comparability of long distancp expeiidi.ture ,. providing a scinple sizethat woiild not unduly favor ons psxtic^xlar analytic technique- The focus ofthis paper is, hovevei-, data analysiri pncl not inference, although decisionand predictive modelit i-we being bullr, based en the find.lngs from this type ofdata analysis utilizing larger xa.t more generalised samples of t-ie population.

Table 1 lists the fourteen so^ieconoioic and demographic variables and thedependent variable, long distance expenditure-, with their meanu and standarddeviations. To avoid the effects of neasoaality said holidays, the long distancebehavior variable was based cii the monthly- average of a year's history for eachresidence customer, expressed in dollars and cents. Tlie dollar signs have beenomitted for the sake of simplij.city. Among the fourteen socioeconomic saddemographic variables, two index "-ariables have been included j the socioeconomicstatus (SES) index and the Life Cycle index. The 3ES in-^ex in a score developedfrom a composite of th« education and occupation of the head of household endfamily income level using the procedures of the U.S. Sureau of the Csnsus (1963).The Life Cycle index is determined from the age and maritpJL status of the headof household and fanily coapositionj" following the proc^dtyres used by the SurveyResearch Center at the University of Michigan (Lansing & Kish, 195T).

Page 12: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 13: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

TAJBLE 1

List of Variables

StandardNumber Description Mean Deviation

1 Socio-Econoniic Status 6.385 2.0792 Own/Rent 1.267 0.I+U3

3 Type of Residence 1.50^ 0,803k No. of Floors 1.90a 0.7905 No. of Rooms 5.971*

,I.7U7

6 Length of Residence U.165 1.662

7 Ho. of Moves (in past 5 yrs.) i.Uo 0.7888 Sex of H. H. 1.187 0.3909 Age of H. H. 5.009 1.383

10 Occupation of H. H. 6.095 2.32211 Education of H. R. 1^.276 1.78912 Family Income it. 511 1.80113 Family- Size 3.279 1.629ll* Life Cycle l^.76U 1.58515 Long Distance

Expenditure (average month) 7.219 10.376

Problems of Data .tealysis and Alternative Statistical Approaches

In Figure 1, long distance expenditure is plotted for each socioeconomicand demographic variable. The variables have been grouped in four categoriesderived from a prior factor -analysis of these data. Examining the plotsclearly points out the followinfr dpta problems t;,T)icaJ. of most siirvey research(Morgan & Sonquist. 1963; Carman, 1967', Sonquist^ 1970):

1. All of the deraosraphic variables are discrete rather thsun

continuoos , a3.though r;ary ot' theni c.o have ruccessive classintervals containing larg» aunibeis of observations.

2. The variables have a irdxture of scales cons^isting of nominaland interval-scaled dats-.

3. The relationship of long distance e:cpendl.ture with many ofthe demographic variables is not linear.

h. In some cases, the relationship is not even monotonic.

5. The demographic variables may be related to long distance tele-phone behavior in an interactive manner rather thsua in a simpleadditive mamier.

6. The demographic variables tend to be correlated with one another,which may be a serious problem when using regression analysis(Blalock, 1963). Table 2 s-ummarizes some of the highlymsiltlcolliaear variables.

Page 14: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 15: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

Figure 1

Average Long Distance ExpendlBy Demographic Category

HOUSIIMO/FAMILY —1S.0O

Type of Residence No. of FloorsISOOj

52.50 f-

LeRgllt of R^-std©Bce Mo. of Moves in L^st 5 Years

r NumlHr ot Tmh in Euh CsMgcryAll Cmtomtr Avsragi -^-i—•—

Page 16: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 17: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

Figurel (oontU)

Average Long Distance ExpenditureBy Demographic Category

SOCIO-ECOIMOMICFamily Sncome Education (Head of Household)

iSO -

Lta thin 3,000 5,003 7,500 10,003 S.OOO >0,000 3Q,C"Q Sim* 6.S. Sonre H.S. Soms Cotlc^ Grad Mtstfui PhO31090 «,998 7.433 S,9Sf 14,993 13,^93 29,893 at im<< ^ S. dtu' H.S. Crtd Collq* Grid School

60 87 83 137 232 516 M 44 1 43 73 124 233 142 83 43 25 18t

l&OOOccypatfon (Head of Household)

M.QOSocio - Economic Status'*'

Uborw

14S

12.50

10.00

7.50

5.00

2.5C

S«vit» Craftsman Clwicsl fA«raB«r Piofe«;onil 10-13 20-29 30-39 «0-^9 50-59 60-63 7C-79 80-69 90-99

''Jofkar Fofoman S(ln Cropfiitcfr Tochpiiai

112 97 ^2S 197 113? 15 33 43 E? 74 144 152 133 139

1

i-E_l^ ns r^\^f^t F=kaBr* (JSa «^ Yi '^si0t;tnKSii

ftga (Head of Household)(4X0

Life Cycle15.00

-12.50 - t^SO -

10.00- -

10.09-

7.50 ~'^ •«* ««« Mw7'^''^NiiaiiM^i|i|Wf1f'*'-—''— .^i^.^

(T""^7.50 = jg^fit^iTlTzr!::rsr3 PT'^Bik^^ —r; 7.22

5.00 - ^^^ 5.00 - ^y^""^ ^^2,50 - — 2.3Q

y^1 1 1 1 a i 1 1 1 1

Un

2

der 25-34 35-44 45-54 55-6i!

5

D 10S 171 197 152

65 &

148 t

Vol

Sin

inj Young Youngql3 Mirritd Family

1 215

Youngfemily

T««n899f

25

Older

Fsmily

141

Older

Coupla

274

Olt

Sin

8.

er

gle

Jfi BiMd upon U.S. Bur»«i of C«na», t^athodology

wd Sco^w o1 Sociooconornfe Stttut,

Working PtjMr Ho. 15. 1983

I Num'^tr of Cuet in Each CltfCOTY

All CuttonHT Av«ng* —^— —

Page 18: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 19: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

TABLE 2

Mxiliicollineai'ity i'iinong Demographic Variables

Variable

1. SES Score

2. Own/Rent

.3. Type of Residence

k. No. of Floors

5. Length of Residence

6. /-ge of H. H.

7. Occupation of H. H.

6. Education of H. H.

Correli,tion

0.51

Variable

0.-^5

0.T60.78

Occupation of H. H.

Education of H. H.

Incoffie of H. H.

0.69-0.56

Type of ResidenceNo. of Rooms

0.5^ No. of Rooms

0.52 No. of Rooms

-0.710.50

No. of MovesAge of H. H.

0.81 Life Cycle

0.51 Family Income

Family Income

Starting with a large body of empiricaJ. data and li-^tle or no prior theory

with which to develop functional relationships, it is difficult to constructa multivariate model without first investigating the effects of these dataproblems as they relate to ana-iysis by standard st8,tistical methods. In fact,

the authors believe that the blind u-5e of a statistical method maj'- produce great

harm from mlsintex-pretatloa of the data., and could, even result in throwing out

important data as irrelevant or useless for a marketing probl'.em.

Our objective was, therefore, to first e

which take into consideration, to ^rvjrji.np de

multiple regression, AID vith the nonotonicables, and >ill,> without the mono-ionlc reatricalternative strategies because each r'ispcnds

characteristics. Such a combinati:n should,of forming an \;.nsupportable advsace bypotheson the actual sti*ucture ijnder obse^-vation

xplore s'everal analytic strategiesgreefi , these data problems. Stepwiserestriction on the predictor vari-ticn vere chosen as the threesomewhat differently^ to these datatherefore, both avoid the problemis and give the analyst perspective

Miiltip.'.e regression is a robuit method, rich in both data analysis andinference. In addition, regression analysis, with spme variations, can takeinto accoxont the problem of mixed scsQes and class interval data. For example,by vising dummy variables it is possible to include a* number of nomineillyscaled demographic descriptors such as ownership of residence and sex of thehead of household. Finally, stepwise multiple regression considers theproblem of multicollinearity by developing partial correlations at each step,

Page 20: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

1/v!

Page 21: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

thereby eliminating variables that axe Jaighly correlated vith variables already-included in the regression equation. 5 However, since the regression equationis a linear additive model, it is not capable of effectively handling theproblems of nonlinear, nojimonotonic ani interactive relationships^.

The objective in AID analysis is to paitition the total sample into anoptimal set of nonoverlapping subgroups , ieveloped from the profiles of thepredictor variables, vhose categories explain more of the variation in thedependent variable than do any other set of subgroups. This objective isachieved by a sequential partitioning of the total sample into two subgroupsbased on the split of a single praciictor variable vhich produces the largestratio of between sum of squares to total suia of squares. This process isrepeated on each of the subgroups until some minimum level of explainedvariance is encountered or a minimum sample size is reached in the subgroup.Thus, one-way analysis of variance is explicitly included in the analysis.

Because the splitting of groups is sequential, the AID analysis is a step-wise procedure similar to the stepwise regression method, and so minimizesthe problem of multicollinearity . However, the optimal split at each step isnot based on a predictor's contribution to reducing the error variance in thetotal sample but the variance in the subgroup.

Several researchers have used the AID technique in marketing, either fordeveloping segments (Assael, 1970) or for model building (Carman, 1967;Armstrong & Andress, 1970)- The technique itself is described by Morgan andSonqxiist (1963), Sonquist and Morgan (196^0, and Sonqiiist (19T0). The AIDprogram is capable of handling both the categorical and class interval predictorvariables, regardless of whether their relationship is lineeir, nonlinear ornonmonotonic with respect to the criterion variable. Of course, the criterion,or dependent, variable may be continuous and in the case of long distanceexpenditure it is. FinaJLly, and most importantly, this procedure is capableof handling both the additive and the interactive relationships of a set ofpredictors with the criterion variable.

Two types of AID analyses were lised to r^eparately examine the nonmonotonicand the interactive effects of the relationships between long distanceexpenditure and the demographic variablcb. Ttie first, monotonic AID analysis,preserves the ordinal3.ty of the predictor variable vhen it is chosen &s acandidate to split the sample. Therefore, the two new subgroups are definedas above and below the boundai^y of a category interval of the predictorvariable. For example, given the eight categories of income, there are onlyseven (K-l) comparisons possible b:/ splitting the group at each of the adjacentcategories. By definition, therefore, monotonic AID analysis is capable ofhandling nonlinear relationships as long as they are monotonic ororder-preserving

.

The second procediure, free AID analysis, allows the split on a predictorvariable withput regard to the order of the categories of that variable. Thus,there is a much larger number of combinations of the predictor variable categorieswhich me^y be examined to split the sample. This removes the nonlinear restrictionand allows for the analysis of a nonmonotonic relationship if one exists betweenthe predictor and the criterion variable.

Page 22: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 23: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

Ccaaparative Data Analysis and Results

The stepwise linear regression analysis was performed using the UCLABiomedical computer program BMD 02R (Dixon, 1971). To avoid highly collinearvariables and randcan effects, an F value of 3.65, comparable to 0.05 level

of significance, was set for a predictor variable to enter into the equation.

The results of the stepwise regress:' on analysis are summarized in Table 3.

The multiple R was 0.36, resulting in 12.68 percent of the variance in long

distance expenditure being explained by four, of the demographic predictors.

These four significant predictors and their associated explained variances sure

(l) family income, 9.86^ (2) number of rooms, 0.75>^ (3) length of residence, 0.98^and {h) life cycle of the family 1.09%. Tlie relationship is positive with

income, number of rooms and life cycle, but negative with length of residence.In short, the greater the income, the more rooms in the residence unit, the

later the stage of the life cycle, and the more recent the move of a

residence customer, the greater the average long distance expenditure willbe. It is interesting to note that life cycle as an index variable performedbetter than its component demographic variables but the SES index did notperform better than income.

TABLE 3

Stepwise Linear Regression Analysis

Vai'iables in Equation

Step Variable Beta Coef

.

StandardError

' MultipleR RSQ

F Ratioto Enter

12

3U

(ConstantIncomeNo. of RoomsLength of ResidenceLife Cycle

0.0001)0.26990.1335

-0.1*^13

0.1k?2i;

i

0.03870.03930.03910.0389.

0.31U0 0.09860.3257 0.10610.3»+0l+ 0.11590.3561 0,1268

86.506.628.7U9.88

Analysis of Variance

Source of.Variation

Degrees ofFreedom

Sum of

SquaresMeanSquare F Ratio

TotalRegressionResidual

792k

788

35,261*

10,8127U,U52

2,703.05.gk.UQ

28.61*

Percent Variance Explained 12.68•Significant at the 0.01 level.

Page 24: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 25: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

i,

i

a ^O W SJ* \a)

tUtO MI>J MM MM M|V »-'

^? ^^ ^^ vv ^^ t

v^'ji v»jv v*>-J VAiVJ-PH' VJllT' VjJH ^J>VJ1 O-^- M-F- ^;s «g )}iit

M o vO Ott

^-r

t-"

O MM 'ipj

M -J

VJi JN VJ M H"

If

a" <3

>i Id

(-» M Ui ~J

oo\^ oo\j> v^-j oo^ 05o^a> .t^M Moa o-o oovj> -c^- h So u> •Xt H ^ -J

0^00 00*^ 00 H M nOMU) UiOO oo-p- Noo <3 M ->1 O s M -t^ M O --3

-J M \-n *^ -v3\J(T> UJM Ovv VajM vp*-

t»Vjj vDO\»jOO O IT- vO-J -t^-O

0<1 vpVi CO-t~ vD.t^vji-t^ OvU VJi-0 O^nO

O 'J*M -f^

OVO M 00 C CO mo 00 Ui

S"!

^^ 5J^^ Si^ ^5 ^^ ^ «+ 5.

V o.

V^M VJ<^M-t^\jJ\jJ>Ji>X>(-'*^0^*-^M\vJ>Jil-'-f- « ^i

M -OvO M SOM \0 H O H ~J -«3

O 00

O OO ui

-J M M -JMO 03 00

^

o

11 w

O H. >M M> >-, W tCTOn M (t o en . (t

O M1-J M pj

1 Cn I O(B (B • i->

H- aV^ (B

< 1n fl!

11/ 1

I O -f* I-

:8c^

M

^fet-" I-' ^> H'

\j» o f,i,o o

1 1 VJI 1 - t* 1 1

«s+ V5^

en a ^^

J Oaa- og&(10 ». <wM »r>

a- aO CTi

PA§*?o

8 oMM O M

OvD Ml

-J

+ o

VIAso «»

o3

oo

p H

crto (-•

o™

03 IBCf (B

ie

^ ss

M

s

^

MO CD oM

J-^

^

o

o

M O

M'J3

PI

H» ID

13 5p« (B

c<B

03

i?

Q

Page 26: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

r

Page 27: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

Several questions are implicit in these findings. First, the b\ilk of

the variance explained is concentrated in income (77.7$) with relatively little

contribution from the other demographic variables. Secondly, the total amount

of explained variance is relatively lower than might be expected7. And third,

most other demographic variables fail to exhibit any relationship on a partial

correlation basis. There are obviously two answers. First, no relationship may

exist with these other variables , so that long distance expenditure is determined

by other factors not in the equation. However, and secondly, it is possible

that the linear additive model built iato the regression suppresses any non-

linear or interactive relationship between the demographic variables and long

distance telephone behavior. Unless the latter explanation is riiled out,

good data may be discarded due to inappropriate analytic methods.

The monotonic AID analysis was the second method used. This procedure allows

for monotonic, nonlinear and interactive relationships between the predictorvariables and the criterion variable as noted. To avoid unstable results and

to meet sampling error requirements , the AID analysis was ba^ed on the additional

constraints of a minimxam sample size of 30 in each final subgroup, emd a minimumpercent variance explained equal to or greater than 0.6 percent at each step.

The statistical results are summarized in Tables h and 5. The explainedvariance was increased from 12.68 percent, using regression analysis, to l6.0U

percent lasing AID and allowing monotonic and interactive relationships. Table k

shows that the SES score, the education and age of the head of household and

number of moves have entered into the analysis. The additional explanatorypower comes from (l) the inclusion of these vsLriables and (2) the increases

in the predictive power of the variables as against the regression equation. The

best examples of increased predictive power are the SES score, which was 1.3^^,and the number of rooms which increased from 0.75^ to 2.69^ variance explained.

These values are the summation of individual percent variance explained in

Table k. This increased predictive power can be explained by the fact that bothof these variables have a step function with the long distance expenditure as

seen from the plots in Figure 1. A similar step function in length of residencealso slightly increases its predictive power.

Monton-

TABLE 5

.c AID Aiiaiyis

Analysis of Variance

Source ofVariation

Degrees ofFreedom

Sum ofSquares

MeanSquare F Ratio

TotalBetweenWithin

79215

777

85,261i

13,6lU71,590

911.6192. lU

9.89*

Percent variance explained l6.0iv

•Significant at the 0.01 level

Page 28: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 29: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

'1.

A™™

s

2->

1A...

3

X

!

Page 30: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 31: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

On the other hand, the predictive power of income and life cycle decreasedslightly in monotonic AID. This is largely due to the small number of cases

at the iipper end of the income scale and at the lower end of the life cycle

index. These small cell sis;es do not permit further subdivision due to therestriction of the minimum, size in the finst]. groups formed.

A careful examination of the group splits which result in large betweensum of squares reveals that they are cLrsctly a function of the trtaacation inthe monotonic re].atioriship of the predictor variable with the criterionvariable. Thus, the greater the rise of the step in the function between thepredictor and the criterion vitriable, the greater the relative predictivepower that variable possesses, oxirprisingly, the interaction rnnong thesignificant demographic variables is not as great as expected. "Hiis is shownin the tree diagram of Figure 2 by the relativelj.^ good sjomaetry of splits \rtiere

the predictive variables appear on both branches of the split. Strongerinteraction between the demographic and the socioeconomic variables had beenanticipated; however, this interaction does not seem to be present in the data.

Finally, a very important benefit of monotonic AID analysis comes fromthe fact that it matches the managerial problem definition and decision-makingprocess. Typically, marketing management is interested in market differentiationand discrimination for better marketing effectiveness. Furthermore, the marketdifferentiation is based on segmenting customers who are pres^jmed to havedifferent wants and desires. The demographic variables are considered themost common casual factors in bringing these differences to light, especially bythe regulatory and other governmental agencies in the utility industry. TheAID results have been represented as a diagram in Figxire 2 which is bothmeaningful and communicable to management. For example, it indicates thatcustomers with income above $15,000, with residence units consisting of eightor more rooms , and with less than ten years of residence at their present -

location have the highest average of long distance calling and expenditure(group six). On the other hand, customers with less than $10,000 Income andan extremely low SES score manifest the lowest amount of average monthlyexpenditure (group sixteen). Both of these extreme groups, as well as theother segments, are meaningful and relate to management's prior experiences anddecisions . In view of the fact that half the problem in successful marketingreseeirch is its effective communication, AID seems to be an advantageousanalytical strategy.

The third analytic technique is free AID anadysis where the nonmono-tonicity of the i^redictor-criterion relationship is taken into account inaddition to the nonlinearity and interactive aspects. The same stoppingcriteria were utilized here (minimum subgroup size greater then or equalto 30 and 0.6 percent variance explained at each step). The statisticalresults are summarized in Tables 6 and 7 and Figure 3.

Free AID ana;jLysis increases the predictive power of the demographicvariables from l6.0^ percent to 20.23 percent when compared to monotonic AIDanalysis. In the process, it includes occupation and type of residence variables;however, the bulk of the increased predictive power in this analysis comes fromthe demographic variable age of head of household, which has the greatest non-monotonic relationship with long distance expenditure. Somewhat smaller increasesin the predictive power of nujcber of rooms, education, and life cycle are alsodue to their nonmonotonic relationship with long distance expenditure.

Page 32: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

vf.,'--' ; ;'.i

a.f.'i' «!aiJ

Page 33: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

^

1

8

I

ii^

1

r

1 1

I--

-J

1 1 i y 1 1

to

t 1

VO

>

ma2.3

'4t^ V^ ^« n ^f? PJK •:;&;^£* ** V*i W ^1 \0 ro "^

°'' VJ.*» Vo M »-

CO

«S*-*

a^ to «y iisi ^)^ yj 00 -f- VJ O W sa S8 Vdg; :SS ^S 2Js

s?

11

oj a-

52:

VO CO 03 *«

'<! vnP- VJ

-J

« CO

V« t>J OO VJi -JVO OJXjO 00 -0 SS 00 -t^ <X -N ca ^ 00 >J> KS V» -J sII

fe>c! O^V*>0> -J

vO W *?£ S o U2 C'-*** hi <hoo«- ws ^8 fcs y§ y

^a

2,Hro w »-* w "w -^ w-t^ •*^ Va) •ji ^ -JVJ yp t-^ -i- < P KP ro «p "iK 03

H to >-o 2^

0^ \j^ e VJ-IS- Co VJ-

VJ> v«SSVJ vn U ;2S

nvooa en-a vj.

KJ

^

o O ^ "^

s?s:o U K.

i"!"

3) O o o _ < 1^ 2 M» O f-j s: > -% i-D ^-1 1--^ o a O ft IS H >^, o M -^ ^ *- n K

J-.'XI

IIiniO':! fp rt>

•^fe o C« o Hft x*i = rtj 2?^ <iU

1% ;x o 1^' I&o m a: »(1) H.

(3. (6

B o i^ ?:s-n O . o

3M ?sr ^sr CI o .=•§

Q

|-»'-0

PO H H'V>) AJ ^ <

f^ J^» ro H- ^jt ^J NJ i^ 1 M M VO « H*

S 2.

4^ <> h-'00 0» rv •,i?r ?? Va) »

-0- »«0 O^ VJ1 H' M-3 CV v^g

VO-

-r^s?

sO- o \£/- •-JV c ON. 2 PIo 1 O^ \jt -J -IN O'' NO ^ -c a fvO CQ

S

^CDo »

g 8 g 1:-0 s CXI (-» ^ oaO

o

Si o

k < «8 o o o o o o O o O O O O o H VJ M ~i

» 1- c5

A> t-' .f^ 8jT- 0^ VJ> •c- b^ 4- t>

3 5 ore o rf-

VU rj w ^J> ro \j^ ro •^ IVJ -0 -wl h) 0- VJ •£> •^ -0

<T\i H* VO K> PJ l-» t-* v-J t\.> V>J H- to ro vn TO ro 00

^ B ^ g; g ))'. ^ j2 8; « fe s s s

o

a-

Page 34: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 35: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

TABLE 7

Free AID Analysis

Analysis of Variance

Source ofVariation

Degrees ofFreedom

Sum ofSquares

MeanSquare F Ratio

TotalBetweenWithin

79218

85 ,261+

17,2U668,018

958.0787.88

10.90»

Percent variance explained 20.23•Significant at the 0.01 level

Siirprisingly, the predictive power of both the SES index and length of

residence decreased in the free AID analysis. This is attributable to the

extreme skewness of the two predictor variables

.

The free AID analysis reveals some subtle differences among customersegments which are hidden in the monotonic AID analysis. For example, theseventh group which has the highest average monthly bill, consists ofcustomers who have greater than $15,000 income, both small and large

residence units (three rooms and eight or more rooms) and who are bothrelatively young and relatively old (between 25 and 3^ years and between55 and 6k years). This was not fully revealed either in monotonic AID orin stepwise regression.

TABLE 8

Comparative Analysis of Predictor Variables

Percent Vsu:5 ance Explained

Stepwise Monotoni c FreeRegression AID AID

Family Income 9.86 9.30 9.30No. of Rooms 0.75 2.69 2.91Length of Residence 0.98 1.1*2 0.61Life Cycle 1.09 0.39 0.72SES Score — 1.3U 0.57Age of H. H. ~^ 0.27 1*.02

No. of Moves — 0.17 __

Education of H. H. — 0.1i6 0.87Occupation of H. H. ~. «- 0.71Type of Residence —

~

— 0.52

12.68 I6.6li 20.23

Page 36: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 37: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

T^

i;j

«iL^ \\V .^rl

.^i,

i: -f*

i ^

y\i I

I,

8

Page 38: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 39: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

In viev of the fact tnat the deinoferaphic variables which explain the

variance in long distexice axpt-iditv-re differ for the three saalytic methods.

Table 8 hau been prepared to sunaniirize their relative eontributioa toward

explaining variance in the criterion vturiable. Income, which was by far the

best predictor, does not do fs vfflJ, in the AID analyses , primarily due to the

problem of sample siiie in the ejclreme colls, The other three demographicvariables - nuraber of room;. , SEE scnre and length of residence - did very wellin laonotonic AID due to trifo factors: (l) they have a step function I'elation-

ship with tJie criteiion variable, and (i) the distributions are badly skewed.Finally, age of head of household does best in free AID prirasurily due to its

nonmonotonic relationship T^rith the criterion variable.

Having removed the aonltne&r and nonraonotoni c constraints and permittedinteractive relationships in the analysis, the explained variance has increasedfrom 12.68 to 20.32 peioent. However, the iinexplalned variance still remainsquite large. An additional advantage of the AID analysis is the ability toinvestigate the variance structixre by customer segment. Monotonic AID and theFree AID analyses both developed three branches defined by low, medimum and highincome categories. These represent hk%^ 2S% and 27/S of the sample size. Table 9surrnnarizes the effects for each branch.

TABLE 9

Summary of Variance(in Percentages)

IncomeCategory

Variancej

in each Modtonic AIDbrar.ch Explained Unexplained

XLess than $10,000$10,000 to $15,000More than $15,000

J3.1t8

18. V:

I

„i.)0.^9 j

1.361.28i<-.10

12 .12

1T.4S5i-.39

Free AIDExplained Unexplained

1,961.857.12

11.5216.8851. 3T

Note: 9"303f is explained 'c^f xhe three i icome categories.

Fin&liy, noting •t;l'ie.'.. the buli: of the unexpleir.ed variance is in the highincome branch n, seai-ch of Table i- "mfl 6 showt. that final group 6 in themonotonic AID aaa3.;;,-sif. has 36-J- percent of the- totcl vs,rianca, and group 7 infree AID analysis h-^^s 30-' percent These groups represent the tail of theskewed long di.stence expenditure distribution »nd should be evaluated furtherwith a l£irgsr cample size in order to determine if the socioeconomic anddemographic variables are capable of fiirther explaining the variance in thesecustraaer segments. Until this is completed, an ultimate judgement on theefficaqr of these variables in explaining long distance expenditure can notbe made. However, in the final anal^i-sis, it appears that approximately 30percent of the v.iriance can be explained by these predictor variables, whichis in line with the initial expectations when the project was undertaken.

Page 40: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

•'V M

Page 41: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

An Approach To'.'&rd Eripiric£>l Model ISuildlng

CcJjnparing linear regression and AID analysis, it is difficxilt to say whichtechnique is better. Each offers certain ac).v.aita€e3 that the other does not,and each has inherent probilems. MD in mich 'iiore flexible in data analysis€Bid data handling becf-use it req.-.d. res the Sjjiallest set of esstmptions . At thesame time, it is extremei.y ccnspatiolc vi h th« managerial ^•lefwpoint of themarket place. Therefore, it h3^ the advantage of better ccanmunieating themarket research results. Finallry 3 the te-lhrique brings into bold relief therelationships among the v&riables » wh Ich ..eads the researcher to think of theinteractive effects of a set of predictor variables. This is very likely tobroaden his inductive theorizing process. On the other hand, AID is lackingin inferential capability- It is nuch easier to di.splay the data than to buildpredictive empirical models with AID. This is because AID is largely baaed onanalysis of vajriance principles and therefore requires prior experimentalor matrix designs to enable an/ inferences to be drawn from the analysis. Asecond disadvantage with AID is less parsimony in data analysis and model building;considerable computation and search is irlierent in the technique and the branchingprocess often is fairly complex snA lengthy, which reduces its usefulness froma pragmatic control standpoint. Finally, as was demonstrated in this paper,AID requires large sets of observations which must be fairly well-behaved in theirdistributions over the predictor and the criterion variables. In other words

,

skewness presents a serious problem in AID analysis.

Linear regression is veiy powerful in developing parametric models,and provides a mechanism for establishing point and interval estimates forpredictive piirposos. On the other hand, it presxmes the data to be linear,error free and additively related to the criterion variable.

In view of the fact that in a regulated Industry there is a need to buildpowerful predictisre models, a systematic approach is necessary to developinductive models of telephone behavior based on large scale empirical data.The data analysis reported in this paper, together with the following proceduralsteps are recommended for inductive model building.

8

1. Given a large scale data base, p rform ax. initial AID analysis with as

many predictor varie'vles 6S are available or cte be handled by thecomputer. The AID anaD^^jsis will bring into bold relief the nature ofthe relationships among the variables resulting from a minimum set ofrestrictions with respect to the aampio size of subgroups, splitreducibility criterion Riid the priorit;v ordering and coding aspects ofthe predictor ve.riab3es. In short, a free AID paalysis is recommendedin this initial phar e.

2. From the initial AID analyris, predictor variables should be selectedfor future tmalysis b.ased on their explanatory power. Kie predictorvariables should then t'i factor analyzed to estimate the degree ofintercorrelations ejnong them.

3. Choose a set of orthogonal predictor variables from, the factoranalysis results selecting the variable with the highest factorloading. The problems of error in measixrement should also, beconsidered. For exanple it will generally be more advantageousto choose the age of thf" husband rather than the wife if both areloaded equeOJy on e factor because of the possibility of response

Page 42: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 43: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

error in the latter variable. Similarly, education voiild be prefered

over income, ^t the same time, the researcher must watch for the

possibility of creating an index variable, especially when several

predictor variables contribute to an equal, but smaller, extent toward

the eigenvalue of the factor. Such an index, by definition, wovild be

a linear additive index.

U. Utilizing the selected orthogorfa predictor variables, the researcher

should perform a monotonic AID analysis. The restriction of monotoniclty

is more appropriate for managerial decision making since it will enable

the reseai'cher to develop models of a set of predictor variables which

are split above or below a certadn level.

5. Based on the monotonic AID analysis the predictor variables should be

defined in terms of broad categories where a split occurred. For income

in our data, this is likely to be below $10,000, between $10,000 and

$15,000, and above $15,000. In the seme way, a set of interactive

predictor variables must be defined; for example, income above $15,600 and

eight or more rooms.

6. If the interactive effects are not substantied, as evidenced by the

monotonic AID analysis, the simplest procedure would be to create a

successive interval scale for each predictor variable based on AID

categorization. This may result in a dichotomous scale or a discrete

interval scale.

At this stage, a discriminant or regression model should be built in which

the redefined variables developed from the prior analysis are the predictors

and the phenomenon under investigation is the criterion variable. If the^

criterion phenomenon is dichotomous or classi factory, the discriminant

model will be appropriate; however, a regression model shovild be used if

the criterion variable is continuous and well behaved. The regression or

discriminant model will then estimate a set of optimal weights for

predictive and inferential purposes.

It is, however, possible that the interest is in building a model which

takes into account each category of a predictor variable separately.

This if3 possible by ijorivertinR the regression or discriminant problem

to a dumny variate analysis problem.

7. If there are strong interactions among the orthogonal predictor

variables as evidenced from the monotonic AID analysis, it will be

necessary to develop inde:: variables based on the pattern of

interactions . This should be relatively easy in view of the fact

that the logical combinations are likely to be greatly reduced when

the stage of performing a monotonic AID analysis is reached. Ihe

predictive model can be built from these index variables utilizing

regression or discriminant analysis.

To stmmiarize, several conclusions can be drawn from these efforts at inductive

model building based on large scale data banks. First, it is extremely important

to examine the quality of the data and the nature of the relationships among the

variables. Without this critical examination, the researcher is likely to

fall prey to a statistical or mathematiceil model popular at the time. Most of

the recent model building in mairketin^ has been baised on management science techniques

Page 44: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 45: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

which clearly attests to this problem. Second, it ia very unlikely that a singlestatistical model such as stepwise regression, AID or discriainont analysis willbe sufficient. The authors strcng!;,v sviggest that a variety of statistical toolsare sequentially necessary at various stagfis of irtductive laodel building.Finally, it is unlikely that deaogrr&phic fe.ctors alone will enable the re-searcher to build highlj- predictive models. The demographic factors, however,seem highly useful in segmenting the tot^: population into subpoptilations whichmay be the logical independent marketing se^Jients reqijiring separate models.

Footnotes

1. TOiis study is part of ongoing empirical research on the telephone behaviorof both residence and bxisiness ciatoaeis of the Bell System and wasprepared under the auspices of the llai'kat Research Section of the ;

American Telephone and Telegraph Company in Nev York.

The authors wish to express their appreciation to Mr. N. J, Mammana, Directorof Marketing Research for his support of the study and to Welling Howellwho prepared and assembled the data and performed the computer analysis.

2. A Marvin Roscoe, Jr. is a Marketing Supervisor at A.T.& T. where heis responsible for developing analytic methodology for the Market ResearchInformation System. Previously he was with the Bell Telephone Companyof Pennsylvania and the Long Lines Department of A.T.& T. in varioussales and meirketing positions. He has a B.S.E.E. from RensselaerPolytechnic Institute and a M.B.A. from the University of Pittsburgh.

Jagdish N. Sheth is presently Professor of Business and Research Professorat the University of Illinois. Prior to that, he was on the faculty ofColumbia University and M.I.T. He received his Ph.D. at the University ofPittsburgh. He has also been visiting professor at the Indian Institute ofManagement, Calcutta, and Visiting Lecturer at the International MarketingInstitute, Harvard University. Dr. Sheth is coauthor (with John A. Howard)of The Theory of Buyer Behavior , author of How A(^vertising Works and isa frequent contributor to business and scientific journals, especially inthe area of marketing.

3. The authors are well eware of the controversy with regard to the usefulnessof demographic factors in predicting consumer behavior (Yankelovlch, 196U;Fran3:, 1963; Bass, Tigext, \ Londale, 19681. Ho-ffevor^ given the regulatednature of the xrtility industry, it is necessary to understand and to beable to predict the impact of corporate strp.tegies on different socio-economic segments of the populattca.

U. The term expenditure properl^r connotes the aspects of consumer buying be-havior. Increasing the consumer's If^vel of long distance expend!t\ire is e

very important consideration to tae telephone industry. Since the dis-tribution channel is always available and there ere frequent periods ofavailable capacity, increased calling during these periods can have veryobvious ecoDomi c implications to society.

Page 46: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 47: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

5. There are several other methods for handling the multicollinearity problem,

such as examination of the simple correlations or factor structure of the

correlation matrix. In fact, due to the order bias built into stepwise

regression, other methods should be used to reduce the collineaxity pro-

blem.

6. Regression theory can handle nonlinesir and interactive relationships,

but these need to be developed a priori based on some theory of Judgement.Without theory to suggest rational approaches , the number of nonlineartransformations and the combination of interactions are too many for

regression analysis to solve efficiently.

7. By setting a low F value (O.Ol) all of the lU demographic variables were

permitted to enter in the final step of the regression analysis. Thepredictive power increased to lU.25 percent. These results were replicatedusing the UCLA BMD 03R Multiple Regression Program. Unfortunately, theadditional explanatory power has the inherent problems of instabilityand multicollinearity.

8. The procedure is somewhat different from the two-stage AID-MCA linkage

suggested by Sonquist (1970).

References

Armstrong, J. S. & Andress, J. G. Exploratory analysis of marketing data:

trees vs. regression. Joxzrnal of Marketinp; Research, Nov., 1970,VII , U87-U9?.

Assael, E. Segmenting markets by group purchasing behavior: an applicationof the AID technique. Joux-nal. of Marketing Research , May, 1970, VII ,

153-158.

Blalock, H. M. Jr. Correlater?. independent variables: the problem ofmulticollinearity. Social Forces. I963, ]i2, 37^1-380.

Bass, F. M., Tigert, D. J., & LonadelCj R. T. Market segmentation: groupversus individual behavior. Jouma,! of Marketin/? Research, Aiig.,

1968, V, ?5!4-270.

Carman, J. M. Multiple classification analysis without assumption of intervalmeasurement, linearity, or additivity: a comparison of techniques.Proceedings of the Social Statistics Section, American StatisticalAasociation , Dec, 1967. 260-270.

Dixon, J. W. (Ed). W!D Biomedical Computer Programs , Los Angeles: Universityof Ceaifomia Press, 1971.

Draper, N. R. & Smith, H. Applied Regression Analysis , New York: Wiley, 1966,

Page 48: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

;• .•'.|>-;

Page 49: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth

Frank, R. E. Market segmentation research: findings and implications. InF. Bass et al. (Eds.), Application of the Sciences in Marketing ManagementNew York: Wiley, 1968.

Frank, R. E. , Massey, W. F,, & Wind, Y. Market Segmentation . Englewood Cliffs:Prentice Hall, 1972.

Johnston, J. Econometric Methods , New York: McGraw-Hill, I962.

Lansing, J. B. & Kisk, L. Family life cycle as an independent variable.American Sociological Review , Oct., 1957, 22,, 512-519.

Lansing,. J. B. & Morgan, J. N. Consumer finances over the life cycle. InL. Clsirk (Ed.), Thie Life Cycle and Consumer Behavior . New York: New YorkUniversity Press, 1955, 36-51. .t

Morgan, J. W. & Sonquist, J. A. Problems in the analysis of survey data anda proposal. Journal of the American Statistical Association , June,1963, 58, iH5-U35.

~

Sonquist, J. A. Multivariate Model Building , Survey Research Center,Ann Arbor: Institute for Social Research, University of Michigan, 1970.

Sonquist, J. A. & Morgan, J. N. The Detection of Interaction Effects ,

Survey Research Center, Monograph No. 35, Ann Arbor: Institute forSocial Research, University of Michigan, 196k.

Staelin, R. A note on dection of interaction. Public Opinion Quarterly ,

Fall, 1970, 3^!., it08-l4ll.

Staelin, R. Another look at AID. Journal of Advertising Research . October,1971, II No. 5 . 23-28.

U. S. Bureau of the Census. Methodology and Scores of Socioeconomic Statiis .

Working Paper No. 15, Washington, D.C. , I963.

Yankelovich, D. New criteria for market segmentation. Harvard Business Review ,

March-April, I96U, XLII , 83-90.

Page 50: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 51: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 52: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 53: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 54: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 55: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth
Page 56: Demographic segmentation of long distance behavior : data ......FacultyWorkingPapers DEMOGRAPHICSEGMENTATIONOFLONG DISTANCEBEHAVIOR:DATAANALYSIS &INDUCTIVEMODELBUILDING JagdishN.Sheth