mb0034 solved assignments

Upload: nigist-woldeselassie

Post on 03-Apr-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/28/2019 Mb0034 Solved Assignments

    1/52

    Q 1. Give examples of specific situations that would call for the following types of

    research, explaining why a) Exploratory research b) Descriptive research c) Diagnostic

    research d) Evaluation research.

    Ans.: Research may be classified crudely according to its major intent or the methods.According to the intent, research may be classified as:

    Basic (aka fundamental or pure) research is driven by a scientist's curiosity or interest in a

    scientific question. The main motivation is to expand man's knowledge, not to create or invent

    something. There is no obvious commercial value to the discoveries that result from basic

    research.

    For example, basic science investigations probe for answers to questions such as:

    How did the universe begin?

    What are protons, neutrons, and electrons composed of?

    How do slime molds reproduce?

    What is the specific genetic code of the fruit fly?

    Most scientists believe that a basic, fundamental understanding of all branches of science is

    needed in order for progress to take place. In other words, basic research lays down the

    foundation for the applied science that follows. If basic work is done first, then applied spin-offs

    often eventually result from this research. As Dr. George Smoot of LBNL says, "People cannot

    foresee the future well enough to predict what's going to develop from basic research. If we only

    did applied research, we would still be making better spears."

    Applied research is designed to solve practical problems of the modern world, rather than to

    acquire knowledge for knowledge's sake. One might say that the goal of the applied scientist is

    to improve the human condition.

    For example, applied researchers may investigate ways to:

    Improve agricultural crop production

    1

  • 7/28/2019 Mb0034 Solved Assignments

    2/52

    Treat or cure a specific disease

    Improve the energy efficiency of homes, offices, or modes of transportation

    Some scientists feel that the time has come for a shift in emphasis away from purely basic

    research and toward applied science. This trend, they feel, is necessitated by the problems

    resulting from global overpopulation, pollution, and the overuse of the earth's natural resources.Exploratory research provides insights into and comprehension of an issue or situation. It

    should draw definitive conclusions only with extreme caution. Exploratory research is a type of

    research conducted because a problem has not been clearly defined. Exploratory research

    helps determine the best research design, data collection method and selection of subjects.

    Given its fundamental nature, exploratory research often concludes that a perceived problem

    does not actually exist.

    Exploratory research often relies on secondary research such as reviewing available literature

    and/or data, or qualitative approaches such as informal discussions with consumers, employees,

    management or competitors, and more formal approaches through in-depth interviews, focus

    groups, projective methods, case studies or pilot studies. The Internet allows for research

    methods that are more interactive in nature: E.g., RSS feeds efficiently supply researchers with

    up-to-date information; major search engine search results may be sent by email to researchers

    by services such as Google Alerts; comprehensive search results are tracked over lengthy

    periods of time by services such as Google Trends; and Web sites may be created to attract

    worldwide feedback on any subject.

    The results of exploratory research are not usually useful for decision-making by themselves, but

    they can provide significant insight into a given situation. Although the results of qualitative

    research can give some indication as to the "why", "how" and "when" something occurs, it

    cannot tell us "how often" or "how many."

    Exploratory research is not typically generalizable to the population at large.

    A defining characteristic of causal research is the random assignment of participants to the

    conditions of the experiment; e.g., an Experimental and a Control Condition... Such assignmentresults in the groups being comparable at the beginning of the experiment. Any difference

    between the groups at the end of the experiment is attributable to the manipulated variable.

    Observational research typically looks for difference among "in-tact" defined groups. A common

    example compares smokers and non-smokers with regard to health problems. Causal

    conclusions can't be drawn from such a study because of other possible differences between the

    2

  • 7/28/2019 Mb0034 Solved Assignments

    3/52

    groups; e.g., smokers may drink more alcohol than non-smokers. Other unknown differences

    could exist as well. Hence, we may see a relation between smoking and health but a conclusion

    that smoking is a cause would not be warranted in this situation. (Cp)

    Descriptive research, also known as statistical research, describes data and characteristics

    about the population or phenomenon being studied. Descriptive research answers the questions

    who, what, where, when and how.Although the data description is factual, accurate and systematic, the research cannot describe

    what caused a situation. Thus, descriptive research cannot be used to create a causal

    relationship, where one variable affects another. In other words, descriptive research can be

    said to have a low requirement for internal validity.

    The description is used for frequencies, averages and other statistical calculations. Often the

    best approach, prior to writing descriptive research, is to conduct a survey investigation.

    Qualitative research often has the aim of description and researchers may follow-up with

    examinations of why the observations exist and what the implications of the findings are.

    In short descriptive research deals with everything that can be counted and studied. But there

    are always restrictions to that. Your research must have an impact to the life of the people

    around you. For example, finding the most frequent disease that affects the children of a town.

    The reader of the research will know what to do to prevent that disease thus; more people will

    live a healthy life.

    Diagnostic study: it is similar to descriptive study but with different focus. It is directed towards

    discovering what is happening and what can be done about. It aims at identifying the causes ofa problem and the possible solutions for it. It may also be concerned with discovering and

    testing whether certain variables are associated. This type of research requires prior knowledge

    of the problem, its thorough formulation, clear-cut definition of the given population, adequate

    methods for collecting accurate information, precise measurement of variables, statistical

    analysis and test of significance.

    Evaluation Studies: it is a type of applied research. It is made for assessing the effectiveness of

    social or economic programmes implemented or for assessing the impact of development of the

    project area. It is thus directed to assess or appraise the quality and quantity of an activity and

    its performance and to specify its attributes and conditions required for its success. It is

    concerned with causal relationships and is more actively guided by hypothesis. It is concerned

    also with change over time.

    3

  • 7/28/2019 Mb0034 Solved Assignments

    4/52

    Action research is a reflective process of progressive problem solving led by individuals

    working with others in teams or as part of a "community of practice" to improve the way they

    address issues and solve problems. Action research can also be undertaken by larger

    organizations or institutions, assisted or guided by professional researchers, with the aim of

    improving their strategies, practices, and knowledge of the environments within which they

    practice. As designers and stakeholders, researchers work with others to propose a new courseof action to help their community improve its work practices (Center for Collaborative Action

    Research). Kurt Lewin, then a professor at MIT, first coined the term action research in about

    1944, and it appears in his 1946 paper Action Research and Minority Problems. In that paper,

    he described action research as a comparative research on the conditions and effects of

    various forms of social action and research leading to social action that uses a spiral of steps,

    each of which is composed of a circle of planning, action, and fact-finding about the result of the

    action.

    Action research is an interactive inquiry process that balances problem solving actions

    implemented in a collaborative context with data-driven collaborative analysis or research to

    understand underlying causes enabling future predictions about personal and organizational

    change (Reason & Bradbury, 2001). After six decades of action research development, many

    methodologies have evolved that adjust the balance to focus more on the actions taken or more

    on the research that results from the reflective understanding of the actions. This tension exists

    between

    those that are more driven by the researchers agenda to those more driven byparticipants;

    Those that are motivated primarily by instrumental goal attainment to those

    motivated primarily by the aim of personal, organizational, or societal transformation;

    and

    1st-, to 2nd-, to 3rd-person research, that is, my research on my own action, aimed

    primarily at personal change; our research on our group (family/team), aimed primarily

    at improving the group; and scholarly research aimed primarily at theoretical

    generalization and/or large scale change.

    Action research challenges traditional social science, by moving beyond reflective knowledge

    created by outside experts sampling variables to an active moment-to-moment theorizing, data

    collecting, and inquiring occurring in the midst of emergent structure. Knowledge is always

    gained through action and for action. From this starting point, to question the validity of social

    4

  • 7/28/2019 Mb0034 Solved Assignments

    5/52

    knowledge is to question, not how to develop a reflective science about action, but how to

    develop genuinely well-informed action how to conduct an action science (Tolbert 2001).

    Q 2.In the context of hypothesis testing, briefly explain the difference between a) Null and

    alternative hypothesis b) Type 1 and type 2 error c) Two tailed and one tailed test d)

    Parametric and non-parametric tests.

    Ans.: Some basic concepts in the context of testing of hypotheses are explained below -

    11) Null Hypotheses and Alternative Hypotheses: In the context of statistical analysis,

    we often talk about null and alternative hypotheses. If we are to compare the

    superiority of method A with that of method B and we proceed on the assumption that

    both methods are equally good, then this assumption is termed as a null hypothesis.

    On the other hand, if we think that method A is superior, then it is known as an

    alternative hypothesis.

    These are symbolically represented as:

    Null hypothesis = H0 and Alternative hypothesis = Ha

    Suppose we want to test the hypothesis that the population mean is equal to the hypothesized

    mean ( H0) = 100. Then we would say that the null hypothesis is that the population mean is

    equal to the hypothesized mean 100 and symbolically we can express it as: H0: = H0=100

    If our sample results do not support this null hypothesis, we should conclude that something else

    is true. What we conclude rejecting the null hypothesis is known as an alternative hypothesis. Ifwe accept H0, then we are rejecting Ha and if we reject H0, then we are accepting Ha. For H0:

    = H0=100, we may consider three possible alternative hypotheses as follows:

    Alternative

    Hypotheses

    To be read as follows

    Ha: H0 (The alternative hypothesis is that the population mean is not equal to

    100 i.e., it may be more or less 100)

    Ha: > H0 (The alternative hypothesis is that the population mean is greater than

    100)Ha: < H0 (The alternative hypothesis is that the population mean is less than 100)

    5

  • 7/28/2019 Mb0034 Solved Assignments

    6/52

    The null hypotheses and the alternative hypotheses are chosen before the sample is drawn (the

    researcher must avoid the error of deriving hypotheses from the data he collects and testing the

    hypotheses from the same data). In the choice of null hypothesis, the following considerations

    are usually kept in view:

    1a. The alternative hypothesis is usually the one, which is to be proved, and the null

    hypothesis is the one that is to be disproved. Thus a null hypothesis represents thehypothesis we are trying to reject, while the alternative hypothesis represents all

    other possibilities.

    2b. If the rejection of a certain hypothesis when it is actually true involves great risk, it

    is taken as null hypothesis, because then the probability of rejecting it when it is

    true is (the level of significance) which is chosen very small.

    3c. The null hypothesis should always be a specific hypothesis i.e., it should not state

    an approximate value.

    Generally, in hypothesis testing, we proceed on the basis of the null hypothesis, keeping the

    alternative hypothesis in view. Why so? The answer is that on the assumption that the null

    hypothesis is true, one can assign the probabilities to different possible sample results, but this

    cannot be done if we proceed with alternative hypotheses. Hence the use of null hypotheses (at

    times also known as statistical hypotheses) is quite frequent.

    12) The Level of Significance: This is a very important concept in the context of

    hypothesis testing. It is always some percentage (usually 5%), which should be

    chosen with great care, thought and reason. In case we take the significance levelat 5%, then this implies that H0 will be rejected when the sampling result (i.e.,

    observed evidence) has a less than 0.05 probability of occurring if H0 is true. In

    other words, the 5% level of significance means that the researcher is willing to

    take as much as 5% risk rejecting the null hypothesis when it (H0) happens to be

    true. Thus the significance level is the maximum value of the probability of rejecting

    H0 when it is true and is usually determined in advance before testing the

    hypothesis.

    23) Decision Rule or Test of Hypotheses: Given a hypothesis Ha and an alternative

    hypothesis H0, we make a rule, which is known as a decision rule, according to

    which we accept H0 (i.e., reject Ha) or reject H0 (i.e., accept Ha). For instance, if

    H0 is that a certain lot is good (there are very few defective items in it), against Ha,

    that the lot is not good (there are many defective items in it), then we must decide

    6

  • 7/28/2019 Mb0034 Solved Assignments

    7/52

    the number of items to be tested and the criterion for accepting or rejecting the

    hypothesis. We might test 10 items in the lot and plan our decision saying that if

    there are none or only 1 defective item among the 10, we will accept H0; otherwise

    we will reject H0 (or accept Ha). This sort of basis is known as a decision rule.

    34) Type I & II Errors: In the context of testing of hypotheses, there are basically two

    types of errors that we can make. We may reject H0 when H0 is true and we mayaccept H0 when it is not true. The former is known as Type I and the latter is

    known as Type II. In other words, Type I error means rejection of hypotheses,

    which should have been accepted, and Type II error means accepting of

    hypotheses, which should have been rejected. Type I error is denoted by (alpha),

    also called as level of significance of test; and Type II error is denoted by (beta).

    Decision

    Accept H0 Reject H0

    H0 (true) Correct decision Type I error (

    error)Ho (false) Type II error ( error) Correct decision

    The probability of Type I error is usually determined in advance and is understood as the level of

    significance of testing the hypotheses. If type I error is fixed at 5%, it means there are about 5

    chances in 100 that we will reject H0 when H0 is true. We can control type I error just by fixing it

    at a lower level. For instance, if we fix it at 1%, we will say that the maximum probability of

    committing type I error would only be 0.01.

    But with a fixed sample size n, when we try to reduce type I error, the probability of committing

    type II error increases. Both types of errors cannot be reduced simultaneously, since there is a

    trade-off in business situations. Decision makers decide the appropriate level of type I error by

    examining the costs of penalties attached to both types of errors. If type I error involves time and

    trouble of reworking a batch of chemicals that should have been accepted, whereas type II errormeans taking a chance that an entire group of users of this chemicals compound will be

    poisoned, then in such a situation one should prefer a type I error to a type II error. As a result,

    one must set a very high level for type I error in ones testing techniques of a given hypothesis.

    Hence, in testing of hypotheses, one must make all possible efforts to strike an adequate

    balance between Type I & Type II error.

    7

  • 7/28/2019 Mb0034 Solved Assignments

    8/52

    15) Two Tailed Test & One Tailed Test: In the context of hypothesis testing, these two terms

    are quite important and must be clearly understood. A two-tailed test rejects the null hypothesis

    if, say, the sample mean is significantly higher or lower than the hypothesized value of the mean

    of the population. Such a test is inappropriate when we have H0: = H0 and Ha: H0 which

    may > H0 or

  • 7/28/2019 Mb0034 Solved Assignments

    9/52

    Q 3. Explain the difference between a causal relationship and correlation, with an

    example of each. What are the possible reasons for a correlation between two variables?

    Ans.: Correlation: The correlation is knowing what the consumer wants, and providing it.

    Marketing research looks at trends in sales and studies all of the variables, i.e. price, color,

    availability, and styles, and the best way to give the customer what he or she wants. If you cangive the customer what they want, they will buy, and let friends and family know where they got

    it. Making them happy makes the money.

    Casual relationship Marketing was first defined as a form ofmarketing developed from direct

    response marketing campaigns, which emphasizes customer retention and satisfaction, rather

    than a dominant focus on sales transactions.

    As a practice, Relationship Marketing differs from other forms of marketing in that it recognizes

    the long term value of customer relationships and extends communication beyond intrusive

    advertising and sales promotional messages.

    With the growth of the internet and mobile platforms, Relationship Marketing has continued to

    evolve and move forward as technology opens more collaborative and social communication

    channels. This includes tools for managing relationships with customers that goes beyond

    simple demographic and customer service data. Relationship Marketing extends to include

    Inbound Marketing efforts (a combination of search optimization and Strategic Content), PR,Social Media and Application Development.

    Just like Customer relationship management(CRM), Relationship Marketing is a broadly

    recognized, widely-implemented strategy for managing and nurturing a companys interactions

    with clients and sales prospects. It also involves using technology to, organize, synchronize

    business processes (principally sales and marketing activities) and most importantly, automate

    those marketing and communication activities on concrete marketing sequences that could run

    in autopilot (also known as marketing sequences). The overall goals are to find, attract, and win

    new clients, nurture and retain those the company already has, entice former clients back into

    the fold, and reduce the costs of marketing and client service. [1] Once simply a label for a

    category of software tools, today, it generally denotes a company-wide business strategy

    embracing all client-facing departments and even beyond. When an implementation is effective,

    9

    http://en.wikipedia.org/wiki/Marketinghttp://en.wikipedia.org/wiki/Marketing
  • 7/28/2019 Mb0034 Solved Assignments

    10/52

    people, processes, and technology work in synergy to increase profitability, and reduce

    operational costs

    Reasons for a correlation between two variables: Chance association, (the relationship is

    due to chance) or causative association (one variable causes the other).

    The information given by a correlation coefficient is not enough to define the dependence

    structure between random variables. The correlation coefficient completely defines the

    dependence structure only in very particular cases, for example when the distribution is a

    multivariate normal distribution. (See diagram above.) In the case of elliptic distributions it

    characterizes the (hyper-)ellipses of equal density, however, it does not completely characterize

    the dependence structure (for example, a multivariate t-distribution's degrees of freedom

    determine the level of tail dependence).

    Distance correlation and Brownian covariance / Brownian correlation [8][9] were introduced to

    address the deficiency of Pearson's correlation that it can be zero for dependent random

    variables; zero distance correlation and zero Brownian correlation imply independence.

    The correlation ratio is able to detect almost any functional dependency, or the entropy-based

    mutual information/total correlation which is capable of detecting even more general

    dependencies. The latter are sometimes referred to as multi-moment correlation measures, in

    comparison to those that consider only 2nd moment (pairwise or quadratic) dependence.

    The polychoric correlation is another correlation applied to ordinal data that aims to estimate the

    correlation between theorised latent variables.

    One way to capture a more complete view of dependence structure is to consider a copula

    between them.

    Q 4. Briefly explain any two factors that affect the choice of a sampling technique. What

    are the characteristics of a good sample?

    Ans.: The difference between non-probability and probability sampling is that non-probability

    sampling does not involve random selection and probability sampling does. Does that mean that

    non-probability samples aren't representative of the population? Not necessarily. But it does

    mean that non-probability samples cannot depend upon the rationale of probability theory. At

    10

    http://en.wikipedia.org/wiki/Multivariate_normal_distributionhttp://en.wikipedia.org/wiki/Distance_correlationhttp://en.wikipedia.org/wiki/Brownian_covariancehttp://en.wikipedia.org/wiki/#cite_note-7http://en.wikipedia.org/wiki/#cite_note-8http://en.wikipedia.org/wiki/Correlation_ratiohttp://en.wikipedia.org/wiki/Information_entropyhttp://en.wikipedia.org/wiki/Mutual_informationhttp://en.wikipedia.org/wiki/Total_correlationhttp://en.wikipedia.org/wiki/Polychoric_correlationhttp://en.wikipedia.org/wiki/Copula_(statistics)http://en.wikipedia.org/wiki/Multivariate_normal_distributionhttp://en.wikipedia.org/wiki/Distance_correlationhttp://en.wikipedia.org/wiki/Brownian_covariancehttp://en.wikipedia.org/wiki/#cite_note-7http://en.wikipedia.org/wiki/#cite_note-8http://en.wikipedia.org/wiki/Correlation_ratiohttp://en.wikipedia.org/wiki/Information_entropyhttp://en.wikipedia.org/wiki/Mutual_informationhttp://en.wikipedia.org/wiki/Total_correlationhttp://en.wikipedia.org/wiki/Polychoric_correlationhttp://en.wikipedia.org/wiki/Copula_(statistics)
  • 7/28/2019 Mb0034 Solved Assignments

    11/52

    least with a probabilistic sample, we know the odds or probability that we have represented the

    population well. We are able to estimate confidence intervals for the statistic. With non-

    probability samples, we may or may not represent the population well, and it will often be hard

    for us to know how well we've done so. In general, researchers prefer probabilistic or random

    sampling methods over non probabilistic ones, and consider them to be more accurate and

    rigorous. However, in applied social research there may be circumstances where it is notfeasible, practical or theoretically sensible to do random sampling. Here, we consider a wide

    range of non-probabilistic alternatives.

    We can divide non-probability sampling methods into two broad types:

    Accidentalorpurposive.

    Most sampling methods are purposive in nature because we usually approach the sampling

    problem with a specific plan in mind. The most important distinctions among these types of

    sampling methods are the ones between the different types of purposive sampling approaches.

    Accidental, Haphazard or Convenience Sampling

    One of the most common methods of sampling goes under the various titles listed here. I would

    include in this category the traditional "man on the street" (of course, now it's probably the

    "person on the street") interviews conducted frequently by television news programs to get a

    quick (although non representative) reading of public opinion. I would also argue that the typical

    use of college students in much psychological research is primarily a matter of convenience.

    (You don't really believe that psychologists use college students because they believe they're

    representative of the population at large, do you?). In clinical practice, we might use clients who

    are available to us as our sample. In many research contexts, we sample simply by asking for

    volunteers. Clearly, the problem with all of these types of samples is that we have no evidence

    that they are representative of the populations we're interested in generalizing to -- and in many

    cases we would clearly suspect that they are not.

    Purposive Sampling

    In purposive sampling, we sample with apurpose in mind. We usually would have one or more

    specific predefined groups we are seeking. For instance, have you ever run into people in a mall

    or on the street who are carrying a clipboard and who are stopping various people and asking if

    they could interview them? Most likely they are conducting a purposive sample (and most likely

    11

  • 7/28/2019 Mb0034 Solved Assignments

    12/52

    they are engaged in market research). They might be looking for Caucasian females between

    30-40 years old. They size up the people passing by and anyone who looks to be in that

    category they stop to ask if they will participate. One of the first things they're likely to do is verify

    that the respondent does in fact meet the criteria for being in the sample. Purposive sampling

    can be very useful for situations where you need to reach a targeted sample quickly and where

    sampling for proportionality is not the primary concern. With a purposive sample, you are likelyto get the opinions of your target population, but you are also likely to overweight subgroups in

    your population that are more readily accessible.

    All of the methods that follow can be considered subcategories of purposive sampling methods.

    We might sample for specific groups or types of people as in modal instance, expert, or quota

    sampling. We might sample for diversity as in heterogeneity sampling. Or, we might capitalize on

    informal social networks to identify specific respondents who are hard to locate otherwise, as in

    snowball sampling. In all of these methods we know what we want -- we are sampling with a

    purpose.

    Modal Instance Sampling

    In statistics, the mode is the most frequently occurring value in a distribution. In sampling, when

    we do a modal instance sample, we are sampling the most frequent case, or the "typical" case.

    In a lot of informal public opinion polls, for instance, they interview a "typical" voter. There are a

    number of problems with this sampling approach. First, how do we know what the "typical" or

    "modal" case is? We could say that the modal voter is a person who is of average age,educational level, and income in the population. But, it's not clear that using the averages of

    these is the fairest (consider the skewed distribution of income, for instance). And, how do you

    know that those three variables -- age, education, income -- are the only or even the most

    relevant for classifying the typical voter? What if religion or ethnicity is an important

    discriminator? Clearly, modal instance sampling is only sensible for informal sampling contexts.

    Expert Sampling

    Expert sampling involves the assembling of a sample of persons with known or demonstrable

    experience and expertise in some area. Often, we convene such a sample under the auspices of

    a "panel of experts." There are actually two reasons you might do expert sampling. First,

    because it would be the best way to elicit the views of persons who have specific expertise. In

    this case, expert sampling is essentially just a specific sub case of purposive sampling. But the

    12

  • 7/28/2019 Mb0034 Solved Assignments

    13/52

    other reason you might use expert sampling is to provide evidence for the validity of another

    sampling approach you've chosen. For instance, let's say you do modal instance sampling and

    are concerned that the criteria you used for defining the modal instance are subject to criticism.

    You might convene an expert panel consisting of persons with acknowledged experience and

    insight into that field or topic and ask them to examine your modal definitions and comment on

    their appropriateness and validity. The advantage of doing this is that you aren't out on your owntrying to defend your decisions -- you have some acknowledged experts to back you. The

    disadvantage is that even the experts can be, and often are, wrong.

    Quota Sampling

    In quota sampling, you select people non-randomly according to some fixed quota. There are

    two types of quota sampling:proportionaland non proportional. In proportional quota

    sampling you want to represent the major characteristics of the population by sampling a

    proportional amount of each. For instance, if you know the population has 40% women and 60%

    men, and that you want a total sample size of 100, you will continue sampling until you get those

    percentages and then you will stop. So, if you've already got the 40 women for your sample, but

    not the sixty men, you will continue to sample men but even if legitimate women respondents

    come along, you will not sample them because you have already "met your quota." The problem

    here (as in much purposive sampling) is that you have to decide the specific characteristics on

    which you will base the quota. Will it be by gender, age, education race, religion, etc.?

    Non-proportional quota sampling is a bit less restrictive. In this method, you specify theminimum number of sampled units you want in each category. Here, you're not concerned with

    having numbers that match the proportions in the population. Instead, you simply want to have

    enough to assure that you will be able to talk about even small groups in the population. This

    method is the non-probabilistic analogue of stratified random sampling in that it is typically used

    to assure that smaller groups are adequately represented in your sample.

    Heterogeneity Sampling

    We sample for heterogeneity when we want to include all opinions or views, and we aren't

    concerned about representing these views proportionately. Another term for this is sampling for

    diversity. In many brainstorming or nominal group processes (including concept mapping), we

    would use some form of heterogeneity sampling because our primary interest is in getting broad

    spectrum of ideas, not identifying the "average" or "modal instance" ones. In effect, what we

    13

  • 7/28/2019 Mb0034 Solved Assignments

    14/52

    would like to be sampling is not people, but ideas. We imagine that there is a universe of all

    possible ideas relevant to some topic and that we want to sample this population, not the

    population of people who have the ideas. Clearly, in order to get all of the ideas, and especially

    the "outlier" or unusual ones, we have to include a broad and diverse range of participants.

    Heterogeneity sampling is, in this sense, almost the opposite of modal instance sampling.

    Snowball Sampling

    In snowball sampling, you begin by identifying someone who meets the criteria for inclusion in

    your study. You then ask them to recommend others who they may know who also meet the

    criteria. Although this method would hardly lead to representative samples, there are times when

    it may be the best method available. Snowball sampling is especially useful when you are trying

    to reach populations that are inaccessible or hard to find. For instance, if you are studying the

    homeless, you are not likely to be able to find good lists of homeless people within a specific

    geographical area. However, if you go to that area and identify one or two, you may find that

    they know very well whom the other homeless people in their vicinity are and how you can find

    them.

    Characteristics of good Sample: The decision process is a complicated one. The researcher

    has to first identify the limiting factor or factors and must judiciously balance the conflicting

    factors. The various criteria governing the choice of the sampling technique are:

    11. Purpose of the Survey: What does the researcher aim at? If he intends to generalize

    the findings based on the sample survey to the population, then an appropriateprobability sampling method must be selected. The choice of a particular type of

    probability sampling depends on the geographical area of the survey and the size and

    the nature of the population under study.

    22.Measurability: The application of statistical inference theory requires computation of

    the sampling error from the sample itself. Only probability samples allow such

    computation. Hence, where the research objective requires statistical inference, the

    sample should be drawn by applying simple random sampling method or stratified

    random sampling method, depending on whether the population is homogenous or

    heterogeneous.

    33.Degree of Precision: Should the results of the survey be very precise, or could even

    rough results serve the purpose? The desired level of precision is one of the criteria

    for sampling method selection. Where a high degree of precision of results is desired,

    14

  • 7/28/2019 Mb0034 Solved Assignments

    15/52

    probability sampling should be used. Where even crude results would serve the

    purpose (E.g., marketing surveys, readership surveys etc), any convenient non-

    random sampling like quota sampling would be enough.

    44. Information about Population: How much information is available about the

    population to be studied? Where no list of population and no information about its

    nature are available, it is difficult to apply a probability sampling method. Then anexploratory study with non-probability sampling may be done to gain a better idea of

    the population. After gaining sufficient knowledge about the population through the

    exploratory study, an appropriate probability sampling design may be adopted.

    55. The Nature of the Population: In terms of the variables to be studied, is the

    population homogenous or heterogeneous? In the case of a homogenous population,

    even simple random sampling will give a representative sample. If the population is

    heterogeneous, stratified random sampling is appropriate.

    66. Geographical Area of the Study and the Size of the Population : If the area

    covered by a survey is very large and the size of the population is quite large, multi-

    stage cluster sampling would be appropriate. But if the area and the size of the

    population are small, single stage probability sampling methods could be used.

    77. Financial Resources: If the available finance is limited, it may become necessary to

    choose a less costly sampling plan like multistage cluster sampling, or even quota

    sampling as a compromise. However, if the objectives of the study and the desired

    level of precision cannot be attained within the stipulated budget, there is noalternative but to give up the proposed survey. Where the finance is not a constraint, a

    researcher can choose the most appropriate method of sampling that fits the research

    objective and the nature of population.

    88. Time Limitation: The time limit within which the research project should be

    completed restricts the choice of a sampling method. Then, as a compromise, it may

    become necessary to choose less time consuming methods like simple random

    sampling, instead of stratified sampling/sampling with probability proportional to size;

    or multi-stage cluster sampling, instead of single-stage sampling of elements. Of

    course, the precision has to be sacrificed to some extent.

    99. Economy: It should be another criterion in choosing the sampling method. It means

    achieving the desired level of precision at minimum cost. A sample is economical if the

    precision per unit cost is high, or the cost per unit of variance is low. The above criteria

    15

  • 7/28/2019 Mb0034 Solved Assignments

    16/52

    frequently conflict with each other and the researcher must balance and blend them to

    obtain a good sampling plan. The chosen plan thus represents an adaptation of the

    sampling theory to the available facilities and resources. That is, it represents a

    compromise between idealism and feasibility. One should use simple workable

    methods, instead of unduly elaborate and complicated techniques.

    Q 5. Select any topic for research and explain how you will use both secondary and

    primary sources to gather the required information.

    Ans.: Primary Sources of Data

    Primary sources are original sources from which the researcher directly collects data that has

    not been previously collected, e.g., collection of data directly by the researcher on brand

    awareness, brand preference, and brand loyalty and other aspects of consumer behavior, from a

    sample of consumers by interviewing them. Primary data is first hand information collected

    through various methods such as surveys, experiments and observation, for the purposes of the

    project immediately at hand.

    The advantages of primary data are

    1 It is unique to a particular research study

    2 It is recent information, unlike published information that is already available

    The disadvantages are

    1 It is expensive to collect, compared to gathering information from available

    sources

    2 Data collection is a time consuming process

    3 It requires trained interviewers and investigators

    2 Secondary Sources of Data

    These are sources containing data, which has been collected and compiled for another purpose.

    Secondary sources may be internal sources, such as annual reports, financial statements, sales

    reports, inventory records, minutes of meetings and other information that is available within the

    firm, in the form of a marketing information system. They may also be external sources, such as

    government agencies (e.g. census reports, reports of government departments), published

    16

  • 7/28/2019 Mb0034 Solved Assignments

    17/52

    sources (annual reports of currency and finance published by the Reserve Bank of India,

    publications of international organizations such as the UN, World Bank and International

    Monetary Fund, trade and financial journals, etc.), trade associations (e.g. Chambers of

    Commerce) and commercial services (outside suppliers of information).

    Methods of Data Collection:

    The researcher directly collects primary data from its original sources. In this case, theresearcher can collect the required data precisely according to his research needs and he can

    collect them when he wants and in the form that he needs it. But the collection of primary data is

    costly and time consuming. Yet, for several types of social science research, required data is not

    available from secondary sources and it has to be directly gathered from the primary sources.

    Primary data has to be gathered in cases where the available data is inappropriate, inadequate

    or obsolete. It includes: socio economic surveys, social anthropological studies of rural

    communities and tribal communities, sociological studies of social problems and social

    institutions, marketing research, leadership studies, opinion polls, attitudinal surveys, radio

    listening and T.V. viewing surveys, knowledge-awareness practice (KAP) studies, farm

    management studies, business management studies etc.

    There are various methods of primary data collection, including surveys, audits and panels,

    observation and experiments.

    1 Survey Research

    A survey is a fact-finding study. It is a method of research involving collection of data directly

    from a population or a sample at a particular time. A survey has certain characteristics:1 It is always conducted in a natural setting. It is a field study.

    2 It seeks responses directly from the respondents.

    3 It can cover a very large population.

    4 It may include an extensive study or an intensive study

    5 It covers a definite geographical area.

    A survey involves the following steps -

    1 Selection of a problem and its formulation

    2 Preparation of the research design

    3 Operation concepts and construction of measuring indexes and scales

    4 Sampling

    5 Construction of tools for data collection

    17

  • 7/28/2019 Mb0034 Solved Assignments

    18/52

    6 Field work and collection of data

    7 Processing of data and tabulation

    8Analysis of data

    9 Reporting

    There are four basic survey methods, which include:1 Personal interview

    2 Telephone interview

    3 Mail survey and

    4 Fax survey

    Personal Interview

    Personal interviewing is one of the prominent methods of data collection. It may be defined as a

    two-way systematic conversation between an investigator and an informant, initiated for

    obtaining information relevant to a specific study. It involves not only conversation, but also

    learning from the respondents gestures, facial expressions and pauses, and his environment.

    Interviewing may be used either as a main method or as a supplementary one in studies of

    persons. Interviewing is the only suitable method for gathering information from illiterate or less

    educated respondents. It is useful for collecting a wide range of data, from factual demographic

    data to highly personal and intimate information relating to a persons opinions, attitudes, values,

    beliefs, experiences and future intentions. Interviewing is appropriate when qualitative

    information is required, or probing is necessary to draw out the respondent fully. Where the areacovered for the survey is compact, or when a sufficient number of qualified interviewers are

    available, personal interview is feasible.

    Interview is often superior to other data-gathering methods. People are usually more willing to

    talk than to write. Once rapport is established, even confidential information may be obtained. It

    permits probing into the context and reasons for answers to questions.

    Interview can add flesh to statistical information. It enables the investigator to grasp the

    behavioral context of the data furnished by the respondents. It permits the investigator to seek

    clarifications and brings to the forefront those questions, which for some reason or the other the

    respondents do not want to answer. Interviewing as a method of data collection has certain

    characteristics. They are:

    18

  • 7/28/2019 Mb0034 Solved Assignments

    19/52

    1. The participants the interviewer and the respondent are strangers;

    hence, the investigator has to get himself/herself introduced to the

    respondent in an appropriate manner.

    2. The relationship between the participants is a transitory one. It has a fixed

    beginning and termination points. The interview proper is a fleeting,

    momentary experience for them.3. The interview is not a mere casual conversational exchange, but a

    conversation with a specific purpose, viz., obtaining information relevant to a

    study.

    4. The interview is a mode of obtaining verbal answers to questions put

    verbally.

    5. The interaction between the interviewer and the respondent need not

    necessarily be on a face-to-face basis, because the interview can also be

    conducted over the telephone.

    6. Although the interview is usually a conversation between two persons, it

    need not be limited to a single respondent. It can also be conducted with a

    group of persons, such as family members, or a group of children, or a

    group of customers, depending on the requirements of the study.

    7. The interview is an interactive process. The interaction between the

    interviewer and the respondent depends upon how they perceive each

    other.8. The respondent reacts to the interviewers appearance, behavior, gestures,

    facial expression and intonation, his perception of the thrust of the questions

    and his own personal needs. As far as possible, the interviewer should try to

    be closer to the social-economic level of the respondents.

    9. The investigator records information furnished by the respondent in the

    interview. This poses a problem of seeing that recording does not interfere

    with the tempo of conversation.

    10.Interviewing is not a standardized process like that of a chemical technician;

    it is rather a flexible, psychological process.

    3 Telephone Interviewing Telephone interviewing is a non-personal method of data collection.

    It may be used as a major method or as a supplementary method. It will be useful in the

    following situations:

    19

  • 7/28/2019 Mb0034 Solved Assignments

    20/52

    11.When the universe is composed of those persons whose names are listed in

    telephone directories, e.g. business houses, business executives, doctors

    and other professionals.

    12.When the study requires responses to five or six simple questions, e.g. a

    radio or television program survey.

    13.When the survey must be conducted in a very short period of time, providedthe units of study are listed in the telephone directory.

    14.When the subject is interesting or important to respondents, e.g. a survey

    relating to trade conducted by a trade association or a chamber of

    commerce, a survey relating to a profession conducted by the concerned

    professional association.

    15.When the respondents are widely scattered and when there are many call

    backs to make.

    4 Group InterviewsA group interview may be defined as a method of collecting primary data in

    which a number of individuals with a common interest interact with each other. In a personal

    interview, the flow of information is multi dimensional. The group may consist of about six to

    eight individuals with a common interest. The interviewer acts as the discussion leader. Free

    discussion is encouraged on some aspect of the subject under study. The discussion leader

    stimulates the group members to interact with each other. The desired information may be

    obtained through self-administered questionnaire or interview, with the discussion serving as a

    guide to ensure consideration of the areas of concern. In particular, the interviewers look forevidence of common elements of attitudes, beliefs, intentions and opinions among individuals in

    the group. At the same time, he must be aware that a single comment by a member can provide

    important insight. Samples for group interviews can be obtained through schools, clubs and

    other organized groups.

    5 Mail Survey The mail survey is another method of collecting primary data. This method

    involves sending questionnaires to the respondents with a request to complete them and return

    them by post. This can be used in the case of educated respondents only. The mail

    questionnaires should be simple so that the respondents can easily understand the questions

    and answer them. It should preferably contain mostly closed-ended and multiple choice

    questions, so that it could be completed within a few minutes. The distinctive feature of the mail

    survey is that the questionnaire is self-administered by the respondents themselves and the

    responses are recorded by them and not by the investigator, as in the case of personal interview

    20

  • 7/28/2019 Mb0034 Solved Assignments

    21/52

    method. It does not involve face-to-face conversation between the investigator and the

    respondent. Communication is carried out only in writing and this requires more cooperation

    from the respondents than verbal communication. The researcher should prepare a mailing list

    of the selected respondents, by collecting the addresses from the telephone directory of the

    association or organization to which they belong. The following procedures should be followed -

    a covering letter should accompany a copy of the questionnaire. It must explain to the

    respondent the purpose of the study and the importance of his cooperation to the success of the

    project. Anonymity must be assured. The sponsors identity may be revealed. However,

    when such information may bias the result, it is not desirable to reveal it. In this case, a

    disguised organization name may be used. A self-addressed stamped envelope should be

    enclosed in the covering letter.

    1After a few days from the date of mailing the questionnaires to the respondents, the

    researcher can expect the return of completed ones from them. The progress in return may be

    watched and at the appropriate stage, follow-up efforts can be made.

    The response rate in mail surveys is generally very low in developing countries like India.

    Certain techniques have to be adopted to increase the response rate. They are:

    11. Quality printing: The questionnaire may be neatly printed on quality light colored paper,

    so as to attract the attention of the respondent.

    22. Covering letter: The covering letter should be couched in a pleasant style, so as to

    attract and hold the interest of the respondent. It must anticipate objections and answerthem briefly. It is desirable to address the respondent by name.

    33. Advance information:Advance information can be provided to potential respondents by

    a telephone call, or advance notice in the newsletter of the concerned organization, or by

    a letter. Such preliminary contact with potential respondents is more successful than

    follow-up efforts.

    44. Incentives: Money, stamps for collection and other incentives are also used to induce

    respondents to complete and return the mail questionnaire.

    55. Follow-up-contacts: In the case of respondents belonging to an organization, they may

    be approached through someone in that organization known as the researcher.

    66. Larger sample size:A larger sample may be drawn than the estimated sample size. For

    example, if the required sample size is 1000, a sample of 1500 may be drawn. This may

    help the researcher to secure an effective sample size closer to the required size.

    21

  • 7/28/2019 Mb0034 Solved Assignments

    22/52

    7

    8Q 6. Case Study: You are engaged to carry out a market survey on behalf of a leading

    Newspaper that is keen to increase its circulation in Bangalore City, in order to

    ascertain reader habits and interests. Develop a title for the study; define the

    research problem and the objectives or questions to be answered by the study.

    Ans.:Title: Newspaper reading choices

    Research problem: A research problem is the situation that causes the researcher to feel

    apprehensive, confused and ill at ease. It is the demarcation of a problem area within a certain

    context involving the WHO or WHAT, the WHERE, the WHEN and the WHY of the problem

    situation.

    There are many problem situations that may give rise to research. Three sources usually

    contribute to problem identification. Own experience or the experience of others may be a

    source of problem supply. A second source could be scientific literature. You may read about

    certain findings and notice that a certain field was not covered. This could lead to a research

    problem. Theories could be a third source. Shortcomings in theories could be researched.

    Research can thus be aimed at clarifying or substantiating an existing theory, at clarifying

    contradictory findings, at correcting a faulty methodology, at correcting the inadequate or

    unsuitable use of statistical techniques, at reconciling conflicting opinions, or at solving existing

    practical problems

    Types of questions to be asked :For more than 35 years, the news about newspapers and

    young readers has been mostly bad for the newspaper industry. Long before any competition

    from cable television or Nintendo, American newspaper publishers were worrying about

    declining readership among the young.

    As early as 1960, at least 20 years prior to Music Television (MTV) or the Internet, media

    research scholars1 began to focus their studies on young adult readers' decreasing interest in

    newspaper content. The concern over a declining youth market preceded and perhaps

    foreshadowed today's fretting over market penetration. Even where circulation has grown or

    stayed stable, there is rising concern over penetration, defined as the percentage of occupied

    22

  • 7/28/2019 Mb0034 Solved Assignments

    23/52

    households in a geographic market that are served by a newspaper.2 Simply put, population

    growth is occurring more rapidly than newspaper readership in most communities.

    This study looks at trends in newspaper readership among the 18-to-34 age group and

    examines some of the choices young adults make when reading newspapers.

    One of the underlying concerns behind the decline in youth newspaper reading is the question of

    how young people view the newspaper. A number of studies explored how young readers

    evaluate and use newspaper content.

    Comparing reader content preferences over a 10-year period, Gerald Stone and Timothy

    Boudreau found differences between readers ages 18-34 and those 35-plus.16 Younger readers

    showed increased interest in national news, weather, sports, and classified advertisements over

    the decade between 1984 and 1994, while older readers ranked weather, editorials, and food

    advertisements higher. Interest in international news and letters to the editor was less among

    younger readers, while older readers showed less interest in reports of births, obituaries, and

    marriages.

    David Atkin explored the influence of telecommunication technology on newspaper readership

    among students in undergraduate media courses.17 He reported that computer-related

    technologies, including electronic mail and computer networks, were unrelated to newspaper

    readership. The study found that newspaper subscribers preferred print formats over electronic.

    In a study of younger, school-age children, Brian Brooks and James Kropp found that electronic

    newspapers could persuade children to become news consumers, but that young readers would

    choose an electronic newspaper over a printed one.18

    In an exploration of leisure reading among college students, Leo Jeffres and Atkin assessed

    dimensions of interest in newspapers, magazines, and books,19 exploring the influence of

    media use, non-media leisure, and academic major on newspaper content preferences. The

    study discovered that overall newspaper readership was positively related to students' focus onentertainment, job / travel information, and public affairs. However, the students' preference for

    reading as a leisure-time activity was related only to a public affairs focus. Content preferences

    for newspapers and other print media were related. The researchers found no significant

    differences in readership among various academic majors, or by gender, though there was a

    23

  • 7/28/2019 Mb0034 Solved Assignments

    24/52

    slight correlation between age and the public affairs readership index, with older readers more

    interested in news about public affairs.

    Methodology

    Sample

    Participants in this study (N=267) were students enrolled in 100- and 200-level English courses

    at a midwestern public university. Courses that comprise the framework for this sample were

    selected because they could fulfill basic studies requirements for all majors. A basic studies

    course is one that is listed within the core curriculum required for all students. The researcher

    obtained permission from seven professors to distribute questionnaires in the eight classes

    during regularly scheduled class periods. The students' participation was voluntary; two students

    declined. The goal of this sampling procedure was to reach a cross-section of students

    representing various fields of study. In all, 53 majors were represented.

    Of the 267 students who participated in the study, 65 (24.3 percent) were male and 177 (66.3

    percent) were female. A total of 25 participants chose not to divulge their genders. Ages ranged

    from 17 to 56, with a mean age of 23.6 years. This mean does not include the 32 respondents

    who declined to give their ages. A total of 157 participants (58.8 percent) said they were of the

    Caucasian race, 59 (22.1 percent) African American, 10 (3.8 percent) Asian, five (1.9 percent)

    African/Native American, two (.8 percent) Hispanic, two (.8 percent) Native American, and one

    (.4 percent) Arabic. Most (214) of the students were enrolled full time, whereas a few (28) were

    part-time students. The class rank breakdown was: freshmen, 45 (16.9 percent); sophomores,

    15 (5.6 percent); juniors, 33 (12.4 percent); seniors, 133 (49.8 percent); and graduate students,

    16 (6 percent).

    Procedure

    After two pre-tests and revisions, questionnaires were distributed and collected by the

    investigator. In each of the eight classes, the researcher introduced herself to the students as a

    journalism professor who was conducting a study on students' use of newspapers and other

    media. Each questionnaire included a cover letter with the researcher's name, address, and

    phone number. The researcher provided pencils and was available to answer questions if

    anyone needed further assistance. The average time spent on the questionnaires was 20

    24

  • 7/28/2019 Mb0034 Solved Assignments

    25/52

    minutes, with some individual students taking as long as an hour. Approximately six students

    asked to take the questionnaires home to finish. They returned the questionnaires to the

    researcher's mailbox within a couple of day.

    Assignment Set- 2

    Q 1.Discuss the relative advantages and disadvantages of the different methods of

    distributing questionnaires to the respondents of a study.

    Ans.: There are some alternative methods of distributing questionnaires to the respondents.

    They are:

    1) Personal delivery,

    2) Attaching the questionnaire to a product,

    3) Advertising the questionnaire in a newspaper or magazine, and

    4) News-stand inserts.

    Personal delivery: The researcher or his assistant may deliver the questionnaires to the

    potential respondents, with a request to complete them at their convenience. After a day or two,

    the completed questionnaires can be collected from them. Often referred to as the self-

    administered questionnaire method, it combines the advantages of the personal interview and

    the mail survey. Alternatively, the questionnaires may be delivered in person and therespondents may return the completed questionnaires through mail.

    Attaching questionnaire to a product:A firm test marketing a product may attach a

    questionnaire to a product and request the buyer to complete it and mail it back to the firm. A gift

    or a discount coupon usually rewards the respondent.

    Advertising the questionnaire: The questionnaire with the instructions for completion may be

    advertised on a page of a magazine or in a section of newspapers. The potential respondent

    completes it, tears it out and mails it to the advertiser. For example, the committee of Banks

    Customer Services used this method for collecting information from the customers of

    commercial banks in India. This method may be useful for large-scale studies on topics of

    common interest. Newsstand inserts: This method involves inserting the covering letter,

    questionnaire and self addressed reply-paid envelope into a random sample of newsstand

    copies of a newspaper or magazine.

    25

  • 7/28/2019 Mb0034 Solved Assignments

    26/52

    Advantages and Disadvantages:

    The advantages of Questionnaire are:

    this method facilitates collection of more accurate data for longitudinal studies than any other

    method, because under this method, the event or action is reported soon after its occurrence.

    this method makes it possible to have before and after designs made for field based studies.

    For example, the effect of public relations or advertising campaigns or welfare measures can bemeasured by collecting data before, during and after the campaign.

    the panel method offers a good way of studying trends in events, behavior or attitudes. For

    example, a panel enables a market researcher to study how brand preferences change from

    month to month; it enables an economics researcher to study how employment, income and

    expenditure of agricultural laborers change from month to month; a political scientist can study

    the shifts in inclinations of voters and the causative influential factors during an election. It is also

    possible to find out how the constituency of the various economic and social strata of society

    changes through time and so on.

    A panel study also provides evidence on the causal relationship between variables. For

    example, a cross sectional study of employees may show an association between their attitude

    to their jobs and their positions in the organization, but it does not indicate as to which comes

    first - favorable attitude or promotion. A panel study can provide data for finding an answer to

    this question.

    It facilities depth interviewing, because panel members become well acquainted with the field

    workers and will be willing to allow probing interviews.

    The major limitations or problems of Questionnaire method are:

    this method is very expensive. The selection of panel members, the payment of premiums,

    periodic training of investigators and supervisors, and the costs involved in replacing dropouts,

    all add to the expenditure.

    it is often difficult to set up a representative panel and to keep it representative. Many persons

    may be unwilling to participate in a panel study. In the course of the study, there may be

    frequent dropouts. Persons with similar characteristics may replace the dropouts. However,

    there is no guarantee that the emerging panel would be representative.

    A real danger with the panel method is panel conditioning i.e., the risk that repeated

    interviews may sensitize the panel members and they become untypical, as a result of being on

    the panel. For example, the members of a panel study of political opinions may try to appear

    26

  • 7/28/2019 Mb0034 Solved Assignments

    27/52

    consistent in the views they express on consecutive occasions. In such cases, the panel

    becomes untypical of the population it was selected to represent. One possible safeguard to

    panel conditioning is to give members of a panel only a limited panel life and then to replace

    them with persons taken randomly from a reserve list.

    the quality of reporting may tend to decline, due to decreasing interest, after a panel has been

    in operation for some time. Cheating by panel members or investigators may be a problem insome cases.

    Q 2. In processing data, what is the difference between measures of central tendency and

    measures of dispersion? What is the most important measure of central tendency and

    dispersion?

    Ans.: Measures of Central tendency:

    Arithmetic Mean

    The arithmetic mean is the most common measure of central tendency. It simply the sum of the

    numbers divided by the number of numbers. The symbol m is used for the mean of a population.

    The symbol M is used for the mean of a sample. The formula for m is shown below: m=

    X

    NWhere X is the sum of all the numbers in the numbers in the sample and N is the number of

    numbers in the sample. As an example, the mean of the numbers 1+2+3+6+8=

    20

    5=4 regardless of whether the numbers constitute the entire population or just a sample from

    the population.

    The table, Number of touchdown passes, shows the number of touchdown (TD) passes thrown

    by each of the 31 teams in the National Football League in the 2000 season. The mean number

    of touchdown passes thrown is 20.4516 as shown below. m=X

    N=

    634

    31=20.4516

    27

    http://cnx.org/content/m11061/latest/#table1http://cnx.org/content/m11061/latest/#table1
  • 7/28/2019 Mb0034 Solved Assignments

    28/52

    37 33 33 32 29 28 28 2322 22 22 21 21 21 20 2019 19 18 18 18 18 16 1514 14 14 12 12 9 6Table 1: Number of touchdown passes

    Although the arithmetic mean is not the only "mean" (there is also a geometric mean), it is by far

    the most commonly used. Therefore, if the term "mean" is used without specifying whether it is

    the arithmetic mean, the geometric mean, or some other mean, it is assumed to refer to the

    arithmetic mean.

    Median

    The median is also a frequently used measure of central tendency. The median is the midpoint

    of a distribution: the same number of scores is above the median as below it. For the data in the

    table, Number of touchdown passes, there are 31 scores. The 16th highest score (which equals

    20) is the median because there are 15 scores below the 16th score and 15 scores above the

    16th score. The median can also be thought of as the 50th percentile.Let's return to the made up example of the quiz on which you made a three discussed previously

    in the module Introduction to Central Tendency and shown in Table 2.

    Student Dataset 1 Dataset 2 Dataset 3

    You 3 3 3John's 3 4 2Maria's 3 4 2Shareecia's 3 4 2Luther's 3 5 1Table 2: Three possible datasets for the 5-point make-up

    quizFor Dataset 1, the median is three, the same as your score. For Dataset 2, the median is 4.

    Therefore, your score is below the median. This means you are in the lower half of the class.

    Finally for Dataset 3, the median is 2. For this dataset, your score is above the median and

    therefore in the upper half of the distribution.

    Computation of the Median: When there is an odd number of numbers, the median is simply the

    middle number. For example, the median of 2, 4, and 7 is 4. When there is an even number of

    numbers, the median is the mean of the two middle numbers. Thus, the median of the numbers2, 4, 7, 12 is

    4+7

    2=5.5.

    Mode

    28

    http://cnx.org/content/m11061/latest/#table1http://cnx.org/content/m10805/latest/http://cnx.org/content/m10942/latest/http://cnx.org/content/m11061/latest/#table2http://cnx.org/content/m11061/latest/#table1http://cnx.org/content/m10805/latest/http://cnx.org/content/m10942/latest/http://cnx.org/content/m11061/latest/#table2
  • 7/28/2019 Mb0034 Solved Assignments

    29/52

    The mode is the most frequently occurring value. For the data in the table, Number of

    touchdown passes, the mode is 18 since more teams (4) had 18 touchdown passes than any

    other number of touchdown passes. With continuous data such as response time measured to

    many decimals, the frequency of each value is one since no two scores will be exactly the same

    (see discussion ofcontinuous variables). Therefore the mode of continuous data is normally

    computed from a grouped frequency distribution. The Grouped frequency distributiontableshows a grouped frequency distribution for the target response time data. Since the interval with

    the highest frequency is 600-700, the mode is the middle of that interval (650).

    Range Frequency

    500-600 3600-700 6700-800 5800-900 5900-1000 01000-1100 1Table 3: Grouped frequency

    distribution

    Measures of Dispersion: A measure of statistical dispersion is a real numberthat is zero if all

    the data are identical, and increases as the data becomes more diverse. It cannot be less than

    zero.

    Most measures of dispersion have the same scale as the quantity being measured. In otherwords, if the measurements have units, such as metres or seconds, the measure of dispersion

    has the same units. Such measures of dispersion include:

    Standard deviation

    Interquartile range

    Range

    Mean difference

    Median absolute deviation

    Average absolute deviation (or simply called average deviation)

    Distance standard deviation

    These are frequently used (together with scale factors) as estimators ofscale parameters, in

    which capacity they are called estimates of scale.

    29

    http://cnx.org/content/m11061/latest/#table1http://cnx.org/content/m11061/latest/#table1http://cnx.org/content/m10868/latest/http://cnx.org/content/m11061/latest/#table3http://cnx.org/content/m11061/latest/#table3http://en.wikipedia.org/wiki/Real_numberhttp://en.wikipedia.org/wiki/Units_of_measurementhttp://en.wikipedia.org/wiki/Standard_deviationhttp://en.wikipedia.org/wiki/Interquartile_rangehttp://en.wikipedia.org/wiki/Range_(statistics)http://en.wikipedia.org/wiki/Mean_differencehttp://en.wikipedia.org/wiki/Median_absolute_deviationhttp://en.wikipedia.org/wiki/Average_absolute_deviationhttp://en.wikipedia.org/wiki/Distance_standard_deviationhttp://en.wikipedia.org/wiki/Scale_factorhttp://en.wikipedia.org/wiki/Estimatorhttp://en.wikipedia.org/wiki/Scale_parameterhttp://cnx.org/content/m11061/latest/#table1http://cnx.org/content/m11061/latest/#table1http://cnx.org/content/m10868/latest/http://cnx.org/content/m11061/latest/#table3http://en.wikipedia.org/wiki/Real_numberhttp://en.wikipedia.org/wiki/Units_of_measurementhttp://en.wikipedia.org/wiki/Standard_deviationhttp://en.wikipedia.org/wiki/Interquartile_rangehttp://en.wikipedia.org/wiki/Range_(statistics)http://en.wikipedia.org/wiki/Mean_differencehttp://en.wikipedia.org/wiki/Median_absolute_deviationhttp://en.wikipedia.org/wiki/Average_absolute_deviationhttp://en.wikipedia.org/wiki/Distance_standard_deviationhttp://en.wikipedia.org/wiki/Scale_factorhttp://en.wikipedia.org/wiki/Estimatorhttp://en.wikipedia.org/wiki/Scale_parameter
  • 7/28/2019 Mb0034 Solved Assignments

    30/52

    All the above measures of statistical dispersion have the useful property that they are location-

    invariant, as well as linear in scale. So if a random variableXhas a dispersion ofSX then a

    linear transformationY= aX+ b forreala and b should have dispersion SY= |a|SX.

    Other measures of dispersion are dimensionless (scale-free). In other words, they have no

    units even if the variable itself has units. These include:

    Coefficient of variation

    Quartile coefficient of dispersion

    Relative mean difference, equal to twice the Gini coefficient

    There are other measures of dispersion:

    Variance (the square of the standard deviation) location-invariant but not linear in

    scale.

    Variance-to-mean ratio mostly used forcount data when the term coefficient of

    dispersion is used and when this ratio is dimensionless, as count data are themselves

    dimensionless: otherwise this is not scale-free.

    Some measures of dispersion have specialized purposes, among them theAllan variance and

    the Hadamard variance.

    Forcategorical variables, it is less common to measure dispersion by a single number. See

    qualitative variation. One measure that does so is the discrete entropy.

    Sources of statistical dispersion

    In the physical sciences, such variability may result only from random measurement errors:

    instrument measurements are often not perfectly precise, i.e., reproducible. One may assume

    that the quantity being measured is unchanging and stable, and that the variation between

    measurements is due to observational error.

    In the biological sciences, this assumption is false: the variation observed might be intrinsicto

    the phenomenon: distinct members of a population differ greatly. This is also seen in the arena

    of manufactured products; even there, the meticulous scientist finds variation.The simple model

    30

    http://en.wikipedia.org/wiki/Random_variablehttp://en.wikipedia.org/wiki/Linear_transformationhttp://en.wikipedia.org/wiki/Real_numberhttp://en.wikipedia.org/wiki/Dimensionlesshttp://en.wikipedia.org/wiki/Coefficient_of_variationhttp://en.wikipedia.org/wiki/Quartile_coefficient_of_dispersionhttp://en.wikipedia.org/wiki/Relative_mean_differencehttp://en.wikipedia.org/wiki/Gini_coefficienthttp://en.wikipedia.org/wiki/Variancehttp://en.wikipedia.org/wiki/Variance-to-mean_ratiohttp://en.wikipedia.org/wiki/Count_datahttp://en.wikipedia.org/wiki/Coefficient_of_dispersionhttp://en.wikipedia.org/wiki/Coefficient_of_dispersionhttp://en.wikipedia.org/wiki/Dimensionlesshttp://en.wikipedia.org/wiki/Allan_variancehttp://en.wikipedia.org/w/index.php?title=Hadamard_variance&action=edit&redlink=1http://en.wikipedia.org/wiki/Categorical_variablehttp://en.wikipedia.org/wiki/Qualitative_variationhttp://en.wikipedia.org/wiki/Information_entropyhttp://en.wikipedia.org/wiki/Accuracy_and_precisionhttp://en.wikipedia.org/wiki/Observational_errorhttp://en.wikipedia.org/wiki/Random_variablehttp://en.wikipedia.org/wiki/Linear_transformationhttp://en.wikipedia.org/wiki/Real_numberhttp://en.wikipedia.org/wiki/Dimensionlesshttp://en.wikipedia.org/wiki/Coefficient_of_variationhttp://en.wikipedia.org/wiki/Quartile_coefficient_of_dispersionhttp://en.wikipedia.org/wiki/Relative_mean_differencehttp://en.wikipedia.org/wiki/Gini_coefficienthttp://en.wikipedia.org/wiki/Variancehttp://en.wikipedia.org/wiki/Variance-to-mean_ratiohttp://en.wikipedia.org/wiki/Count_datahttp://en.wikipedia.org/wiki/Coefficient_of_dispersionhttp://en.wikipedia.org/wiki/Coefficient_of_dispersionhttp://en.wikipedia.org/wiki/Dimensionlesshttp://en.wikipedia.org/wiki/Allan_variancehttp://en.wikipedia.org/w/index.php?title=Hadamard_variance&action=edit&redlink=1http://en.wikipedia.org/wiki/Categorical_variablehttp://en.wikipedia.org/wiki/Qualitative_variationhttp://en.wikipedia.org/wiki/Information_entropyhttp://en.wikipedia.org/wiki/Accuracy_and_precisionhttp://en.wikipedia.org/wiki/Observational_error
  • 7/28/2019 Mb0034 Solved Assignments

    31/52

    of a stable quantity is preferred when it is tenable. Each phenomenon must be examined to see

    if it warrants such a simplification.

    Q 3. What are the characteristics of a good research design? Explain how the research

    design for exploratory studies is different from the research design for descriptive and

    diagnostic studies.

    Ans.: Good research design:Much contemporary social research is devoted to examining

    whether a program, treatment, or manipulation causes some outcome or result. For example, we

    might wish to know whether a new educational program causes subsequent achievement score

    gains, whether a special work release program for prisoners causes lower recidivism rates,

    whether a novel drug causes a reduction in symptoms, and so on. Cook and Campbell (1979)

    argue that three conditions must be met before we can infer that such a cause-effect relation

    exists:

    1. Covariation. Changes in the presumed cause must be related to changes in the

    presumed effect. Thus, if we introduce, remove, or change the level of a treatment or

    program, we should observe some change in the outcome measures.

    2. Temporal Precedence. The presumed cause must occur prior to the presumed effect.

    3. No Plausible Alternative Explanations. The presumed cause must be the only

    reasonable explanation for changes in the outcome measures. If there are other factors,

    which could be responsible for changes in the outcome measures, we cannot beconfident that the presumed cause-effect relationship is correct.

    In most social research the third condition is the most difficult to meet. Any number of factors

    other than the treatment or program could cause changes in outcome measures. Campbell and

    Stanley (1966) and later, Cook and Campbell (1979) list a number of common plausible

    alternative explanations (or, threats to internal validity). For example, it may be that some

    historical event which occurs at the same time that the program or treatment is instituted was

    responsible for the change in the outcome measures; or, changes in record keeping or

    measurement systems which occur at the same time as the program might be falsely attributed

    to the program. The reader is referred to standard research methods texts for more detailed

    discussions of threats to validity.

    31

  • 7/28/2019 Mb0034 Solved Assignments

    32/52

    This paper is primarily heuristic in purpose. Standard social science methodology textbooks

    (Cook and Campbell 1979; Judd and Kenny, 1981) typically present an array of research

    designs and the alternative explanations, which these designs rule out or minimize. This tends to

    foster a "cookbook" approach to research design - an emphasis on the selection of an available

    design rather than on the construction of an appropriate research strategy. While standard

    designs may sometimes fit real-life situations, it will often be necessary to "tailor" a researchdesign to minimize specific threats to validity. Furthermore, even if standard textbook designs

    are used, an understanding of the logic of design construction in general will improve the

    comprehension of these standard approaches. This paper takes a structural approach to

    research design. While this is by no means the only strategy for constructing research designs, it

    helps to clarify some of the basic principles of design logic.

    Minimizing Threats to Validity

    Good research designs minimize the plausible alternative explanations for the hypothesized

    cause-effect relationship. But such explanations may be ruled out or minimized in a number of

    ways other than by design. The discussion, which follows, outlines five ways to minimize threats

    to validity, one of which is by research design:

    1. By Argument. The most straightforward way to rule out a potential threat to validity is to

    simply argue that the threat in question is not a reasonable one. Such an argument may

    be made eithera prioriora posteriori, although the former will usually be more convincingthan the latter. For example, depending on the situation, one might argue that an

    instrumentation threat is not likely because the same test is used for pre and post test

    measurements and did not involve observers who might improve, or other such factors. In

    most cases, ruling out a potential threat to validity by argument alone will be weaker than

    the other approaches listed below. As a result, the most plausible threats in a study

    should not, except in unusual cases, be ruled out by argument only.

    2. By Measurement or Observation. In some cases it will be possible to rule out a threat

    by measuring it and demonstrating that either it does not occur at all or occurs so

    minimally as to not be a strong alternative explanation for the cause-effect relationship.

    Consider, for example, a study of the effects of an advertising campaign on subsequent

    sales of a particular product. In such a study, history (i.e., the occurrence of other events

    which might lead to an increased desire to purchase the product) would be a plausible

    32

  • 7/28/2019 Mb0034 Solved Assignments

    33/52

    alternative explanation. For example, a change in the local economy, the removal of a

    competing product from the market, or similar events could cause an increase in product

    sales. One might attempt to minimize such threats by measuring local economic

    indicators and the availability and sales of competing products. If there is no change in

    these measures coincident with the onset of the advertising campaign, these threats

    would be considerably minimized. Similarly, if one is studying the effects of specialmathematics training on math achievement scores of children, it might be useful to

    observe everyday classroom behavior in order to verify that students were not receiving

    any additional math training to that provided in the study.

    3. By Design. Here, the major emphasis is on ruling out alternative explanations by adding

    treatment or control groups, waves of measurement, and the like. This topic will be

    discussed in more detail below.

    4. By Analysis. There are a number of ways to rule out alternative explanations using

    statistical analysis. One interesting example is provided by Jurs and Glass (1971). They

    suggest that one could study the plausibility of an attrition or mortality threat by

    conducting a two-way analysis of variance. One factor in this study would be the original

    treatment group designations (i.e., program vs. comparison group), while the other factor

    would be attrition (i.e., dropout vs. non-dropout group). The dependent measure could be

    the pretest or other available pre-program measures. A main effect on the attrition factor

    would be indicative of a threat to external validity or generalizability, while an interaction

    between group and attrition factors would point to a possible threat to internal validity.Where both effects occur, it is reasonable to infer that there is a threat to both internal and

    external validity.

    The plausibility of alternative explanations might also be minimized using covariance

    analysis. For example, in a study of the effects of "workfare" programs on social welfare

    caseloads, one plausible alternative explanation might be the status of local economic

    conditions. Here, it might be possible to construct a measure of economic conditions and

    include that measure as a covariate in the statistical analysis. One must be careful when

    using covariance adjustments of this type -- "perfect" covariates do not exist in most

    social research and the use of imperfect covariates will not completely adjust for potential

    alternative explanations. Nevertheless causal assertions are likely to be strengthened by

    demonstrating that treatment effects occur even after adjusting on a number of good

    covariates.

    33

  • 7/28/2019 Mb0034 Solved Assignments

    34/52

    5. By Preventive Action. When potential threats are anticipated some type of preventive

    action can often rule them out. For example, if the program is a desirable one, it is likely

    that the comparison group would feel jealous or demoralized. Several actions can be

    taken to minimize the effects of these attitudes including offering the program to the

    comparison group upon completion of the study or using program and comparison groups

    which have little opportunity for contact and communication. In addition, auditing methodsand quality control can be used to track potential experimental dropouts or to insure the

    standardization of measurement.

    The five categories listed above should not be considered mutually exclusive. The inclusion of

    measurements designed to minimize threats to validity will obviously be related to the design

    structure and is likely to be a factor in the analysis. A good research plan should, where

    possible. make use of multiple methods for reducing threats. In general, reducing a particular

    threat by design or preventive action will probably be stronger than by using one of the other

    three approaches. The choice of which strategy to use for any particular threat is complex and

    depends at least on the cost of the strategy and on the potential seriousness of the threat.

    Design Construction

    Basic Design Elements. Most research designs can be constructed from four basic elements:

    1. Time. A causal relationship, by its very nature, implies that some time has elapsed

    between the occurrence of the cause and the consequent effect. While for some

    phenomena the elapsed time might be measured in microseconds and therefore might be

    unnoticeable to a casual observer, we normally assume that the cause and effect in social

    science arenas do not occur simultaneously, In design notation we indicate this temporal

    element horizontally - whatever symbol is used to indicate the presumed cause would be

    placed to the left of the symbol indicating measurement of the effect. Thus, as we read

    from left to right in design notation we are reading across time. Complex designs might

    involve a lengthy sequence of observations and programs or treatments across time.

    2. Program(s) or Treatment(s). The presumed cause may be a program or treatment

    under the explicit control of the researcher or the occurrence of some natural event or

    program not explicitly controlled. In design notation we usually depict a presumed cause

    with the symbol "X". When multiple programs or treatments are being studied using the

    same design, we can keep the programs distinct by using subscripts such as "X 1" or "X2".

    34

  • 7/28/2019 Mb0034 Solved Assignments

    35/52