1. Business Research MethodsBusiness Research- Data Analysis
2. ANOVA – One-way Multivariate Analysis – Factor Analysis, Discriminant Analysis, Cluster Analysis, Conjoint Analysis Statistical Analysis of Business Research: Bivariate Analysis (Chi- square) Data Analysis : Editing, Coding, Classification, Tabulation, Analysis and Interpretation Topics to be covered & Two-way classification
3. When the data is processed by doing statistical analysis and some conclusion can be drawn from it, it is known as information. Any raw facts or figures is known as data.Difference between Data andInformation
4. Steps in Processing of Data
Questionnaire checking, Editing, Coding, Tabulation, Data Cleaning ,Statistically adjusting the data Selecting a Data Analysis Strategy
5. Questionnaire checking – The initial step in questionnaire checking involves a check of all questionnaires for completeness and interviewing quality. A questionnaire returned from the field may be unacceptable for several reasons:1. Part of the questionnaire may be incomplete.2. The pattern of responses may indicate that the respondent did not understand or follow the instructions.3. The responses show little variance.4. The questionnaire is answered by someone who does not qualify for participation.5. The returned questionnaire is physically incomplete, one or more pages are missing.
6. Editing – Review of the questionnaires with the objective of increasing accuracy and precision. It consists of screening questionnaires to identify illegible, incomplete, inconsistent or ambiguous responses. This can be done in two stages:a) Field Editing – Objective of field editing is to make sure that proper procedure is followed in selecting the respondent, interview them and record their responses. The main problems faced in field editing are:1. Inappropriate Respondents – Instead of house owners, tenant is interviewed.2. Incomplete interviews, 3. Improper understanding, 4. Lack of consistency, 5. Legibility, 6, Fictitious interview – Questionnaires are filled by interviewer himself without conducting the interview.b) Office Editing – It is more thorough than field editing. Problems of consistency, rapport with respondents are some of the issues which get highlighted during office editing.
7. Example of Inconsistency:A respondent indicated that he doesn’t drink coffee, but when questioned about his favorite brand, he replied ‘BRU’.Treatment of Unsatisfactory ResponsesReturning to the field – Questionnaires with unsatisfactory responses may be returned to the field, where the interviewers recontact the respondents.Assigning missing value – Editor may assign missing values to unsatisfactory responses. This approach may be desirable if 1) the number of respondents with unsatisfactory responses is small, 2) the proportion of unsatisfactory responses for each of these respondents is small, or 3) the variables with unsatisfactory responses are not the key variables.Discarding unsatisfactory respondents – This is possible only when proportion of unsatisfactory respondents is small or the sample size is large.
8. Coding – Coding refers to those activities which helps in transforming edited questionnaires into a form that is ready for analysis. Coding speeds up the tabulation while editing eliminates errors. Coding involves assigning numbers or other symbols to answers so that the responses can be grouped into limited number of classes or categories. The code includes an indication of the column and data record it will occupy. For eg. Sex of respondents may be coded as 1for males and 2 for females.Questions Answers Codes 1. Do you own Yes 1 a vehicle? No 2 2. What is your Salaried S occupation? Business B Retired R
9. Tabulation – Refers to counting the number of cases that fall into various categories. The results are summarized in the form of statistical tables. The raw data is divided into groups and sub-groups. The counting and placing of data in a particular group and sub-group are done. The tabulation involves:1. Sorting and counting.2. Summarising of data.Tabulation may be of two types:1. Simple tabulation – In simple tabulation, a single variable is counted.2. Cross tabulation – Includes two or more variables, which are treated simultaneously.Tabulation can be done entirely by hand, or by machine, or by both hand and machine.
10. Sorting and counting of data: Sorting can be done as follows: Income (Rs) Tally Marks Frequencies 1000 IIII 4 1500 II 2 2000 III 3Format of a Blank table Table No. Sub heading indicates the rowTITLE – Number of children per family title or the rowHead Note – Unit of measurement headings. Caption indicates what Caption each column is Sub- Total meant for. Headin Body Body of the table g gives full Foot note information of
11. Kinds of Tabulation1. Simple or one-way tabulation – The multiple choice questions which allow only one answer may use on-way tabulation or univariate. The questions are predetermined and consist of counting the number of responses falling into a particular category and calculate the percentage.ExampleTable 14.1: Study of number of children in a family No. of children Family Percentage 0 10 5 1 30 15 2 70 35
12. 2. Cross Tabulation or Two-way Tabulation – This is known as Bivariate Tabulation.The data may include two or more variables.Eg. Popularity of a health drink among families having different incomes. Table 14.3: Use of Health Drink Income No. of 1 2 No. of per children families month per family (0) 1000 10 5 8 23 1001- 5 0 8 13 2000 2001- 20 10 12 42 3000
13. Data cleaning – Includes consistency checks and treatment of missing responses. Although preliminary consistency checks have been made during editing, the checks at this stage are more thorough and extensive, because they are made by computer.Consistency checks – Identify data that are out of range, logically inconsistent or have extreme values. For eg. A respondent may indicate that she charges long distance calls to a calling card, although she does not have one.
14. Treatment of missing responses – Missing responses represent values of a variable that are unknown, either because respondents provided ambiguous answers or their answers were not properly recorded.1. Substitute a Neutral Value – A neutral value, typically the mean to the variable, is substituted for the missing responses.2. Substitute an Imputed Response – The respondent’s pattern of responses to other questions are used to impute or calculate a suitable response to the missing questions.3. Casewise Deletion – Cases or respondents with any missing responses are discarded from the analysis.4. Pairwise deletion – Instead of discarding all cases with any missing values, the researcher uses only the cases or respondents with complete responses for each calculation. As a result, different calculations in an analysis may be based on different sample sizes.
15. Selecting a Data Analysis Strategy – The selection of a data analysis strategy should be based on the earlier steps of the marketing research process, known characteristics of the data, properties of statistical techniques and the background and philosophy of the researcher. Statistically Adjusting the Data – If any correction needs to be done for the statistical analysis, the data is adjusted accordingly.
16. Univariate TechniquesUnivariate techniques are appropriate when there is a single measurement of each element in the sample, or there are several measurements of each element but each variable is analyzed in isolation.Multivariate Techniques – Suitable for analyzing data when there are two or more measurements of each element and the variables are analyzed simultaneously. Concerned with the simultaneous relationships among two or more phenomena.Multivariate techniques differ from univariate techniques in that they shift the focus away from the levels (averages) and distributions (variances) of the phenomena, concentrating instead upon the degree of relationships among these phenomena.
17. In interdependence techniques, the variables are not classified as dependent or independent, rather the whole set of interdependent relationships is examined. Dependence Techniques are appropriate when one or more variables can be identified as dependent variables and the remaining as independent variables. Multivariate Techniques Dependence Interdependence Techniques Techniques •Discriminant •Factor Analysis Analysis •Cluster Analysis •Conjoint Analysis Bivariate Technique – Chi-square
18. It has only one parameter, namely degrees of freedom. Degrees of freedom refers to the number of classes to which a value can be assigned freely without exceeding the limitation placed. It is particularly useful in tests involving nominal data. The chi distribution is positive but an asymmetrical distribution. Through the test, we are able to determine the extent of difference between the theory or expected value and the observed or the actual value. χ2 = ∑ (O-E)2 EWhere, O = Observed frequencies E = Expected frequencies The chi-square test is represented by the symbol χ2 and owes its origin to greek letter chi. This test was first used by Karl Pearson and is one of the most widely used test today.Chi-square Test
19. Chi-square as a test of independence – We can establish if two or more attributes are associated or independent. Eg. A doctor may be interested in knowing if the new BCG vaccine is effective in controlling the target diseases or not. If there is any relationship between divorce rates and working wives.We start with the null hypothesis that there is no association between the attributes specified .Acceptance criteria – Calculated value is less than the table value, null hypothesis is accepted.Rejection Criteria – Calculated value is greater than the table value, null hypothesis is rejected. Applications of chi-square:
20. As a test of homogeneity – Used to find out if two or more randomly selected independent samples have been drawn from the same population or not. As a test of goodness of fit – It can be used to determine how well a theoretical distribution (poisson or normal) fits on the observed data or how appropriately a theoretical distribution fits empirical distribution.
21. Data given should be in original and absolute form and not in relative form like proportions, percentages etc. The observation used in the study should be collected on the random basis so that there is no element of bias. The number of items in each group should be reasonable. The total number of items should be large enough to guarantee a modem of similarity between the theoretically correct distribution and the sampling distribution being studied.Conditions for chi-square Test
22. It is easy to calculate and interpret. Additive property of chi-square test that allows the researcher to add the results of independent but related samples. It is a distribution free test and hence can be used in any type of population distribution. It can be used without making any assumption about the form of parent distribution or its parameters, since it is based on observed frequencies and not on parameters like mean and standard deviation.Merits of chi-square test
23. It is not as reliable as parametric tests. It is used for testing hypothesis but is not useful for estimation. It has some limitations associated with it.Demerits of chi-square
24. Discriminant AnalysisIn this analysis two or more groups are compared. Discriminant Analysis is a technique for analyzing data when the dependent variable is categorical and the independent variables are interval in nature. For eg. The dependent variable may be the choice of a brand of personal computer and the independent variables may be ratings of attributes of PCs on a seven-point Likert scale.Eg. Where discriminant analysis is used:1. Those who buy our brand and those who buy competitor’s brand.2. Heavy user, medium user and light user of the product.
25. DevelopmentThe value of dependent variable is calculated by using the data of independent variable. Z = b1x1+b2x2+b3x3+…………Where, Z = Discriminant score b1 = Discriminant weight for variable x1 = Independent variableThe objectives of discriminant analysis: Evaluation of the accuracy of classification. Classification of cases to one of the groups based on the values of the independent variable. Determination of which independent variables contribute to most of the intergroup differences. Examination of whether significant differences exist among the groups, in terms of independent variable.of discriminant functions, or linear combinations of independent variables, which will best discriminate between the categories of the dependent variable.
26. Example of Discriminant VariableA study of 294 consumers was undertaken to determine the correlates of rebate proneness, or the characteristics of consumers who respond favorably to rebate promotions.The independent variables were four factors related to household shopping attitudes and behaviors, and selected demographic characteristics (sex, age and income). The dependent variable was the respondents degree of rebate proneness, of which three levels were identified. Respondents who reported no rebate-triggered purchases during the past 12 months were classified as nonusers, those who reported one or two such purchases as light users and those with more than two purchases, frequent users of rebates.Two findngs emerged: Rebate sensitive customers associate less effort with fulfilling the requirements of the rebate purchase, And consumers who are aware of the regular prices of products are more likely to respond
27. Do heavy, medium, and light users of soft drinks differ in terms of their consumption of frozen foods? In terms of demographic characteristics, how do consumers who exhibit store loyalty differ from those who do not? When the dependent variable has two categories, the technique is known as two-group discriminant analysis. When three or more categories are involved, the technique is referred to as multiple discriminant analysis.Example of Discriminant analysis:
28. Conjoint AnalysisConjoint Analysis attempts to determine the relative importance consumers attach to salient attributes and the utilities they attach to the levels of attributes.This information is derived from consumer’s evaluation of brands, or brand profiles composed of these attributes and their levels. The respondents are presented with stimuli that consist of combinations of attribute levels. They are asked to evaluate these attributes in terms of their desirability.In a situation where the company would like to know the most desirable attributes or their combination for a new product or service, the use of conjoint analysis is most appropriate.
29. Eg. Of Conjoint Analysis for a laptop:Weight (3kg or 5 kg), Battery life (2 hrs or 4 hrs), Brand name ( Lenovo or Dell)Rank order the combination of the characteristics, i.e. 3kg, 2hrs, Lenovo 5kg, 4 hrs, Dell A comparison between the utility of a price level of Rs. 400 versus Rs 500, a delivery period of 1 week versus 2 weeks or an after-sales response of 24 hours versus 48 hours. An airline would like to know, which is the most desirable combination of attributes to a frequent traveller: a) Punctuality, b) Air fare, c)Quality of food served on the flight, and d) Hospitality and empathy shown.Example
30. Segmenting the market based on similarity of preferences for attribute levels. Determining the composition of the most preferred brand. Estimating market share of brands that differ n attribute levels. Determining the relative importance of attributes in the consumer choice process. Estimation of worth for the attribute chosen.Conjoint Analysis has been used in marketing for a variety of purposes: Collection of data. Identification of relevant products or service attributes.Three steps in Conjoint Analysis
31. Factor AnalysisFactor Analysis is a general name denoting a class of procedures primarily used for data reduction and summarization.In marketing research, there may be a large number of variables, most of which are correlated and which must be reduced to a manageable level.Relationships among sets of many interrelated variables are examined and represented in terms of a few underlying factors.
32. Example of Factor AnalysisRespondents in a survey were asked to rate the importance of 15 bank attributes. A five point scale ranging from not important to very important was employed. These data were analyzed via principal component analysis.A four factor solution resulted, with the factors being labeled as traditional services, convenience, visibility and competence. Traditional services included interest rates on loans, easy to read monthly statements and obtainability of loans. Convenience was comprised of convenient branch location, convenient ATM locations, speed of service, and convenient banking hours. The visibility factor included recommendations from friends and relatives and attractiveness of the physical structure.
33. Common factor analysis – If the researcher wants to analyse the components of the main factor, common factor analysis is used.Eg. Common Factor – Inconvenience inside a car.The components may be:1. Leg Room2. Seat arrangement3. Entering the rear seat4. Door locking mechanism. Principal Component Analysis – When the objective is to summarise information from a large set of variables into fewer factors, principal component factor analysis is used.Two most commonly employed factor analysis procedures:
34. Principal Component Factor AnalysisCustomer feedback about a two wheeler manufactured by a company.Identified six variables:1. Fuel efficiency2. Durability of life3. Comfort4. Spare parts availability5. Breakdown frequency6. PriceGrouping of 1,2,4,5 into Factor 1(Technical Factor)6 into Factor 2 ( Price Factor)3 into Factor 3 (Personal Factor)
35. To identify a smaller set of salient variables from a larger set for use in subsequent multivariate analysis. To identify a new, smaller set of uncorrelated variables to replace the original set of correlated variables. To identify underlying dimensions or factors that explain the correlations among a set of variables.Factor analysis is used in the following circumstances:
36. In pricing studies, to identify the characteristics of price sensitive consumers. In advertising studies, to understand the media consumption habits of the target market. In product research, factor analysis can be employed to determine the brand attributes that influence consumer choice. It can be used in market segmentation for identifying the underlying variables on which to group the customers.Applications of FactorAnalysis
37. To identify specific customer customer segment for the company’s brand. To classify persons or objects into small number of clusters or group.Cluster AnalysisCluster Analysis is a class of techniques used to classify objects or cases into relatively homogeneous groups called clusters. Objects in each cluster tend to be similar to each other and dissimilar to objects in the other clusters.Cluster Analysis is used:
38. Reducing data- Data reduction tool to develop clusters or subgroups of data. Selecting test markets Identifying new product opportunities – By clustering brands and products, competitive sets within the markets can be determined. A firm can examine its current offerings compared to those of its competitors to identify potential new product opportunities. Understanding buyer behaviors Segmenting may be clustered on the basis of benefits sought from the purchase of a product.Applications of ClusterAnalysis the market: For eg. Consumers
39. Example of Cluster AnalysisIn a study examining decision making pattern among international vacationers, 200 respondents were clustered into six clusters. Namely respondents preferring vacations for psychological purpose, educational purpose, social, relaxational , physiological and aesthetic purpose. The behavior of vacationers was studied and appropriate promotional strategies were designed for each of the clusters.
No comments:
Post a Comment
Note: only a member of this blog may post a comment.