0% found this document useful (0 votes)

14 views55 pages

Data Preperation and SPSS Intro

The document discusses the process of data preparation and descriptive statistics for marketing research. It outlines the steps for data preparation including questionnaire checking, editing, coding, transcribing, and cleaning. Descriptive statistics are then used to summarize the data, including measures of central tendency like the mean, median, and mode as well as measures of dispersion like range, variance, and standard deviation.

Uploaded by

seif.yazhord

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views55 pages

Data Preperation and SPSS Intro

Uploaded by

seif.yazhord

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

Marketing Research

Data Preparation &

Descriptive Statistics
Overview

 Data Preparation Process

 Basic Descriptive Statistics

 Measure of Dispersion
Data Preparation Process

Preliminary Plan for Data Analysis

1. Questionnaire Checking
2. Editing
3. Coding
4. Transcribing
5. Data Cleaning

Data Analysis Strategy

Step 1: Questionnaire Checking

 What are the reasons for a questionnaire returned from the

field to be unacceptable?

Parts of the questionnaire may be incomplete.

The pattern of responses may indicate that the respondent did not
understand or follow the instructions .

The responses show little variance.

One or more pages are missing.

The questionnaire is received after the pre-established cut-off date.

The questionnaire is answered by someone who does not qualify for

participation.
Step 2: Editing

 A review of the questionnaires with the objective of

increasing accuracy & precision

 Treatment of Unsatisfactory Responses

 Return to the field
 Assign missing data
 Discard unsatisfactory responses
Step 3: Coding

 The assignment of a code to represent a specific

response to a specific question along with the data
record and column position that code will occupy

 A Codebook contains coding instructions and the

necessary information about variables in the dataset

 Steps:
1. Transforming responses to each question into a set of
meaningful categories
2. Assigning numerical codes to the categories
3. Creating a data set suitable for computer analysis
1. Transforming Responses into Meaningful
Categories

 A structured question is pre-categorized

 Responses to a non-structured or open-ended

questions to be grouped into a meaningful and
manageable set of categories
 Other? please specify
2. Assigning Numerical Codes

 Assign appropriate numerical codes to responses

that are not already in quantified form

 To assign numerical codes, the researcher should

facilitate computer manipulation and analysis of
responses
Example

 A questionnaire was collected for a fast food chain

research with 200 completed responses for the
following questions
 Rate you preference to eat in a familiar restaurant (1= Weak
Preference, 7= Strong Preference
 Rate the restaurant in terms of
 Quality of food (1= Poor , 7 = Excellent)
 Quantity of food (1= Poor , 7 = Excellent)
 Value For Money (1= Poor , 7 = Excellent)
 Service Quality (1= Poor , 7 = Excellent)
 Please indicate your household income
Less than 20,000 EGP (=1) 20,000 EGP- 34,999 EGP (=2)
35,000 EGP- 49,999 EGP (=3) 50,000 EGP- 74,999 EGP (=4)
75,000 EGP- 99,999 EGP (=5) 100,000 EGP or More (=6)
Codebook ( In Appendix)

Column Variable Variable Question Coding

Number Number Name Number Instructions
1 1 ID 1 to 200 as coded
2 2 Preference 1 Input the number circled.
1=Weak Preference
7=Strong Preference

3 3 Quality 2 Input the number circled.

1=Poor
7=Excellent

4 4 Quantity 3 Input the number circled.

1=Poor
7=Excellent

5 5 Value 4 Input the number circled.

1=Poor
7=Excellent

6 6 Service 5 Input the number circled.

1=Poor
7=Excellent
Codebook Excerpt (Cont.)

Column Variable Variable Question Coding

Number Number Name Number Instructions
7 7 Gender 6 Input the number
selected
1=Female
2=Male

8 8 Income 7 Input the number

circled.
1 = Less than $20,000
2 = $20,000 to 34,999
3 = $35,000 to 49,999
4 = $50,000 to 74,999
5 = $75,000 to 99,999
6 = $100,00 or more
Coding Multiple Response

 Which of the following countries have you visited during the

past 12 months? (Mark all that apply)
________Canada
________England
________France
________Germany
________Japan
________Mexico

 How to code it?

Coding Multiple Response

 Which of the following countries have you visited during the

past 12 months? (Mark all that apply)
________Canada
________England
________France
________Germany
________Japan
________Mexico

 How to code it: Need 6 variables, each relating to a specific

country and having two possible values (Ex: 1= “Yes” and 0 =
“No”)
Rank Order Question

 Please rank the following fast-food restaurants by placing

a 1 beside the restaurant you think is best overall, a 2
beside the restaurant you think is second best, and so on.
__________Burger King
__________McDonald's
__________Wendy's
__________Hardy’s

 How to code it?

Rank Order Question

 Please rank the following fast-food restaurants by placing

a 1 beside the restaurant you think is best overall, a 2
beside the restaurant you think is second best, and so on.
__________Burger King
__________McDonald's
__________Wendy's
__________Hardy’s

 How to code it?

This question requires as many variables (and columns) as
there are objects to be ranked
3. Creating a Data Set

 Organized collection of data records

 Each sample unit within the data set is called a Case or

Observation

 Structure of a Data Set

 The number of observations = n
 The total number of variables embedded in the
questionnaire is m, then
 Data set = n x m matrix of numbers
Structure of a Data Sheet

Respondent 1’s response

to variable 1.
Structure of a Data Sheet
Step Four: Transcribing

 Transcribing: is transferring the coded data from the

questionnaires into the computers.

 This step is unecceasry in most of the cases because

data are entered directly into the computer.
Step Five: Data Cleaning

 Consistency Checks
 A part of the data cleaning process that identifies data that are
out of range or logically inconsistent, or that have extreme
values
 Treatment of Missing Responses
 A respondent's refusal to answer a question
 An interviewer's failure to ask a question or record an answer
or a "don't know" that does not seem legitimate
 Requires sound questionnaire design & tight control over
fieldwork
Data Preparation & Cleaning your project

 Change to codes (based on your codebook)

 If requires more variables (columns then create it)
 Open ended: Other…
Descriptive statistics

• Summarizes/describes the characteristics of a

data set.
• Consists of two basic categories of measures:
1) Measures of central tendency: describe the center
of a data set.
2) Measures of dispersion: describe the
variability/spread of data within the set.
Measures of Central Tendency

 Mode: Most frequently category chosen

 Median: 50th percentile response

 Mean: Simple average of the various numbers

Example

Choices Code #
 A sample of 100
Students
students has been
drawn to measure their Hate it 1 30
perceptions of AUC’s
online instruction.
Don’t like 2 25
 Calculate the mode, it
median and mean based
on the results in the Neutral 3 25
following table.
 How do you evaluate the
results? Like it 4 15

Love it 5 5
Measures of Dispersion

 Describe how the responses are clustered around

the mean or a central value.

 Measures:
 Range: The difference between the largest and smallest
response value
 Variance: The mean squared deviation from the mean
(Normal distribution assumption). The variance can never be
negative.
 Standard Deviation: The square root of the variance. It
measures of dispersion around the mean
 The Coefficient of Variation: The ratio of the standard
deviation to the mean expressed as a percentage, and is a
unitless measure of relative variability
Measures of Central Tendency &
Dispersion

MEASUREMENT MEASURES MEASURES OF

LEVEL OF DATA OF CENTRAL DIESPERSION
PERTAINING TO TENDENCY
VARIABLE

Nominal MODE NO MEASURE

Ordinal MEDIAN RANGE

Interval MEAN STANDARD DEVIATION

Ratio MEAN STANDARD DEVIATION

Why Averages May be Misleading

 Researchers tested a new sauce product & found:

 Mean rating of the taste test was close to the middle of the
scale, which had "very mild" and "very hot" as its bipolar
adjectives
 Researcher’s conclusion
 Consumers need neither really hot nor really mild sauce
 Deeper examination revealed
 The existence of a large proportion of consumers who
wanted the sauce to be mild and an equally large proportion
who wanted it to be hot
 Morale of the story:
 A clear understanding of the distribution of responses can
help a researcher avoid erroneous inferences
SPSS
SPSS interface

Data View
• The place to enter data
• Columns: Variables
• Rows: Records

Variable View
• The place to enter variables
• List of all variables
• Characteristics of all variables
30
Data View on SPSS

Variable Name (Set in

Variable View)

Data Value
Respondent
Number
(Called case
number)
Data View
Variable View on SPSS
Possible Scale
Variable Values used
Name
Variable Question Values for
Type Statement missing

Variable View
Data View on SPSS
Exercise: Creating variables in SPSS

 Open SPSS

 Create a Nominal Variable Gender

 Create a variable named Gender
 Put 1 For males and 2 for females

 Create a second variable called age group

 Put <18 as 1
 Put 18-25 as 2
 Put 26-35 as 3
 Put >35 as 4

 Create a continuous variable called Salary

Importing Data from Excel

 Select File Open Data

 Choose Excel as file type
 Select the file you want to import
 Then click Open

42
Open Excel files in SPSS

43
Variable Names
appeared on the
column header as
written in Excel
Frequency Distribution

 A mathematical distribution with the objective of obtaining

a count of the number of responses associated with values
of one variable and to express these counts in
percentage terms

 One-way tabulation is a table showing the distribution of

data pertaining to categories of a single variable
Frequency Distribution on SPSS

46
Frequency Distribution in SPSS

Analyze>Descriptive Statistics>Frequencies

Step 1: Choose the

Type of Analysis
Frequency Distribution in SPSS

Step 2 : Select
the Variable for
which you want
to compute
frequencies
and press “ ok”

Note: You can also

choose to display a
Histogram
(Frequency
Distribution Chart)
Frequency Distribution in SPSS

Step 3 :
Analyze the
Output
Frequency Table
Measure of Central Tendencies

Step 2 : Select
the Variable for
which you want to
compute and click
on “Statistics”
Measures of Central Tendency in SPSS

Step 3 : Select
the analysis you
want to compute
Measures of Dispersion in SPSS

Step 3 : Select
the analysis you
want to
compute

Coefficient of Variation - Definition, Formula, Interpretation, Examples & FAQs
No ratings yet
Coefficient of Variation - Definition, Formula, Interpretation, Examples & FAQs
19 pages
Risk Army Guidance
No ratings yet
Risk Army Guidance
74 pages
MPC-006
No ratings yet
MPC-006
99 pages
Data Preparation and Analysis 3
No ratings yet
Data Preparation and Analysis 3
182 pages
SPSS Session
No ratings yet
SPSS Session
133 pages
Lesson 11:: Expected Value of Random Variables
No ratings yet
Lesson 11:: Expected Value of Random Variables
20 pages
6-classification-in-geography
No ratings yet
6-classification-in-geography
23 pages
Business Statistics and Computing Complete Ppts (1)
No ratings yet
Business Statistics and Computing Complete Ppts (1)
213 pages
Chapter 7 -Data analysis process (Full- updated)
No ratings yet
Chapter 7 -Data analysis process (Full- updated)
77 pages
Ch. 9 Montgomery RGM
No ratings yet
Ch. 9 Montgomery RGM
66 pages
Univariate Bivariate & Multivariate Analysis of Data
No ratings yet
Univariate Bivariate & Multivariate Analysis of Data
24 pages
Chap13 - Quantitative Data Analysis - Revised - Jan2021
No ratings yet
Chap13 - Quantitative Data Analysis - Revised - Jan2021
54 pages
02 Stats Revision
No ratings yet
02 Stats Revision
46 pages
Unit 3 Descriptive Statistics
No ratings yet
Unit 3 Descriptive Statistics
46 pages
Topic 3 Data Processing_bus 221(0)
No ratings yet
Topic 3 Data Processing_bus 221(0)
130 pages
MCQ - Stactistics Management
No ratings yet
MCQ - Stactistics Management
75 pages
Session 1
No ratings yet
Session 1
51 pages
Statistical-Analysis-1
No ratings yet
Statistical-Analysis-1
35 pages
Notes On Data Processing, Analysis, Presentation
No ratings yet
Notes On Data Processing, Analysis, Presentation
63 pages
Chap13 Quantitative Data Analysis Revised Jan2021
No ratings yet
Chap13 Quantitative Data Analysis Revised Jan2021
54 pages
Intro SRM
No ratings yet
Intro SRM
73 pages
AIL-report
No ratings yet
AIL-report
43 pages
CH01 - Introduction To Statistics 2
No ratings yet
CH01 - Introduction To Statistics 2
52 pages
Sample Final Paper For LBOLYTC
No ratings yet
Sample Final Paper For LBOLYTC
39 pages
aMMW (1)
No ratings yet
aMMW (1)
42 pages
3 Bpa MV Regression Reference Guide May2012 Final
No ratings yet
3 Bpa MV Regression Reference Guide May2012 Final
58 pages
Marketing Research Print
No ratings yet
Marketing Research Print
71 pages
Manual Basic LAboratory
100% (3)
Manual Basic LAboratory
245 pages
Topic 5 Data Preparation Initial Exploration - Slides - Updated
No ratings yet
Topic 5 Data Preparation Initial Exploration - Slides - Updated
51 pages
MMW 0607
No ratings yet
MMW 0607
29 pages
Research Methodology: Result and Analysis (Part 1)
No ratings yet
Research Methodology: Result and Analysis (Part 1)
65 pages
Qunt Data Coding & Analysis
No ratings yet
Qunt Data Coding & Analysis
104 pages
Week 9 Data Analysis Using SPSS 33
0% (1)
Week 9 Data Analysis Using SPSS 33
82 pages
Statistics and Probability TQ Q3 A4 Final
100% (1)
Statistics and Probability TQ Q3 A4 Final
10 pages
Fifth Semester Syllabus
No ratings yet
Fifth Semester Syllabus
25 pages
Descriptive Analytics
No ratings yet
Descriptive Analytics
42 pages
Data Analysis
No ratings yet
Data Analysis
30 pages
Lesson 5 (Descriptive Statistics Part 1)_Oct 2024
No ratings yet
Lesson 5 (Descriptive Statistics Part 1)_Oct 2024
72 pages
Data Management
No ratings yet
Data Management
48 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
26 pages
STATS & HD REVIEWER PRELIMS
No ratings yet
STATS & HD REVIEWER PRELIMS
15 pages
6.Research Methodology-BBA S1M6
No ratings yet
6.Research Methodology-BBA S1M6
64 pages
Notes Stats
No ratings yet
Notes Stats
21 pages
Data Analysis Basics:: Variables and Distribution
No ratings yet
Data Analysis Basics:: Variables and Distribution
29 pages
614 Descriptive Statistcs
No ratings yet
614 Descriptive Statistcs
56 pages
CH 8 Data Analysis
No ratings yet
CH 8 Data Analysis
34 pages
Estadístic A Descriptiv A: Dr. Lázaro Bustio Martínez Otoño 2023
No ratings yet
Estadístic A Descriptiv A: Dr. Lázaro Bustio Martínez Otoño 2023
42 pages
Data Analysis and Interpretation
No ratings yet
Data Analysis and Interpretation
33 pages
Ramiro Et Al (2017) - Reprodução de Três Espécies de Gymnophthalmidae - Calyptommatus - Nothobachia - Procellosaurinus
No ratings yet
Ramiro Et Al (2017) - Reprodução de Três Espécies de Gymnophthalmidae - Calyptommatus - Nothobachia - Procellosaurinus
14 pages
Chapter3 Lesson1
No ratings yet
Chapter3 Lesson1
27 pages
10 Data Preparation
No ratings yet
10 Data Preparation
42 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
chapter 13
No ratings yet
chapter 13
71 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Standard Costing and Variance Analysis
No ratings yet
Standard Costing and Variance Analysis
22 pages
Lecture 8 Data Analysis
No ratings yet
Lecture 8 Data Analysis
30 pages
Lecture2
No ratings yet
Lecture2
33 pages
365 Data Science Axs
No ratings yet
365 Data Science Axs
103 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
22 pages
RM-EBBA-class-8-CH0-11-Quatitative-analysis
No ratings yet
RM-EBBA-class-8-CH0-11-Quatitative-analysis
37 pages
What Are Your Results?: Jeffrey Barnes
No ratings yet
What Are Your Results?: Jeffrey Barnes
17 pages
SQC
No ratings yet
SQC
53 pages
L11 - 12-Quantitative Analysis-2 - Page
No ratings yet
L11 - 12-Quantitative Analysis-2 - Page
9 pages
11 Eleventh - Class - Q 0Q - Research - JAVERIANA
No ratings yet
11 Eleventh - Class - Q 0Q - Research - JAVERIANA
15 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
13 pages
Police Stress
No ratings yet
Police Stress
18 pages
Introduction To STATISTICS-new
No ratings yet
Introduction To STATISTICS-new
44 pages
Testing Testing One Two Three by Les Hayduk
No ratings yet
Testing Testing One Two Three by Les Hayduk
10 pages
Data Analysis
100% (2)
Data Analysis
87 pages
Material Labour Overhead Cost Variance
No ratings yet
Material Labour Overhead Cost Variance
15 pages
Plackett RL, Burman, JP. (1946) The Design of
No ratings yet
Plackett RL, Burman, JP. (1946) The Design of
17 pages
MTL390
No ratings yet
MTL390
8 pages
Applied Stats Final Exam
No ratings yet
Applied Stats Final Exam
12 pages
Anjali BRMbasic Data Analysis
No ratings yet
Anjali BRMbasic Data Analysis
19 pages
Hasil Uji Normalitas: Case Processing Summary
No ratings yet
Hasil Uji Normalitas: Case Processing Summary
6 pages
Unit - 8 Data Analysis
No ratings yet
Unit - 8 Data Analysis
6 pages
Data Analysis Topics Discussed Getting Data Ready For Analysis 1) - Editing Data (Definition)
No ratings yet
Data Analysis Topics Discussed Getting Data Ready For Analysis 1) - Editing Data (Definition)
8 pages
Engineering Metrology - Unit 14 - Week 12
No ratings yet
Engineering Metrology - Unit 14 - Week 12
4 pages
3is Interpretation of Quantitative Data
No ratings yet
3is Interpretation of Quantitative Data
13 pages
MMW-FINALS-REVIEWER - Etc
No ratings yet
MMW-FINALS-REVIEWER - Etc
4 pages
Quantitative Data Analysis
100% (2)
Quantitative Data Analysis
27 pages
assessment-Module-1
No ratings yet
assessment-Module-1
3 pages
Math Review
No ratings yet
Math Review
4 pages
Lesson 6 Dependent and Independent T Tests
No ratings yet
Lesson 6 Dependent and Independent T Tests
7 pages
6 Data Analysis
No ratings yet
6 Data Analysis
24 pages
Statistics and Data Analytics Cheat Sheets
100% (1)
Statistics and Data Analytics Cheat Sheets
2 pages
Business Analytics (MIS171) Summary Notes
No ratings yet
Business Analytics (MIS171) Summary Notes
6 pages
Flexible Budgeting 7-8-2018
No ratings yet
Flexible Budgeting 7-8-2018
7 pages
Research Methods Session 11 Data Preparation and Preliminary Data Analysis (Compatibility Mode)
No ratings yet
Research Methods Session 11 Data Preparation and Preliminary Data Analysis (Compatibility Mode)
9 pages
Ged Basics in Mathematics
From Everand
Ged Basics in Mathematics
Henry Varela
5/5 (1)

Uploaded by

Uploaded by

Marketing Research

Data Preparation &

 Data Preparation Process

 Basic Descriptive Statistics

Preliminary Plan for Data Analysis

Data Analysis Strategy

 What are the reasons for a questionnaire returned from the

Parts of the questionnaire may be incomplete.

The responses show little variance.

One or more pages are missing.

The questionnaire is received after the pre-established cut-off date.

The questionnaire is answered by someone who does not qualify for

 A review of the questionnaires with the objective of

 Treatment of Unsatisfactory Responses

 The assignment of a code to represent a specific

 A Codebook contains coding instructions and the

 A structured question is pre-categorized

 Responses to a non-structured or open-ended

 Assign appropriate numerical codes to responses

 To assign numerical codes, the researcher should

 A questionnaire was collected for a fast food chain

Column Variable Variable Question Coding

3 3 Quality 2 Input the number circled.

4 4 Quantity 3 Input the number circled.

5 5 Value 4 Input the number circled.

6 6 Service 5 Input the number circled.

Column Variable Variable Question Coding

8 8 Income 7 Input the number

 Which of the following countries have you visited during the

 How to code it?

 Which of the following countries have you visited during the

 How to code it: Need 6 variables, each relating to a specific

 Please rank the following fast-food restaurants by placing

 How to code it?

 Please rank the following fast-food restaurants by placing

 How to code it?

 Organized collection of data records

 Each sample unit within the data set is called a Case or

 Structure of a Data Set

Respondent 1’s response

 Transcribing: is transferring the coded data from the

 This step is unecceasry in most of the cases because

 Change to codes (based on your codebook)

• Summarizes/describes the characteristics of a

 Mode: Most frequently category chosen

 Median: 50th percentile response

 Mean: Simple average of the various numbers

 Describe how the responses are clustered around

MEASUREMENT MEASURES MEASURES OF

Nominal MODE NO MEASURE

Ordinal MEDIAN RANGE

Interval MEAN STANDARD DEVIATION

Ratio MEAN STANDARD DEVIATION

 Researchers tested a new sauce product & found:

 Researchers tested a new sauce product & found:

Variable Name (Set in

 Create a Nominal Variable Gender

 Create a second variable called age group

 Create a continuous variable called Salary

 Select File Open Data

 A mathematical distribution with the objective of obtaining

 One-way tabulation is a table showing the distribution of

Step 1: Choose the

Note: You can also

You might also like