0% found this document useful (0 votes)
32 views37 pages

BRM Practical File: SPSS Guide

5

Uploaded by

Ankit Bisht
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views37 pages

BRM Practical File: SPSS Guide

5

Uploaded by

Ankit Bisht
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

BRM PRACTICAL FILE

GURU GOBIND SINGH INDRAPRASTHA UNIVERSITY NEW DELHI

DELHI SCHOOL OF PROFESSIONAL STUDIES AND RESEARCH

BUSINESS RESEARCH METHODOLOGY LAB (BBA 213)

PRACTICAL FILE

SUBMITTED TO: SUBMITTED BY:

MR. AKSHAY CHAUHAN AKSHAT GAUTAM

ASST. PROFESSOR BBA 3A

03012501723
BRM PRACTICAL FILE

INDEX
S NO. PARTICULARS PAGE
NO.
1 SPSS INTRODUCTION 1
2 DATA ENTRY AND VARIABLES 2-3
3 FREQUENCY TABLE 4-8
4 GRAPH/HISTOGRAM 9-12
5 MEAN 13-14
6 NORMALITY TEST 15-21
7 ONE SAMPLE T-TEST 22-23
8 INDEPENDENT SAMPLE T-TEST 24-26
9 PAIRED SAMPLE T-TEST 27-29
10 DATA SET 30
11 QUESTIONNAIRE 31-36
BRM PRACTICAL FILE

INTRODUCTION:SPSS

SPSS is a Windows based program that can

be used to perform data entry and

analysis and to create tables and graphs.

SPSS is capable of handling large amounts

of data and can perform all of the

analyses covered in the text and much

more. SPSS is commonly used in the Social

Sciences and in the business world, so

familiarity with this program should

serve you well in the future. SPSS is

updated often. This document was written

around an earlier version, but the

differences should not cause any problems.

QUES-1) Data entry and Variables’ Description


BRM PRACTICAL FILE

DATA ENTRY

Data Entry in SPSS is the most important task involved in any analysis. Data may exist in any form; it may be written
on a piece of paper or it may be typed in a computer in raw data form. Before doing data entry in SPSS, one should start
SPSS. It is very easy to start SPSS from the start menu by just clicking on the “SPSS” icon. As soon as SPSS opens, a
window will appear which is called the “data viewer window.” In SPSS, data viewer column value is known as the
record measure or the variable and row to identify the case (or subject). If the data size is small, then the data entry in
SPSS can be done manually in the data viewer window. Whenever a data size is large, then the data entry in SPSS is not
possible manually. For eg: in the following figure we have entered the data of sleeping pattern, marital status, gender,
educational qualifications and many more variables of the respondents .

WHILE ENTERING THE DATA WE MUST ENSURE THE FOLLOWING:

• No spaces
• Either camel case or underscore.
• Cannot begin with special characters.
• Cannot begin with a number ; must begin with alphabets(variable).
• Each variable must be unique and most importantly maximum of 64 characters can be used.

VARIABLE DESCRIPTION

Understanding SPSS variable types and formats allows you to get things done fast and reliably. SPSS has 2 variable
types:

• Numeric variables contain only numbers and are suitable for numeric calculations such as addition and
multiplication.
• String variables may contain letters, numbers and other characters. You can't do calculations on string variables
-even if they contain only numbers.

There are no other variable types in SPSS than string and numeric. However, numeric variables have several different
formats that are often confused with variable types.
BRM PRACTICAL FILE
BRM PRACTICAL FILE

QUES-2) What is a frequency table?

In statistics, a frequency distribution is a list, table (i.e.: frequency table) or graph (i.e.: bar plot or histogram) that
displays the frequency of various outcomes in a sample.[1] Each entry in the table contains the frequency or count of the
occurrences of values within a particular group or interval.

STEPS TO CREATE A FREQUENCY TABLE?

STEPS

1. Insert the data into the data view .

2. Enter the variables in variable view, insert all the values and select the appropriate scale according to the
variable.
BRM PRACTICAL FILE

3. Go to analyze Tab.
4. Select descriptive statistics and then frequencies.
BRM PRACTICAL FILE

5. All the descriptive statistics and all the charts selected for the required variables would appear.
BRM PRACTICAL FILE
BRM PRACTICAL FILE
BRM PRACTICAL FILE

QUES-3) Graph / Histogram

A histogram is a chart that shows frequencies for intervals of values of a metric variable. Such intervals as known as
“bins” and they all have the same widths.

STEPS TO CREATE AN HISTOGRAM

1. Go to graph tab. Select Chart builder.

2. Go to gallery. Then Choose histogram. Select simple Histogram (drag it to gallery window).Ensure display
variable name .
3. Drag the variable to the X-axis and then from element properties.
BRM PRACTICAL FILE

4. From element properties:


• Choose histogram percentage.
• From set parameters choose Custom value for anchor=0.
• Then bin sizes go to custom interval width=1 and number of interval=3.
5. Click on continue.
BRM PRACTICAL FILE

6. Click on Apply. Then OK.


BRM PRACTICAL FILE

Required graph / histogram would appear.


BRM PRACTICAL FILE
BRM PRACTICAL FILE

QUES-4) What is mean?

In mathematics and statistics, the arithmetic mean or arithmetic average, or simply just the mean or the average (when
the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection.

CALCULATION OF MEAN UNDER SPSS

STEPS:

1. Enter the data into data view. Enter the variables for which you want to calculate means in variable view and
apply appropriate scales according to your variables.
2. Go to analyze and select compare means and then select means.
BRM PRACTICAL FILE

3. After selecting means new dialogue box would appear. From here put height in inches in Dependent list and
gender of respondent in independent list. Then click OK.

Result would appear.


BRM PRACTICAL FILE

QUES-5) Normality Test

SPSS Statistics outputs many table and graphs with this procedure. One of the reasons for this is that the Explore...
command is not used solely for the testing of normality, but in describing data in many different ways. When testing for
normality, we are mainly interested in the Tests of Normality table and the Normal Q-Q Plots, our numerical and
graphical methods to test for the normality of data, respectively.

We require dependent variable to be approximately normally distributed following numerical and visual outputs must be
investigated:

• Z-values for Skewness and Kurtosis (somewhere in the span of -1.96 to +1.96)

• Shapiro-Wilk Test p-value (should be above 0.05)

In SPSS, p-value is denoted by 'sig'.

• Null Hypothesis is rejected if p<0.05


BRM PRACTICAL FILE

• Histograms, Normal Q-Q Plots and Box Plots (should be symmetrical and should visually indicate that our data are
approximately normally distributed)

Main Aspects-

• Data must be approximately normally distributed, if not perfectly.

• Check each category of Independent Variable.

• The skewness and Kurtosis measures should be as close to zero as possible in SPSS.

• A small departure from zero is no problem as long as the measures are not too large compared to their standard errors.

• Divide the measure by its standard errors manually (with the help of calculator).

Z-Value = Measured Value/Standard Error

Shapiro-Wilk Test of Normality

The Shapiro-Wilk Test is more appropriate for small sample sizes (< 50 samples), but can also handle sample sizes as
large as 2000. For this reason, we will use the Shapiro-Wilk test as our numerical means of assessing normality.

Normal Q-Q Plots

In order to determine normality graphically, we can use the output of a normal Q-Q Plot. If the data are normally
distributed, the data points will be close to the diagonal line. If the data points stray from the line in an obvious non-
linear fashion, the data are not normally distributed.

STEPS TO EXPLORE

1. Click Analyze > Descriptive Statistics > Explore.


BRM PRACTICAL FILE

2. Add variables Height to the Dependent List and gender to the factor list box.
BRM PRACTICAL FILE

3. Click Plots. Check histogram in Descriptive. Check the box next to Normality plots with tests. Click Continue.

4. Click OK.
BRM PRACTICAL FILE

Result
BRM PRACTICAL FILE
BRM PRACTICAL FILE
BRM PRACTICAL FILE
BRM PRACTICAL FILE
BRM PRACTICAL FILE

Ques-6) One sample T Test

The One Sample t Test examines whether the mean of a population is statistically different from a known or
hypothesized value. The One Sample t Test is a parametric [Link] test is also known as Single Sample t Test, the
variable used in this test is known as Test [Link] a One Sample t Test, the test variable's mean is compared against a
"test value", which is a known or hypothesized value of the mean in the population.

STEPS OF ONE SAMPLE T TEST

1. Analyze -> Compare Means -> One-Sample T Test.

2. Drag and drop the variable you want to test against the population mean into the Test Variable(s) box. Specify
your population mean in the Test Value box>OK.
BRM PRACTICAL FILE

3. Your result will appear in the SPSS output viewer.


BRM PRACTICAL FILE

QUES-7) Independent Samples t-test

Testing – Whether the population mean for one category is significantly different from the population mean for another
category, where in the two samples drawn are independent of each other.

Two independent samples to be measured at a single point in time and since, the groups are independent of each other;
we can appropriately apply independent samples t-test.

Variables: Gender and Height w.r.t. the reference data set

Categorical (Nominal Scale)

Variable – Gender (Males – 1 & Females – 2)

Height (Scale Variable)

HO: “Is there a significant difference in the height of males and females”?

STEPS OF INDEPENDENT T- TEST

1. Analyze >Compare Means >Independent samples t-test.


BRM PRACTICAL FILE

2. Height – Test Variable Box; Gender - Grouping Variable.

3. Define Groups: Group 1 – 1; Group 2 – 2 >Continue >OK.


BRM PRACTICAL FILE

4. Output Window: Group Statistics table – Descriptive statistics table


Independent Samples test table – Inferential statistics table
We need to focus on the middle part of the inferential statistics’ table including the following:
T calculated value
P-value (sig.)
Degrees of freedom (df)
Equal variances assumed row for Height variable
Three ways to determine the statistical significance of the difference between the heights across the categories
(males and females):
BRM PRACTICAL FILE

1. T calculated Value > T (critical value); Reject HO


2. P-Value < 0.05; Reject HO
3. If the confidence interval doesn’t include 0; Reject HO

QUES-8) Paired Samples t-test OR Within-Subjects design/Repeated Measures Design/Paired Samples Design

A single sample to be measured at different points in time (just like a before and after design) and since, the groups are
dependent of each other; we can appropriately apply paired samples t-test.

Paired Samples t-test - Comparing two related means about population

Variables: Gender and Height & Weight w.r.t. the reference data set

Categorical (Nominal Scale)

Variable – Gender (Males – 1 & Females – 2)

Height (Scale Variable) & Weight (Scale Variable)

STEPS OF RAIRED-SAMPLE T TEST

1. Analyze>Compare Means > Paired samples t-test.


BRM PRACTICAL FILE

2. Paired Variables Box: Height – Variable 1 & Weight – Variable 2 > OK.
BRM PRACTICAL FILE

3. If the confidence interval doesn’t include 0; Reject HO.

What exactly is a confidence interval?


BRM PRACTICAL FILE

A confidence interval is a range in which the mean difference is likely to fall 95% (α = 5%) of the times. Alternatively,
if a similar test is repeated, say, 100 times, 95% of the time the mean difference would fall within the desired range.

The confidence interval is much more certain, but less precise too. However, with good measurements and low
variability, we can get both the mean and the confidence interval as accurate as possible.

Positive and Negative t values (t calculated) are interpreted in a similar manner.


BRM PRACTICAL FILE

QUESTIONNAIRE AND DATA SET :-


BRM PRACTICAL FILE

Common questions

Powered by AI

An independent samples t-test compares the means of two independent groups to see if they are statistically different, suitable when the samples have no effect on each other, such as comparing male and female heights. Conversely, a paired samples t-test is used for dependent groups, often the same group but at different times, such as before and after interventions. The choice depends on the study design: use independent samples t-test for separate groups and paired samples t-test for related observations .

A one-sample t-test in SPSS is conducted through Analyze > Compare Means > One-Sample T Test. The variable under test is dragged into the test variable box, and the population mean is entered as the "test value". This test aims to determine if the sample mean is significantly different from this specified test value. The "test value" is typically a known or hypothesized mean for the population, serving as a benchmark for the comparison .

The 'Explore' command in SPSS is used to generate various tables and plots that provide descriptive statistics and test for normality. It allows the user to evaluate the shape, spread, and central tendency of data through numerous outputs, including histograms and Q-Q plots, which are essential for determining the distributional properties of data before choosing appropriate statistical tests. This command is integral for ensuring data meets analytical assumptions .

Histograms and Q-Q plots complement each other by providing both numerical and visual evaluations of data normality. Histograms display the frequency distribution of the data, allowing for a quick visual inspection of skewness and deviations from normality. Q-Q plots offer a more detailed look by plotting observed data against expected values under normal distribution; deviations from the diagonal line indicate departures from normality. Together, they provide a robust assessment by confirming findings across different visualization forms .

Creating a histogram in SPSS involves selecting the chart builder from the graph tab, choosing the histogram option from the gallery, and dragging the desired variable onto the X-axis. The bin sizes can be customized, and options like histogram percentage can be set through element properties. This graphical representation is significant as it provides a visual impression of the distribution of a dataset, allowing for the assessment of normality, skewness, and the presence of outliers, which are key in deciding subsequent data analysis steps .

The construction of a frequency table facilitates understanding of data distribution by organizing data into categories that show the number of occurrences of individual values or ranges of values. This visualization helps in identifying patterns, such as central tendency and dispersion, within the data set. By summarizing large sets of data into easily interpretable intervals, frequency tables enhance clarity and assist in detecting trends and deviations from expected outcomes .

In SPSS, the independent samples t-test procedure involves Analyze > Compare Means > Independent Samples T-Test, designating the test and grouping variables. Hypothesis testing assesses whether the means of two independent groups are equal, with rejection of the null hypothesis prompted if the t-calculated value exceeds the critical value, the p-value is less than 0.05, or the confidence interval does not include zero. These outcomes indicate a significant difference between group means .

Data entry and variable description in SPSS are critical as they form the foundation for any analysis. Accurate data entry ensures that the dataset reflects the true characteristics of the variables under study, thereby maintaining data integrity. Proper variable description allows for the correct categorization of data types as either string or numeric, which is essential because calculations can only be performed on numeric variables. Misclassification of variables can lead to errors in analysis, such as incorrect statistical results or the inability to perform certain analyses .

Confidence intervals provide a range in which the true mean difference is expected to fall, offering a measure of the precision and reliability of the estimate. If a confidence interval does not contain zero, it implies that there is a statistically significant difference between the groups being compared, as zero would indicate no difference in means. This helps in making more informed decisions about rejecting the null hypothesis .

To conduct a normality test in SPSS, one uses the Analyze > Descriptive Statistics > Explore function, adding variables to the dependent list and checking the appropriate boxes for histograms and normality plots. The key criteria include checking the Shapiro-Wilk test p-value (should be > 0.05) and ensuring Z-values for skewness and kurtosis fall within -1.96 to +1.96. Normality tests are crucial because many statistical tests assume normal distribution of the data. Verifying this assumption ensures the validity of inferences drawn from these tests .

You might also like