Monday, 3 September 2012


Today it was the introductory session of Business Analytics in which we were exposed to the basic concepts of the package SPSS. There are different other tools in the market like SAS, Minitab but we are using SPSS as it is user friendly and it is widely accepted.
The first learning was about understanding the concept of variables:

Classification of variable:-

1.      Category variable- must have finite number of values example- District, State, Months etc
2.      Continuous- infinite values example- marks, age, etc. This is further classified into
a.      Continuous- infinite values including fractions eg-
b.      Discrete- infinite values with whole numbers eg- age

In the SPSS home page we saw variables and it explains the relationship of each and every variable in the data view of the data editor with that of the variable view of the same data editor. Each variable attributes are explained using its NAME ,TYPE, WIDTH, DECIMALS, LABEL, VALUE, MISSING.

Sometimes the respondents doesn’t answer/fill a particular category because of various reasons, this leads to missing numbers. This can happen for reasons like privacy, respondent didn’t understand the Qs or Qs not applicable for him. These missing numbers is necessary because it helps us in correcting our mistake if any in framing the Qs which might have lead to no response from the respondent.
The next concept was about understanding Measure which can be:

1.                 Nominal: We have the freedom to assign any number to a variable. Eg. If suppose we consider Male and female then we can assign 1 to Male and  2 to Female or vice versa.
2.                    Ordinal: shows order like
No School –       1     4
High School-      2     3
Graduation-        3     2
Post Graduate-  4     1
The first order is correct but not the second
3.                   Scale:  Where we can spot the exact difference in the range of ordinal scale. Ex-

If we consider 2 registration number of cars

WB02k   0125
WB02k   0534

Looking at the last 4 digits we can figure out that the first number had an early registration than the other one.

Classification of techniques:
1.                 1st level analysis: frequencies, cross-tabs, OLAP cubes
                               2nd level analysis: multi-variate techniques

2.                 Univariate – use/ show only one variable at a time. Pie chart (various values of the same variable)                
              A pie chart can be bivariate if one bring thickness of the pie into the picture.
                              Bivariate – correlation, t-test
                              Multivariate – bubble diagram

PS:  We have to use all variables as numeric variable as far as possible.

Understanding all the concepts we started our first assignment for proving a hypothesis for a given set of data.

Descriptive Statistics in SPSS
·           Frequencies:        It is used primarily for nominal and ordinal data. Tells the number of times the variable has occurred.
         
           Cross Tabulation: It is a process by which two or more data variables are tabulated, and displays the relationship in a tabular form. Generates information about  bi-variate relationships. It is not suitable for continuous variables that assume multiple values. In such a case, these continuous variables are re-categorised into category variables.

By
Suman Boro 
14169


                








No comments:

Post a Comment