

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Quantitative Business Analysis. General information on Statistics. Analytics: discovery and communication of meaningful patterns in data.
Typology: Lecture notes
1 / 3
This page cannot be seen from the preview
Don't miss anything!


Analytics: discovery and communication of meaningful patterns in data. It involves statistics, computer programming and operations research. Statistics: branch of maths that makes numbers ( data ) informative.
Descriptive statistics are the methods that help collect, summarise, present and analyse data. Inferential statistics are methods that use the data collected from a small group to draw conclusions about a larger group. 4 important uses of statistics o Visualise and summarize data (e.g. tables, charts. Descriptive method) o Reach conclusions about a large group based on data collected from small group (Inferential). o Make reliable predictions based on statistical models (Inferential) o Improve business processes.
After defining goal and variables, data must be collected. Data can be a primary or secondary source. Primary source: the analyst is the data collector. o Organisations and individuals that collect and publish data use their data as a primary source (e.g. ABS) and let others use their data as a secondary source. o E.g. data from a political survey , experiment and directly observed data. Secondary source: data for analysis has been collected by someone else. o Company database, internet information, textbook, data made on stock markets.
Population: all the items or individuals (elements) which you want to reach conclusions. Sample: portion of a population selected for analysis. The process of selecting the sample is sampling. o If sample is to be informative of total population, sampling must be done carefully, impartially and objectively. o Random sampling : selecting individuals in a totally random fashion to prevent bias. Can be used to draw conclusions about larger population. Characteristics of an individual/element is a variable and this affects results. Variables can be: o Categorical (qualitative): variables can be placed into categories like yes/no or true/false. Nominal scale: classifies data into categories without ranking. E.g. type of email provider. Ordinal scale: classifies values into distinct categories with ranking. E.g. rating service. o Numerical (quantitative): variables have values that represent actual quantities. Discrete: values that arise from counting process (integer values). E.g. No. of TV channels. Continuous: arise from a measuring process and can be assigned any value within a given interval. Can have decimal point values E.g. room temperature/height of child. Numerical variables are measured on an interval or ratio scale.
Ratio: ordered scale in which the difference between the measurements involves a true 0 point. Interval: ordered scale in which the difference between the measurements is a meaningful quantity but does NOT involve a true 0 point.
Data may be organised in different structures depending on whether: You are summarising variables or looking at relationship between variables. The variables are categorical or numerical. Ask yourself “Is there enough data points to merit a graph?” If not, use a table. E.g. a graph is unnecessary when people present a few numbers in a bar chart. Summary Table Presents tallies of frequencies or percentages for each category. Helps to see the differences among categories.
Contingency Table Allows you to study patterns between the responses of 2 or more categorical variables. Cross tabulated form that tallies the responses of the categorical variables together.
Creating charts for visualizing data enhances the discovery of patterns and relationships. It also depends on type of variable and number of variable. Numerical Categorical One variable Two variables
Histogram, box-plot Scatter plot, time series plot, percentage polygon
Bar chart Side-by-side bar chart. Bar Chart Compares different categories by using individual bars to represent tallies. Unlike histogram, bar chart separates bars between each category. Can be horizontal or vertical.
Side by Side Bar Chart Two or more bars to represent tallies. Shows joint responses from 2 categorical variables. 3D bar charts are confusing - hard to determine exact values. Pie Chart Uses parts of a circle to represent tallies. Lets you visualise the portion of the entire pie in each category. Very few instances where pie chart is more informative than a simple bar chart.
You organize numerical data by creating ordered arrays or distributions. Method you choose depends on amount of data that you have and what you see to discover about your variables. Ordered Array Arranges values of numerical variable in rank order, from smallest to largest. Helps get a better sense of range and any outliers. If data set contains a large number of values, ordered array is difficult.
Frequency Distribution Summary table where data is assigned to numerically ordered categories, called classes.