Prof. Visconti IPLE 2022-2023 first trimester

DATA ANALYSIS – SUMMARY

BASICS

Data analysis is a process of inspecting, cleansing, transforming and modelling data with the goal of

discovering useful information, informing conclusions and supporting decision-making. Data are empirical

material organized into a form that can be analyzed. Primary data analysis: you collect and categorize

(coding process) your own data through interviews, official documents, experiments, surveys, etc.

Secondary data analysis: you use data sets that have been gathered by others and have subsequently been

deposited in databases (: existing archived collections of data). Units of analysis are the objects/subjects to

which the properties investigated pertain A variable is an empirical measurement of a characteristic. Key

features of a variable: a name and at least two values (otherwise it would be a constant). Variation of a

variable may occur in two ways: over time, on the same cases; or among cases, at the same time.

The levels of measurement of variables are three: nominal, ordinal and interval. Nominal variable is one

that has two or more categories, but there is no intrinsic ordering to the categories (discrete and non-

orderable). Ordinal variable is similar to a categorical variable, but there is a clear ordering of the

categories (discrete and orderable). Interval variable is similar to an ordinal variable, except that the

intervals between the values of the variable are equally spaced.

A spreadsheet is an interactive computer application for the organization, analysis, and storage of data in

tabular form. A spreadsheet consists of a table of cells arranged into rows and columns and referred to by

the X and Y locations. A cell is a box for holding data. A single cell is usually referenced by its column and

row. A worksheet! is a grid of cells with either raw data, called values, or formulas in the cells. Values! are

raw data (general numbers, text, dates). Alternatively, a value can be based on a formula, which might

perform a calculation. A formula is an equation that performs calculations, such as addition, subtraction,

multiplication, and division, on values in a worksheet. To enter a formula in a cell you always need to start

with the equal sign (=). Functions! can be built-in functions, such as arithmetic operations (for example,

summations, averages), trigonometric functions, statistical functions, etc. Charts are graphical display of

data.

A cell reference identifies a cell’s location in the worksheet, based on its column letter and row number,

such as A1 (column A, row 1) or E4 (column E, row 4). There are three types of cell references:

• Relative, both row and column references are relative (for example D4). If you copy-paste the

content of a cell with a relative reference in another cell the reference changes according to the

distance in number of rows and columns between the first and second cell.

• Absolute, the row and the column references are fixed using the $ (Dollar) sign (for example $C$3)

and if you copy-paste the content of a cell with an absolute reference the reference does not

change.

• Mixed, either the row or the column reference is absolute (for example $A1 to fix the column or

A$1 to fix the row); if you copy-paste the content of a cell with a mixed reference the column or the

row changes.

Riassunto Data Analysis IPLE, Schemi e mappe concettuali di Analisi Dei Dati

Documenti correlati

Anteprima parziale del testo

Scarica Riassunto Data Analysis IPLE e più Schemi e mappe concettuali in PDF di Analisi Dei Dati solo su Docsity!

DATA ANALYSIS – SUMMARY

BASICS

HOW TO STRUCTURE A DATASET

DATA MANAGEMEN WITH EXCEL

HOW TO DESCRIBE DATA?

BIVARIATE ANALYSIS

BIVARIATE ANALYSIS: CATEGORICAL DATA

BIVARIATE ANALYSIS: CONTINOUS DATA

BIVARIATE ANALYSIS: MIXED DATA