




Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Practical notes on various data analysis techniques, including working with numerical and categorical data, creating histograms and scatterplots, using pivot tables, and applying functions. Additionally, it covers data cleansing steps such as visual scanning, filtering, sorting, and using advanced filters to find duplicates.
Typology: Study notes
1 / 8
This page cannot be seen from the preview
Don't miss anything!





Monday, November 19, 2007 5:36 PM Numerical Data - data which can be used in arithmetic (^) Discrete - whole, counting numbers (^) Continuous - measured numbers, (ex: timeline) Categorical Data - data where answers are not numbers used to arithmetic Histogram - chart of frequency of one variable that is numerical and is best to be continuous Bar Chart - for categorical data Creating a Histogram with StatTools
Histogram Tips
(^) Only should be used for continuous, numerical data. PivotTable Create a PivotTable
Subtotal Function (^) Returns the subtotal of the numbers in a column in a list or database. (^) =SUBTOTAL(method,array) 1 = AVERAGE 2 = COUNT 3 = COUNTA 4 = MAX 5 = MIN 6 = PRODUCT 7 = STDEV 8 = STDEVP 9 = SUM 10 = VAR 11 = VARP Indirect Funtion =INDIRECT(Array) (^) The INDIRECT function returns a reference to a range. You can use this function to create a reference that won't change if row or columns are inserted in the worksheet. Or, use it to create a reference from letters and numbers in other cells. Text Functions CONCATENATE - Joins several text items into one text item FIND - Finds one text value within another (case-sensitive) LEFT - Returns the leftmost characters from a text value LEN - Returns the number of characters in a text string
Returns a value you specify if a formula evaluates to an error; otherwise, returns the result of the formula. Use the IFERROR function to trap and handle errors in a formula. =IFERROR(value,value_if_error) Data Cleansing Steps
Cleansing Steps Worksheet When cleansing data, you need to document everything you do to the data set on the Cleansing Steps worksheet. This sheet needs to contain 3 columns: Problem, # of Records, and Cleansing Step. Even records moved to Questionable records need to be documented.