Practical Notes: Data Analysis for Numerical & Categorical Data, Histograms, Scatterplots,, Study notes of Introduction to Business Management

Practical notes on various data analysis techniques, including working with numerical and categorical data, creating histograms and scatterplots, using pivot tables, and applying functions. Additionally, it covers data cleansing steps such as visual scanning, filtering, sorting, and using advanced filters to find duplicates.

Typology: Study notes

Pre 2010

Uploaded on 10/26/2009

joshcarpenter
joshcarpenter 🇺🇸

2 documents

1 / 8

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
X201 Practical One Notes
Monday, November 19, 2007
5:36 PM
Numerical Data - data which can be used in arithmetic
Discrete - whole, counting numbers
Continuous - measured numbers, (ex: timeline)
Categorical Data - data where answers are not numbers used to arithmetic
Histogram - chart of frequency of one variable that is numerical and is best to be continuous
Bar Chart - for categorical data
Creating a Histogram with StatTools
1. Create a data set using data set manager
2. Under Summary Graphs, select Histogram
3. Select the continuous, numerical value you would like to graph
Creating a Histogram with ToolPak
1. Open the Data Analysis menu and choose Histogram
2. Enter the data range, the bin range and select where you want the output
pf3
pf4
pf5
pf8

Partial preview of the text

Download Practical Notes: Data Analysis for Numerical & Categorical Data, Histograms, Scatterplots, and more Study notes Introduction to Business Management in PDF only on Docsity!

X201 Practical One Notes

Monday, November 19, 2007 5:36 PM Numerical Data - data which can be used in arithmetic  (^) Discrete - whole, counting numbers  (^) Continuous - measured numbers, (ex: timeline) Categorical Data - data where answers are not numbers used to arithmetic Histogram - chart of frequency of one variable that is numerical and is best to be continuous Bar Chart - for categorical data Creating a Histogram with StatTools

  1. Create a data set using data set manager
  2. Under Summary Graphs, select Histogram
  3. Select the continuous, numerical value you would like to graph Creating a Histogram with ToolPak
  4. Open the Data Analysis menu and choose Histogram
  5. Enter the data range, the bin range and select where you want the output

Histogram Tips

  1. Number of bins should be 8-
  2. No Gaps between bins
  3. Equal bin ranges if comparing multiple histograms
  4. Delete Legend
  5. Change Coloring
  6. Change Title
  7. Bins in Equal Intervals
  8. Deleter "0" Bins Stacked - splitting the data by categorical data StatTools Summary Statistics
  9. Shape --- Skewness -1 to 1 = Symmetric Anything Else = Skewed
  10. Center --- Central Tendency Measures Symmetric = Mean Skewed = Median Categorical = Mode
  11. Spread --- Symmetric = Standard Deviation Skewed = Interquartile Range (IQR)

 (^) Only should be used for continuous, numerical data. PivotTable Create a PivotTable

  1. Select the data that will be going in to your Pivot Table.
  2. On the insert tab of the ribbon, click the pivot table button.
  3. Check your data range and chose where you want the pivot table to be placed. Drill Down in Pivot Table  (^) Double click the value you want to drill down Create a calculated field in the Pivot Table  (^) Click anywhere inside your pivot table  (^) Under the PivotTable Tools Options tab on the ribbon, click Formulas then Calculated field Click in your pivot table and on the pivot table tab, click formulas, calculated field  (^) Create a formula using your fields and click OK. It will then be added as a column just as any other field would be displayed in the PivotTable.

Subtotal Function  (^) Returns the subtotal of the numbers in a column in a list or database.  (^) =SUBTOTAL(method,array) 1 = AVERAGE 2 = COUNT 3 = COUNTA 4 = MAX 5 = MIN 6 = PRODUCT 7 = STDEV 8 = STDEVP 9 = SUM 10 = VAR 11 = VARP Indirect Funtion =INDIRECT(Array)  (^) The INDIRECT function returns a reference to a range. You can use this function to create a reference that won't change if row or columns are inserted in the worksheet. Or, use it to create a reference from letters and numbers in other cells. Text Functions CONCATENATE - Joins several text items into one text item FIND - Finds one text value within another (case-sensitive) LEFT - Returns the leftmost characters from a text value LEN - Returns the number of characters in a text string

IFERROR

Returns a value you specify if a formula evaluates to an error; otherwise, returns the result of the formula. Use the IFERROR function to trap and handle errors in a formula. =IFERROR(value,value_if_error) Data Cleansing Steps

  1. Visual Scanning
  2. Filter
  3. Sorting
  4. Pivot Table (Used to find spaces)
  5. Excel error message ( Colored triangle in corner of cell)
  6. Left/Right Aligned (Denotes if formatted as text or number)
  7. Conditional Formatting (Identify Duplicates
  8. Charts
  9. Functions Finding Duplicates
  10. Highlight column
  11. Click home on the ribbon
  12. Click Conditional Formatting
  13. Select Highlight Cells Rules
  14. Select Duplicate Values Use Advanced Filter to Find Duplicate Records
  15. Select data including column heading
  16. Click data tab on the ribbon
  17. Under Sort & Filter, select Advanced Filter
  18. Select Unique Records
  19. Look at list to see if there are duplicate records entered incorrectly Questionable Records Worksheet When questionable records are found within a data set, they need to be moved to a new worksheet called "Questionable Records." This worksheet needs to display the same column titles as the data worksheet. Go down a few rows and type what the problem is and then cut and paste the records that have the corresponding problem.

Cleansing Steps Worksheet When cleansing data, you need to document everything you do to the data set on the Cleansing Steps worksheet. This sheet needs to contain 3 columns: Problem, # of Records, and Cleansing Step. Even records moved to Questionable records need to be documented.