









Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Statistics reviewer, simple and detailed
Typology: Study notes
Uploaded on 04/22/2025
1 / 15
This page cannot be seen from the preview
Don't miss anything!










Statistics is a branch of mathematics that collects, organizes, analyzes, and interprets numerical data to make decisions. It allows businesses, governments, and individuals to make informed choices based on data.
Decision-makers use statistics to: ● Present and describe business data properly. ● Draw conclusions about a population using a sample. ● Make forecasts about business and economic activities. ● Improve business processes through data-driven decision-making.
● Variable: A characteristic that can take different values (e.g., height, weight, income). ● Data: The values collected from variables. ● Population: The entire group being studied. ● Sample: A subset of the population used for analysis. ● Parameter: A numerical measure that describes a characteristic of a population (e.g., population mean). ● Statistic: A numerical measure that describes a characteristic of a sample (e.g., sample mean).
● Survey (Questionnaire/Interview) – Directly asking respondents. ● Observation – Recording behaviors without interference. ● Experiment – Conducting controlled studies to determine cause-effect relationships.
● Registration – Collecting official records (e.g., birth certificates, company records).
MODULE 1B: BASIC STATISTICAL CONCEPTS
● Categorical (Qualitative) Variables – Represent categories (e.g., Gender: Male/Female). ● Numerical (Quantitative) Variables – Represent numbers (e.g., Height, Weight). ○ Discrete Variables: Countable values (e.g., number of children). ○ Continuous Variables: Measurable values (e.g., temperature).
● Frequency Tables: Organize data into classes with counts. ● Bar Graphs & Pie Charts: Display categorical data. ● Histograms & Line Graphs: Display numerical data.
MODULE 1C: DESCRIPTIVE STATISTICS
○ Affected by extreme values (outliers).
● Mean = Median = Mode. ● Total area under the curve = 1.
● Converts any normal distribution into a standard normal distribution (mean = 0, SD = 1).
● Z-scores above 0: Values above the mean. ● Z-scores below 0: Values below the mean.
MODULE 2A: CHARACTERISTICS OF A GOOD MEASUREMENT TOOL
● Measurement in research consists of assigning numbers to empirical events, objects, or properties in compliance with a set of rules. ● It involves a three-part process :
● Variables in research can be classified as: ○ Objects: Tangible items (e.g., people, cars, buildings). ○ Properties: Characteristics of objects (e.g., height, weight, attitude, intelligence). ● Constructs like satisfaction, leadership, and engagement cannot be measured directly but require observation of indicants.
● Validity: Measures what it intends to measure. ○ External Validity: Generalizability across populations, settings, and times. ○ Internal Validity: Accuracy in measuring the intended concept. ● Reliability: Produces consistent results over repeated trials. ○ Stability: Consistency over time. ○ Equivalence: Consistency among different observers. ○ Internal Consistency: Homogeneity among measurement items. ● Practicality: Measurement must be economical, convenient, and interpretable.
Factors influencing selection: ● Research objectives ● Response types ● Data properties ● Balanced vs. unbalanced scales ● Forced vs. unforced choices ● Number of scale points ● Rater errors
MODULE 2B: SURVEY RESEARCH
● Mail Surveys: Low response rates, high nonresponse bias. ● Telephone Surveys: Low response rates, difficult targeting. ● Interviews: High cost, high quality, useful for sensitive topics. ● Web Surveys: Fast, cost-effective, but susceptible to bias. ● Direct Observation: Unobtrusive, but may require informed consent.
● Consider staff expertise, budget, and precision. ● Ensure high-quality survey design. ● Conduct pilot tests before full deployment. ● Increase response rates by explaining purpose and offering incentives. ● Work with experts for better design and analysis.
● Use white space for readability. ● Provide clear instructions. ● Ensure anonymity. ● Organize questions logically. ● Use filters (e.g., "If no, skip to question 7"). ● Keep surveys as short as possible.
● Open-ended: "Describe your job goals."
● Type I Error (α) : Rejecting H₀ when it is true. ● Type II Error (β) : Failing to reject H₀ when it is false.
● Used when population standard deviation (σ) is known.
● Requires n ≥ 30 or normally distributed population.
● Formula:
σ 𝑛
Where:
○ X̄ = sample mean ○ μ = population mean (hypothesized value) ○ σ = population standard deviation ○ n = sample size
● Critical Value Approach: Compare z-computed to z-critical. ● P-Value Approach: If p-value < α , reject H₀.
Example:
● A researcher claims students score better than 515 in a test. ● Sample: n = 40 , X̄ = 540 , σ = 114. ● Compute z and compare to z-critical at α = 0.05.
● JASP is statistical software used for hypothesis testing. ● The output includes: ○ Test statistic (z-value) ○ P-value ○ Confidence intervals ○ Decision: Reject or fail to reject H ₀
Example JASP Output Interpretation:
● Computed z-value = 1. ● Critical z-value at α = 0.05 (one-tailed) = 1.
● P-value = 0. ● Decision: Fail to reject H ₀ (Not enough evidence to support the claim).
● Used when σ is unknown.
● Uses sample standard deviation (s).
● Sample size n < 30 must be from a normal distribution.
● Formula:
𝑠 𝑛
Where s = sample standard deviation.
● Uses Student’s t-distribution with df = n - 1.
● Critical values obtained from t-table.
Example:
● A researcher wants to test if the mean salary of a sample differs from $59,. ● Uses a t-test since population σ is unknown.
Situation Test to Use
σ is known, n ≥ 30 Z-Test
σ is unknown, n < 30 T-Test
σ is unknown, n ≥ 30 (CLT applies)
● One-tailed : Tests for direction (e.g., greater than, less than). ● Two-tailed : Tests for difference (e.g., not equal to).
Hypothesis Type of Test Rejection Region
H₁: μ > μ₀ Right-tailed z > z-critical
● Statistical tests help determine if there is enough evidence to support or reject a claim.
○ States there is no significant difference between the sample and the population. ○ Contains “=” , “≤” , or “≥” symbols. ○ Example: H ₀ : μ = 5,320 (The mean household electricity expense has not changed.)
○ Suggests a difference or effect exists. ○ Uses “≠” , “>” , or “<”. ○ Example: H ₁ : μ ≠ 5,320 (The mean electricity expense has changed.)
● Type I Error (α): Rejecting H₀ when it is true. ● Type II Error (β): Failing to reject H₀ when it is false
● Used when population standard deviation (σ) is unknown.
● Uses sample standard deviation (s).
● Requires Student’s t-distribution.
● Formula:
𝑠 𝑛
Where:
○ X̄ = sample mean ○ μ = population mean (hypothesized value) ○ s = sample standard deviation
○ n = sample size
● Critical Value Approach: Compare t-computed to t-critical (from t-table). ● P-Value Approach: If p-value < α , reject H₀.
Example:
● A researcher tests if household electricity expenses have changed from PhP 5,320. ● Sample: n = 35 , X̄ = 6,480 , s = 1,. ● Compute t and compare to t-critical at α = 0.05.
Properties of the Student’s T-Distribution
● Similarities to Z-Distribution: ○ Bell-shaped, symmetric. ○ Mean, median, and mode = 0. ● Differences: ○ Variance > 1. ○ Based on degrees of freedom (df = n - 1). ○ Approaches normal distribution as n increases.
JASP Output Interpretation
● JASP is statistical software for hypothesis testing. ● The output includes: ○ Test statistic (t-value) ○ P-value ○ Confidence intervals ○ Decision: Reject or fail to reject H ₀
Example JASP Output Interpretation:
● Computed t-value = 3. ● Degrees of freedom (df) = 6 ● P-value < 0. ● Decision: Reject H ₀ (There is significant evidence to support the claim).
One-Tailed vs. Two-Tailed Tests
● One-tailed : Tests for direction (e.g., greater than, less than). ● Two-tailed : Tests for difference (e.g., not equal to).
Hypothesis Type of Test Rejection Region
H₁: μ > μ₀ Right-tailed t > t-critical
H₁: μ < μ₀ Left-tailed t < - t-critical
● 59% of consumers purchase gifts for their fathers. ● 50.3% of businessmen own stocks and mutual funds. ● 55% of Filipinos buy generic products ● 40% of Filipino families eat out once a week.
2. Conditions for a Valid Z-Test for Proportions
Before using the z-test, the following conditions must be met:
The test statistic for a z-test for a population proportion is:
𝑧 = or
𝑝(1−𝑝) 𝑛
𝑧 =
Where:
● 𝑝= population proportion ● 𝑝= sample proportion ● 𝑛= sample size
4. Hypothesis Testing Approaches
A survey in 2018 showed that 41% of families ate dinner together every night. A recent study of 1, families found that 405 had dinner together every night. At a 0.05 significance level , has the proportion decreased?
● Given: ○ p=0.41, α=0. ○ 𝑝 = (^) 1,127^405 = 0. 3594 ● Compute z-va1lue:
○ 𝑧 =
0.41(1−0.41) 1,
=− 0. 3594
● Compare with Critical Value (−1.645) ○ Since − 0. 3594 <− 1. 654, reject (𝐻𝑜) ○ Conclusion: The proportion of families eating dinner together every night has decreased.
20 years ago, 76% of Americans preferred American cars. A new survey of 56 people found that 38 still preferred American cars. At 0.01 significance level , is this proportion different?
● Given: ○ p=0.76, α=0. ○ 𝑝 = 3856 = 0. 6786 ● Compute z-value:
○ 𝑧 =
0.76(1−0.76) 56
=− 1. 43
● Find p-value: ○ From the z-table, p = 0. ○ Since it's a two-tailed test , multiply by 2 → p = 0.