Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Quantitative Evaluation: Understanding Experimental Design and Statistical Analysis in HCI, Study notes of Computer Science

An in-depth exploration of quantitative evaluation methods in human-computer interaction (hci). The authors discuss the importance of experimental design, including the formulation of lucid and testable hypotheses, the manipulation of independent variables, and the measurement of dependent variables. The document also covers statistical analysis, significance levels, and the interpretation of results. Examples of correlation and regression are provided to illustrate the application of statistical methods in hci research.

Typology: Study notes

Pre 2010

Uploaded on 02/13/2009

koofers-user-ihg
koofers-user-ihg 🇺🇸

10 documents

1 / 12

Toggle sidebar

Related documents


Partial preview of the text

Download Quantitative Evaluation: Understanding Experimental Design and Statistical Analysis in HCI and more Study notes Computer Science in PDF only on Docsity! Evaluation-Quantitative 1 Ben Bederson / Saul Greenberg Quantitative Evaluation What is experimental design? What is an experimental hypothesis? How do I plan an experiment? Why are statistics used? What are the important statistical methods? Ben Bederson / Saul Greenberg Question: Which size grid is better? Evan Golub Evaluation-Quantitative 2 Ben Bederson / Saul Greenberg Question: Which menu placement system is better? Top of Window Top of Screen Evan Golub Ben Bederson / Saul Greenberg Quantitative methods 1. User performance data collection • data is collected on system use - frequency of request for on-line assistance what did people ask for help with? - frequency of use of different parts of the system why are parts of system unused? - number of errors and where they occurred why does an error occur repeatedly? - time it takes to complete some operation what tasks take longer than expected? • collects heaps of data in the hope that something interesting shows up • often difficult to sift through data unless specific aspects are targeted - as in list above Evaluation-Quantitative 5 Ben Bederson / Saul Greenberg The experimental method... c) Carefully choose the dependent variables that will be measured Dependent variables • variables dependent on the subject’s behaviour / reaction to the independent variable in toothpaste experiment • number of cavities • frequency of brushing in menu experiment • time to select an item • selection errors made • Subjective satisfaction as reported in a questionnaire Ben Bederson / Saul Greenberg The experimental method... d) Judiciously select and assign subjects to groups Ways of controlling subject variability • recognize classes and make them an independent variable • minimize unaccounted anomalies in subject group - superstars versus poor performers • use reasonable number of subjects and random assignment Novice Expert Evaluation-Quantitative 6 Ben Bederson / Saul Greenberg The experimental method... e) Control for biasing factors • unbiased instructions + experimental protocols - prepare ahead of time • double-blind experiments, ... Now you get to do the pop-up menus. I think you will really like them... I designed them myself! Ben Bederson / Saul Greenberg The experimental method... f) Apply statistical methods to data analysis • Confidence limits: the confidence that your conclusion is correct - “The hypothesis that mouse experience makes no difference is rejected at the .05 level” - “Expert mouse users can use pull-down menus 15% faster than novice mouse users, and that result is statistically significant” - means: a 95% chance that your statement is correct a 5% chance you are wrong g) Interpret your results • what you believe the results mean, and their implications Evaluation-Quantitative 7 Ben Bederson / Saul Greenberg Statistical Analysis Calculations that tell us • mathematical attributes about our data sets - mean, amount of variance, ... • how data sets relate to each other - whether we are “sampling” from the same or different distributions • the probability that our claims are correct - “statistical significance” Ben Bederson / Saul Greenberg Statistical significance vs Practical significance when n is large, even a trivial difference may be large enough to produce a statistically significant result • eg menu choice: mean selection time of menu a is 3 seconds; menu b is 3.05 seconds Statistical significance does not imply that the difference is important! • a matter of interpretation Evaluation-Quantitative 10 Ben Bederson / Saul Greenberg Choice of significance levels and two types of errors There is no difference between Pie menus and traditional pop-up menus • Type 1: extra work developing software and having people learn a new idiom for no benefit • Type 2: use a less efficient (but already familiar) menu • Case 1: Redesigning a traditional GUI interface - a Type 2 error is preferable to a Type 1 error • Case 2: Designing a digital mapping application where experts perform extremely frequent menu selections - a Type 1 error is preferable to a Type 2 error New Open Close Save NewOpen C lo se S av e Ben Bederson / Saul Greenberg Other Tests: Correlation Measures the extent to which two concepts are related • eg years of university training vs computer ownership per capita How? • obtain the two sets of measurements • calculate correlation coefficient - +1: positively correlated - 0: no correlation (no relation) - –1: negatively correlated Dangers • attributing causality - a correlation does not imply cause and effect - cause may be due to a third “hidden” variable related to both other variables - eg (above example) age, affluence • drawing strong conclusion from small numbers - unreliable with small groups - be wary of accepting anything more than the direction of correlation unless you have at least 40 subjects Evaluation-Quantitative 11 Ben Bederson / Saul Greenberg Sample Study: Cigarette Consumption Crude Male death rate for lung cancer in 1950 per capita consumption of cigarettes in 1930 in various countries. Ben Bederson / Saul Greenberg Correlation 5 6 4 5 6 7 4 4 5 6 3 5 5 7 4 4 5 7 6 7 6 6 7 7 6 8 7 9 condition 1 condition 2 3 4 5 6 7 8 9 10 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 Condition 1 r2 = .668 Evaluation-Quantitative 12 Ben Bederson / Saul Greenberg Other Tests: Regression Calculate a line of “best fit” use the value of one variable to predict the value of the other • e.g., 60% of people with 3 years of university own a computer 3 4 5 6 7 8 9 10 3 4 5 6 7 Condition 1 y = .988x + 1.132, r2 = .668 65 4 5 6 7 4 4 5 6 3 5 5 7 4 4 5 7 6 7 6 6 7 7 6 8 7 9 condition 1 condition 2 C o n d it io n 2 Ben Bederson / Saul Greenberg You know now Controlled experiments can provide clear convincing result on specific issues Creating testable hypotheses are critical to good experimental design Experimental design requires a great deal of planning Statistics inform us about • mathematical attributes about our data sets • how data sets relate to each other • the probability that our claims are correct There are many statistical methods that can be applied to different experimental designs - one example is the use of correlation and regression.