Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Understanding Quantitative and Categorical Data: A Key Concept in Data Visualization, Summaries of Design

Design

This document by Stephen Few explains the difference between quantitative and categorical data and their role in creating effective graphs. It covers the nature of quantitative information, the distinction between quantitative and categorical scales, and the appropriate use of lines and bars to encode data. Understanding these concepts is crucial for designing clear and meaningful data visualizations.

Typology: Summaries

2021/2022

Uploaded on 09/27/2022

explain 🇺🇸

4

(2)

230 documents

1 / 5

This page cannot be seen from the preview

Don't miss anything!

Perceptual Edge Quantitative vs. Categorical Data: A Difference Worth Knowing Page 1

Quantitative vs. Categorical Data: A Difference Worth Knowing

Stephen Few

April 2005

When you create a graph, you step through a series of choices, including which type of graph

you should use and several aspects of its appearance. Most people walk through these

choices as if they were sleepwalking, with only a vague sense at best of what works, of why

one choice is better than another. Without guiding principles rooted in a clear understanding

of graph design, choices are arbitrary and the resulting communication fails in a way that can

be costly to the business. To communicate effectively using graphs, you must understand the

nature of the data, graphing conventions and a bit about visual perception—not only what

works and what doesn't, but why.

This month's column focuses on the nature of quantitative information. Graphs display

quantitative information: numbers that measure performance, predict the future and identify

opportunities. The nature of quantitative information varies in some fundamental ways that tie

directly to some of the choices you must make when graphing that information.

Quantitative information consists not only of numbers, but also of data that identifies what the

numbers mean. If I walked up to you, looked you in the eye, and said, "The answer is

24,901," you would probably be confused, understandably suspicious that I had a few screws

loose. By itself, a number means nothing. However, if I were to tell you that the circumference

of the earth at the equator is 24,901 miles, that would mean something. To be complete and

meaningful, quantitative information consists of both quantitative data—the numbers—and

categorical data—the labels that tell us what the numbers measure. The graph in Figure 1

highlights this distinction by displaying the categorical data in black and the quantitative data

in various other colors. (Note: The gray axes, excluding the red tick marks, are neither

quantitative nor categorical data, and in fact are not data at all, but simply visual objects that

support the graph by defining the plot area.)

Partial preview of the text

Download Understanding Quantitative and Categorical Data: A Key Concept in Data Visualization and more Summaries Design in PDF only on Docsity!

Quantitative vs. Categorical Data: A Difference Worth Knowing

Stephen Few April 2005

When you create a graph, you step through a series of choices, including which type of graph you should use and several aspects of its appearance. Most people walk through these choices as if they were sleepwalking, with only a vague sense at best of what works, of why one choice is better than another. Without guiding principles rooted in a clear understanding of graph design, choices are arbitrary and the resulting communication fails in a way that can be costly to the business. To communicate effectively using graphs, you must understand the nature of the data, graphing conventions and a bit about visual perception—not only what works and what doesn't, but why.

This month's column focuses on the nature of quantitative information. Graphs display quantitative information: numbers that measure performance, predict the future and identify opportunities. The nature of quantitative information varies in some fundamental ways that tie directly to some of the choices you must make when graphing that information.

Quantitative information consists not only of numbers, but also of data that identifies what the numbers mean. If I walked up to you, looked you in the eye, and said, "The answer is 24,901," you would probably be confused, understandably suspicious that I had a few screws loose. By itself, a number means nothing. However, if I were to tell you that the circumference of the earth at the equator is 24,901 miles, that would mean something. To be complete and meaningful, quantitative information consists of both quantitative data —the numbers—and categorical data —the labels that tell us what the numbers measure. The graph in Figure 1 highlights this distinction by displaying the categorical data in black and the quantitative data in various other colors. (Note: The gray axes, excluding the red tick marks, are neither quantitative nor categorical data, and in fact are not data at all, but simply visual objects that support the graph by defining the plot area.)

Figure 1: Illustration of the difference between quantitative data (red) and categorical data (black).

The graph in Figure 1 displays two scales: a quantitative scale along the vertical axis and a categorical scale along the horizontal axis. They differ in what they identify: quantitative values on the one hand and categorical items on the other. Most two-dimensional graphs consist of one quantitative scale and one categorical scale, although a familiar exception is the scatterplot, which has quantitative scales along both axes (see Figure 2). In a line graph, the categorical scale always appears on the horizontal axis. In a bar graph, the categorical scale can appear on either axis, with bars running horizontally or vertically. Data points— simple symbols such as dots, squares, triangles and so forth—are rarely used by themselves to encode values other than in scatterplots. Unlike bars and lines, data points can encode a quantitative value simultaneously along two scales.

Figure 2: A scatterplot is the only commonly used 2-D graph that lacks a categorical scale along one of its two axes.

Months of the year obviously have an intrinsic order, which leaves the question: "Do the items correspond to quantitative values?" In fact, they do. Units of time such as years, quarters, months, weeks, days and hours are measures of quantity, and the individual items in any given unit of measure (e.g., years) represent equal intervals. Actually, months aren't exactly equal and even years vary in size occasionally due to leap years, but they are close enough in size to constitute an interval scale.

Categorical Scales and Graph Design

The primary graph design principle that applies to this distinction between nominal, ordinal and interval scales involves the use of lines to encode quantitative data. You should only use lines (as in a line graph) to encode data along an interval scale. In nominal and ordinal scales, the individual items are not related closely enough to be linked with lines, so you should use bars instead.

The strength of lines in a graph is their ability to reveal the trend of and patterns in the data. Lines suggest change from one item to the next, but change isn't happening if the items aren't closely related as sequential subdivisions of a continuous range of values. For instance, it is appropriate to use lines to display change from one day to the next or from one price range to the next, but not from one sales region to the next. (See Figure 5.)

Figure 5: Examples of inappropriate and appropriate uses of lines in a graph.

With interval scales, you are not forced in all cases to use lines; you can use bars as well. If you want to emphasize the overall shape of the data or changes from one item to the next, lines work best. If, however, you want to emphasize individual items, such as individual

months, or to support discrete comparisons of multiple values at the same location along the interval scale, such as revenues and expenses for individual months, then bars work best.

These concepts are relevant to many issues and principles of graph design, so keep them handy as you continue to read this column in the future to learn more about data visualization.

(This article was originally published in DM Review .)

About the Author

Stephen Few has worked for over 20 years as an IT innovator, consultant, and teacher. Today, as Principal of the consultancy Perceptual Edge, Stephen focuses on data visualization for analyzing and communicating quantitative business information. He provides training and consulting services, writes the monthly Visual Business Intelligence Newsletter , speaks frequently at conferences, and teaches in the MBA program at the University of California, Berkeley. He is the author of two books: Show Me the Numbers: Designing Tables and Graphs to Enlighten and Information Dashboard Design: The Effective Visual Communication of Data. You can learn more about Stephen’s work and access an entire library of articles at www.perceptualedge.com. Between articles, you can read Stephen’s thoughts on the industry in his blog.

Understanding Quantitative and Categorical Data: A Key Concept in Data Visualization, Summaries of Design

Related documents

Partial preview of the text

Download Understanding Quantitative and Categorical Data: A Key Concept in Data Visualization and more Summaries Design in PDF only on Docsity!

Quantitative vs. Categorical Data: A Difference Worth Knowing