Comparison of Histogram, Scatterplot, and Boxplot for Data Visualization, Assignments of Mathematical Statistics

Insights into the use of histogram, scatterplot, and boxplot for data visualization. The author discusses the advantages and disadvantages of each graph type, with examples and explanations. Histograms are useful for comparing large datasets with consistent intervals, while scatterplots reveal correlations and relationships between variables. Boxplots show data distribution and statistical measures, making them ideal for comparing multiple datasets. However, each graph type has its limitations, such as difficulty in comparing data with multiple categories for histograms or the inability to handle categorical data for scatterplots.

Typology: Assignments

2020/2021

Uploaded on 11/22/2022

milky666666
milky666666 🇺🇸

5 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Histogram allows us to compare large sample of data or data with large value ranges
easily. It also provides consistency due to the equal intervals. It is easy to transform
data from a table to a histogram. Some of the disentangles of histogram including: it will
be difficult to get the exact number of an independent variable if the y-axis is not about
frequency. Histogram is also hard for us to compare data with multiple categories. I
would not choose a different graph to represent the data in this graph I chosen from the
internet. It provided a clear visualization of each % of fat. The y-axis is frequency, so it
is easy to point out the exact number of inputs in each column. There is only one
category in this example, so it is easy to compare.
https://statisticsbyjim.com/basics/histograms/
Scatterplot can also show large quantities of data at the same time. It is visible
to see correlation and relationship between variables due to clustering effects.
However, scatterplot is not helpful for discretized data, and it is not helpful for
categorical data. Also, there is an overplotting problem when a lot of the data
stacked on top of each other. It is also difficult to tell the amount of inputs of
an independent variable on a scatterplot too.
this is a very bad example of scatterplot.
As you can see, it is very difficult to count the number of inputs for a single
independent variable. I would use a bar chart to present these data, and it can
show the relationship between grades better.
https://www.chrisstucchio.com/blog/2012/dont_use_scatterplots.html
pf2

Partial preview of the text

Download Comparison of Histogram, Scatterplot, and Boxplot for Data Visualization and more Assignments Mathematical Statistics in PDF only on Docsity!

Histogram allows us to compare large sample of data or data with large value ranges easily. It also provides consistency due to the equal intervals. It is easy to transform data from a table to a histogram. Some of the disentangles of histogram including: it will be difficult to get the exact number of an independent variable if the y-axis is not about frequency. Histogram is also hard for us to compare data with multiple categories. I would not choose a different graph to represent the data in this graph I chosen from the internet. It provided a clear visualization of each % of fat. The y-axis is frequency, so it is easy to point out the exact number of inputs in each column. There is only one category in this example, so it is easy to compare. https://statisticsbyjim.com/basics/histograms/ Scatterplot can also show large quantities of data at the same time. It is visible to see correlation and relationship between variables due to clustering effects. However, scatterplot is not helpful for discretized data, and it is not helpful for categorical data. Also, there is an overplotting problem when a lot of the data stacked on top of each other. It is also difficult to tell the amount of inputs of an independent variable on a scatterplot too. this is a very bad example of scatterplot. As you can see, it is very difficult to count the number of inputs for a single independent variable. I would use a bar chart to present these data, and it can show the relationship between grades better. https://www.chrisstucchio.com/blog/2012/dont_use_scatterplots.html

A boxplot can show data distribution, and we can visualize where 50% of the data fall. We can find statistical data easily, such as, the minimum, the maximum, the median. It is unaffected by outliers. It is good for comparison between multiple data sets. However, it does not show individual values and can be heavily skewed. The follow example is a good boxplot. We can see very clearly that marketing has more base salary than research from the graph. We can also tell that the median of marketing salary is higher, and the interquartile range of marketing salary is larger. https://nelsontouchconsulting.wordpress.com/2011/01/07/behold-the-box-plot/