Maya 2 Page To continue learning about data visualization, read sections 5 and 6 of the attachment.Also, watch this video.
Hans Rosling’s 200 Countries, 200 Years, 4 Minutes – the Joy of Stats (YouTube 4:47) (Links to an external site.)
Complete the following for this assignment. Task 11, 12, and 13 are located in the Seeing and Understanding document.
Select and complete either task 11 or 12.
Complete task 13
Re-play Rosling’s visualization at your own speed.
Explore the data for a single country by selecting it in the rightmost column.
Write a paragraph about the story of the world when the dynamic data includes all countries and another paragraph about how the country you watched fits or deviates from that story.
Give a careful description of the story told by the animation for all 200 countries and how the story of just a single country fits into the larger world picture.
Include a screenshot of the final year in that single-country story.
Use current APA formatting to cite your sources. Ursinus College
Digital Commons @ Ursinus College
Statistics and Probability
Transforming Instruction in Undergraduate
Mathematics via Primary Historical Sources
Seeing and Understanding Data
WW/Mathematics, Physical and Life Sciences, email@example.com
Follow this and additional works at: https://digitalcommons.ursinus.edu/triumphs_statistics
Part of the Curriculum and Instruction Commons, Educational Methods Commons, Higher
Education Commons, Science and Mathematics Education Commons, and the Statistics and
Click here to let us know how access to this document benefits you.
This Course Materials is brought to you for free and open access by the Transforming Instruction in Undergraduate Mathematics via Primary Historical
Sources (TRIUMPHS) at Digital Commons @ Ursinus College. It has been accepted for inclusion in Statistics and Probability by an authorized
administrator of Digital Commons @ Ursinus College. For more information, please contact firstname.lastname@example.org.
Wood, Beverly and Bolch, Charlotte, “Seeing and Understanding Data” (2018). Statistics and Probability. 2.
Seeing and Understanding Data
Beverly Wood∗ and Charlotte Bolch†
August 21, 2019
Visual displays of data are commonly used today in media reports online or in print. For example,
data visualizations are sometimes used as a marketing tool to convince people to purchase a certain
product, or they are displayed in articles or magazines as a way to graphically display data to
emphasize a certain point. In general, it is hard to imagine the majority of disciplines in science and
mathematics not using data visualizations. However, before standard data visualization techniques
were developed (and accepted by the community), mathematicians and scientists very rarely used
graphical displays or pictures to represent empirical data.
This project has four main parts. The first section introduces some of the earliest data visual-
izations, which were novel constructions in their time. Next, we consider works of Michael Florent
van Langren (1598–1675) and William Playfair (1759–1823) that contain the first known uses of
statistical representations of data, some of which are still used today. The third section focuses on
the work of Florence Nightingale (1820–1910) and Charles Joseph Minard (1781–1870) and their
ability to construct data displays that made an argument or told a story. Finally, in the last section
we consider works of Edward Tufte (1942– ) and Hans Rosling (1948–2017) that had a major impact
on the current field of data visualizations and how to best create them using computer software.
2 “Ancient” Visualization
One of the earliest data visualizations in printed form is a times-series graph (Figure 1) from the
late 10th or early 11th century that shows changes in the orbital positions of seven “planets”1 over
time and space. It appears in an appendix to a commentary on a work of Cicero (106–43 BCE) that
reviews the physics and astronomy of the day. Take a few minutes to look at the graph below, then
discuss with a partner the following questions.
∗Department of Mathematics, Physical & Life Sciences, Embry-Riddle Aeronautical University, Worldwide;
†School of Teaching and Learning, University of Florida, Gainesville, FL, 32601; email@example.com.
1At the time this graph was drawn, the term planets was used to describe heavenly bodies that seemed to “wander”
(moving relative to the background of stars) and could be seen with unaided eyes. Therefore, the sun and moon would
be called planets.
Figure 1. Time-series graph from the 10th or possibly 11th century [Unknown, 1010]
Task 1 (a) As you can see, the image has no labels on the x- or y-axis. What do you think the x-axis
and y-axis represent?
(b) This graph is novel because a coordinate system was used to plot the various changes in
orbits for the seven planets over time. Why do you think that a coordinate system was
needed for this graph?
Now read the following discussion of this graph, written by a twentieth-century mathematician
The graph apparently was meant to represent a plot of the inclinations of the planetary orbits2
as a function of the time. For this purpose the zone of the zodiac was represented on a plane
with a horizontal line divided into thirty parts as the time or longitudinal axis. The vertical
axis designates the width of the zodiac. The horizontal scale appears to have been chosen for
each planet individually for the periods cannot be reconciled. The accompanying text refers
only to the amplitudes. The curves are apparently not related in time. [Funkhouser, 1936]
Task 2 After reading Funkhouser’s discussion of the graph in Figure 1, does your interpretation of the
x- and y-axis change or stay the same? If you changed your interpretation, in what way?
3 Communicating Data Visually
In the mid-15th century, the invention of the moveable-type3 printing press by Johannes Gutenberg
had a large impact on the Renaissance, in that information was able to be spread throughout Eu-
ropean civilization quickly and accurately compared to news previously carried by word of mouth
or handwritten letters and manuscripts. With the moveable-type printing system, components of a
document that was composed of text were much more easily reproduced. However, the printing of
graphical displays rather than summary tables or lists of numbers was much more difficult because
the graphics did not have common components like the moveable type. Instead, a new graphical
display had to be created for each printing which was very expensive, limiting their use for several
2The inclination of a planetary orbit is the angle between the horizon and the astronomical body being observed.
The website https://stellarium.org/ can be used as a virtual planetarium that you can use to find “star inclinations
from anywhere on Earth.
3This is a system of printing documents on paper using moveable components that are individual characters,
numbers, and punctuation marks. Earlier versions of the moveable-type printing press made from wood, clay, porcelain,
or bronze materials were known in China and Korea. Gutenbergs durable alloy and mold technique made his press
3.1 Distance Graph by van Langren
In 1644, Michael Florent van Langren (1598–1675) created a graph showing determinations of the
distance (measured in degrees of longitude) from Toledo, Spain to Rome, Italy defined by various
people at different times. Statisticians and historians credit this graph as one of the first visual
representations of statistical data. Van Langren served as an astronomer to the royal house of Spain.
Although astronomers at that time were able to determine latitude from star inclinations, longitude
was much harder to determine precisely. Finding an accurate method for determining longitude was
of political and economical value to Spain and other European nations interested in navigation at
sea. Van Langren prepared his graph as part of a request to the Spanish court for financial support
of his own efforts to solve the longitude problem.
Figure 2. Graph of the distance from Toledo, Spain to Rome, Italy by Michael Florent van
Langren [van Langren, 1643]
Task 3 Take a few minutes to study the graph in Figure 2. What do you gather about the determination
of distance in longitude from Toledo to Rome based on the spread of data points? What do
you notice about the data points and their relation to each other?
Task 4 As a class, choose two cities near your college or university that you would like to estimate the
miles between. Then, have everyone in the class write their estimate between the two cities on
a piece of paper (rounding to the nearest tenth of a mile) along with their name. Have one or
two students write all the estimates on the board as students call out their guesses.
(a) In groups of three to four students, create a graph similar to van Langrens using the class
estimates for the distance from the chosen city A to chosen city B.
(b) Then combine pairs of the groups (total of six to eight students) and discuss each groups
graph. What are the similarities between the two graphs? Are there any differences?
What does the spread of data points look like on each graph?
3.2 Common Statistical Graphs by Playfair
Historians of statistics consider William Playfair4 (1759–1823) to have been the first developer of
many common statistical graphics used today, including the pie chart, the bar graph, and the statis-
tical line graph. Through these inventions he was able to create a universal common language that
was used from science to commerce as a way to understand and look at data. The bar graph below is
from his 1786 book entitled Commercial and Political Atlas. Playfair’s intention in this book was to
represent data about the import/export of many countries that were prominent in foreign commerce
at the time. The atlas did not have much success in England, but was very well received in France.
Playfair reported in regard to King Louis XVI of France that “As his majesty made Geography a
study, he at once understood the charts and was highly pleased. He said they spoke all languages
and were very clear and easily understood (as quoted in [Spence and Wainer, 2001, p. 110]). The
graph below is for the country of Scotland for one year.
Figure 3. Playfair Bar Graph [Playfair, 1786]
4William had two brothers with their own professional success putting their names down in history. John Playfair
(1748-1819) was a noted mathematician of the Scottish Enlightenment and an original Fellow of the Royal Society of
Edinburgh. James (1755-1794) was the architect for Melville Castle outside Edinburgh.
Task 5 Compare and contrast this bar graph with the bar graphs we use today in the newspaper and
other media. What do you see that is different between the bar graph in Figure 3 compared
to a bar graph today? Does how we interpret the bar graph in Figure 3 differ from how we
interpret bar graphs made today?
The following graph is a pie chart that was also created by William Playfair. Playfair included
this graph in his 1805 translation of D. F. Donnants 1802 French text, titled Statistical Account of
the United States of America.
Figure 4. Playfair Pie Chart [Playfair, 1805]
Task 6 Given that this pie chart was created in 1805, what do you think each slice of the pie chart
Task 7 A map of the United States in 1804 is shown below. Why do the proportions represented by
the slices of the pie chart in Figure 4 look different than how we would expect those of a pie
chart of the present day United States to look?
Figure 5. Map of the United States in 1804 [Golbez, 2006]
The following circle graph was created by William Playfair and was published in his book ti-
tled Statistical Breviary: Shewing, on a Principle Entirely New, the Resources of Every State and
Kingdom in Europe that was published in 1801.
Figure 6. Playfair Circle Graph [Playfair, 1801]
Task 8 (a) Figure 6 is a complicated graph. Take about five minutes to study the graph taking note
that the axis on the left side is on a different scale compared to the axis on the right side
of the graph.
(i) What does each circle represent?
(ii) Does the radius of each circle mean something?
(iii) What does the axis on the left-side represent?
(iv) What does the axis on the right-side represent?
(v) What do each of the lines on the left and right of each circle represent?
(vi) Why is there a line connecting the left and right line for each circle?
(b) Have you seen any recent data displayed in this fashion? If so, do you recall the subject
matter? If not, why do you think we dont use this type of graph anymore?
4 Making an Argument by Telling a Data Story
Playfairs work brought data to a much wider audience than just mathematicians used to working
with large collections of numbers or summary tables. It was all about describing social and economic
conditions without overwhelming the reader with lists of numbers. His ideas were overlooked by his
contemporaries, but they would be revisited in the 19th century with greater success.
4.1 Graphs by Florence Nightingale
The well-to-do parents of Florence Nightingale (1820–1910) knew there was something “wrong with
their youngest daughter, who preferred studying mathematics to dancing and had no interest in
accepting the several offers of marriage made to her. Victorian high society would condone no other
occupation for a young lady than keeping a husbands house (through the labor of servants) and
raising his children. Nightingale, however, longed to care for the sick and related her unusual desire
for public service to a religious calling. Nineteenth-century British society considered nursing a
degrading occupation, often associated with drunkenness, squalor, and promiscuity. After years of
denials from her mother, Nightingale took an important step toward fulfilling her dream by going to
the Deaconess School at Kaiserwerth in Germany to train as a nurse in 1851. She spent a year as
the superintendent at the Institute for the Care of Sick Gentlewomen, probably the only place she
could ever have worked with the consent of her ever-proper mother.
The outbreak of war in the Crimea presented an opportunity for Nightingale to combine her
unconventional desire for occupation, medical training, and social connections in a patriotic cause.
When the first reports from the Battle of Alma included descriptions of the disastrous state of the
field hospitals, Nightingale wrote a letter to the Minister of War (a social acquaintance) volunteering
her services as a professional nurse, which actually crossed in the mail with his request to her to
lead a party of nurses to the Crimea. What she and her nurses found upon their arrival at Scutari
exceeded even the most shocking newspaper reports. In addition to directing the nurses in her
charge regarding the nutrition and hygiene for the thousands of wounded soldiers, she organized the
chaotic administrative records and collected data that would change the British and, subsequently,
the United States Army hospitals.5
The Minister of War once again wrote to Nightingale after her return to Britain, soliciting her
opinions on the state of the Army hospitals. Her response was more than 500 pages long and included
a striking illustration of the data she had gathered at the city of Scutari (Figure 7).
5This data collection became the basis for her report to the Minister of War which earned her an honorary fellowship
in the American Statistical Association in 1874. She had already been the first woman elected as a fellow in the Royal
Statistical Society in 1859.
Figure 7. Florence Nightingale’s “coxcombs” [Nightingale, 1858]
The text in the bottom left of the illustration serves as a legend. Within each wedge of this odd
pie chart (also called a polar area diagram or rose diagram), the area is computed in polar coordinates
using identical common angles of 30◦. The blue represents deaths by preventable diseases, the red
represents deaths resulting from wounds, and the black represents deaths by all other causes. The
left circle displays the deaths of soldiers in the first year of the war and the right one shows the
deaths in the second year.
Task 9 (a) Compare the wedges for the same months in the two years, paying particular attention
to the proportion of each color within the wedge. Consult a timeline of the major battles
and discuss how they align with the size of wedges in either year.
(b) What data did Nightingale need in order to calculate the areas to draw and to shade?
She did not have a computer to calculate or draw this illustration for her. Speculate on
the tools she needed in order to draw these illustrations.
(c) Have you seen any recent data displayed in this fashion? If so, do you recall the subject
matter? If not, can you speculate on a modern issue that could be displayed like this?
4.2 Graph by Charles Minard
Arguably one of the most elegant displays of data ever produced is Charles Joseph Minards (1781–
1870) Figurative map of the successive losses in men of the French Army in the Russian campaign
1812–13 shown below.
Figure 8. Carte figurative des pertes successives en hommes de l’Armée Franaise dans la
campagne de Russie 1812–1813 by Charles Joseph Minard [Minard, 1869]
Minard was a civil engineer by training and trade during the Napoleonic era in France (1799–
1815), and retired from government service in 1851 as the Inspector General of Bridges and Roads.
In his retirement, he created a collection of visualizations that culminated in this astounding piece
drawn in 1869 and still admired today. Here is a link to a larger version of the graph: https:
Task 10 (a) Count and describe the variables that Minard included in this visualization.
(b) The combination of so many variables on a two-dimensional representation is part of this
displays long-lasting appeal. Compare your list of variables with those of classmates and
discuss why each variable was important to Minard in the telling of this data story of the
You probably cannot read the tiny print of the legend at the top of the graph but it makes no
mention whatsoever of Napoleon! Minards interest was in the soldiers who suffered in this misguided
march. In the graph’s legend, he also gave credit for the sources of his data and stated that the scale
on the bands is one millimeter for every 10,000 men.
5 Beyond Paper and Ink
Nightingale and Minard provided examples of beautiful data displays with their laboriously con-
structed visualizations that told a story almost without words. As the utility of statistics expanded
through various sciences over the course of the 19th and early 20th centuries, refinement of Playfairs
simpler constructions led to the almost ubiquitous use of data displays that have become standard
in both public and scholarly publications: bar graph, pie chart, histogram, line chart, and scatter
5.1 Tufte’s Principles of Graphical Excellence
In the preface to the second edition of The Visual Display of Quantitative Information, Edward
Tufte (1942– ) describes the genesis of the first edition as part of a seminar series with John Tukey
(1915–2000) at Princeton University. Tukeys interest in “exploratory data analysis” focused on easy-
to-construct (by hand after minimal arithmetic) displays of data to complement statistical analysis.
Two such displays are still commonplace: the boxplot and the stem-and-leaf plot.
In the 2001 edition of The Visual Display of Quantitative Information, Tufte also repeats the
nine qualities of graphical excellence from the groundbreaking 1983 edition [Tufte, 2001, p. 13]:
Excellence in statistical graphics consists of complex ideas communicated with clarity, preci-
sion, and efficiency. Graphical displays should
• show the data
• induce the viewer to think about the substance rather than about methodology, graphic
design, the technology of graphic production, or something else
• avoid distorting what the data have to say
• present many numbers in a small space
• make large data sets coherent
• encourage the eye to compare different pieces of data
• reveal the data at several levels of detail, from a broad overview to the fine structure
• serve a reasonably clear purpose: description, exploration, tabulation, or decoration
• be closely integrated with the statistical and verbal descriptions of a data set.
Graphics reveal data. Indeed graphics can be more precise and revealing than conventional
Task 11 Review each of the seven figures shown in the previous sections, choosing two about which
to write a paragraph comparing the data display with Tuftes list of graphical excellence
Another idea initiated by Tufte is “data-ink ratio.” Data-ink is the part of the graph that cannot
be erased without loss of data. For example, the Playfair Circle Graph in Figure 6 includes more
gridlines than …