Maya 2 Page To continue learning about data visualization, read sections 5 and 6 of the attachment.Also, watch this video.

Hans Rosling’s 200 Countries, 200 Years, 4 Minutes – the Joy of Stats (YouTube 4:47) (Links to an external site.)

Complete the following for this assignment. Task 11, 12, and 13 are located in the Seeing and Understanding document.

Select and complete either task 11 or 12.

Complete task 13

Re-play Rosling’s visualization at your own speed.

Explore the data for a single country by selecting it in the rightmost column.

Write a paragraph about the story of the world when the dynamic data includes all countries and another paragraph about how the country you watched fits or deviates from that story.

Give a careful description of the story told by the animation for all 200 countries and how the story of just a single country fits into the larger world picture.

Include a screenshot of the final year in that single-country story.

Use current APA formatting to cite your sources. Ursinus College

Digital Commons @ Ursinus College

Statistics and Probability

Transforming Instruction in Undergraduate

Mathematics via Primary Historical Sources

(TRIUMPHS)

Fall 2018

Seeing and Understanding Data

Beverly Wood

WW/Mathematics, Physical and Life Sciences, woodb14@erau.edu

Charlotte Bolch

cbolch@ufl.edu

Follow this and additional works at: https://digitalcommons.ursinus.edu/triumphs_statistics

Part of the Curriculum and Instruction Commons, Educational Methods Commons, Higher

Education Commons, Science and Mathematics Education Commons, and the Statistics and

Probability Commons

Click here to let us know how access to this document benefits you.

This Course Materials is brought to you for free and open access by the Transforming Instruction in Undergraduate Mathematics via Primary Historical

Sources (TRIUMPHS) at Digital Commons @ Ursinus College. It has been accepted for inclusion in Statistics and Probability by an authorized

administrator of Digital Commons @ Ursinus College. For more information, please contact aprock@ursinus.edu.

Recommended Citation

Wood, Beverly and Bolch, Charlotte, “Seeing and Understanding Data” (2018). Statistics and Probability. 2.

https://digitalcommons.ursinus.edu/triumphs_statistics/2

http://ursinus.edu?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://ursinus.edu?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

https://digitalcommons.ursinus.edu?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

https://digitalcommons.ursinus.edu/triumphs_statistics?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

https://digitalcommons.ursinus.edu/triumphs?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

https://digitalcommons.ursinus.edu/triumphs?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

https://digitalcommons.ursinus.edu/triumphs?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

https://digitalcommons.ursinus.edu/triumphs_statistics?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/786?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/1227?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/1245?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/1245?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/800?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/208?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/208?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

https://ursinus.co1.qualtrics.com/jfe/form/SV_1RIyfqzdxsWfMQ5

https://digitalcommons.ursinus.edu/triumphs_statistics/2?utm_source=digitalcommons.ursinus.edu%2Ftriumphs_statistics%2F2&utm_medium=PDF&utm_campaign=PDFCoverPages

mailto:aprock@ursinus.edu

Seeing and Understanding Data

Beverly Wood∗ and Charlotte Bolch†

August 21, 2019

1 Introduction

Visual displays of data are commonly used today in media reports online or in print. For example,

data visualizations are sometimes used as a marketing tool to convince people to purchase a certain

product, or they are displayed in articles or magazines as a way to graphically display data to

emphasize a certain point. In general, it is hard to imagine the majority of disciplines in science and

mathematics not using data visualizations. However, before standard data visualization techniques

were developed (and accepted by the community), mathematicians and scientists very rarely used

graphical displays or pictures to represent empirical data.

This project has four main parts. The first section introduces some of the earliest data visual-

izations, which were novel constructions in their time. Next, we consider works of Michael Florent

van Langren (1598–1675) and William Playfair (1759–1823) that contain the first known uses of

statistical representations of data, some of which are still used today. The third section focuses on

the work of Florence Nightingale (1820–1910) and Charles Joseph Minard (1781–1870) and their

ability to construct data displays that made an argument or told a story. Finally, in the last section

we consider works of Edward Tufte (1942– ) and Hans Rosling (1948–2017) that had a major impact

on the current field of data visualizations and how to best create them using computer software.

2 “Ancient” Visualization

One of the earliest data visualizations in printed form is a times-series graph (Figure 1) from the

late 10th or early 11th century that shows changes in the orbital positions of seven “planets”1 over

time and space. It appears in an appendix to a commentary on a work of Cicero (106–43 BCE) that

reviews the physics and astronomy of the day. Take a few minutes to look at the graph below, then

discuss with a partner the following questions.

∗Department of Mathematics, Physical & Life Sciences, Embry-Riddle Aeronautical University, Worldwide;

Beverly.Wood@erau.edu.

†School of Teaching and Learning, University of Florida, Gainesville, FL, 32601; cbolch@ufl.edu.

1At the time this graph was drawn, the term planets was used to describe heavenly bodies that seemed to “wander”

(moving relative to the background of stars) and could be seen with unaided eyes. Therefore, the sun and moon would

be called planets.

1

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 1. Time-series graph from the 10th or possibly 11th century [Unknown, 1010]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Task 1 (a) As you can see, the image has no labels on the x- or y-axis. What do you think the x-axis

and y-axis represent?

(b) This graph is novel because a coordinate system was used to plot the various changes in

orbits for the seven planets over time. Why do you think that a coordinate system was

needed for this graph?

2

Now read the following discussion of this graph, written by a twentieth-century mathematician

and historian.

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

The graph apparently was meant to represent a plot of the inclinations of the planetary orbits2

as a function of the time. For this purpose the zone of the zodiac was represented on a plane

with a horizontal line divided into thirty parts as the time or longitudinal axis. The vertical

axis designates the width of the zodiac. The horizontal scale appears to have been chosen for

each planet individually for the periods cannot be reconciled. The accompanying text refers

only to the amplitudes. The curves are apparently not related in time. [Funkhouser, 1936]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Task 2 After reading Funkhouser’s discussion of the graph in Figure 1, does your interpretation of the

x- and y-axis change or stay the same? If you changed your interpretation, in what way?

3 Communicating Data Visually

In the mid-15th century, the invention of the moveable-type3 printing press by Johannes Gutenberg

had a large impact on the Renaissance, in that information was able to be spread throughout Eu-

ropean civilization quickly and accurately compared to news previously carried by word of mouth

or handwritten letters and manuscripts. With the moveable-type printing system, components of a

document that was composed of text were much more easily reproduced. However, the printing of

graphical displays rather than summary tables or lists of numbers was much more difficult because

the graphics did not have common components like the moveable type. Instead, a new graphical

display had to be created for each printing which was very expensive, limiting their use for several

more centuries.

2The inclination of a planetary orbit is the angle between the horizon and the astronomical body being observed.

The website https://stellarium.org/ can be used as a virtual planetarium that you can use to find “star inclinations

from anywhere on Earth.

3This is a system of printing documents on paper using moveable components that are individual characters,

numbers, and punctuation marks. Earlier versions of the moveable-type printing press made from wood, clay, porcelain,

or bronze materials were known in China and Korea. Gutenbergs durable alloy and mold technique made his press

commercially viable.

3

3.1 Distance Graph by van Langren

In 1644, Michael Florent van Langren (1598–1675) created a graph showing determinations of the

distance (measured in degrees of longitude) from Toledo, Spain to Rome, Italy defined by various

people at different times. Statisticians and historians credit this graph as one of the first visual

representations of statistical data. Van Langren served as an astronomer to the royal house of Spain.

Although astronomers at that time were able to determine latitude from star inclinations, longitude

was much harder to determine precisely. Finding an accurate method for determining longitude was

of political and economical value to Spain and other European nations interested in navigation at

sea. Van Langren prepared his graph as part of a request to the Spanish court for financial support

of his own efforts to solve the longitude problem.

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 2. Graph of the distance from Toledo, Spain to Rome, Italy by Michael Florent van

Langren [van Langren, 1643]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Task 3 Take a few minutes to study the graph in Figure 2. What do you gather about the determination

of distance in longitude from Toledo to Rome based on the spread of data points? What do

you notice about the data points and their relation to each other?

Task 4 As a class, choose two cities near your college or university that you would like to estimate the

miles between. Then, have everyone in the class write their estimate between the two cities on

a piece of paper (rounding to the nearest tenth of a mile) along with their name. Have one or

two students write all the estimates on the board as students call out their guesses.

(a) In groups of three to four students, create a graph similar to van Langrens using the class

estimates for the distance from the chosen city A to chosen city B.

(b) Then combine pairs of the groups (total of six to eight students) and discuss each groups

graph. What are the similarities between the two graphs? Are there any differences?

What does the spread of data points look like on each graph?

4

3.2 Common Statistical Graphs by Playfair

Historians of statistics consider William Playfair4 (1759–1823) to have been the first developer of

many common statistical graphics used today, including the pie chart, the bar graph, and the statis-

tical line graph. Through these inventions he was able to create a universal common language that

was used from science to commerce as a way to understand and look at data. The bar graph below is

from his 1786 book entitled Commercial and Political Atlas. Playfair’s intention in this book was to

represent data about the import/export of many countries that were prominent in foreign commerce

at the time. The atlas did not have much success in England, but was very well received in France.

Playfair reported in regard to King Louis XVI of France that “As his majesty made Geography a

study, he at once understood the charts and was highly pleased. He said they spoke all languages

and were very clear and easily understood (as quoted in [Spence and Wainer, 2001, p. 110]). The

graph below is for the country of Scotland for one year.

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 3. Playfair Bar Graph [Playfair, 1786]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

4William had two brothers with their own professional success putting their names down in history. John Playfair

(1748-1819) was a noted mathematician of the Scottish Enlightenment and an original Fellow of the Royal Society of

Edinburgh. James (1755-1794) was the architect for Melville Castle outside Edinburgh.

5

Task 5 Compare and contrast this bar graph with the bar graphs we use today in the newspaper and

other media. What do you see that is different between the bar graph in Figure 3 compared

to a bar graph today? Does how we interpret the bar graph in Figure 3 differ from how we

interpret bar graphs made today?

The following graph is a pie chart that was also created by William Playfair. Playfair included

this graph in his 1805 translation of D. F. Donnants 1802 French text, titled Statistical Account of

the United States of America.

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 4. Playfair Pie Chart [Playfair, 1805]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

6

Task 6 Given that this pie chart was created in 1805, what do you think each slice of the pie chart

represents?

Task 7 A map of the United States in 1804 is shown below. Why do the proportions represented by

the slices of the pie chart in Figure 4 look different than how we would expect those of a pie

chart of the present day United States to look?

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 5. Map of the United States in 1804 [Golbez, 2006]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

7

The following circle graph was created by William Playfair and was published in his book ti-

tled Statistical Breviary: Shewing, on a Principle Entirely New, the Resources of Every State and

Kingdom in Europe that was published in 1801.

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 6. Playfair Circle Graph [Playfair, 1801]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Task 8 (a) Figure 6 is a complicated graph. Take about five minutes to study the graph taking note

that the axis on the left side is on a different scale compared to the axis on the right side

of the graph.

(i) What does each circle represent?

(ii) Does the radius of each circle mean something?

(iii) What does the axis on the left-side represent?

(iv) What does the axis on the right-side represent?

(v) What do each of the lines on the left and right of each circle represent?

(vi) Why is there a line connecting the left and right line for each circle?

(b) Have you seen any recent data displayed in this fashion? If so, do you recall the subject

matter? If not, why do you think we dont use this type of graph anymore?

8

4 Making an Argument by Telling a Data Story

Playfairs work brought data to a much wider audience than just mathematicians used to working

with large collections of numbers or summary tables. It was all about describing social and economic

conditions without overwhelming the reader with lists of numbers. His ideas were overlooked by his

contemporaries, but they would be revisited in the 19th century with greater success.

4.1 Graphs by Florence Nightingale

The well-to-do parents of Florence Nightingale (1820–1910) knew there was something “wrong with

their youngest daughter, who preferred studying mathematics to dancing and had no interest in

accepting the several offers of marriage made to her. Victorian high society would condone no other

occupation for a young lady than keeping a husbands house (through the labor of servants) and

raising his children. Nightingale, however, longed to care for the sick and related her unusual desire

for public service to a religious calling. Nineteenth-century British society considered nursing a

degrading occupation, often associated with drunkenness, squalor, and promiscuity. After years of

denials from her mother, Nightingale took an important step toward fulfilling her dream by going to

the Deaconess School at Kaiserwerth in Germany to train as a nurse in 1851. She spent a year as

the superintendent at the Institute for the Care of Sick Gentlewomen, probably the only place she

could ever have worked with the consent of her ever-proper mother.

The outbreak of war in the Crimea presented an opportunity for Nightingale to combine her

unconventional desire for occupation, medical training, and social connections in a patriotic cause.

When the first reports from the Battle of Alma included descriptions of the disastrous state of the

field hospitals, Nightingale wrote a letter to the Minister of War (a social acquaintance) volunteering

her services as a professional nurse, which actually crossed in the mail with his request to her to

lead a party of nurses to the Crimea. What she and her nurses found upon their arrival at Scutari

exceeded even the most shocking newspaper reports. In addition to directing the nurses in her

charge regarding the nutrition and hygiene for the thousands of wounded soldiers, she organized the

chaotic administrative records and collected data that would change the British and, subsequently,

the United States Army hospitals.5

The Minister of War once again wrote to Nightingale after her return to Britain, soliciting her

opinions on the state of the Army hospitals. Her response was more than 500 pages long and included

a striking illustration of the data she had gathered at the city of Scutari (Figure 7).

5This data collection became the basis for her report to the Minister of War which earned her an honorary fellowship

in the American Statistical Association in 1874. She had already been the first woman elected as a fellow in the Royal

Statistical Society in 1859.

9

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 7. Florence Nightingale’s “coxcombs” [Nightingale, 1858]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

The text in the bottom left of the illustration serves as a legend. Within each wedge of this odd

pie chart (also called a polar area diagram or rose diagram), the area is computed in polar coordinates

using identical common angles of 30◦. The blue represents deaths by preventable diseases, the red

represents deaths resulting from wounds, and the black represents deaths by all other causes. The

left circle displays the deaths of soldiers in the first year of the war and the right one shows the

deaths in the second year.

Task 9 (a) Compare the wedges for the same months in the two years, paying particular attention

to the proportion of each color within the wedge. Consult a timeline of the major battles

and discuss how they align with the size of wedges in either year.

(b) What data did Nightingale need in order to calculate the areas to draw and to shade?

She did not have a computer to calculate or draw this illustration for her. Speculate on

the tools she needed in order to draw these illustrations.

(c) Have you seen any recent data displayed in this fashion? If so, do you recall the subject

matter? If not, can you speculate on a modern issue that could be displayed like this?

10

4.2 Graph by Charles Minard

Arguably one of the most elegant displays of data ever produced is Charles Joseph Minards (1781–

1870) Figurative map of the successive losses in men of the French Army in the Russian campaign

1812–13 shown below.

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Figure 8. Carte figurative des pertes successives en hommes de l’Armée Franaise dans la

campagne de Russie 1812–1813 by Charles Joseph Minard [Minard, 1869]

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Minard was a civil engineer by training and trade during the Napoleonic era in France (1799–

1815), and retired from government service in 1851 as the Inspector General of Bridges and Roads.

In his retirement, he created a collection of visualizations that culminated in this astounding piece

drawn in 1869 and still admired today. Here is a link to a larger version of the graph: https:

//upload.wikimedia.org/wikipedia/commons/e/e2/Minard_Update.png

Task 10 (a) Count and describe the variables that Minard included in this visualization.

(b) The combination of so many variables on a two-dimensional representation is part of this

displays long-lasting appeal. Compare your list of variables with those of classmates and

discuss why each variable was important to Minard in the telling of this data story of the

French Army.

You probably cannot read the tiny print of the legend at the top of the graph but it makes no

mention whatsoever of Napoleon! Minards interest was in the soldiers who suffered in this misguided

march. In the graph’s legend, he also gave credit for the sources of his data and stated that the scale

on the bands is one millimeter for every 10,000 men.

11

5 Beyond Paper and Ink

Nightingale and Minard provided examples of beautiful data displays with their laboriously con-

structed visualizations that told a story almost without words. As the utility of statistics expanded

through various sciences over the course of the 19th and early 20th centuries, refinement of Playfairs

simpler constructions led to the almost ubiquitous use of data displays that have become standard

in both public and scholarly publications: bar graph, pie chart, histogram, line chart, and scatter

plot.

5.1 Tufte’s Principles of Graphical Excellence

In the preface to the second edition of The Visual Display of Quantitative Information, Edward

Tufte (1942– ) describes the genesis of the first edition as part of a seminar series with John Tukey

(1915–2000) at Princeton University. Tukeys interest in “exploratory data analysis” focused on easy-

to-construct (by hand after minimal arithmetic) displays of data to complement statistical analysis.

Two such displays are still commonplace: the boxplot and the stem-and-leaf plot.

In the 2001 edition of The Visual Display of Quantitative Information, Tufte also repeats the

nine qualities of graphical excellence from the groundbreaking 1983 edition [Tufte, 2001, p. 13]:

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Excellence in statistical graphics consists of complex ideas communicated with clarity, preci-

sion, and efficiency. Graphical displays should

• show the data

• induce the viewer to think about the substance rather than about methodology, graphic

design, the technology of graphic production, or something else

• avoid distorting what the data have to say

• present many numbers in a small space

• make large data sets coherent

• encourage the eye to compare different pieces of data

• reveal the data at several levels of detail, from a broad overview to the fine structure

• serve a reasonably clear purpose: description, exploration, tabulation, or decoration

• be closely integrated with the statistical and verbal descriptions of a data set.

Graphics reveal data. Indeed graphics can be more precise and revealing than conventional

statistical computations.

∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞∞

Task 11 Review each of the seven figures shown in the previous sections, choosing two about which

to write a paragraph comparing the data display with Tuftes list of graphical excellence

qualities.

12

Another idea initiated by Tufte is “data-ink ratio.” Data-ink is the part of the graph that cannot

be erased without loss of data. For example, the Playfair Circle Graph in Figure 6 includes more

gridlines than …