Assignment: Deceptive Visualization
As visualizations increasingly play a role in how the general public consumes news and information, it is important to realize how a visualization design may influence what a viewer concludes and remembers about the data. In this assignment, you will identify a dataset of interest and design two static visualizations to communicate the data. One design will be an earnest representation of the data, whereas the other will be a deceptive visualization that aims to mislead the viewer. For this second visualuzation, the design challenge is to attempt to mislead without using overtly obvious distortions or omissions.
Assignment Description
Your task is to design two static (single image) visualizations of your chosen dataset. One visualization should aim to effectively and earnestly communicate insights about the data, whereas the other visualization design should aim to mislead the viewer into drawing the wrong conclusions. You will also provide a short write-up (no more than 4 paragraphs) describing your design rationale for both visualizations.
For this assignment, we will consider an earnest visualization to be one where:
- The visualization is clear and interpretable by the general population
- The visual encodings are appropriate and effective for the intended task
- Data transformations are clearly and transparently communicated
- The underlying data source (and any potential bias) is clearly communicated
On the other hand, a deceptive visualization may exhibit the following properties:
- The visual representation is intentionally inappropriate or misleading
- Titles are skewed to intentionally influence the viewer’s perception
- The data has been transformed or filtered in an intentionally misleading way
- The existence or source of bias in the underlying data is unclear
For the earnest visualization, your goal is to be as clear and transparent as possible to help viewers answer your intended question. For the deceptive visualization, your goal is to trick the viewer (including the course staff!) into believing that the visualization is legitimate and earnest. Ideally, it should not be immediately obvious which visualization is trying to be deceptive. Subtle ineffective choices in the design should require conscientious reading to be identified.
For the deceptive visualization, misleading strategies are fine but outright lying is not. For example, sketchy, unreliable or untrustworthy input datasets are discouraged, but misleading omission, filtering, or transformation of trustworthy data records is fine. Deliberate lies in the title, axes, labels, or annotations is discouraged, but technically true/relevant but otherwise misleading text in the visualization is fine.
For both visualization designs, start by choosing a question you would like to answer. Design your visualization to answer that question either correctly (for the earnest visualization) or incorrectly (for the deceptive visualization). You may choose to address a different question with each visualization. Be sure to document the question as part of the visualization design (e.g., title, subtitle, or caption) and in your assignment write-up. We recommend titling each chart with your intended (earnest or deceptive) answer to your question.
Your write-up should contain the following information:
- The specific question each visualization aims to answer
- A description of your design rationale and important considerations for each visualization
Recommended Data Sources
To get up and running quickly with this assignment, we recommend using one of the following provided datasets or sources, but you are free to use any dataset of your choice. You must use the same dataset for both visualizations, but you may transform the data differently, use additional data variables, or choose to address a different question for each design. These datasets are intentionally chosen to cover politically charged topics for the simple reason that these are typically the types of data where deceptive visualizations may proliferate.
The World Bank Data, 1960-2018
The World Bank has tracked global human development by indicators such as climate change, economy, education, environment, gender equality, health, and science and technology since 1960. You can browse the data by indicators or by countries. Click on an indicator category or country to download the CSV file.
Greenhouse Gas Emissions, 1990-2019
The Organization for Economic Co-operation and Development (OECD) has compiled data for the emissions of all participating countries broken out by the pollutant (e.g., carbon monoxide, methane, etc.) and by different sources (e.g., energy, agriculture, etc.). While the linked interface can be somewhat hard to navigate, you are free to browse alternate themes from the panel on the left (such as education or health). You can download the data by selecting “Export” at the top of the chosen table.
Data: Greenhouse Gas Emissions
DEA Pain Pills Database
The Washington Post has published a significant portion of a database maintained by the Drug Enforcement Administration (DEA) that tracks every opioid from their manufacturer, through to distributors, and into pharmacies in towns and cities across the United States. Note that this is a very large dataset with many different facets, so you may want to focus on a particular area or set of attributes of interest.
Important Note: In order to download and use this data you may need to enter your email in order to access the Washington Post Data Access page
Data: Washington Post Data Access Page
Others
Here are some other possible sources to consider. You are also free to use data from a source different from those included here. If you have any questions on whether a dataset is appropriate, please ask the course staff ASAP!
- data.seattle.gov - City of Seattle Open Data
- data.wa.gov - State of Washington Open Data
- data.gov - U.S. Government Open Datasets
- U.S. Census Bureau - Census Datasets
- IPUMS.org - Integrated Census & Survey Data from around the World
- Federal Elections Commission - Campaign Finance & Expenditures
- Federal Aviation Administration - FAA Data & Research
- NOAA Daily Weather - Daily Historical Climatology Network Data
- yelp.com/dataset - Yelp Open Dataset
- fivethirtyeight.com - Data and Code behind the Stories and Interactives
- Buzzfeed News - Open-source data from BuzzFeed’s newsroom
- Kaggle Datasets - Datasets for Kaggle contests
- List of datasets useful for course projects - curated by Mike Freeman
Grading
We will determine scores by judging the soundness of your visualization designs, the duplicity of your deceptive visualization, and the quality of the write-up. Here are examples of aspects that may lead to point deductions:
- Obvious identification of the earnest and deceptive visualizations.
- Ineffective visual encodings for your stated goal.
- Missing indication of the main analysis question.
- Missing or incomplete design rationale in write-up.
We will reward entries that go above and beyond the assignment requirements to produce effective (and deceptive) graphics. Examples may include outstanding visual design, effective annotations and other narrative devices, exceptional creativity, or deceptive designs that require the write-up in order to properly identify the misleading design components.
Submission Details
This is an individual assignment. You may not work in groups. Submit your assignment by completing this page and publishing it to your GitLab repository. The rendered page should be viewable on your GitLab pages site.
Second, you must submit your write-up and two screenshot images of your visualizations to Gradescope. The images must be in either PNG or JPG format, and properly sized to be comfortably viewable on a screen without scrolling or zooming. In addition, the file names must end with the text -earnest
(for your earnest chart) or -deceptive
(for your deceptive chart).
For example, you might upload the files w2-earnest.png
and w2-deceptive.png
to Gradescope. Please be sure to spell earnest and deceptive exactly as indicated here.
We will use automated scripts to prepare your charts for peer review!
Acknowledgements
The design of this assignment is based on the experience and recommendations of Niklas Elmqvist and a similar assignment from Arvind Satyanarayan at MIT.
Earnest Visualization
Replace this text and put your earnest visualization here.
Deceptive Visualization
Replace this text and put your deceptive visualization here.
Write Up
Replace this text and put your write-up here. The write-up should be about two paragraphs, one for each chart.