Final Group Project
For the final project, you will work in groups of three or four to explore a
dataset with about 1000 to 4000
data points and turn your findings into a public history project, hosted
on your own project websites.
These datasets include, for example,
Cigarette advertisements
,
Opioid prescription data
,
Experience reports from people who take drugs
, or
radio show
transcripts.
Each project website needs to include:
- A narrative of about 2000 words that describes your research and
findings
- At least five visualizations that explore clearly defined questions
- An “About” page that describes the sources, processing, and presentation
that went into the project as well as your explanations for choices that
you make. The “About” page should also include short bios of the team
members and outline who work on what aspect of the project.
- An annotated bibliography of the literature and projects consulted
- A data critique that explains what information is included in your dataset
and what questions it can and cannot answer.
Sample Projects
For inspiration, have a look at these student projects:
Datasets
Here are datasets that you could consider working with:
-
Reddit Corpora: This is a great site
that gives you access to Reddit submissions in a very flexible way.
For example, you could get comments from
r/Drugs. Or you could get
a sample of comments mentioning Fentanyl from 2015 through 2020 and compare
them. Or you could get samples from different areas (r/Kentucky vs.
r/Boston) and see what patterns you can identify.
Mode of analysis: Primarily text, possibly geographic
-
Erowid: We have already seen
Visualizing Erowid. However,
the rich user reports aggregated by
Text Mining Altered States of Consciousness can yield many more insights.
Mode of analysis: Text
-
DEA Opioid Prescription Data
Multiple options. e.g. map out where drugs are distributed by pharmacies
in a county, potentially compare prescription rates to number of inhabitants
in a town. (In essence, a version of
this map. Geocoded pharmacy list available
here.
Mode of analysis: Mapping
-
Cigarette
Advertisments: This website collects thousands of cigarette
advertisements organized by themes. Could be used for an online exhibit
of a particular theme but also to generate, for example,
"average" advertisements by decade.
Mode of analysis: Images, online exhibits.
Acknowledgements
This class as a whole, but this final project in particular strongly
draws--with permission--on the final assignment of
Miriam Posner's fantastic
DH 101 class.