I’ve started learning Python, so I decided to apply some of my newly developing skills to this Tidy Tuesday from a few weeks ago. The data come from the Ask a Manager Survey, which includes earnings information from more than 24,000 self-selecting survey respondents. The respondents are non-random and skew heavily toward white women in professional jobs in the United States. While exploring the data, I found, unsurprisingly, that formal education and years of experience in a field seem to have a profound effect on compensation.
This week’s Tidy Tuesday deals with commercial fishing on the Great Lakes. While exploring the data, I was struck by a rapid increase followed by a rapid decline in commercial fish hauls. I was further struck by how much of this rise and fall occurred entirely due to one species (alewife) in one lake (Michigan). It turns out that alewife are an invasive species that were first found in Lake Michigan in 1949.
This week’s Tidy Tuesday deals with Mario Kart 64 world records. In my exploration of the data, I found that newly discovered shortcuts can lead to massive improvements in world record times. While the records without shortcuts tend to improve very gradually, records with shortcuts can show large, sudden improvements. Here’s a plot showing the biggest jumps: My source code and data exploration is available on GitHub.
It’s important to add alt text to images in order to make them accessible to users of screen readers. I compose this blog using the excellent blogdown package, which enables me to easily include code-generated plots. Yesterday, I decided to finally figure out how to add alt text to these plots. I’m sharing what I learned in order to help others in the R community make their visualizations more accessible and as a reminder to myself.
This week’s Tidy Tuesday includes data on broadband usage in the United States. I started out with some exploratory analysis of this data set. Using linear regression, I modeled the relationship of broadband usage in a county to broadband availability (per the FCC), the poverty rate, median household income, the percentage of Black residents, and the county’s rural or urban character. I found that each of these parameters had a statistically significant impact on broadband usage, but when constructing a multivariate model, I found that, of these factors, only broadband availability and median household income explained a significant amount of variance:
This week’s Tidy Tuesday uses data from Water Point Data Exchange, an organization which gathers water point data from various sources with the goal of improving water access for millions of people. This week I decided to take a different approach than usual and not use ggplot2. Instead, I used Leaflet to create an interactive map of water sources in Madagascar with information about each source visible as a popup.
For this Tidy Tuesday, I decided to try my hand at interactive visualization. This week’s data comes from the Urban Institute and includes all sorts of interesting and important demographic data on wealth and income distribution. For the purposes of this visualization, I focused purely on income distribution.
Inspired by Cameron Blevins' visualization (and using his data) I created this Tidy Tuesday entry, my first animated plot. Code available in my GitHub repository.
For this Tidy Tuesday entry, I’m looking at data from “Avatar: The Last Airbender”. I was curious if the central character of each episode (based on number of lines spoken) correlated with the episode’s IMDB rating. A first step was to determine the main character for each episode. If Aang speaks the most lines in nearly every episode then this exercise would be pretty pointless. It turns out that—unsurprisingly—Aang speaks more lines in more episodes than anyone else (21 episodes), followed by Sokka (17), Katara (10), Zuko (7), and Azula (2).
I have a new article out on the Rosa Luxemburg Stiftung’s website, As Goes Georgia: What Is at Stake in the Runoff Elections: We don’t know how these runoff elections will turn out, but we do know that whatever the outcome, it will determine the national political terrain at the beginning of the next presidential administration. In Georgia, win or lose, these elections are one more step in challenging Republican dominance and building real progressive power.