Publications by R on kieranhealy.org

Excess Deaths February Update

24.02.2021

The CDC continues to update its counts of deaths by cause for 2020 as data comes in from the jurisdictions that report to it. The data are by now fairly complete, though there are still significant gaps in several states due to delayed reporting. North Carolina, in particular, has yet to report almost any deaths for the entire final quarter of 20...

4508 sym 110 img

Map, Walk, Pivot

04.05.2021

Recently I came across a question where someone was looking to take a bunch of CSV files, each of which contained numerical columns, and (a) get them into R, (b) calculate the mean and standard deviation of every column in every CSV file, and (c) calculate some overall summary like the mean of all the means and the mean of all the standard deviat...

8259 sym R (9277 sym/28 pcs) 2 img 14 tbl

Covid Trajectories

03.09.2021

I updated the covdata package for the first time in a while, as I’ll be using it to teach in the near future. As a side-effect, I ended up taking a look at what the ongoing polarization or divergence of the COVID experience is like in different parts of the United States. Here I use county-level data to draw out some of the trends. The idea is ...

2466 sym 4 img

Excess Deaths in 2020

21.10.2021

Prompted by a guest visit to Mine Çetinkaya-Rundel’s Advanced Data Visualization class here at Duke, I’ve updated my US and state excess death graphs. Earlier posts (like this one from February) will update as well. I am interested in all-cause mortality in the United States for 2020. I look at each jurisdiction, ordered by how far off its 2...

5024 sym 112 img

The Polarization of Death

30.10.2021

I’m continuing to update the covdata package in anticipation of a Data Visualization for Social Science course I’ll teach next semester. I revisited the Partisan Trajectories graph, as it seems there’s more that could be done with it. More on that in the future, I hope. For now, here’s an updated version using the 2020 Presidential electi...

1796 sym 2 img

Comparing Distributions

19.12.2021

When we want to see how something varies across categories, the trellis or small multiple plot is a good friend. We repeatedly draw the same graph once for each category, lining them up in a way that makes them comparable. Here’s an example from my book, using the gapminder data, which provides a cross-national time series of GDP per capita for...

6926 sym R (9843 sym/26 pcs) 20 img 13 tbl

Clustering Pundits

15.02.2022

For the past view years, Jason Snell at Six Colors has conducted a survey of people who write about Apple. He asks a series of questions about the company and its products and presents a report of people’s answers. This year’s report has all the details for those interested. I’m a subscriber to Six Colors (it’s well worth it if you like t...

10112 sym R (851 sym/2 pcs) 10 img 1 tbl