Publications by Julia Silge
A Beginner’s Guide to Travis-CI for R
Have you seen all those attractive green badges on other people’s R packages and thought, “I want a lovely green badge!” Always a nice feeling when Travis manages to actually build. #runconf16— Julia Silge (@juliasilge) April 1, 2016 OF COURSE YOU DO. Well, let’s give it a shot, because today I am going to att...
8886 sym R (245 sym/3 pcs) 2 img
Term Frequency and tf-idf Using Tidy Data Principles
At the end of last week, Dave Robinson and I released a new version of tidytext on CRAN, our R package for text mining using tidy data principles. You can check out my first blog post about tidytext to learn a bit about the philosophy of the package and see some of the ways to use it, or see the package on GitHub. In this new release (tidytext 0....
7490 sym R (9698 sym/20 pcs) 12 img
Fatal Police Shootings Across the U.S.
I have been full of grief and sadness and some anger in the wake of yet more videos going viral in the past couple days showing black men being killed by police officers. I am not an expert on what it means to be a person of color in the United States or what is or isn’t wrong with policing today here, but it sure feels like something is deeply...
2212 sym 2 img
Return of the NEISS Data
Almost six months ago (!) I wrote a blog post about the NEISS data set, a sample of accidents reported to emergency rooms in the U.S. that are related to consumer products. Ever since I did that exploration, I have been wanting to ask a bit of a different question from that sample of accidents. How do the accidents that people suffer depend on th...
3829 sym R (3849 sym/13 pcs) 4 img
Something Strange in the Neighborhood
Today I was so pleased to see a new data package hit CRAN, and how wonderful to see such accomplished women writing R packages. What a great new data package on CRAN! And always great to see more women authors in #rstats— Julia Silge (@juliasilge) August 5, 2016 The ghostr package includes a da...
2988 sym R (3995 sym/10 pcs)
We Are Not Very Evenly Distributed
I saw this tweet making the rounds this past week. Half of all Americans live in the red counties, half live in the orange counties— Conrad Hackett (@conradhackett) August 8, 2016 Interesting! I saw people using this map to make the argument that the Electoral College was super important, or a terrible idea, or any of...
3807 sym R (4461 sym/9 pcs) 6 img
Song Lyrics Across the United States
The inspiration for this post is a joint venture by both me and my husband, and its genesis lies more than 15 years in our past. One of the recurring conversations we have in our relationship (all long-term relationships have these, right?!) is about song lyrics and place names. I think the first time we ever had this conversation was in the late...
7874 sym R (9580 sym/26 pcs) 8 img
Singing the Bayesian Beginner Blues
Earlier this week, I published a post about song lyrics and how different U.S. states are mentioned at different rates, and at different rates relative to their populations. That was a very fun post to work on, but you can tell from that paragraph near the end that I am a little bothered by the uncertainty involved in calculating the rates by jus...
5444 sym R (6036 sym/13 pcs) 10 img
Tidy Text Mining with R
I am so pleased to announce that tidytext 0.1.2 is now available on CRAN. This release of tidytext, a package for text mining using tidy data principles by Dave Robinson and me, includes some bug fixes and performance improvements, as well as some new functionality. There is now a handy function for accessing the various lexicons in the sentiment...
3897 sym R (1258 sym/6 pcs) 6 img
Mapping Election Results in Utah
My adopted home state of Utah has been a weird place this election cycle. For the unfamiliar, Utah is extremely conservative when it comes to politics; it is one of the reddest of the red states and has backed the Republican candidate for president for the past many decades. In 2012, about 3/4 of the popular vote went to Mitt Romney (who is LDS, ...
6270 sym R (5394 sym/15 pcs) 4 img