Publications by Data Science Riot!
Calculate Inflation with the blscrapeR Package
The Consumer Price Index (CPI) is the main standard for tracking the inflation of the U.S. dollar. The various CPI measures are published monthly by the Bureau of Labor Statistics. For this walk-through, we will be using the blcsrapeR package to download our data from the BLS and perform the calculation. The blscrapeR package can be installed via...
2484 sym R (1115 sym/8 pcs)
Calculate Inflation with the blscrapeR Package
The Consumer Price Index (CPI) is the main standard for tracking the inflation of the U.S. dollar. The various CPI measures are published monthly by the Bureau of Labor Statistics. For this walk-through, we will be using the blcsrapeR package to download our data from the BLS and perform the calculation. The blscrapeR package can be installed via...
2484 sym R (1115 sym/8 pcs)
Shiny Server on Docker: CentOS 7 Edition
Docker is generally used for application development and deployment. While it is possible to develop and deploy Shiny applications in Docker containers, I have found it is much more useful to keep a Shiny Docker container that is a twin of my production server. This allows us to test new versions and new applications before putting them into prod...
2919 sym R (426 sym/11 pcs)
Shiny Server on Docker: CentOS 7 Edition
Docker is generally used for application development and deployment. While it is possible to develop and deploy Shiny applications in Docker containers, I have found it is much more useful to keep a Shiny Docker container that is a twin of my production server. This allows us to test new versions and new applications before putting them into prod...
2919 sym R (426 sym/11 pcs)
Creating a Baseball Database with baseballDBR
My original motivation to write the baseballDBR package for R was to provide a quick and easy way to have access to Sean Lahman’s Baseball Database. The Lahman package has been around for several years, and is a great resource, however it lacks consistant updates. Also, the CRAN repository has limits on how large data packages can be, and the L...
1381 sym R (668 sym/2 pcs) 2 img
Creating a Baseball Database with baseballDBR
My original motivation to write the baseballDBR package for R was to provide a quick and easy way to have access to Sean Lahman’s Baseball Database. The Lahman package has been around for several years, and is a great resource, however it lacks consistant updates. Also, the CRAN repository has limits on how large data packages can be, and the L...
1381 sym R (668 sym/2 pcs) 2 img
American vs. National League wOBA Vales.
Traditionally, statistics like wOBA (weighted on-base average) have been calculated using league averages. While building the baseballDBR package, I thought it would be interesting to group the American and National when making wOBA calculations. In theory, there should be parity across the two leagues, but that is not always the case. In order t...
1875 sym R (1951 sym/6 pcs) 4 img
American vs. National League wOBA Vales.
Traditionally, statistics like wOBA (weighted on-base average) have been calculated using league averages. While building the baseballDBR package, I thought it would be interesting to group the American and National when making wOBA calculations. In theory, there should be parity across the two leagues, but that is not always the case. In order t...
1885 sym R (1951 sym/6 pcs) 4 img
Tidyer BLS data with the blscarpeR package
The recent release of the blscrapeR package brings the “tidyverse” into the fold. Inspired by my recent collaboration with Kyle Walker on his excellent tidycensus package, blscrapeR has been optimized for use within the tidyverse as of the current version 3.0.0. New things you’ll notice right away include: All data now returned as tibbles....
2258 sym R (1433 sym/4 pcs) 4 img
Tidyer BLS data with the blscarpeR package
The recent release of the blscrapeR package brings the “tidyverse” into the fold. Inspired by my recent collaboration with Kyle Walker on his excellent tidycensus package, blscrapeR has been optimized for use within the tidyverse as of the current version 3.0.0. New things you’ll notice right away include: All data now returned as tibbles....
2258 sym R (1404 sym/4 pcs) 4 img