Publications by Louise E. Sinks
TidyTuesday Week 27: Historical Markers
Today’s TidyTuesday is about historical markers with the data coming from the Historical Marker Database. I’m going to add to the map that I made last week with information about Historic Districts in Arlington, VA. I’m going to make an interactive leaflet map with the new information added to the old map. Loading libraries. library(tidyvers...
2407 sym R (5740 sym/20 pcs)
Tidy Tuesday Revisited: Interactive Map of Arlington Historic Neighborhoods
This week’s TidyTuesday is about place names as recorded by the US Board on Geographic Names. The dataset has been cleaned to include only populated places. I ended up augmenting the dataset with information about Arlington Historic neighborhoods and current neighborhood boundaries. My post with code on this project is here. I wanted to create an...
2751 sym R (584 sym/5 pcs)
Tidy Tuesday: US Populated Places
Today’s TidyTuesday is about place names as recorded by the US Board on Geographic Names. The dataset has been cleaned to include only populated places. This week will involve more libraries than normal, since I am going to play with mapping. library(tidyverse) # who doesn't want to be tidy? library(ggthemes) # more themes for ggplot library(gt)...
8717 sym R (2816 sym/18 pcs) 4 tbl
TidyTuesday Week 25: UFO Sightings Redux
I haven’t been TidyTuesdaying because I’ve been learning Tableau. I’ll write more about that later; but it has been an interesting experience and provides different perspectives on data compared to what you might get from R. (I’m sure you could reproduce everything in Tableau in R and vice versa, but it is certainly easier to perform certai...
7846 sym R (4258 sym/39 pcs) 6 img 6 tbl
TidyTuesday Week 18: Portal Project
Today’s TidyTuesday is about the Portal Project, which is a long terms study on the ecology of Arizona. The study explores how ants, rodents, plants respond to climate in the desert near Portal, Az. A subset of the data is provided for this week’s TidyTuesday. Loading libraries. Not really using anything fancy today! library(tidyverse) library...
6433 sym R (4055 sym/30 pcs) 2 img
TidyTuesday Week 17: London Marathon
Today’s TidyTuesday is based on a dataset about the London Marathon. The data is via a package by Nicola Rennie and there is an accompanying tutorial about how to scrape data from the web. library(skimr) library(tidyverse) library(ggthemes) library(skimr) Load the data with the TidyTuesday package in the usual way. tuesdata <- tidytuesdayR::tt...
5192 sym R (4491 sym/34 pcs) 16 img 4 tbl
TidyTuesday Week 16: Neolithic Founder Crops
Today’s TidyTuesday relates to the use of plants in the Neolithic era and is based on a paper by Arranz-Otaegul and Roe. The authors have made their data and analysis available on GitHub. The methods for generating all the figures and tables are in an RMarkdown document with some explanatory text. Having just recently looked at the code and data ...
4907 sym R (2479 sym/7 pcs) 2 img 1 tbl
Credit Card Fraud: A Tidymodels Tutorial
1. Classification using tidymodels I will walk through a classification problem from importing the data, cleaning, exploring, fitting, choosing a model, and finalizing the model. I wanted to create a project that could serve as a template for other two-class classification problems. I also wanted to fully use the tidymodels framework, particularly ...
26467 sym R (25235 sym/100 pcs) 28 img 7 tbl
A Tidymodels Tutorial
As I’ve started working on more complicated machine learning projects, I’ve leaned into the tidymodels approach. Tidymodels is a highly modular approach, and I felt it reduced the number of errors, especially when evaluating many machine models and different preprocessing steps. (This is, in fact, a stated goal of the tidymodels ecosystem.) Thr...
21979 sym R (18027 sym/98 pcs) 6 img
30 Day Chart Challenge- Endangered Species
It is Day 4 of the #30DayChartChallenge. More info can be found at the challenge’s Github page. Today’s theme is history. But this is a subtheme of “comparisions”, so I’d like to avoid doing a simple time series. I decided to look at the endangered species list the US Fish and Wildlife Service maintains. They have a bunch of data spread o...
3912 sym R (4506 sym/18 pcs) 2 img 4 tbl