Publications by Kitada Smalley
DATA151: Guinea Pigs
data("ToothGrowth") ?ToothGrowth str(ToothGrowth) ## 'data.frame': 60 obs. of 3 variables: ## $ len : num 4.2 11.5 7.3 5.8 6.4 10 11.2 11.2 5.2 7 ... ## $ supp: Factor w/ 2 levels "OJ","VC": 2 2 2 2 2 2 2 2 2 2 ... ## $ dose: num 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 0.5 ... 1) What do the rows of this dataset represent? The observations ar...
600 sym R (1728 sym/14 pcs) 4 img
FiveThirtyEight: Voter Registration
A. Read the Article “Voter Registrations Are Way, Way Down During the Pandemic” (Jun 26, 2020) by Kaleigh Rogers and Nathaniel Rakich https://fivethirtyeight.com/features/voter-registrations-are-way-way-down-during-the-pandemic/ B. Discuss in Small Groups How are graphics used to tell the author’s story? What geometries are used? C. The Da...
696 sym R (1135 sym/6 pcs) 1 img
DATA151: Gotta Catch Em All
How to Start: Create a new R markdown document for the Midterm template Save new Rmd in your R Studio Cloud as “data151_mid2_YOURNAME” Change the author in the heading of the R markdown Change the date Pokemon For this assessment you will use a dataset about Pokemon. I wrangled these data from two different sources on Kaggle: https://www.kag...
4594 sym R (5826 sym/25 pcs) 4 img
DataViz: Scales Solutions
Adapted from a lab written by Prof Charlotte Wickham LAB SOLUTIONS: Exploring flight delays To experiment with some features of scales we are going to work with a heatmap of the proportion of flights that have a departure delay of more that 15 minutes (prop_over_15) at the George Bush Intercontinental Airport (IAH) by day of the week (DayOfWeek) ...
3930 sym R (2708 sym/26 pcs) 10 img
DataViz: Polish Solutions (2023)
Adapted from a lab written by Prof Charlotte Wickham The data These data come from the hflights package, but some summarization is done for you. library(tidyverse) iah <- read.csv("https://raw.githubusercontent.com/kitadasmalley/Teaching/main/DATA502/FA2023/R_Markdown/Week8/iah_flightSummary.csv") str(iah) ## 'data.frame': 156 obs. of 6 varia...
842 sym R (7671 sym/30 pcs) 13 img
DataViz: Scales Directions (2023)
Adapted from a lab written by Prof Charlotte Wickham LAB DIRECTIONS: Exploring flight delays To experiment with some features of scales we are going to work with a heatmap of the proportion of flights that have a departure delay of more that 15 minutes (prop_over_15) at the George Bush Intercontinental Airport (IAH) by day of the week (DayOfWeek)...
3911 sym R (1337 sym/16 pcs)
DATA151: Tidyr and Joins (Fall 2023)
Learning Objectives In this lesson students will gain more practice with the tidyverse with a particular focus on wrangling the tidyr package. This lesson covers: tidyr Package gather() spread() unite() separate() Joins: left_join, inner_join, right_join Before you start you will need to load the tidyverse library(tidyverse) PART I: tidyr Doe...
2350 sym R (16117 sym/69 pcs) 7 img
DataViz: Alternative to Pie Charts
Commuting Data The following data show commuting transportation methods for states on the West Coast of the United States. library(tidyverse) #rds is an special r dataframe commute <- readRDS("~/Desktop/DataViz Reading/Week 5 Data and Code/commute.rds") # a subset of states for class states <- c("or", "ak", "ca", "wa", "id", "nv") commute_nw <- f...
618 sym R (685 sym/3 pcs) 1 img
(UPDATED) DataViz: Basics
I. Basic Graphics We will be doing simplified reproductions of figures found in Storytelling with Data. When we learn more about ggplot we can personalize and polish our plots more. 1) Heatmapped Table To create a heatmapped table you might want to use the following packages: ztable or tidyverse. I had a lot of difficult with installing ztable. So...
1822 sym R (8358 sym/40 pcs) 14 img
DATA252:Hierarchical Clustering
Learning Objectives In this lesson students will … Implement the Hierarchical clustering algorithm Visualize the model Resources: The following example comes from: Machine Learning for Biostatistics https://bookdown.org/tpinto_home/Unsupervised-learning/k-means-clustering.html Hierarchical clustering Step 0: Load Data bdiag<- read.csv("https...
497 sym R (1786 sym/19 pcs) 4 img