Publications by Kitada Smalley
DataViz: Interactive Demos (Part2)
The following demos are modified from https://www.r-graph-gallery.com/interactive-charts.html. Part 1 For scatterplots, heatmaps, and area plots see: https://rpubs.com/hsmalley/dataViz_interact Part 2 IV. Chord diagrams with chorddiag #devtools::install_github("mattflor/chorddiag") library(chorddiag) # Create dummy data m <- matrix(c(11975, ...
285 sym R (1923 sym/2 pcs)
DataViz: Thanksgiving
Motivation Thanksgiving is just around the corner. Let’s celebrate the best way we know how, by analyzing some festive data! I hope you have fun making graphics! Happy Thanksgiving! Dataset/Article We are using the data from fivethirtyeight the Here’s What Your Part of America Eats on Thanksgiving article https://fivethirtyeight.com/features...
2760 sym R (8953 sym/14 pcs) 2 img
MATH138: Cereal MLR
Motivating Example: Breakfast Cereal Data is a sample of 30 breakfast cereals. cereal<-read.delim("https://www.lock5stat.com/datasets/Cereal.txt", header=TRUE) head(cereal) ## Name Company Serving Calories Fat Sodium Carbs Fiber Sugars ## 1 AppleJacks K 1.00 117 0.6 143 27 0....
290 sym R (3981 sym/22 pcs) 6 img
DataViz: Ensemble Graphics
Ensemble Graphics These examples modified from Chapter 12 Ensemble Graphics and Case Studies in Graphical Data Analysis with R,by Antony Unwin Motivating Example 1: Coffee #install.packages("pgmm") library(pgmm) data("coffee") head(coffee) ## Variety Country Water Bean Weight Extract Yield ph Value Free Acid ## 1 1 mexico 8.93999...
2390 sym R (18951 sym/83 pcs) 19 img
SLR Quakes
Earthquake Data The data set give the locations of 1000 seismic events of MB > 4.0. The events occurred in a cube near Fiji since 1964. lat = Latitude of event long = Longitude depth = Depth (km) mag = Richter magnitude stations = Number of statitions reporting Import the Data library(tidyverse) data(quakes) Step Zero: Relationships Between Va...
3391 sym R (3153 sym/41 pcs) 9 img
LMT: CheatSheet
1. Simple Linear Regression (SLR) Model Let \((x_1, Y_1), (x_2, Y_2), ..., (x_n, Y_n)\) be \(n\) bivariate data points observed from a simple linear model \[Y_i=\beta_0+\beta_1 x_i+\varepsilon_i, i=1, 2, ..., n\] 1.2 Normality Assumption \[\varepsilon_i \sim Normal(0, \sigma^2)\] Errors are in independent and identically distributed (iid) Norma...
14614 sym 2 tbl
Confidence vs Prediction Activity
How do we use a regression model for prediction? Once the model is fitted we can use it to make predictions on the value of the response variable \(Y\), for any given value of \(x=x_0\). For this example we will continue to use the {rocket} data. rocket<-read.csv("https://raw.githubusercontent.com/kitadasmalley/sp21_MATH376LMT/main/data/rocketPro...
1663 sym R (2262 sym/26 pcs) 1 img
LMT: Slope Inference
We will be using the Rocket dataset: A rocket motor is manufactured by bonding an igniter propellant and a sustainer propellant together inside a metal housing. The shear strength of the bond between the two types of propellants is an important quality characteristic. It is suspected that shear strength is related to the age in weeks of the batch...
2693 sym R (2818 sym/35 pcs) 1 img
Skills Solutions
PART 1: Data Wrangling library(tidyverse) 1) Make a new data set that has the average depth and price of the diamonds in the data set. avgPandD = diamonds %>% summarise(mean(price), mean(depth)) avgPandD ## # A tibble: 1 x 2 ## `mean(price)` `mean(depth)` ## <dbl> <dbl> ## 1 3933. 61.7 2) Add a new column to...
3513 sym R (6379 sym/47 pcs) 2 img
LMT: Variable Selection
Hald Cement Data Hald [1952] presented a dataset concerning the heat involved in calories per gram of cement (𝑌) as a function of the amount of each of the following four ingredients in the mixture: \(x_1\): tricalcium aluminate \(x_2\): tricalcium silicate \(x_3\): tetracalcium alumino ferrite \(x_4\): dicalcium silicate hald<-read.csv("htt...
1947 sym R (8171 sym/45 pcs)