Publications by Keeno Glanville
Cardata_Multiple Regression
Determine if the average price of a house in NY is decreasing or increasing compared to one year ago. Will involve turning the months/years into a long format. Load data urldat <- "https://raw.githubusercontent.com/kglan/MSDS/main/DATA605/Regression%20Analysis/CarPrice_Assignment.csv" zill<- read_csv(url(urldat)) ## Rows: 205 Columns: 26 ## �...
572 sym 8 img
DATA605 Regression Analysis
View dataset head(cars) ## speed dist ## 1 4 2 ## 2 4 10 ## 3 7 4 ## 4 7 22 ## 5 8 16 ## 6 9 10 colnames(cars) ## [1] "speed" "dist" Visualize Data plot(cars[,"speed"], cars[,"dist"], main="Speed vs Stopping Distance", xlab="Speed", ylab="Stopping Distance") Model Creation -17.579 + 3.932*speed model...
125 sym 5 img
Discussion11_RegressionAnalysis
Will be going with a dataset I had analyzed back in DATA607 before I learnt of regressions View Dataset on Gundata head(gundata_clean) ## # A tibble: 6 × 12 ## year Crime/…¹ Murde…² Robbe…³ Priso…⁴ %Blac…⁵ %Whit…⁶ Popul…⁷ Incom…⁸ Densi…⁹ ## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> ...
591 sym 5 img
DATA605 Week7 Imp. Distributions
1 We must find the difference between the two probabilities P(Y=y) = P(Y≤y) - P(Y≤y-1) = [(k-y+1)/k]^n - [(k-y)/k]^n Thus: P(Y=y) = [(k-y+1)^n - (k-y)^n]/k^n 2a cat("Prob not failing 8 yrs geometric=", (9/10)^8, "\n", "Expected=", 10, "\n", "SD",sqrt((1-1/10)/(1/10)^2) ) ## Prob not failing 8 yrs geometric= 0.4304672 ## Expected= ...
190 sym
Data605 Probability Distributions
I tried to do all by hand as much as possible by hand and show the proper R solution. However as the distributions changed so did the complexity. 1 (Bayesian). A new test for multinucleoside-resistant (MNR) human immunodeficiency virus type 1 (HIV-1) variants was recently developed. The test maintains 96% sensitivity, meaning that, for those w...
5493 sym
Linear Transformatons & Representation DATA605 HW4 Eigenshoes
Load Dependencies rm(list = ls()) library(jpeg) ## Warning: package 'jpeg' was built under R version 4.2.2 library(OpenImageR) ## Warning: package 'OpenImageR' was built under R version 4.2.2 library(Matrix) Check the dimensions of our first image. files <- "jpg/" img <- readJPEG(paste0(files,"/RC_2500x1200_2014_us_53446.jpg")) dim(img) ## [1]...
1112 sym R (3327 sym/22 pcs) 6 img
DATA605 Eigenvalues and Eigenvectors
Problem Set 1 1 What is the rank of the matrix A? A<- matrix(c(1,2,3,4,-1,0,1,3,0,1,-2,1,5,4,-2,-3), nrow = 4, ncol=4, byrow = TRUE) A ## [,1] [,2] [,3] [,4] ## [1,] 1 2 3 4 ## [2,] -1 0 1 3 ## [3,] 0 1 -2 1 ## [4,] 5 4 -2 -3 # The rank of the matrix is determined by the number of number of ind...
1091 sym
DATA605 Week 1, Vectors / Matrices / Systems of Equation
library("gifski") ## Warning: package 'gifski' was built under R version 4.2.2 library("gganimate") ## Warning: package 'gganimate' was built under R version 4.2.2 ## Loading required package: ggplot2 ## Warning: package 'ggplot2' was built under R version 4.2.2 Initials KG x=c(rep(-1,800),seq(0,-1,length.out=800), seq(0, -1, length.out = 800), s...
233 sym R (1739 sym/13 pcs) 81 img
DATA606 Lab1
The RStudio Interface The goal of this lab is to introduce you to R and RStudio, which you’ll be using throughout the course both to learn the statistical concepts discussed in the course and to analyze real data and come to informed conclusions. To clarify which is which: R is the name of the programming language itself and RStudio is a co...
17534 sym 7 img
DATA607 Assignment 1: Subsetting Data
Introduction I am opting to go with the FIFA data set. https://fivethirtyeight.com/features/how-to-break-fifa/ I chose this data set as it gives insightful info as to the participation of countries to the FIFA world cup viewership. This is insightful as it demonstrates the strength of viewership in each area and could be used to group FIFA re...
1509 sym