Publications by Chandler Brown

Data 101 Final Project

08.05.2023

The Dataset The dataset I chose for my final project is a dataset on almost 2 million car crashes that have occurred in New York City collected by NYC open data. (https://data.cityofnewyork.us/Public-Safety/Motor-Vehicle-Collisions-Crashes/h9gi-nx95) The categorical variable in this dataset are: -BOROUGH: the borough of NYC where the crash occur...

6368 sym R (8158 sym/32 pcs) 5 img

IC 11

25.04.2023

library(cluster) data("iris") iris_features <- iris[, c("Sepal.Length", "Sepal.Width", "Petal.Length", "Petal.Width")] k <- 3 kmeans_model <- kmeans(iris_features, centers = k) cluster_labels <- kmeans_model$cluster iris$Cluster <- cluster_labels print(iris) ## Sepal.Length Sepal.Width Petal.Length Petal.Width Species Cluster ## 1 ...

10 sym R (11880 sym/7 pcs)

Project 4

25.04.2023

The data set The data set I chose for this project consists of European Bike Store Sales from 2011-2016 collected by PrepInsta Technologies. The categorical data collected on each sale includes: -the date the sale occurred -the month the sale occurred in -the day the sale occurred -the year the sale occurred -the Age Group of the customer -the ...

3250 sym R (10999 sym/33 pcs) 3 img

IC 10

18.04.2023

Read in the CSV file library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.4.0 ✔ purrr 1.0.1 ## ✔ tibble 3.2.1 ✔ dplyr 1.1.1 ## ✔ tidyr 1.3.0 ✔ stringr 1.5.0 ## ✔ ...

897 sym R (2651 sym/17 pcs) 2 img

Project 2

19.04.2023

Introduction The data I decided to work with in this project is a dataset of all games played in the MLB (Major League Baseball) from the years 1871-2016 collected by Retrosheet. This dataset contains a whopping 161 columns of data and 171,000 games recorded. This data contains stats about the teams that played in the games, split up by the vis...

6832 sym R (5212 sym/24 pcs) 4 img

GIS Tutorial

12.04.2023

Set various values needed, including names of files and FIPS codes for New Hampshire and South Carolina #nhdatafiile <- "NHD2016.xlsx" nhdatafilecsv <- "NHD2016.csv" usshapefile <- "cb_2014_us_county_5m/cb_2014_us_county_5m.shp" nhfipscode <- "33" scdatafile <- "SCGOP2016.csv" scfipscode <- "45" Load tmap, tmaptools, and leaflet packages int...

2120 sym R (9968 sym/85 pcs) 7 img

Project 3

10.04.2023

Loading in the data library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.4.0 ✔ purrr 1.0.1 ## ✔ tibble 3.1.8 ✔ dplyr 1.1.0 ## ✔ tidyr 1.3.0 ✔ stringr 1.5.0 ## ✔ r...

2441 sym R (4542 sym/44 pcs) 6 img

Web Scraping Assignment

05.04.2023

#Loading the rvest package #install.packages("rtools") #install.packages("rvest") library(rvest) ## Warning: package 'rvest' was built under R version 4.2.3 #Specifying the url for desired website to be scraped url <- 'http://www.imdb.com/search/title?count=100&release_date=2016,2016&title_type=feature' #Reading the HTML code from the website web...

3334 sym R (10136 sym/99 pcs) 3 img

Quiz 3

03.04.2023

1. Create an R Markdown file, import the whaledata.csv file and assign it to a data frame called df. Review the data using any of the functions we have used to find the number of rows and columns, classes of the data columns and view a small subset of the data. df <- read.csv("whaledata.csv") dim(df) ## [1] 100 8 str(df) ## 'data.frame': 1...

2276 sym 1 img

Project Flags of the World

03.04.2023

Image by Gordon Johnson from Pixabay Data Import Download flag.csv and flag.names to your working directory. Make sure to set your working directory appropriately! Let’s look at some information about this file. Open flag.names in RStudio by double clicking it in the files pane in bottom left. Read through this file. Who is the donor of th...

4682 sym R (15398 sym/34 pcs) 1 img