Publications by Leo Yi & Christopher Bloome
Data624 Project 2
Group Members Subhalaxmi Rout Kenan Sooklall Devin Teran Christian Thieme Leo Yi Introduction We have been given a dataset from a beverage manufacturing company that consists of 2,571 rows of data and 33 columns. The dataset contains information on different beverages and their chemical composition. The goal of this analysis is to use the 32 pr...
14256 sym R (12529 sym/21 pcs) 8 img 2 tbl
data622_final_draft
Libraries library(stringr) library(dplyr) library(ggplot2) library(tidyr) library(gridExtra) library(caret) library(conflicted) library(factoextra) library(purrr) library(scales) # conflict_scout() conflict_prefer('filter', 'dplyr') Import # CSV FILES DOWNLOADED FROM # https://www.kaggle.com/zalando-research/fashionmnist file_pat...
5561 sym R (22496 sym/23 pcs) 5 img
pave_tha
Import Data To begin, let’s start by importing the data and taking a quick look at what we’re working with. file_path <- 'C:/Downloads/pave.dev tha/data-analyst/data.parquet' df <- read_parquet(file_path) glimpse(df) ## Rows: 4,139 ## Columns: 8 ## $ transaction_id <chr> "tx_793434", "tx_2323260", "tx_792384", "tx_79~ ## $ use...
8110 sym R (17758 sym/48 pcs) 11 img
Data622 HW3
Loan Approval Status We will be working with a dataset of loan approval status information. The task is to develop models to predict loan approval status with the given feature variables. After a preliminary exploratory data analysis, we will fit Linear Discriminant, K-Nearest Neighbors, Decision Trees and Random Forest models to a subset of the ...
1208 sym R (9575 sym/28 pcs) 4 img
Data622 HW4
Introduction We’ll be working with a mental health dataset and will be conducting exploratory data analysis, unsupervised clustering, principal component analysis, gradient boosting, and support vector machines. Import Data To begin, the following code will import the data and load the libraries: library(stringr) library(tidyr) library(dplyr...
5447 sym R (37373 sym/32 pcs) 5 img