Publications by Glen Dale Davis
Data 606 - Final Project - Research
Load Required Packages: Below, we load the packages required for data analysis and visualization. library(plyr) library(tidyverse) library(knitr) library(DT) library(RColorBrewer) Load Data: We load the data we collected from NYC OpenData regarding park-related 311 service requests and Parks Inspection Program ratings. We will be limiting o...
6061 sym R (14534 sym/24 pcs) 4 img
Data 622 - Homework 4 - Final Project
Packages library(caret) library(DataExplorer) library(e1071) library(ggbiplot) library(ggcorrplot) library(knitr) library(psych) library(randomForest) library(RColorBrewer) library(snakecase) library(tidyverse) Introduction We load a dataset of red wine quality. We will be using this dataset to compare the performance of a couple diffe...
8782 sym R (24905 sym/26 pcs) 6 img 12 tbl
Classification of Recipe Review Text According to Problems Identified by Reviewers
Abstract Home cooks of all skill levels frequent AllRecipes.com for meal inspiration, but the quality of user-submitted recipe content on the site varies widely. We have identified seven kinds of non-exclusive problems (related to missing/misleading steps, incorrect measurements, incorrect cooking/preparation time, incorrect temperature, a key/...
36482 sym R (49516 sym/1 pcs) 6 img 10 tbl
Data 622 - Homework 3
Packages library(caret) library(DataExplorer) library(e1071) library(ggcorrplot) library(knitr) library(MASS) select <- dplyr::select library(png) library(psych) library(RColorBrewer) library(tidyverse) Introduction We load the dataset of Web sites labeled either Phishing or Legitimate that we used in Homework 2. As a reminder, below ...
7505 sym R (9233 sym/17 pcs) 5 img 7 tbl
Data 622 - Homework 2
Packages library(caret) library(DataExplorer) library(ggcorrplot) library(knitr) library(MASS) select <- dplyr::select library(tidyverse) library(RColorBrewer) library(randomForest) library(rpart) library(rpart.plot) library(psych) Introduction We load a dataset of Web sites labeled either Phishing or Legitimate. Below are the first 1...
9056 sym R (10330 sym/21 pcs) 6 img 9 tbl
Data 622 - Homework 1
Packages library(caret) library(DataExplorer) library(ggraph) library(igraph) library(knitr) library(Matrix) library(naivebayes) library(naniar) library(RColorBrewer) library(scales) library(stopwords) library(tidytext) library(tidyverse) library(widyr) library(xgboost) Introduction We load two labeled text datasets of very different...
6733 sym R (18675 sym/37 pcs) 4 img 3 tbl
Data 622 - Homework1
Packages library(caret) library(DataExplorer) library(knitr) library(Matrix) library(naivebayes) library(naniar) library(RColorBrewer) library(scales) library(stopwords) library(tidytext) library(tidyverse) library(xgboost) Introduction We load two labeled text datasets of very different sizes, which we will use to train two different ...
5949 sym R (17586 sym/35 pcs) 2 img 3 tbl
Data 624 - Project 2
Packages: library(tidyverse) library(httr) library(readxl) library(DataExplorer) library(psych) library(knitr) library(snakecase) library(RColorBrewer) library(VIM) library(ggcorrplot) library(caret) library(randomForest) library(cowplot) library(car) library(MASS) select <- dplyr::select library(earth) library(rminer) library(wri...
16079 sym R (50473 sym/73 pcs) 22 img 21 tbl
Data 606 - Final Project - Research
Load Required Packages: Below, we load the packages required for data analysis and visualization. library(plyr) library(tidyverse) library(knitr) library(DT) library(RColorBrewer) Load Data: We load the data we collected from NYC OpenData regarding park-related 311 service requests and Parks Inspection Program ratings. We will be limiting o...
6061 sym R (14794 sym/24 pcs) 4 img
DATA621 - HW4
Homework 4 - Binary Logistic Regression & Multiple Linear Regression Introduction: We load an auto insurance company dataset containing 8,161 records. Each record represents a customer, and each record has two response variables: TARGET_FLAG and TARGET_AMT. Below is a short description of all the variables of interest in the data set, includi...
30394 sym R (67403 sym/16 pcs) 23 img 15 tbl