Publications by Glen Dale Davis

Data 606 - Final Project - Research

03.06.2024

Load Required Packages: Below, we load the packages required for data analysis and visualization. library(plyr) library(tidyverse) library(knitr) library(DT) library(RColorBrewer) Load Data: We load the data we collected from NYC OpenData regarding park-related 311 service requests and Parks Inspection Program ratings. We will be limiting o...

6061 sym R (14534 sym/24 pcs) 4 img

Data 622 - Homework 4 - Final Project

22.05.2024

Packages library(caret) library(DataExplorer) library(e1071) library(ggbiplot) library(ggcorrplot) library(knitr) library(psych) library(randomForest) library(RColorBrewer) library(snakecase) library(tidyverse) Introduction We load a dataset of red wine quality. We will be using this dataset to compare the performance of a couple diffe...

8782 sym R (24905 sym/26 pcs) 6 img 12 tbl

Classification of Recipe Review Text According to Problems Identified by Reviewers

15.05.2024

Abstract Home cooks of all skill levels frequent AllRecipes.com for meal inspiration, but the quality of user-submitted recipe content on the site varies widely. We have identified seven kinds of non-exclusive problems (related to missing/misleading steps, incorrect measurements, incorrect cooking/preparation time, incorrect temperature, a key/...

36482 sym R (49516 sym/1 pcs) 6 img 10 tbl

Data 622 - Homework 3

06.05.2024

Packages library(caret) library(DataExplorer) library(e1071) library(ggcorrplot) library(knitr) library(MASS) select <- dplyr::select library(png) library(psych) library(RColorBrewer) library(tidyverse) Introduction We load the dataset of Web sites labeled either Phishing or Legitimate that we used in Homework 2. As a reminder, below ...

7505 sym R (9233 sym/17 pcs) 5 img 7 tbl

Data 622 - Homework 2

31.03.2024

Packages library(caret) library(DataExplorer) library(ggcorrplot) library(knitr) library(MASS) select <- dplyr::select library(tidyverse) library(RColorBrewer) library(randomForest) library(rpart) library(rpart.plot) library(psych) Introduction We load a dataset of Web sites labeled either Phishing or Legitimate. Below are the first 1...

9056 sym R (10330 sym/21 pcs) 6 img 9 tbl

Data 622 - Homework 1

10.03.2024

Packages library(caret) library(DataExplorer) library(ggraph) library(igraph) library(knitr) library(Matrix) library(naivebayes) library(naniar) library(RColorBrewer) library(scales) library(stopwords) library(tidytext) library(tidyverse) library(widyr) library(xgboost) Introduction We load two labeled text datasets of very different...

6733 sym R (18675 sym/37 pcs) 4 img 3 tbl

Data 622 - Homework1

10.03.2024

Packages library(caret) library(DataExplorer) library(knitr) library(Matrix) library(naivebayes) library(naniar) library(RColorBrewer) library(scales) library(stopwords) library(tidytext) library(tidyverse) library(xgboost) Introduction We load two labeled text datasets of very different sizes, which we will use to train two different ...

5949 sym R (17586 sym/35 pcs) 2 img 3 tbl

Data 624 - Project 2

26.02.2024

Packages: library(tidyverse) library(httr) library(readxl) library(DataExplorer) library(psych) library(knitr) library(snakecase) library(RColorBrewer) library(VIM) library(ggcorrplot) library(caret) library(randomForest) library(cowplot) library(car) library(MASS) select <- dplyr::select library(earth) library(rminer) library(wri...

16079 sym R (50473 sym/73 pcs) 22 img 21 tbl

Data 606 - Final Project - Research

26.02.2024

Load Required Packages: Below, we load the packages required for data analysis and visualization. library(plyr) library(tidyverse) library(knitr) library(DT) library(RColorBrewer) Load Data: We load the data we collected from NYC OpenData regarding park-related 311 service requests and Parks Inspection Program ratings. We will be limiting o...

6061 sym R (14794 sym/24 pcs) 4 img

DATA621 - HW4

03.12.2023

Homework 4 - Binary Logistic Regression & Multiple Linear Regression Introduction: We load an auto insurance company dataset containing 8,161 records. Each record represents a customer, and each record has two response variables: TARGET_FLAG and TARGET_AMT. Below is a short description of all the variables of interest in the data set, includi...

30394 sym R (67403 sym/16 pcs) 23 img 15 tbl