Publications by Dr. Chelsey Hill

Deep Learning for Text with Covariates

29.10.2020

Introduction In real-world data analysis, you will want to analyze both text and non-text data to be able to classify observations. To be able to do this, you will use a multi-input neural network. Preliminary library(tm) # text mining library(caret) # classification library(keras) # deep learning library(tensorflow) # deep learning Cleaning...

2200 sym R (13195 sym/30 pcs) 2 img

Association Analysis

01.02.2021

Preliminary We will use the arules package in the lesson that follows. install.packages("arules") Next, we load the arules package for use. library(arules) In the lesson that follows, we use the Groceries data set from the arules package, which contains 1 month (30 days) of real-world point-of-sale transaction data from a typical local grocery ou...

5525 sym R (22182 sym/47 pcs) 3 img

Data Quality

18.01.2021

Preliminary We will use the DescTools and caret packages to address quality issues with our data. Both packages have very useful and comprehensive vignettes which can be accessed through the help documentation in RStudio. We load the packages for use in the R session using the library() function. library(DescTools) library(caret) In the lesson t...

6504 sym R (9309 sym/37 pcs) 1 img

Introduction to RStudio: Vectors & Dataframes

08.01.2021

Vectors Creating Vectors To create a vector, we concatenate (combine) values using the c() function. First, let’s create some objects (named variables) x <- 2 z <- 4 We can combine the x and z variables to create a vector named b. b <- c(x, z) b ## [1] 2 4 We can add the valuesh values 5 and 8 to the end of the b vector (overwriting the orig...

3486 sym R (4779 sym/73 pcs)

Introduction to RStudio: The Basics

08.01.2021

Introduction R Software is an integrated suite of open-source software facilities for data manipulation, calculation and graphical display. RStudio is an integrated development environment (IDE) that allows us to interact with R more easily. Below is an example of an RStudio session. Example RStudio Session RStudio has a user-friendly interface...

3560 sym R (824 sym/67 pcs) 1 img

Introduction to RStudio: Working Directory, Import & Export

08.01.2021

The Working Directory The working directory is the folder on your local computer that you would like to interact with (read and write to) during your RStudio session. The getwd() function can be used to obtain your current working directory. getwd() ## [1] "C:/Users/chh35/OneDrive - Drexel University/Teaching/Drexel/STAT 331/Course Content/Week 1...

3437 sym R (2590 sym/26 pcs)

Introduction to RStudio: Plotting & Packages

08.01.2021

Plotting: The Basics The plot() function can be used to create basic plots in R. Plots generated in RStudio will display in the ‘Plots’ panel. The window size can be increased to increase the plot size or decreased to decrease the plot size. The type of plot that is produced will depend on the type of data provided to the function. In the exa...

2505 sym R (1370 sym/14 pcs) 4 img

Data Preprocessing

18.01.2021

Preliminary We will use the DescTools and caret packages for preprocessing our data. If you do not already have the caret package installed, you will first install the package using the install.packages() function. Both packages have very useful and comprehensive vignettes which can be accessed through the help documentation in RStudio. install.p...

5648 sym R (2370 sym/25 pcs) 3 img

Data: Types, Exploration & Visualization

19.01.2021

Preliminary We will use the DescTools package to explore and work with our data. If you do not already have these packages installed, you will first install the packages using the install.packages() function. The package has a very useful and comprehensive vignette which can be accessed through the help documentation in RStudio. install.packages(...

11787 sym R (13816 sym/60 pcs) 14 img

k-Means Cluster Analysis (kMC)

25.01.2021

Preliminary We will use the DescTools and caret packages, which were previously installed. In addition, we use the ggplot2 and factoextra for plotting, as well as the cluster and fpc packages. Next, we load all of the necessary libraries for use in the session. library(caret) library(DescTools) library(ggplot2) library(cluster) library(factoe...

3276 sym R (4467 sym/14 pcs) 2 img