Publications by Jack Wright
Document
Abstract The origin of this project was a request when a research team led by Dr. Colette Daiute at the CUNY Graduate Center requested an upgrade to their Interactive Digital Narrative (IDN) Complexity Metric. The purpose of this project is to create a robust IDN Tree Complexity Metric that well describes the nature of the shape of an IDN Narrat...
32300 sym R (5382 sym/11 pcs) 22 img
Modeling Doc
Methodology Since the response variable is continuous, we will use a series of regression models as candidates. We will train a random forest and XGboost because they have proven historically to be excellent at modeling continuous variables. Our EDA pointed us towards some potential clustering in the principle components, so we will train a radia...
1569 sym R (11051 sym/95 pcs) 2 img
Tree Model Homework
8.1,8.2,8.3,8.7 8.1 recreate simulated data. library(mlbench) library(tidymodels) cores<-parallel::detectCores() set.seed(200) simulated <- mlbench.friedman1(200, sd = 1) simulated <- cbind(simulated$x, simulated$y) simulated <- as.data.frame(simulated) colnames(simulated)[ncol(simulated)] <- "y" sim_split<-initial_split(simulated) sim_...
5407 sym R (17031 sym/50 pcs) 3 img
twitter modeling
Load Data library(tidyverse) twitter_sentiment<-read_csv('./data/twitter_training.csv', col_names = FALSE) #twitter_sentiment<-sample_n(twitter_sentiment, size = 10000) #add column names colnames(twitter_sentiment)<-c('tweet_id', 'entity','sentiment', 'content') Missing Data Lets check our missing data library(naniar) vis_miss(twitter_se...
2241 sym R (7820 sym/32 pcs) 8 img
regression homework
Regression Homework Jack Wright Exercises 6.1, 6.2 6.1 Spectroscopy is used to analyze chemical makeup of a sample material, Tecator Infratec Food and Feed Analyzer was used on 215 samples of meat across 100 frequencies. THis was used to determine the percent content of water, fat and protien of each sample. Establish a predictive relationship be...
3401 sym R (14201 sym/59 pcs) 7 img
data and analysis
Overview of the Data Collection Process The data for the Narrative Tree Complexity Metric was collected through a web app built with the python web application flask. It is hosted on the heroku cloud application platform and can be found here. https://ctree-postgres.herokuapp.com/ . A group of research participants were selected for their inv...
11449 sym R (2582 sym/6 pcs) 19 img