Publications by Josh Iden

DATA622 HW4

06.12.2023

Code Show All Code Hide All Code DATA 622 HW4 DATA 622 HW4 Introduction Exploratory Data Analysis Data Prep Helper Functions Modeling SVM Results Josh Iden 2023-12-06 Introduction This assignment examines a Kaggle dataset combining data from the UCI Machine Learning Repository containing patient information and whether or n...

5905 sym 4 img 1 tbl

DATA698 Capstone EDA

05.12.2023

Code Show All Code Hide All Code Capstone EDA Capstone EDA The Data Exploratory Data Analysis Modeling Josh Iden 2023-12-05 The Data library(tidyverse) library(caret) library(DMwR2) library(e1071) library(ipred) library(xgboost) library(kableExtra) library(ggthemes) library(egg) library(forcats) library(summarytools) library(forecast...

3916 sym 5 img 14 tbl

DATA698 Capstone Slides

30.11.2023

DATA 698 Final Presentation Josh Iden Measuring “Star Power”: Predicting Movie Box Office Revenue Based on Directors’ and Leading Actors’ Recent Success Film Industry Statistics 42.5B USD to the US Economy (as of 2022) 2.5 million jobs (Zane, 2023) 36% of movies make profit (Lash, Zhao, 2016) Avg. tenure of executives: 1940s: 20 year...

3004 sym 14 img

DATA622 HW3

29.11.2023

Code Show All Code Hide All Code DATA 622 HW3 DATA 622 HW3 Introduction The Data Data Prep Data Modeling Random Forest SVM Comparison Josh Iden 2023-11-29 Introduction This assignment compares the accuracy of the Support Vector Machines and Random Forests on a previously used dataset from https://excelbianalytics.com/wp/downl...

6436 sym 7 img 1 tbl

Capstone Modeling

20.11.2023

Code Show All Code Hide All Code Capstone Modeling Capstone Modeling Loading Data Helper Functions Calculating Metrics Column IQR Factorizer Box-Cox Transformer Generating Dummies Data Splitting Bagged Modeling Random Forest Modeling XGBoost Modeling Cubist Modeling Modeling Bagged Trees Random ...

5479 sym 1 tbl

DATA622 HW2

20.10.2023

Code Show All Code Hide All Code DATA 622 HW2 DATA 622 HW2 Introduction The Data Data Prep Data Modeling Decision Trees Random Forest Discussion Josh Iden 2023-10-20 Introduction This assignment creates two decision trees from a single dataset from https://excelbianalytics.com/wp/downloads-18-sample-csv-files-data-sets-...

6465 sym 4 img 1 tbl

DATA622 HW1

05.10.2023

Code Show All Code Hide All Code DATA 622 HW1 DATA 622 HW1 Introduction The Data Overview Comparing the Data Model Selection Feature Selection. Data Modeling Selecting a Model Results Discussion / Essay Josh Iden 2023-10-05 Introduction This assignment explores and compares the structure and contents of two datasets from ...

9191 sym 22 img 6 tbl

DATA624 Project 2

14.07.2023

Code Show All Code Hide All Code DATA 624 Project 2 DATA 624 Project 2 Project Requirements The Data Exploratory Data Analysis Data Prep Data Modeling Selecting a Model Model Metrics Josh Iden 2023-07-14 library(tidyverse) library(readxl) library(caret) library(e1071) library(DMwR2) library(pls) library(elasticnet) libra...

5392 sym 9 img 6 tbl

DATA624 PROJECT 2

10.07.2023

Code Show All Code Hide All Code DATA 624 Project 2 DATA 624 Project 2 Project Requirements The Data Exploratory Data Analysis Data Prep Data Modeling Selecting a Model Model Metrics Josh Iden 2023-07-10 library(tidyverse) library(readxl) library(caret) library(e1071) library(DMwR2) library(pls) library(elasticnet) libra...

4388 sym 6 img 5 tbl

DATA624 Project 2

06.07.2023

Code Show All Code Hide All Code DATA 624 Project 2 DATA 624 Project 2 Project Requirements The Data Exploratory Data Analysis Josh Iden 2023-07-06 library(tidyverse) library(readxl) library(XLConnect) library(kableExtra) Project Requirements You are given a simple data set from a beverage manufacturing company. It consists of 2,571...

1807 sym 2 img 4 tbl