Publications by Ian VanWright

STA490_GroupProject2

18.05.2024

class: center, middle, inverse, title-slide .title[ # Comparing Potential Sampling Plans. ] .author[ ### Kyle Weber and Ian Vanwright ] .institute[ ### West Chester University of Pennsylvania ] .date[ ### Prepared for STA490: Data Visualization Slides available at: https://rpubs.com/KW986324 AND https://github.com/Kyle-Weber/STA...

6949 sym

STA490_Week6Homework

17.05.2024

For this report, I will be analyzing bank loan default data. The goal of this analysis is to clean up this data set and do some exploratory data analysis. First, we load all of the data. This data set is split into 9 different files. loan01 <- read.csv("https://pengdsci.github.io/datasets/SBAloan/w06-SBAnational01.csv", header = TRUE)[, -1] loa...

2150 sym 1 tbl

STA490_Week3Homework

10.05.2024

class: center, middle, inverse, title-slide .title[ # Multiple Linear Regression ] .subtitle[ ## Model Building and Analysis of Flight Delay Times ] .author[ ### Ian Vanwright ] .institute[ ### West Chester University of Pennsylvania ] .date[ ### Prepared for STA490: Data Visualization ] --- class: top, center # Tabl...

5914 sym

STA490_Week6Homework

10.05.2024

For this report, I will be analyzing bank loan default data. The goal of this analysis is to clean up this data set and do some exploratory data analysis. First, we load all of the data. This data set is split into 9 different files. loan01 <- read.csv("https://pengdsci.github.io/datasets/SBAloan/w06-SBAnational01.csv", header = TRUE)[, -1] loa...

1417 sym 1 tbl

STA490_Week7Homework

10.05.2024

For this report, i will again be analyzing Bank Loan Default Data. The goal of this analysis is to demonstrate 4 types of sampling: Simplified Random, Stratified Random, Systematic Random, and cluster sampling. Again, the data is still split into 9 seperate files. I will combine all of them into 1 data set. loan01 <- read.csv("https://pengdsci....

16678 sym 4 tbl

STA490_Week11And12Homework

10.05.2024

For this report, I will be analyzing survey data about student satisfaction and engagement. The goal for this analysis is to measure validity in terms of work load. survey_raw <- read.csv("at-risk-survey-data.csv", header = TRUE) nrow(survey_raw) ## [1] 664 1 Cleaning and Extracting Data 1.1 Cleaning the Data This data set originally has 332...

2605 sym 2 img 1 tbl

STA321_Week13Homework

16.12.2023

1 Introduction For this report, I will be analayzing Monthly Natural Gas Prices. Specifically I will be analyzing gas prices from January 2008 to August of 2020 To do this, I will build a time series with the data and then build and then compare various smoothing models. 2 Time Series Analysis 2.1 Training and Testing Data To start, I will s...

1704 sym 1 img 3 tbl

STA321 Week 3 Homework

16.12.2023

1 Introduction For this report, I will be using a data set consisting of delayed flight times of various airlines. In this data set, there are 11 variables and 3593 observations. The variables are: Carrier (Categorical) - The airline that the flight is taken with Airport_Distance (Numeric) - The Distance between the airports in miles Number_of...

4663 sym 4 img 3 tbl

STA321_Week5Homework

16.12.2023

1 Introduction For this report, I will be using a data set consisting of delayed flight times of various airlines. In this data set, there are 11 variables and 3593 observations. The variables are: Carrier (Categorical) - The airline that the flight is taken with Airport_Distance (Numeric) - The Distance between the airports in miles Number_of...

4957 sym Python (7954 sym/14 pcs) 5 img 7 tbl

STA321_Week6Homework

16.12.2023

1 Introduction For this report, I will be analyzing banks loans and whether or not a loan has been defaulted on. I will be using a data set that has documented 1000 loans and 16 variables. The variables in this data set are as follows: Checking_amount - Numeric Term (in months) - Numeric Credit_score - Numeric Gender - Categorical Marital_status...

3153 sym Python (2264 sym/8 pcs) 2 img 2 tbl