Publications by Ryan Pugliano

Assignment 9 - Random Forest

18.04.2024

Objective Predict Credit Card Attrition using the CreditCardData, using the randomforest algorithm from the randomForest package. Data Prep Split the data (10127 observations), into training and validation datasets (70%/30% , 7056/3071) ## Variable Importance The total transaction count is the variable with the greatest importance in determining...

1108 sym 1 img

Assignment 8

11.04.2024

Objective Predict Credit Card Attrition using the CreditCardData, using the classification algorithm from the rpart package. Data Prep Split the data (10127 observations), into training and validation datasets (70%/30% , 7084/3043) The Tree Interpretations from the tree The most likely existing customer: Has a total transaction count greater t...

791 sym 2 img

Assignment 7 - Linear Models

04.04.2024

Data prep for modeling Data summary: The variables: store, and holiday flag are continuous, but should be categorical ## Store Date Weekly_Sales Holiday_Flag ## Min. : 1 Length:6435 Min. : 209986 Min. :0.00000 ## 1st Qu.:12 Class :character 1st Qu.: 553350 1st Qu.:0.00000 ## Median :23 ...

1254 sym

Assignment 6

14.03.2024

1. Is there an association between leaving the company and having a work accident? ## ## ## Cell Contents ## |-------------------------| ## | N | ## | Chi-square contribution | ## | N / Row Total | ## | N / Col Total | ## | N / Table Total | ## |-------------------------| ## ## ## Total Obs...

928 sym

Assignment 4 - Correlations

07.03.2024

Correlogram of the HR Data 1. Create a correlation matrix Correlation Interpretations The correlation between number_project and average_monthly_hours is a medium strength positive correlation where the more projects you have, the more hours you work. The correlation between number_project and satisfaction_level is a weak negative correlation wh...

548 sym 2 img

Assignment 5

29.02.2024

1. Is there a difference in satisfaction level between those that left and those that stayed in the company? ## ## Welch Two Sample t-test ## ## data: HR$satisfaction_level by HR$Employee_Status ## t = -46.636, df = 5167, p-value < 2.2e-16 ## alternative hypothesis: true difference in means between group Left and group Stayed is not equal to 0 ...

558 sym

T-Test & Box Plots

26.02.2024

t-test of MPG by Transmission ## ## Welch Two Sample t-test ## ## data: cars1$mpg by cars1$am ## t = -3.7671, df = 18.332, p-value = 0.001374 ## alternative hypothesis: true difference in means between group 0 and group 1 is not equal to 0 ## 95 percent confidence interval: ## -11.280194 -3.209684 ## sample estimates: ## mean in group 0 mean ...

97 sym