Publications by Anthony Conrardy

DATA 622-Assignment #3

13.04.2025

Assignment Perform an analysis of the dataset used in Homework #2 using the SVM algorithm. Compare the results with the results from previous homework. ## education marital housing contact duration month age balance campaign pdays ## 1 tertiary married yes unknown 261 may 58 2143 1 -1 ## 2 secondary single ...

10016 sym

Assignment #3 R Code

13.04.2025

Assignment #3 R Code # Categorical to Factors categorical_vars <- c('job', 'marital', 'education', 'default', 'housing', 'loan', 'contact', 'month', 'poutcome', 'y') df[categorical_vars] <- lapply(df[categorical_vars], as.factor) # Feature Engineering df$previously_contacted <- ifelse(df$pdays == -1, "no", "yes") df$pr...

43 sym Python (9673 sym/17 pcs)

Story-3: Obesity and Healthcare Spending

07.03.2025

Instructions: Assignment: Investigating the Relationship Between Healthcare Spending and Adult Obesity Public health outcomes often reflect differences in policy priorities and resource allocation. This assignment uses state-level adult obesity prevalence data to explore whether differences in healthcare spending correlate with obesity rates. ...

3296 sym 1 img

Exploratory Data Analysis-Assignment #1 DATA 622

01.03.2025

Dataset A Portuguese bank conducted a marketing campaign (phone calls) to predict if a client will subscribe to a term deposit The records of their efforts are available in the form of a dataset. The objective here is to apply machine learning techniques to analyze the dataset and figure out most effective tactics that will help the bank in nex...

8004 sym 24 img

Homework #3-DATA 605

07.12.2024

Problem 1-Transportation Safety Scenario: You are a data analyst at a transportation safety organization. Your task is to analyze the relationship between the speed of cars and their stopping distance using the built-in R dataset cars. This analysis will help in understanding how speed affects the stopping distance, which is crucial for improvi...

42275 sym Python (9979 sym/47 pcs) 18 img

Homework #2-Data 605

06.11.2024

Problem 1 (Bayesian): A new credit scoring system has been developed to predict the likelihood of loan defaults. The system has a 90% sensitivity, meaning that it correctly identifies 90% of those who will default on their loans. It also has a 95% specificity, meaning that it correctly identifies 95% of those who will not default. The default r...

16849 sym Python (15284 sym/97 pcs) 1 img

Document

15.10.2024

Question #1 Customer monthly movement between your product and competitors is represented by the following monthly market transition matrix. Assume you and your two competitors have 100 customers each on the first month. About how many customers would you expect to have at the beginning of the third month. You are represented by the first row, ...

6124 sym

TLX-Analysis

11.07.2024

Read in Excel File with Task Load Index (TLX) Data Read in the Excel file and change column names for easier analysis. Create a Team Size measure. tlx_df <- read_excel("D:/Documents/R_Working_Directory/TLX Data Analysis/TLI Data(working).xlsx") colnames(tlx_df)[colnames(tlx_df)=="Number"] <- "Team" tlx_df <- tlx_df |> group_by(Team, Scenario) |>...

968 sym 20 img

Final Project Part I

06.05.2024

Introduction While the intent of this project was to link two very disparate data sources, it turned out to be much more complicated than expected. The data sources used for this project came from the student clinical evaluation Jotform data source located as a report on their site, and from a manual extraction of the testing data from the Plat...

3645 sym

Final Project Part II

06.05.2024

Introduction In Part II of the this final project, we will import the sterilized data sets from Part I that are located on the GitHub repository in the locations below. We will then merge the data sets into a unified set containing the required elements for further investigation and analysis. We will rename the variables to be something easier ...

4933 sym Python (7317 sym/13 pcs) 3 img