Publications by Banu Boopalan

BLOG4

19.12.2024

BLOG4: Multiple Linear Regression Multiple Regression summary using R Here, we have the diamonds dataset and we have ran through the multiple linear regression model. The 2nd model we have reduced clarity to see how the model performs.The R squared reduced when we removed a significant variable such as clarity from the original FULL model. Then we...

701 sym 1 img

BLOG5

18.12.2024

BLOG5 Non-Parametric regression Dataset: Swiss Here, I have ran through code similar to this blog to see how we can model fertility dependent on catholic value. There is a blog related here using ksmooth function. “http://users.stat.umn.edu/~helwig/notes/smooth-notes.html” “Nonparametric regression, like linear regression, estimates mean out...

801 sym 11 img

DATA 621 Homework 5

15.12.2024

Table of contents Running Code Overview DATA EXPLORATION DATA PREPARATION BUILD MODELS SELECT MODELS Make Predictions on the Test Evaluation Data set DATA 621 Homework 5, Group : Banu Boopalan, Gregg Maloy, Alexander Moyse, Umais Siddiqui CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FA...

5387 sym R (92766 sym/165 pcs) 43 img 10 tbl

DATA 624 Homework 9

18.11.2024

Table of contents Running Code DATA 624 Homework9 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) Do problems 8.1, 8.2, 8.3, and 8.7 in Kuhn and Johnson. 8.1. Recreate the simulated data from Exercise 7.2: Code set.seed(200) simulated <- mlbench.friedman1(200, sd...

7463 sym R (73601 sym/33 pcs) 2 img 7 tbl

DATA 624 Homework 8

11.11.2024

Table of contents Running Code DATA 624 Homework8 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) Do problems 7.2 and 7.5 in Kuhn and Johnson. There are only two but they have many parts. Please submit both a link to your Rpubs and the .rmd file. 7.2. Friedman (199...

3343 sym R (29113 sym/46 pcs) 11 img 4 tbl

DATA 624 Homework 7

04.11.2024

Table of contents Running Code DATA 624 Homework7 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) In Kuhn and Johnson do problems 6.2 and 6.3. There are only two but they consist of many parts. Please submit a link to your Rpubs and submit the .rmd file as well. 6.2...

5337 sym R (49359 sym/49 pcs) 8 img

DATA 624 Project 1

28.10.2024

Table of contents Project Deliverable PART A Forecasting Analysis Data Exploratory Analysis Review missing variables Review distributions Impute missing values TimeSeries Models Part B – Forecasting Power, ResidentialCustomerForecastLoad-624.xlsx Data Exploration & Plots Data Distributions Data Imputation Models Part C – Bonus Check for...

11993 sym Python (81726 sym/199 pcs) 85 img 26 tbl

DATA 624 Homework6 ARIMA

21.10.2024

Table of contents Running Code DATA 624 Homework6 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) Do exercises 9.1, 9.2, 9.3, 9.5, 9.6, 9.7, 9.8 in Hyndman. 9.1 Figure 9.32 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. Explai...

7859 sym R (30194 sym/57 pcs) 33 img

DATA 622 - Assignment1

19.10.2024

Table of contents Project Deliverable Essay Large Data Set (End to End ML Analysis) Data set Introduction Data Exploration & Plots Correlation Analysis Data Preparation (Imputation and Feature Engineering) Algorithm Selection/Build Models Decision Tree Model and metrics review Oversampling and fixing class imbalance and rerunning decision tree ...

20778 sym Python (120383 sym/174 pcs) 46 img 7 tbl

DATA 621 BLOG1

12.10.2024

BLOG1: Since I used to code in Base SAS, I wanted to connect to SAS online and use the commands via a tutorial to model logistic regression in SAS. It was extremely easy and I found the interface very easy to use and the output via tabular format on various statistics values was very easy to read and follow. Tutorial - https://stats.oarc.ucla...

437 sym