Publications by Banu Boopalan
BLOG4
BLOG4: Multiple Linear Regression Multiple Regression summary using R Here, we have the diamonds dataset and we have ran through the multiple linear regression model. The 2nd model we have reduced clarity to see how the model performs.The R squared reduced when we removed a significant variable such as clarity from the original FULL model. Then we...
701 sym 1 img
BLOG5
BLOG5 Non-Parametric regression Dataset: Swiss Here, I have ran through code similar to this blog to see how we can model fertility dependent on catholic value. There is a blog related here using ksmooth function. “http://users.stat.umn.edu/~helwig/notes/smooth-notes.html” “Nonparametric regression, like linear regression, estimates mean out...
801 sym 11 img
DATA 621 Homework 5
Table of contents Running Code Overview DATA EXPLORATION DATA PREPARATION BUILD MODELS SELECT MODELS Make Predictions on the Test Evaluation Data set DATA 621 Homework 5, Group : Banu Boopalan, Gregg Maloy, Alexander Moyse, Umais Siddiqui CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FA...
5387 sym R (92766 sym/165 pcs) 43 img 10 tbl
DATA 624 Homework 9
Table of contents Running Code DATA 624 Homework9 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) Do problems 8.1, 8.2, 8.3, and 8.7 in Kuhn and Johnson. 8.1. Recreate the simulated data from Exercise 7.2: Code set.seed(200) simulated <- mlbench.friedman1(200, sd...
7463 sym R (73601 sym/33 pcs) 2 img 7 tbl
DATA 624 Homework 8
Table of contents Running Code DATA 624 Homework8 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) Do problems 7.2 and 7.5 in Kuhn and Johnson. There are only two but they have many parts. Please submit both a link to your Rpubs and the .rmd file. 7.2. Friedman (199...
3343 sym R (29113 sym/46 pcs) 11 img 4 tbl
DATA 624 Homework 7
Table of contents Running Code DATA 624 Homework7 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) In Kuhn and Johnson do problems 6.2 and 6.3. There are only two but they consist of many parts. Please submit a link to your Rpubs and submit the .rmd file as well. 6.2...
5337 sym R (49359 sym/49 pcs) 8 img
DATA 624 Project 1
Table of contents Project Deliverable PART A Forecasting Analysis Data Exploratory Analysis Review missing variables Review distributions Impute missing values TimeSeries Models Part B – Forecasting Power, ResidentialCustomerForecastLoad-624.xlsx Data Exploration & Plots Data Distributions Data Imputation Models Part C – Bonus Check for...
11993 sym Python (81726 sym/199 pcs) 85 img 26 tbl
DATA 624 Homework6 ARIMA
Table of contents Running Code DATA 624 Homework6 CodeShow All CodeHide All CodeView Source Running Code Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) Do exercises 9.1, 9.2, 9.3, 9.5, 9.6, 9.7, 9.8 in Hyndman. 9.1 Figure 9.32 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. Explai...
7859 sym R (30194 sym/57 pcs) 33 img
DATA 622 - Assignment1
Table of contents Project Deliverable Essay Large Data Set (End to End ML Analysis) Data set Introduction Data Exploration & Plots Correlation Analysis Data Preparation (Imputation and Feature Engineering) Algorithm Selection/Build Models Decision Tree Model and metrics review Oversampling and fixing class imbalance and rerunning decision tree ...
20778 sym Python (120383 sym/174 pcs) 46 img 7 tbl
DATA 621 BLOG1
BLOG1: Since I used to code in Base SAS, I wanted to connect to SAS online and use the commands via a tutorial to model logistic regression in SAS. It was extremely easy and I found the interface very easy to use and the output via tabular format on various statistics values was very easy to read and follow. Tutorial - https://stats.oarc.ucla...
437 sym