Publications by Banu Boopalan

DATA 624 Homework5

07.10.2024

DATA 624 Homework5 Running Code knitr::opts_chunk$set(echo = TRUE, warning = FALSE, error = FALSE) ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ✔ dplyr 1.1.4 ✔ readr 2.1.5 ✔ forcats 1.0.0 ✔ stringr 1.5.1 ✔ ggplot2 3.5.1 ...

8831 sym Python (22016 sym/94 pcs) 20 img

DATA 624 Homework4

30.09.2024

Do problems 3.1 and 3.2 in the Kuhn and Johnson book Applied Predictive Modeling. Please submit your Rpubs link along with your .pdf for your run code. 3.1 a Using visualizations, explore the predictor variables to understand their distributions as well as the relationships between predictors. ANSWER: I have used histograms and the correlation...

6190 sym 27 img 2 tbl

DATA 621 Homework1_MoneyBall_Analysis

28.09.2024

Assignment Introduction We have read in the 2 datasets provided as part of this project from Github. Our goal for this project is to analyze the variables in the baseball statistics columns provided on the training file, explore missing data, understand and prepare the data and build regression models to fit the data (predict our Y or dependen...

22132 sym Python (102536 sym/146 pcs) 61 img 7 tbl

DATA 624 Homework3

23.09.2024

Do exercises 5.1, 5.2, 5.3, 5.4 and 5.7 in the Hyndman book. 5.1 Produce forecasts for the following series using whichever of NAIVE(y), SNAIVE(y) or RW(y ~ drift()) is more appropriate in each case: Australian Population (global_economy) Bricks (aus_production) NSW Lambs (aus_livestock) Household wealth (hh_budget). Australian takeaway food t...

6696 sym Python (6978 sym/37 pcs) 25 img

DATA 624 Week3_Homework_2

16.09.2024

WEEK3 HOMEWORK Do exercises 3.1, 3.2, 3.3, 3.4, 3.5, 3.7, 3.8 and 3.9 from the online Hyndman book. Please include your Rpubs link along with.pdf file of your run code PROBLEM 3.1 Consider the GDP information in global_economy. Plot the GDP per capita for each country over time.  Which country has the highest GDP per capita? How has this cha...

5408 sym R (23109 sym/90 pcs) 41 img

DATA 624_Homework1

09.09.2024

WEEK2 HOMEWORK Please submit exercises 2.1, 2.2, 2.3, 2.4, 2.5 and 2.8 from the Hyndman online Forecasting book. Please submit both your Rpubs link as well as attach the .pdf file with your code.” PROBLEM 2.1 Explore the following four time series: Bricks from aus_production, Lynx from pelt, Close from gafa_stock, Demand from vic_elec. Use ? ...

4871 sym R (13071 sym/63 pcs) 30 img

DATA 608 Homework1

11.09.2022

Principles of Data Visualization and Introduction to ggplot2 I have provided you with data about the 5,000 fastest growing companies in the US, as compiled by Inc. magazine. lets read this in: inc <- read.csv("https://raw.githubusercontent.com/charleyferrari/CUNY_DATA_608/master/module1/Data/inc5000_data.csv", header= TRUE) And lets preview this...

2420 sym R (13860 sym/51 pcs) 10 img 3 tbl

Module 3 Homework

11.10.2022

I have provided you with data about mortality from all 50 states and the District of Columbia.Please access it at https://github.com/charleyferrari/CUNY_DATA608/tree/master/module3/data You are invited to gather more data from our provider, the CDC WONDER system, at https://wonder.cdc.gov This assignment must be done in R. It must be done using t...

2380 sym R (13582 sym/50 pcs) 2 img 2 tbl

DATA608 Final Project Proposal

24.10.2022

BANU BOOPALAN PROPOSAL: DATA 608 ##DATA SOURCES: NCHS - Leading Causes of Death: United States https://data.cdc.gov/NCHS/NCHS-Leading-Causes-of-Death-United-States/bi63-dtpu Summary of the dataset : This dataset presents the age-adjusted death rates for the 10 leading causes of death in the United States beginning in 1999. Data are based on info...

2321 sym

Data 606 Homework6

16.03.2020

2010 Healthcare Law. (6.48, p. 248) On June 28, 2012 the U.S. Supreme Court upheld the much debated 2010 healthcare law, declaring it constitutional. A Gallup poll released the day after this decision indicates that 46% of 1,012 Americans agree with this decision. At a 95% confidence level, this sample has a 3% margin of error. Based on this inf...

8011 sym R (1712 sym/38 pcs)