Publications by Amit Kapoor
Data624 - Homework5
Exercise 7.1 Consider the pigs series — the number of pigs slaughtered in Victoria each month. str(pigs) ## Time-Series [1:188] from 1980 to 1996: 76378 71947 33873 96428 105084 ... a) Use the ses() function in R to find the optimal values of \(\alpha\) and \(\ell_0\), and generate forecasts for the next four months. # Using ses for pigs pigs...
6641 sym R (14832 sym/72 pcs) 19 img
Data621 - Homework1
DATA EXPLORATION The data set contains approximately 2276 records. Each record represents a professional baseball team from the years 1871 to 2006 inclusive. Each record has the performance of the team for the given year, with all of the statistics adjusted to match the performance of a 162 game season.Below is a short description of the variable...
10622 sym R (72648 sym/64 pcs) 22 img 3 tbl
Data624 - Homework4
library(mlbench) library(VIM) library(DataExplorer) library(GGally) library(psych) library(caret) library(summarytools) library(naniar) library(dplyr) library(Amelia) 3.1 The UC Irvine Machine Learning Repository6 contains a data set related to glass identification. The data consist of 214 glass samples labeled as one of seven class categories. ...
4625 sym R (19949 sym/31 pcs) 11 img
Data621 - Homework2
1. Download Dataset Download the classification output data set (attached in Blackboard to the assignment) dataset_df <- read.csv('https://raw.githubusercontent.com/hillt5/DATA_621/master/HW2/classification-output-data.csv') 2. Confusion Matrix The data set has three key columns we will use: class: the actual class for the observation scored.cl...
4851 sym R (7195 sym/31 pcs) 2 img 1 tbl
Data 621 - Final Project
Abstract For the final assignment, the team will be looking at a dataset of forest fires identified in Montesinho Natural Park, located in the mountainous northeast of Portugal. The original paper was published by a nearby university with the intention of offering low-cost prediction based on available meteorological measurements. Their model use...
18731 sym R (26765 sym/25 pcs) 23 img 1 tbl
Data624 - Project2
Overview ABC Beverage has new regulations in place and the leadership team requires the data scientists team to understand the manufacturing process, the predictive factors and be able to report to them predictive model of PH. The selection of model depends upon various factors like model accuracy, data relevance, cross validation etc. R package...
12186 sym R (32747 sym/30 pcs) 19 img
Data621 - Blog5
Linear Regression “In statistics, linear regression is a linear approach to modelling the relationship between a scalar response and one or more explanatory variables” - Wikipedia Linear regression does model the relationship between two variables by fitting a linear equation to observed data. The independent variable is considered to be an e...
6683 sym R (7157 sym/12 pcs) 3 img
Data621 - Blog1
Principal Component Analysis Sometimes we have too many predictors and if we use all of them in our regression model, we would end up with issues and explanation could be difficult due to collinearity. It could also cause prediction performance degradation by using too many predictors. Hence, it has been proven better to reduce dimension of the d...
3376 sym R (6959 sym/22 pcs) 2 img
Data624 - Homework9
library(mlbench) library(randomForest) library(caret) library(party) library(gbm) library(Cubist) library(rpart) library(AppliedPredictiveModeling) library(tidyverse) library(naniar) library(rpart.plot) Exercise 8.1 Recreate the simulated data from Exercise 7.2 set.seed(317) simulated <- mlbench.friedman1(200, sd = 1) simulated <- cbind(simulate...
8637 sym R (27968 sym/75 pcs) 7 img
Data624 - Homework7
library(AppliedPredictiveModeling) library(tidyverse) library(caret) library(naniar) library(corrplot) Exercise 6.2 Developing a model to predict permeability (see Sect. 1.4) could save significant resources for a pharmaceutical company, while at the same time more rapidly identifying molecules that have a sufficient permeability to become a dru...
7699 sym R (29503 sym/80 pcs) 10 img