Publications by Taha Ahmad
DATA621 HW4 Insurance Predictions
Introduction We will explore, analyze and model a data set containing approximately 8000 records. Each record represents a customer at an auto insurance company. Each record has various predictor variables regarding the customer’s car, job, and demographics. The response variables within this dataset indicate if the customer was in a car cras...
16228 sym R (15675 sym/29 pcs) 11 img 11 tbl
Curse of Dimensionality
library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.2 ✔ tidyr 1.3.0...
4227 sym R (2125 sym/11 pcs) 8 img
DATA624 Project 1 Time Series Forecasting
library(fpp3) library(tidyverse) Context This project consists of 3 parts: Part A – ATM Forecast, ATM.xlsx In part A, I want you to forecast how much cash is taken out of 4 different ATM machines for May 2010. The data is given in a single file. The variable ‘Cash’ is provided in hundreds of dollars, other than that it is straight forwar...
32349 sym R (14963 sym/80 pcs) 30 img 9 tbl
DATA624 FPP Chapter 9
library(fpp3) Introduction In this document, we will be going through exercises 9.1, 9.2, 9.3, 9.5, 9.6, 9.7, 9.8 from Forecasting: Principles and Practice (3rd ed). Exercises 9.1 Figure 9.32 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. Figure 9.32 Explain the differences among these figures. Do they al...
13673 sym R (14064 sym/67 pcs) 30 img
DATA622 Exploratory Machine Learning Analysis
library(tidyverse) library(skimr) library(DataExplorer) library(corrplot) library(ggfortify) library(caret) set.seed(123) Introduction We will explore, analyze and model two sample sales datasets from (https://excelbianalytics.com/wp/downloads-18-sample-csv-files-data-sets-for-testing-sales/) containing 100 and 1,000,000 million records. L...
10294 sym R (32443 sym/33 pcs) 1 img 4 tbl
DATA624 FPP Chapter 8
library(fpp3) Introduction In this document, we will be going through exercises 8.1, 8.5, 8.6, 8.7, 8.8, and 8.9 from Forecasting: Principles and Practice (3rd ed). Exercises 8.1 Consider the the number of pigs slaughtered in Victoria, available in the aus_livestock dataset. Use the ETS() function to estimate the equivalent model for simple ...
6763 sym R (6613 sym/38 pcs) 11 img
DATA624 JK Chapter 3
library(tidyverse) library(mlbench) library(corrplot) library(MASS) Introduction In this document, we will be going through exercises 3.1 and 3.2 from Applied Predictive Modeling - Kuhn and Johnson. Exercises 3.1 The UC Irvine Machine Learning Repository6 contains a data set related to glass identification. The data consist of 214 glass sa...
4676 sym R (3987 sym/9 pcs) 5 img
DATA624 FPP Chapter 5
library(fpp3) Introduction In this document, we will be going through exercises 5.1, 5.2, 5.3, 5.4 and 5.7 from Forecasting: Principles and Practice (3rd ed). Exercises 5.1 Produce forecasts for the following series using whichever of NAIVE(y), SNAIVE(y) or RW(y ~ drift()) is more appropriate in each case: Australian Population (global_econo...
5851 sym R (5725 sym/32 pcs) 23 img
DATA621 HW1 Moneyball
library(tidyverse) library(olsrr) library(skimr) library(DataExplorer) library(corrplot) library(fabletools) library(ggfortify) Introduction We will explore, analyze and model a data set containing approximately 2200 records. Each record represents a professional baseball team from the years 1871 to 2006 inclusive. Each record has the per...
12074 sym R (13620 sym/36 pcs) 14 img 2 tbl
DATA624 FPP Chapter 3
library(fpp3) Introduction In this document, we will be going through exercises 3.1, 3.2, 3.3, 3.4, 3.5, 3.7, 3.8 and 3.9 from Forecasting: Principles and Practice (3rd ed). Exercises 3.1 Consider the GDP information in global_economy. Plot the GDP per capita for each country over time. Which country has the highest GDP per capita? How has t...
7471 sym R (5871 sym/36 pcs) 27 img