Publications by Abdelmalek Hajjam
Data624 - HW4
Homework 4 Exercise 3.1 The UC Irvine Machine Learning Repository6 contains a data set related to glass identification. The data consist of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refractive index and percentages of eight elements: Na, Mg, Al, Si, K, Ca, Ba, and Fe. The data can be acc...
3155 sym R (26709 sym/41 pcs) 3 img 4 tbl
Data624-HW6
Homework 6 Exercise 8.1 Figure 8.31 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. a. Explain the differences among these fgures. Do they all indicate that the data are white noise? The difference between figures is that they show different critical values presented by the blue dashed lines. x-axis show the co...
7114 sym R (12059 sym/139 pcs) 31 img
Data624_HW8
Exercise 7.2 Friedman (1991) introduced several benchmark data sets create by simulation. One of these simulations used the following nonlinear equation to create data: \(y = 10 sin(\pi x_1 x_2) + 20(x_3 - 0.5)^2 + 10x_4 + 5x_5 + N(0, \sigma^2)\) where the \(x\) values are random variables uniformly distributed between \([0, 1]\) (there are also...
3122 sym R (17601 sym/73 pcs) 9 img
Data624-Homework7
library(tidyverse) library(dbplyr) library(caret) # library(grid) # library(gridExtra) library(DMwR) library(pls) library(elasticnet) Exercise 6.2 Developing a model to predict permeability (see Sect. 1.4) could save significant resources for a pharmaceutical company, while at the same time more rapidly identifying molecules that have a ...
5109 sym R (30868 sym/49 pcs) 5 img
Data624_HW9
Execise 8.1 Recreate the simulated data from Exercise 7.2 library(mlbench) set.seed(200) simulated <- mlbench.friedman1(200, sd=1) simulated <- cbind(simulated$x, simulated$y) simulated <- as.data.frame(simulated) colnames(simulated)[ncol(simulated)] <- "y" (a) Fit a random forest model to all of the predictors, then estimate the variable im...
6761 sym R (8968 sym/71 pcs) 7 img
Data624_HW10
Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer’s basket - and therefore “Market Basket Analysis”. That is exactly what the Groceries Data Set contains: a collection of receipts with each line representing 1...
1431 sym R (3714 sym/11 pcs) 2 img 2 tbl
Project2
1 Project Problem Statement This is role playing. I am your new boss. I am in charge of production at ABC Beverage and you are a team of data scientists reporting to me. My leadership has told me that new regulations are requiring us to understand our manufacturing process, the predictive factors and be able to report to them our predictive model...
15929 sym R (38176 sym/116 pcs) 29 img 3 tbl