Publications by Mary Anna Kivenson + Elina v2 + Charls
Data624_wk8
Data 624 Week 8 Assignmnet Figure 8.31 shows the ACFs for 36 random numbers, 360 random numbers and 1,000 random numbers. Explain the differences among these figures. Do they all indicate that the data are white noise? Yes, they all indicate the data are white noise. Because the auto correlation are inside the threshold levels. Why are the cri...
6610 sym R (6375 sym/58 pcs) 33 img
Data624_wk12
Data 624 - Homework 9 8.1 Recreate the simulated data from Exercise 7.2: a) Fit a random forest model to all of the predictors, then estimate the variable importance scores: ## Warning: package 'randomForest' was built under R version 4.0.3 ## randomForest 4.6-14 ## Type rfNews() to see new features/changes/bug fixes. ## Loading required package...
8116 sym R (48580 sym/247 pcs) 5 img
Data624_wk10
R Markdown Problem 6.2 Developing a model to predict permeability (see Sect. 1.4) could save significant resources for a pharmaceutical company, while at the same time more rapidly identifying molecules that have a sufficient permeability to become a drug: a) Start R and use these commands to load the data: Load all fingerprints predictors are ...
5348 sym R (23195 sym/101 pcs) 8 img
Data624_wk11
Data 624 Week11 7.2 Friedman (1991) introduced several benchmark data sets created by simulation. One of these simulations used the following nonlinear equation to create data: y=10sin(πx1x2)+20(x3−0.5)2+10x4+5x5+N(0,σ2) where the x values are random variables uniformly distributed between [0,1] (there are also 5 other non-informative variab...
4170 sym R (49115 sym/109 pcs) 12 img
Data624_Project 1
Data 624 Project Part A – ATM Forecast, ATM624Data.xlsx In part A, I want you to forecast how much cash is taken out of 4 different ATM machines for May 2010. The data is given in a single file. The variable ‘Cash’ is provided in hundreds of dollars, other than that it is straight forward. I am being somewhat ambiguous on purpose to make t...
5417 sym R (18074 sym/69 pcs) 22 img
Data624_wk13
Problem statement : Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer’s basket - and therefore ‘Market Basket Analysis’. That is exactly what the Groceries Data Set contains: a collection of receipts with each...
2814 sym R (12375 sym/20 pcs) 1 img