Publications by Adrianne Kristianto

Data Algorithms II - Assignment 5

01.04.2021

Question 2 For parts (a) through (c), indicate which of i. through iv. is correct. Justify your answer. 2A: The lasso, relative to least squares, is: i. More flexible and hence will give improved prediction accuracy when its increase in bias is less than its decrease in variance. ii. More flexible and hence will give improved prediction accur...

5454 sym R (3185 sym/22 pcs)

Data Algorithms II - Assignment 3

05.03.2021

Question 10: This question should be answered using the Weekly data set, which is part of the ISLR package. This data is similar in nature to theSmarket data from this chapter’s lab, except that it contains 1, 089 weekly returns for 21 years, from the beginning of 1990 to the end of 2010. 10A: Produce some numerical and graphical summaries of ...

6218 sym R (9779 sym/62 pcs) 5 img

Data Algorithms II - Assignment 4

25.03.2021

Question 3 We now review k-fold cross-validation 3A: Explain how k-fold cross-validation is implemented This approach involves randomly k-fold CV dividing the set of observations into k groups, or folds, of approximately equal size. The first fold is treated as a validation set, and the method is fit on the remaining k − 1 folds. 3B: What ar...

6734 sym R (6884 sym/43 pcs)

Data Algorithms II - Assignment 7 (Tree Based Methods)

04.05.2021

Question 3 Consider the Gini index, classification error, and entropy in a simple classification setting with two classes. Create a single plot that displays each of these quantities as a function of ˆpm1. The xaxis should display ˆpm1, ranging from 0 to 1, and the y-axis should display the value of the Gini index, classification error, and en...

4201 sym R (6727 sym/43 pcs) 6 img

Data Algorithms II - Assignment 6 (Nonlinear Models and Tree-based Methods)

22.04.2021

Question 6 In this exercise, you will further analyze the Wage data set considered throughout this chapter. a: Perform polynomial regression to predict wage using age. Use cross-validation to select the optimal degree d for the polynomial. What degree was chosen, and how does this compare to the results of hypothesis testing using ANOVA? Make a ...

1873 sym R (8087 sym/20 pcs) 6 img

Data Algorithms II - Assignment 8 (Support Vector Machines)

04.05.2021

Question 5 We have seen that we can fit an SVM with a non-linear kernel in order to perform classification using a non-linear decision boundary. We will now see that we can also obtain a non-linear decision boundary by performing logistic regression using non-linear transformations of the features. a: Generate a data set with n = 500 and p = 2, ...

3980 sym R (13097 sym/67 pcs) 12 img

Time Series Analysis II

11.05.2021

Exercise 1 Use the Electricity.RData examined in HW1 Exercise 2 (d)-(f). Fit the model using first order differenced log transformed series. a: By visually checking, decide what SARIMA models seem appropriate, i.e., specify p, d, q and P, D, Q in SARIMA model, ARIMA(p, d, q) ∗ ARIMA(P,D, Q)s. Choose the most appropriate two models and explain ...

8185 sym R (22875 sym/60 pcs) 19 img