Publications by Brittany Blankenship
STA4143Final
Background Information: A national veterans’ organization wishes to develop a predictive model to improve the costeffectiveness of their direct marketing campaign. The organization, with its in-house database of over 13 million donors, is one of the largest direct-mail fundraisers in the United States. According to their recent mailing records, t...
4159 sym R (12516 sym/48 pcs) 24 img
STA4143HW8
We have seen that we can fit an SVM with a non-linear kernel in order to perform classification using a non-linear decision boundary. We will now see that we can also obtain a non-linear decision boundary by performing logistic regression using non-linear transformations of the features. #(a) Generate a data set with n = 500 and p = 2, such that t...
4149 sym R (8432 sym/54 pcs) 26 img
STA4143HW7
Consider the Gini index, classification error, and entropy in a simple classification setting with two classes. Create a single plot that displays each of these quantities as a function of ˆpm1. The x-axis should display ˆpm1, ranging from 0 to 1, and the y-axis should display the value of the Gini index, classification error, and entropy. Hint: ...
3863 sym R (8647 sym/72 pcs) 15 img
STA4143HW6
6.)In this exercise, you will further analyze the Wage data set considered throughout this chapter. a.)Perform polynomial regression to predict wage using age . Use cross-validation to select the optimal degree d for the polynomial. What degree was chosen, and how does this compare to the results of hypothesis testing using ANOVA? Make a plot of th...
1675 sym R (6437 sym/20 pcs) 5 img
4143Homework4
3. This problem relates to the QDA model, in which the observations within each class are drawn from a normal distribution with a classspecific mean vector and a class specific covariance matrix. We consider the simple case where p = 1; i.e. there is only one feature. Suppose that we have K classes, and that if an observation belongs to the kth cl...
3660 sym
STA4143Homework3
This question should be answered using the Weekly data set, which is part of the ISLR package. This data contains 1,089 weekly stock returns for 21 years, from the beginning of 1990 to the end of 2010. A.)Produce some numerical and graphical summaries of the Weekly data. Do there appear to be any patterns? library(ISLR) summary(Weekly) ## Ye...
4939 sym R (18327 sym/70 pcs) 2 img