Publications by Erin Dane
Analysis of Salary Determinants
0.1 Executive Summary This report aims to predict salary using various employee demographic characteristics. Data preprocessing is completed to ensure the dataset is ready for analysis. Missing variables were deleted or set to the mean of the data attribute. Then dummy variables are created to ensure categorical variables can be used in the analy...
19803 sym R (19043 sym/62 pcs) 14 img 4 tbl
Pop Quiz March 23
Pop Quiz: Anscombes Quartet Erin 2022-03-23 Introduction This dataset is called Anscombes Quartet. This is a synthetic dataset that was created to show the importance of looking outside pvalue and R squared value when doing analysis. Given that the model is synthetic there is no missing values and no data qaulity issues. > head(four) x123 ...
3298 sym R (3846 sym/23 pcs) 8 img
Pop Quiz March 16th
Pop Quiz March 16th Erin Dane 2022-03-16 > Housing <- read.table("/Users/erindane/Desktop/R Studios /Table2.1HousePrices-NoID.csv", + header=TRUE, stringsAsFactors=TRUE, sep=",", na.strings="NA", dec=".", strip.white=TRUE) Determing Distribution of House Prices Using Histrogram From this hisogram it is safe to assume that the distribution i...
1775 sym R (977 sym/11 pcs) 4 img
Document
R Markdown This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within t...
593 sym R (262 sym/2 pcs) 1 img
Assignment 2 MSCI3230
'Name: Erin Dane Course: MSCI3230' ## [1] "Name: Erin Dane\nCourse: MSCI3230" #Question 1 Housing <- read.csv("~/Desktop/R Studios /Table2.1HousePrices-NoID.csv") SQFT <- Housing$SqFt BNB <- Housing$Brick function1 <- function(x,y) { z <- which(SQFT >= x & BNB == y) w <- mean(Housing$Price[z]) print(paste('The average price of the home is',...
17 sym R (1722 sym/11 pcs) 1 img
Pop Quiz March 9th
Pop Quiz March 9th Erin Dane 2022-03-14 WestRox <- read.table("/Users/erindane/Desktop/westRoxbury.csv", header=TRUE, stringsAsFactors=TRUE, sep=",", na.strings="NA", dec=".", strip.white=TRUE) with(WestRox, hist(TOTAL.VALUE, groups=REMODEL, scale="frequency", breaks="Sturges", col="darkgray")) #These histograms display the difference fequenci...
70 sym R (2133 sym/13 pcs) 6 img
Lab Test 2
Lab Test 2 Erin Dane 2022-03-21 Normality and Correlation Determing Distribution of Amount Spent From this QQ plot we can see a bell shaped distribution. This indicated that the Amount spend is not normally distributed. Given that the shape of the QQ plot opens up means it reight sknewed. From the histogram we can see a large right sknewednes...
3179 sym R (2506 sym/17 pcs) 5 img
Lab Test: Credit Score Analysis
Credit Score Analysis MSCI 3230 Erin Dane 2022-04-06 Objectives The objective of this analysis is finding the best way to predict the credit score of different companies. A sample of the data set can be found below. The given data set contains the following variables. Variables Fleverage: Numeric Profit: Numeric Networth: Numeric Sales: Nume...
4846 sym R (5479 sym/27 pcs) 5 img 3 tbl