Publications by Erin Dane

Analysis of Salary Determinants

30.03.2022

0.1 Executive Summary This report aims to predict salary using various employee demographic characteristics. Data preprocessing is completed to ensure the dataset is ready for analysis. Missing variables were deleted or set to the mean of the data attribute. Then dummy variables are created to ensure categorical variables can be used in the analy...

19803 sym R (19043 sym/62 pcs) 14 img 4 tbl

Pop Quiz March 23

23.03.2022

Pop Quiz: Anscombes Quartet Erin 2022-03-23 Introduction This dataset is called Anscombes Quartet. This is a synthetic dataset that was created to show the importance of looking outside pvalue and R squared value when doing analysis. Given that the model is synthetic there is no missing values and no data qaulity issues. > head(four) x123 ...

3298 sym R (3846 sym/23 pcs) 8 img

Pop Quiz March 16th

16.03.2022

Pop Quiz March 16th Erin Dane 2022-03-16 > Housing <- read.table("/Users/erindane/Desktop/R Studios /Table2.1HousePrices-NoID.csv", + header=TRUE, stringsAsFactors=TRUE, sep=",", na.strings="NA", dec=".", strip.white=TRUE) Determing Distribution of House Prices Using Histrogram From this hisogram it is safe to assume that the distribution i...

1775 sym R (977 sym/11 pcs) 4 img

Document

14.02.2022

R Markdown This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within t...

593 sym R (262 sym/2 pcs) 1 img

Assignment 2 MSCI3230

26.02.2022

'Name: Erin Dane Course: MSCI3230' ## [1] "Name: Erin Dane\nCourse: MSCI3230" #Question 1 Housing <- read.csv("~/Desktop/R Studios /Table2.1HousePrices-NoID.csv") SQFT <- Housing$SqFt BNB <- Housing$Brick function1 <- function(x,y) { z <- which(SQFT >= x & BNB == y) w <- mean(Housing$Price[z]) print(paste('The average price of the home is',...

17 sym R (1722 sym/11 pcs) 1 img

Pop Quiz March 9th

14.03.2022

Pop Quiz March 9th Erin Dane 2022-03-14 WestRox <- read.table("/Users/erindane/Desktop/westRoxbury.csv", header=TRUE, stringsAsFactors=TRUE, sep=",", na.strings="NA", dec=".", strip.white=TRUE) with(WestRox, hist(TOTAL.VALUE, groups=REMODEL, scale="frequency", breaks="Sturges", col="darkgray")) #These histograms display the difference fequenci...

70 sym R (2133 sym/13 pcs) 6 img

Lab Test 2

21.03.2022

Lab Test 2 Erin Dane 2022-03-21 Normality and Correlation Determing Distribution of Amount Spent From this QQ plot we can see a bell shaped distribution. This indicated that the Amount spend is not normally distributed. Given that the shape of the QQ plot opens up means it reight sknewed. From the histogram we can see a large right sknewednes...

3179 sym R (2506 sym/17 pcs) 5 img

Lab Test: Credit Score Analysis

06.04.2022

Credit Score Analysis MSCI 3230 Erin Dane 2022-04-06 Objectives The objective of this analysis is finding the best way to predict the credit score of different companies. A sample of the data set can be found below. The given data set contains the following variables. Variables Fleverage: Numeric Profit: Numeric Networth: Numeric Sales: Nume...

4846 sym R (5479 sym/27 pcs) 5 img 3 tbl