Publications by Laura Schollmeyer & Alexandra Paskowski

Mini Project

25.10.2024

Question 1 data(cars) median_speed <- median(cars$speed) median_speed ## [1] 15 Question 2 #install.packages("jsonlite") library("jsonlite") url <- "https://min-api.cryptocompare.com/data/v2/histoday?fsym=BTC&tsym=USD&limit=100" response <- fromJSON(url) btc_data <- response$Data$Data str(btc_data) ## 'data.frame': 101 obs. of 9 variables:...

878 sym R (7751 sym/23 pcs)

Assignment 6

18.10.2024

Question 1 pacman::p_load(nycflights13) #View(flights) summary(flights) ## year month day dep_time sched_dep_time ## Min. :2013 Min. : 1.000 Min. : 1.00 Min. : 1 Min. : 106 ## 1st Qu.:2013 1st Qu.: 4.000 1st Qu.: 8.00 1st Qu.: 907 1st Qu.: 906 ## Median :2013 Median : 7.00...

162 sym R (13972 sym/42 pcs)

Assignment 5

17.10.2024

Question 1 dirty_iris <- read.csv("https://raw.githubusercontent.com/edwindj/datacleaning/master/data/dirty_iris.csv") sum(is.na(dirty_iris$Petal.Length)) ## [1] 19 Question 2 complete_cases <- complete.cases(dirty_iris) num_complete <- sum(complete_cases) percent_complete <- (num_complete / nrow(dirty_iris)) * 100 num_complete ## [1] 96 percent...

255 sym R (3876 sym/31 pcs)

Assignment 4

04.10.2024

#bad_data <- str_subset(bike_complete$humidity, "[a-z A-Z]") #bike_complete$humidity <- str_replace_all(bike_complete$humidity,bad_data,"61") Question 7 cast <- read.csv("raw_cast.csv") Question 8/9 library(stringr) seperated_columns <- str_split_fixed(cast$Name.1, "-", 1) cast$Name2 <-str_split_fixed(cast$Name.2, "\n",2)[,1] cast$Epsiode <- str...

51 sym R (516488 sym/6 pcs)

Assignment 3

27.09.2024

Question 6 pacman::p_load(robotstxt, rvest) Step 1: Read the HTML document- webpage paths_allowed("https://www.imdb.com/title/tt7235466/fullcredits?ref_=tt_cl_sm") ## www.imdb.com ## [1] TRUE imdb_html <- read_html("https://www.imdb.com/title/tt7235466/fullcredits?ref_=tt_cl_sm") Step 2: Select the HTML elements table_html <- html_elements(imdb_...

806 sym Python (25928 sym/21 pcs)

Document

20.09.2024

library(quantmod) Loading required package: xts Loading required package: zoo Attaching package: ‘zoo’ The following objects are masked from ‘package:base’: as.Date, as.Date.numeric Loading required package: TTR Registered S3 method overwritten by 'quantmod': method from as.zoo.data.frame zoo getSymbols(c("GOOG","AMZ...

4398 sym R (1568 sym/20 pcs) 1 img

Assignment 1_LS

17.09.2024

bike_sharing_data <- read.csv(“bike_sharing_data.csv”) summary(bike_sharing_data) datetime season holiday workingday weather Length:17379 Min. :1.000 Min. :0.00000 Min. :0.0000 Min. :1.000 Class :character 1st Qu.:2.000 1st Qu.:0.00000 1st Qu.:0.0000 1st Qu.:1.000 Mo...

1180 sym

Document

11.04.2024

Objective Predict cheaters using the Affairs data from the ‘AER’ package, using the classification algorithm from the ‘rpart’ package. Data Prep Created a new binary variable ‘cheater’ by classifying observations with 0 affairs as ‘0’ , and with one or more affairs as ‘1’. Split the data (601 observations), into training and v...

1002 sym 2 img

Assignment 6

14.03.2024

1. Is there an association between leaving the company and having a work accident? ## ## ## Cell Contents ## |-------------------------| ## | N | ## | Chi-square contribution | ## | N / Row Total | ## | N / Col Total | ## | N / Table Total | ## |-------------------------| ## ## ## Total Obs...

981 sym

Assignment 5

29.02.2024

1. Is there a difference in satisfaction level between those that left and those that stayed in the company? Employees that stayed, on average, are more satisfied ## ## Welch Two Sample t-test ## ## data: hr$satisfaction_level by hr$Employee_Status ## t = -46.636, df = 5167, p-value < 2.2e-16 ## alternative hypothesis: true difference in means ...

542 sym