Publications by Jordyn Carpenter

Term Project

06.05.2020

This is an extension of the tidytuesday assignment you have already done. Complete the questions below, using the screencast you chose for the tidytuesday assigment. Import data library(tidyverse) library(lubridate) brewing_materials <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2020/2020-03-31/br...

1050 sym R (3894 sym/5 pcs) 2 img

Quiz 5

03.05.2020

Replicate a case study of marketing analytics: https://www.linkedin.com/learning/the-data-science-of-marketing/cluster-analysis-with-r?u=2232593 Q1 Import data myClusterData <- read.csv("/cloud/project/cluster-r.csv") myClusterData ## Email Behavior.3 ## 1 nisl@adipisci...

559 sym R (53186 sym/6 pcs) 1 img

Tidytuesday

27.04.2020

Choose one of David Robinson’s tidytuesday screencasts, watch the video, and summarise. https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ Instructions You must follow the instructions below to get credits for this assignment. Read the document posted in Moodle before answering the following questions. Write in your own words. Multiple ...

5220 sym

Quiz 4

22.04.2020

Make sure to include the unit of the values whenever appropriate. Q1 Build a regression model to predict life expectancy using gdp per capita. library(tidyverse) options(scipen=999) data(gapminder, package="gapminder") gdp_lm <- lm(lifeExp ~ gdpPercap, data = gapminder) # View summary of model 1 summary(gdp_lm) ## ## Call: ##...

1830 sym R (1837 sym/2 pcs)

Regression Analysis

22.04.2020

library(tidyverse) options(scipen=999) data(SaratogaHouses, package="mosaicData") houses_lm <- lm(price ~ bedrooms, data = SaratogaHouses) # View summary of model 1 summary(houses_lm) ## ## Call: ## lm(formula = price ~ bedrooms, data = SaratogaHouses) ## ## Residuals: ## Min 1Q Median 3Q Max ## -266388 -...

1366 sym R (1715 sym/2 pcs)

Reading on Regression

05.04.2020

Instructions You must follow the instructions below to get credits for this assignment. Read the document (example of regression analysis) posted in Moodle before answering the following questions. Write in your own words. Multiple identical answers will get zero. Elaborate your answer. One or two sentence answers won’t get credit. Make sure t...

5805 sym

Quiz 3

01.04.2020

The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest: smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day) disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”. Q1 Describ...

2331 sym 3 img

Mosaic Plot

23.03.2020

In this exercise you will learn to visualize the pairwise relationships between a set of quantitative variables. To this end, you will make your own note of 8.5 Mosaic plots from Data Visualization with R. Mosaic charts can display the relationship between categorical variables using: rectangles whose areas represent the proportion of cases for ...

2314 sym 2 img

Quiz 2

04.03.2020

# Load packages library(tidyquant) library(tidyverse) # Import stock prices stock_prices <- tq_get(c("WMT", "TGT", "AMZN"), get = "stock.prices", from = "2020-01-01") # Calculate daily returns stock_returns <- stock_prices %>% group_by(symbol) %>% tq_mutate(select = adjusted, mutate_fun = periodReturn, period = "daily") stock_retur...

1062 sym R (3958 sym/12 pcs) 4 img

Bivariate Graphs

26.02.2020

Q1 Plot the distribution of daily returns by stock using kernel density plots. Q2 Plot the distribution of daily returns by stock using boxplots. Q3 Based on the boxplot above, which of the two stocks would you invest in? I would invest in Microsoft, because their meadian dail return is higher. The middle 50% is higher than Apple’s daily retu...

851 sym 5 img