Publications by Cam O’Shaughnessy

Quiz 2

04.05.2020

# Load packages library(tidyquant) library(tidyverse) # Import stock prices stock_prices <- tq_get(c("WMT", "TGT", "MSFT"), get = "stock.prices", from = "2020-01-01") # Calculate daily returns stock_returns <- stock_prices %>% group_by(symbol) %>% tq_mutate(select = adjusted, mutate_fun = periodReturn, period = "daily") stock_retur...

1982 sym R (2477 sym/9 pcs) 3 img

Quiz 2

04.05.2020

# Load packages library(tidyquant) library(tidyverse) # Import stock prices stock_prices <- tq_get(c("WMT", "TGT", "AMZN"), get = "stock.prices", from = "2020-01-01") # Calculate daily returns stock_returns <- stock_prices %>% group_by(symbol) %>% tq_mutate(select = adjusted, mutate_fun = periodReturn, period = "daily") stock_return...

2011 sym R (2476 sym/9 pcs) 3 img

Quiz 5

03.05.2020

Replicate a case study of marketing analytics: https://www.linkedin.com/learning/the-data-science-of-marketing/cluster-analysis-with-r?u=2232593 Q1 Import data myClusterData <- read.csv("/cloud/project/cluster-r.csv") myClusterData ## Email Behavior.3 ## 1 nisl@adipiscin...

599 sym R (53178 sym/6 pcs) 1 img

Tidytuesday

25.04.2020

Choose one of David Robinson’s tidytuesday screencasts, watch the video, and summarise. https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ Instructions You must follow the instructions below to get credits for this assignment. Read the document posted in Moodle before answering the following questions. Write in your own words. Multiple ...

2033 sym

Quiz 4

16.04.2020

Make sure to include the unit of the values whenever appropriate. Q1 Build a regression model to predict life expectancy using gdp per capita. Hint: The variables are available in the gapminder data set from the gapminder package. Note that the data set and package both have the same name, gapminder. library(tidyverse) options(scipen=999) data(...

2099 sym R (1845 sym/2 pcs)

Reading on Regression

08.04.2020

Instructions You must follow the instructions below to get credits for this assignment. Read the document (example of regression analysis) posted in Moodle before answering the following questions. Write in your own words. Multiple identical answers will get zero. Elaborate your answer. One or two sentence answers won’t get credit. Make sure t...

2741 sym

Quiz 3

26.03.2020

The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest: smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day) disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”. Q1 Describ...

2185 sym 2 img

Mosaic Plot

24.03.2020

In this exercise you will learn to visualize the pairwise relationships between a set of quantitative variables. To this end, you will make your own note of 8.5 Mosaic plots from Data Visualization with R. Mosaic charts can display the relationship between categorical variables using: rectangles whose areas represent the proportion of cases for ...

2167 sym 2 img

Bivariate Graphs

25.02.2020

In this exercise you will learn to plot data using the ggplot2 package. To answer the questions below, use Chapter 4.3 Categorical vs. Quantitative Data Visualization with R. Q1 Plot the distribution of daily returns by stock using kernel density plots. Hint: See the code in 4.3.2 Grouped kernel density plots. Q2 Plot the distribution of daily...

1550 sym 5 img

Introduction to ggplot2

20.02.2020

In this exercise you will learn to plot data using the ggplot2 package. To answer the questions below, use Chapter 4.3 Categorical vs. Quantitative Data Visualization with R. # Load packages library(tidyquant) library(tidyverse) # Import stock prices stock_prices <- tq_get(c("AAPL", "MSFT"), get = "stock.prices", from = "2020-01-01") stock_pri...

1134 sym R (2673 sym/10 pcs) 6 img