Publications by Daniel Lee
Term Project
This is an extension of the tidytuesday assignment you have already done. Complete the questions below, using the screencast you chose for the tidytuesday assigment. Import data myClusterData <- read.csv("/cloud/project/pizza_jared.csv") myClusterData ## polla_qid answer votes pollq_id question ## 1 ...
469 sym R (55345 sym/1 pcs)
Quiz 2
# Load packages library(tidyquant) ## Loading required package: lubridate ## ## Attaching package: 'lubridate' ## The following objects are masked from 'package:base': ## ## date, intersect, setdiff, union ## Loading required package: PerformanceAnalytics ## Loading required package: xts ## Loading required package: zoo ## ## Attaching pac...
2142 sym R (4269 sym/29 pcs) 3 img
Quiz 1
Use the given code below to answer the questions. ## Load package library(tidyverse) # for cleaning, plotting, etc ## ── Attaching packages ────────────────────────────────────────── tidyverse 1.3.0 ── ## ✓ ggplot2 3.3.0 ✓ purrr 0.3.4 ## ✓ tibb...
2307 sym R (4749 sym/31 pcs) 2 img
Quiz 5
Replicate a case study of marketing analytics: https://www.linkedin.com/learning/the-data-science-of-marketing/cluster-analysis-with-r?u=2232593 Q1 Import data myClusterData <- read.csv("/cloud/project/cluster-r.csv") myClusterData ## Email Behavior.3 ## 1 nisl@adipiscin...
667 sym R (53177 sym/6 pcs) 1 img
Tidytuesday
Choose one of David Robinson’s tidytuesday screencasts, watch the video, and summarise. https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ Instructions You must follow the instructions below to get credits for this assignment. Read the document posted in Moodle before answering the following questions. Write in your own words. Multiple ...
3041 sym
Quiz 4
Make sure to include the unit of the values whenever appropriate. Q1 Build a regression model to predict life expectancy using gdp per capita. Hint: The variables are available in the gapminder data set from the gapminder package. Note that the data set and package both have the same name, gapminder. library(tidyverse) options(scipen=999) data(...
2463 sym R (1769 sym/2 pcs)
Quiz 3
The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest: smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day) disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”. Q1 Describ...
2404 sym 3 img
Document
Instructions You must follow the instructions below to get credits for this assignment. Read the document (example of regression analysis) posted in Moodle before answering the following questions. Write in your own words. Multiple identical answers will get zero. Elaborate your answer. One or two sentence answers won’t get credit. Make sure t...
5248 sym