Publications by Daniel Lee
week 02
Welcome Ch1 Introduction The data science project workflow Prerequisites R RStudio r packages Install the tidyverse package Running R code 1+2 ## [1] 3 Getting help Google Stackoverflow Ch2 Introduction to Data Exploration Ch3 Data Visualization Set up data library(tidyverse) mpg ## # A tibble: 234 × 11 ## manufacturer model ...
1376 sym R (1744 sym/11 pcs) 7 img
Apply it to your data
Welcome 1+2 ## [1] 3 Getting help Google Stackoverflow Ch3 Data Visualization data library(tidyverse) mpg ## # A tibble: 234 × 11 ## manufacturer model displ year cyl trans drv cty hwy fl class ## <chr> <chr> <dbl> <int> <int> <chr> <chr> <int> <int> <chr> <chr> ## 1 audi a4 1.8 1999 ...
105 sym R (17755 sym/16 pcs) 2 img
Apply1
Import stock prices stocks <- tq_get(c("AAPL", "NFLX"), get = "stock.prices", from = "2020-01-01", to = "2021-01-01") stocks ## # A tibble: 506 × 8 ## symbol date open high low close volume adjusted ## <chr> <date> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> ## 1 AAPL 2...
50 sym Python (1096 sym/3 pcs) 1 img
Real world applications 1
Bank of New Hampshire State the name of your company. The Bank of New Hampshire is located in Belknap County, NH and it is in the 1st Congressional District. At this establishment there are 250 employee’s total but at the location there is only 70. Their annual sales is $85,902,000 which is a $21,236,000 increase from 3 years ago when their an...
915 sym
APPLY1
Import stock prices stocks <- tq_get(c("LULU", "CLX", "MMC"), get = "stock.prices", from = "2019-06-01", to = "2020-06-01") stocks ## # A tibble: 753 × 8 ## symbol date open high low close volume adjusted ## <chr> <date> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> ## 1 LULU...
50 sym Python (1090 sym/3 pcs) 1 img
How to Measure Economic Performance
2022-09-13 Data? What is it for? Data? What is it for? To measure something changes over time changes over cross-sections How do we measure economic performance? Ideally, we want to count how much stuff we produced Alternatives Sales, expenditures, employment How do we find the right data? Major factors to consider: Industry Geography Rec...
728 sym
CH4
Coding basics R as a calculator 1000/10 ## [1] 100 x <- 1000/10 # x assignment symbol What’s in a name? Calling functions Use of TAB seq(from = 1, to = 10, by =1) ## [1] 1 2 3 4 5 6 7 8 9 10 continuation character, + seq(from = 1, to = 10) ## [1] 1 2 3 4 5 6 7 8 9 10 Printing to screen y <- seq(from = 1, to = 10) y ##...
171 sym Python (224 sym/9 pcs)
CH5
#Import data flights ## # A tibble: 336,776 × 19 ## year month day dep_time sched_de…¹ dep_d…² arr_t…³ sched…⁴ arr_d…⁵ carrier ## <int> <int> <int> <int> <int> <dbl> <int> <int> <dbl> <chr> ## 1 2013 1 1 517 515 2 830 819 11 UA ## 2 2013 1 1 ...
416 sym Python (20868 sym/51 pcs) 1 img
Code Along 2 Ch 5
Import data flights ## # A tibble: 336,776 × 19 ## year month day dep_time sched_de…¹ dep_d…² arr_t…³ sched…⁴ arr_d…⁵ carrier ## <int> <int> <int> <int> <int> <dbl> <int> <int> <dbl> <chr> ## 1 2013 1 1 517 515 2 830 819 11 UA ## 2 2013 1 1 5...
453 sym Python (20811 sym/51 pcs) 1 img
Code Along 2 Ch4
Coding basics R as a calculator 1000 / 10 ## [1] 100 x <- 1000 / 10 # x assignment symbol What’s in a name? Calling functions Use of TAB continuation character, + Printing to screen ...
165 sym