Publications by Daniel Lee

Predicting Litigation Risk based on Company Reviews

23.11.2019

Major findings The larger the company, the greater litigation risk is. The extent of litigation risk is likely higher for some industries than others. Not sure whether company reviews can be used as a predictor for litigation risk. Future Research Add data on the type of lawsuits. Increase the accuracy of the lawsuits data. The number of lawsu...

5711 sym R (25097 sym/1 pcs) 15 img 1 tbl

Predicting Litigation Risk based on Company Reviews

27.11.2019

Major findings The larger the company, the greater litigation risk is. The extent of litigation risk is likely higher for some industries than others. Not sure whether company reviews can be used as a predictor for litigation risk. Future Research Add data on the type of lawsuits. Increase the accuracy of the lawsuits data. The number of lawsu...

5953 sym R (25097 sym/1 pcs) 21 img 1 tbl

Analyzing US Census Data in R

22.01.2020

Disclaimer: The content of this RMarkdown note came from a course called Analyzing US Census Data in R in datacamp. Analysts across industries rely on data from the United States Census Bureau in their work. In this course, students will learn how to work with Census tabular and spatial data in the R environment. The course focuses on the tidycen...

2525 sym R (4696 sym/10 pcs) 7 img

ESG

28.02.2020

This is a note from Matt’s PP slides https://github.com/business-science/presentations/blob/master/2019_05_17_RFinance_Tidyquant_Portfolio_Optimization/R_Finance_tidyquant_matt_dancho.pdf?utm_source=Business+Science+-+Combined+List&utm_campaign=7407a74aed-RFINANCE_TALK_EMAIL&utm_medium=email&utm_term=0_a4e5b7c52f-7407a74aed-62996495&mc_cid=7407...

1069 sym R (7700 sym/5 pcs)

COVID Lawsuits: Cases of Businesses

29.09.2020

Research design Things to think about firm versus establishment I want to be able to say something like, if you operate in CA and in the retail industry, your probability of getting sued is 79%. A potential problem is that many firms have more than one establishment in a state. For example, if Walmart gets sued five times in New Hampshire, do we...

9151 sym R (380629 sym/15 pcs)

Explaining COVID-19 Lawsuits

02.01.2021

Load packages Import data Lawsuits data State lawsuits data came from HAK. StateCases.csv was prepared in cleanUp.Rmd. Excess deaths by the Economist It was imported from the sars2pack package. State Government responses to COVID It was imported from the sars2pack package. Preprep data for regression Removed policy_binary datasets because s...

7317 sym R (806 sym/4 pcs) 18 img

Explaining COVID-19 Lawsuits

04.01.2021

Load packages Import data Lawsuits data State lawsuits data came from HAK. StateCases.csv was prepared in cleanUp.Rmd. Excess deaths by the Economist It was imported from the sars2pack package. State Government responses to COVID It was imported from the sars2pack package. Preprep data for regression Removed policy_binary datasets because s...

7732 sym R (898 sym/4 pcs) 22 img

DALC January Workshop

05.01.2021

Ch1: Data Preparation 1.1 Importing data 1.2 Cleaning data library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union data(starwars) # keep the variables name, heig...

161 sym R (8711 sym/34 pcs)

DALC January Workshop: Data Visualization

07.01.2021

library(ggplot2) data(Marriage, package = "mosaicData") # plot the distribution of race ggplot(Marriage, aes(x = race)) + geom_bar(fill = "cornflowerblue", color="black") + labs(x = "Race", y = "Frequency", title = "Participants by race") data(mpg, package = "ggplot2") # stacked bar chart ggplot(mpg, ...

32 sym R (2410 sym/16 pcs) 10 img

Common Mental Illness Treatments in the U.S.

19.03.2021

Unit 3: Evalution of Metal Illness Treatments Daniel Lee 3/8/2021 Import data Clean data Join data Visualize data Definition of variables Source of data https://www.datafiles.samhsa.gov/study-series/national-mental-health-services-survey-n-mhss-nid13521 The data came from SAMHDA, National Mental Health Services Survey, 2019 and 2010. Selected ar...

1625 sym R (3436 sym/10 pcs) 1 img