Publications by Sung Lee

Data 607 Final Project

05.05.2020

Assignment on RPubs Rmd on Github Introduction NYC (New Amsterdam) was founded on a land deal and from then on has been driven by profit and commerce. NYC has since weathered many booms and busts. It is with this I endeavor to analyze the impact on businesses of recessions in the United States. My father is a small business owner and I have fri...

7837 sym R (11168 sym/17 pcs) 20 img 6 tbl

Sung Lee Data 607 Week 12 Assignment

29.04.2020

Assignment on RPubs Rmd on Github Introduction The purpose of this assignment is to report on a recommender system. Perform a Scenario Design analysis as described below. Consider whether it makes sense for your selected recommender system to perform scenario design twice, once for the organization (e.g. Amazon.com) and once for the organizat...

3451 sym

Data 607 Project 4 Assignment

24.04.2020

Assignment on RPubs Rmd on Github Introduction The purpose of this project is to get our feet wet in document classification. One application of document classification is identifying “spam” and “ham”. Spam is “any kind of unwanted, unsolicited digital communication, often an email, that gets sent out in bulk.”1 Ham would be the opp...

4264 sym R (20517 sym/23 pcs) 2 img

Sung Lee Data 607 Extend Assignment

20.04.2020

Assignment on RPubs Create Rmd on Github Extend Rmd on Github Purpose This project is to extend Michael Munguia’s stringr CREATE project. It will provide additional demonstration functions for the stringr package. Data This Rmd will employ Michael’s csv data that is already a part of this repository. The csv file is endorsements-2020.csv a...

1362 sym R (809 sym/8 pcs)

Sung Lee Data 607 Week 10 Assignment

01.04.2020

Assignment on RPubs Rmd on Github Introduction Assignment In Text Mining with R, Chapter 2 looks at Sentiment Analysis. In this assignment, you should start by getting the primary example code from chapter 2 working in an R Markdown document. You should provide a citation to this base code. You’re then asked to extend the code in two ways: W...

1994 sym R (38133 sym/31 pcs) 2 img

Data 606 Sung Lee Presentation 8.25

01.04.2020

Problem 8.25 Murders and poverty, Part I. The following regression output is for predicting annual murders per million from percentage living in poverty in a random sample of 20 metropolitan areas. Charts Questions Write out the linear model. We get the values from the chart. The intercept is -29.901 and the slope is 2.559 \(\hat{y} = -29.901 ...

1233 sym 1 img

Data 607 Week 9 Assignment

24.03.2020

Assignment on RPubs Rmd on Github Introduction This asignment will employ the New York Times’ set of APIs to read data into JSON and transform it into an R data frame. We will use the New York Times Top Stories API and select world stories. An example of the call would use this API call https://api.nytimes.com/svc/topstories/v2/world.json?api...

837 sym R (1314 sym/12 pcs)

Data 607 Project 3

21.03.2020

Assignment on RPubs Rmd on GitHub Introduction The purpose of this assignment is to answer the question, “Which are the most valued data science skills?” Within the team we have had experiences applying for jobs, but we were each relatively new to the data science vernacular. For example, we have one seasoned programmer and a student with a...

17542 sym R (17647 sym/63 pcs) 14 img 1 tbl

Test Project 3

19.03.2020

References: [http://www.sthda.com/english/wiki/text-mining-and-word-cloud-fundamentals-in-r-5-simple-steps-you-should-know] [https://www.tidytextmining.com/nasa.html#word-co-ocurrences-and-correlations] library(tm) ## Loading required package: NLP library(SnowballC) library(wordcloud) ## Loading required package: RColorBrewer library(RColorBrewe...

247 sym R (2248 sym/24 pcs) 1 img

Data 607: Tidyverse Create

13.03.2020

Purpose Not all web pages are created the same. Oftentimes, in trying to analyze data, we will need a handy utility to scrape web pages. The package rvest is a handy Tidyverse package that will save time having to “harvest”1 web pages. Preparation We will need to load the library for rvest. library(rvest) # Tidyverse package for rv...

1764 sym R (95883 sym/7 pcs) 1 img