Publications by Sung Lee

Sung Lee Data 607 Week 5 Assignment

29.02.2020

Assignment on RPubs Rmd on Github Introduction This assignment’s purpose is to tidy and transform the flights wide format table below through the use of tidyr and dplyr. The wide format of the table has been created beforehand as a csv file named airlines.csv. #Calling the stringr, tidyr, dplyr, ggplot2 libraries library(stringr) library(t...

1041 sym R (3604 sym/17 pcs) 2 img

Sung Lee Project 1

15.02.2020

Project on RPubs Rmd on Github Introduction This project will read the Chess tournament text file hosted on Github at https://raw.githubusercontent.com/logicalschema/DATA607/master/Project%201/tournamentinfo.txt and will generate a CSV file. We will only be interested in the player’s wins, losses, and draws with other opponents. The following...

861 sym R (4635 sym/6 pcs) 1 tbl

Data 607 Week 2 Assignment

07.02.2020

Assignment on RPubs Rmd on Github Setup: Recreating the database The database created for this assignment is called movies. A mysqldump of the database was taken from a MariaDB Server 5.5.64. However, the dump would be compatible with a MySQL server. To recreate the database, one would only need to create a new database CREATE DATABASE my_new_d...

1528 sym R (881 sym/7 pcs) 3 img

Sung Lee Data 607 Week 1 Assignment

31.01.2020

Week 1 Assignment on RPubs Rmd on Github Introduction The article I selected is Marriage Isn’t Dead — Yet by Ben Casselman. The article reports on the decline of marriage in America by utlizing marriage rates by variables such as gender, education, children, and race. require(plyr) ## Loading required package: plyr #Reads data from Github ...

1448 sym R (2996 sym/10 pcs) 1 img

R Winter Workshop Final Project

20.01.2020

Final Assignment on RPubs Rmd on Github Data Exploration: This should include summary statistics, means, medians, quartiles, or any other relevant information about the data set. Please include some conclusions in the R Markdown text. require(plyr) ## Loading required package: plyr require(ggplot2) ## Loading required package: ggplot2 z_file <...

1847 sym R (2072 sym/12 pcs) 3 img

Week 2 Assignment

10.01.2020

Week 2 Assignment on RPubs Rmd on Github require(plyr) ## Loading required package: plyr # Load data # US Births in 1969 - 1988 z_file <- gzcon(url("https://raw.githubusercontent.com/logicalschema/r_winter_projects/master/Birthdays.csv.gz")) raw_csv <- textConnection(readLines(z_file)) close(z_file) birthday_data <- read.table(raw_csv, h...

1429 sym R (3042 sym/29 pcs) 2 img

Sung Lee Week 1 Assignment

08.01.2020

RPubs URL Write a loop that calculates 12-factorial. #The factorial function takes in integer values (negative or positive) factorial <- function(x) { if ( x < 0 ) return (-1 * factorial(abs(x))) if (x == 0 | x == 1) return (1) else { factorial_value <- 1 for (i in 1:x){ factorial_value <- factorial_value * i } ...

527 sym R (1155 sym/10 pcs)

Sung Lee Week 3 Assignment

12.02.2020

Assignment on RPubs Rmd on Github Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” library(stringr) majorCSV <- read.csv("https://raw.g...

1782 sym R (3718 sym/22 pcs)

Sung Lee Data 607 Project 2

07.03.2020

Assignment on RPubs Rmd on GitHub Introduction The purpose of this project is to work with three datasets, tidy the data, and repurpose for data analysis. The three datasets I will use are: Leo Yi’s climate data from https:/www.usclimatedata.com, Philip Tanofsky’s Coronavirus data https://github.com/CryptoKass/ncov-data/blob/master/world.la...

4666 sym R (11816 sym/58 pcs) 11 img

Data 607 Week 6 Assignment

14.03.2020

Assignment on RPubs Rmd on Github Introduction This week’s assignment is to work with the web uibiquitous forms of html, xml, and json. This assignment will work with and identify differences between three files: books.html, books.xml, and books.json. The books in the brief inventory are from the dystopian science fiction genre. One book Look...

2365 sym R (3640 sym/24 pcs)