Publications by Chun San Yip
Data624 HW2
Exercises 3.1, 3.2, 3.3, 3.4, 3.5, 3.7, 3.8 and 3.9 from the Hyndman online Forecasting book. Ex 3.1 Consider the GDP information in global_economy. Plot the GDP per capita for each country over time. Which country has the highest GDP per capita? How has this changed over time? library(fpp3) gdp_per_capita<-global_economy %>% drop_na(GDP, P...
4645 sym Python (4359 sym/25 pcs) 22 img
Data624 HW1
Exercises 2.1, 2.2, 2.3, 2.4, 2.5 and 2.8 from the Hyndman online Forecasting book. Ex 2.1 Use the help function to explore what the series gafa_stock, PBS, vic_elec and pelt represent. Use autoplot() to plot some of the series in these data sets. What is the time interval of each series? library(fpp3) autoplot(gafa_stock, Close) PBS%>% ...
2099 sym 11 img
Data 605 Final Exam
library(tidyverse) library(cubature) library(expm) library(igraph) library(readr) library(Rfast) library(nnet) library(corrplot) library(ggrepel) library(Matrix) library(MASS) library(Metrics) library(OpenImageR) library(caret) 1.PageRank Form the A matrix. Then, introduce decay and form the B matrix as we did in the course notes. ...
5971 sym R (40475 sym/96 pcs) 42 img
Final Project for CUNY607
Introduction My company is spending a lot of money on Digital Marketing and I am the main person to purchase impressions and clicks from online media. While I am using an agency to purchase for us, I would like to find out the relationship between click and demography of users. I need to build a predictor model or build a basic recommender. It wi...
1710 sym R (7759 sym/29 pcs) 5 img
CUNY 607 Week 10 Assignment - Text Mining
Data Acquisition and Management Weekly Assignment - Wk10 Chun San Yip 2020/04/04 Overview: The assignment for this week is related to Text Mining. In this assignment, I start by getting the primary example code from chapter 2 of Text Mining with R working in an R Markdown document. I will then extend the code in two ways: Work with a different ...
1911 sym R (2364 sym/8 pcs) 3 img
CUNY 607 Week 9 Assignment - Web API
Overview: The assignment for this week is related to API. The task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. The New York Times web site provides a rich set of APIs, as described here:https://developer.nytimes.com/apis I need to start by signing up for ...
434 sym R (16133 sym/3 pcs)
CUNY 607 Project 3 Chun Yip
Overview: It is a group project to provide answer on the question “Which are the most valued data science skills?” Load all the required packages. library(tidyverse) I google the questions on the web and looked at 10 websites. I manually collect those data and put it in a csv file. Read the CSV file theFile <- "https://raw.githubusercontent.c...
566 sym R (1146 sym/5 pcs) 1 img
CUNY 607 Week 7 Assignment - work on HTML, XML and JSON with R
Overview: The assignment for this week is working with HTML, XML and JSON in R Load all the required packages. library(tidyverse) library(RCurl) library(rvest) library(XML) library(RJSONIO) Read data from 3 manually created HTML, XML and JSON file hfile <- "https://raw.githubusercontent.com/ferrysany/CUNY607A7/master/books.html" #"XML" func...
477 sym R (3248 sym/12 pcs)
CUNY 607 Project 1
Overview: The project is to create an R Markdown file that generates a .CSV from a text file with chess tournament results where the information has some structure. The following is the definition of game result from Google: W - win, worth 1 point L - lose, worth 0 points D - draw, worth 0.5 points B - full point bye, worth 1 point (given to the ...
1756 sym R (2444 sym/7 pcs)
CUNY 607 wk3 assignment
Overview: The assignment for this week is related to Regular Expressions. Load the Tidyverse packages. library(tidyverse) Read majors data from CSV theUrl <- "https://raw.githubusercontent.com/fivethirtyeight/data/master/college-majors/majors-list.csv" majors <- read.csv(file=theUrl, header=TRUE, sep=",", stringsAsFactors = FALSE) head(majors) ...
1485 sym R (2608 sym/14 pcs)