Publications by Chun San Yip

Data624 HW2

12.02.2023

Exercises 3.1, 3.2, 3.3, 3.4, 3.5, 3.7, 3.8 and 3.9 from the Hyndman online Forecasting book. Ex 3.1 Consider the GDP information in global_economy. Plot the GDP per capita for each country over time. Which country has the highest GDP per capita? How has this changed over time? library(fpp3) gdp_per_capita<-global_economy %>% drop_na(GDP, P...

4645 sym Python (4359 sym/25 pcs) 22 img

Data624 HW1

28.01.2023

Exercises 2.1, 2.2, 2.3, 2.4, 2.5 and 2.8 from the Hyndman online Forecasting book. Ex 2.1 Use the help function to explore what the series gafa_stock, PBS, vic_elec and pelt represent. Use autoplot() to plot some of the series in these data sets. What is the time interval of each series? library(fpp3) autoplot(gafa_stock, Close) PBS%>% ...

2099 sym 11 img

Data 605 Final Exam

18.12.2022

library(tidyverse) library(cubature) library(expm) library(igraph) library(readr) library(Rfast) library(nnet) library(corrplot) library(ggrepel) library(Matrix) library(MASS) library(Metrics) library(OpenImageR) library(caret) 1.PageRank Form the A matrix. Then, introduce decay and form the B matrix as we did in the course notes. ...

5971 sym R (40475 sym/96 pcs) 42 img

Final Project for CUNY607

11.05.2020

Introduction My company is spending a lot of money on Digital Marketing and I am the main person to purchase impressions and clicks from online media. While I am using an agency to purchase for us, I would like to find out the relationship between click and demography of users. I need to build a predictor model or build a basic recommender. It wi...

1710 sym R (7759 sym/29 pcs) 5 img

CUNY 607 Week 10 Assignment - Text Mining

06.04.2020

Data Acquisition and Management Weekly Assignment - Wk10 Chun San Yip 2020/04/04 Overview: The assignment for this week is related to Text Mining. In this assignment, I start by getting the primary example code from chapter 2 of Text Mining with R working in an R Markdown document. I will then extend the code in two ways: Work with a different ...

1911 sym R (2364 sym/8 pcs) 3 img

CUNY 607 Week 9 Assignment - Web API

29.03.2020

Overview: The assignment for this week is related to API. The task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. The New York Times web site provides a rich set of APIs, as described here:https://developer.nytimes.com/apis I need to start by signing up for ...

434 sym R (16133 sym/3 pcs)

CUNY 607 Project 3 Chun Yip

22.03.2020

Overview: It is a group project to provide answer on the question “Which are the most valued data science skills?” Load all the required packages. library(tidyverse) I google the questions on the web and looked at 10 websites. I manually collect those data and put it in a csv file. Read the CSV file theFile <- "https://raw.githubusercontent.c...

566 sym R (1146 sym/5 pcs) 1 img

CUNY 607 Week 7 Assignment - work on HTML, XML and JSON with R

15.03.2020

Overview: The assignment for this week is working with HTML, XML and JSON in R Load all the required packages. library(tidyverse) library(RCurl) library(rvest) library(XML) library(RJSONIO) Read data from 3 manually created HTML, XML and JSON file hfile <- "https://raw.githubusercontent.com/ferrysany/CUNY607A7/master/books.html" #"XML" func...

477 sym R (3248 sym/12 pcs)

CUNY 607 Project 1

22.02.2020

Overview: The project is to create an R Markdown file that generates a .CSV from a text file with chess tournament results where the information has some structure. The following is the definition of game result from Google: W - win, worth 1 point L - lose, worth 0 points D - draw, worth 0.5 points B - full point bye, worth 1 point (given to the ...

1756 sym R (2444 sym/7 pcs)

CUNY 607 wk3 assignment

17.02.2020

Overview: The assignment for this week is related to Regular Expressions. Load the Tidyverse packages. library(tidyverse) Read majors data from CSV theUrl <- "https://raw.githubusercontent.com/fivethirtyeight/data/master/college-majors/majors-list.csv" majors <- read.csv(file=theUrl, header=TRUE, sep=",", stringsAsFactors = FALSE) head(majors) ...

1485 sym R (2608 sym/14 pcs)