Publications by CHUNJIE NAN
DATA 624 HW2
Do exercises 3.1, 3.2, 3.3 and 3.8 from the online Hyndman book. Please include your Rpubs link along with your rmd file. 3.1 For the following series, find an appropriate Box-Cox transformation in order to stabilise the variance. usnetelec usgdp mcopper enplanements library(fpp2) ## Loading required package: ggplot2 ## Loading required package:...
2392 sym R (2889 sym/43 pcs) 14 img
DATA 624 HW4
3.1 The UC Irvine Machine Learning Repository6 contains a data set related to glass identification. The data consist of 214 glass samples labeled as one of seven class categories. There are nine predictors, including the refractive index and percentages of eight elements: Na, Mg, Al, Si, K, Ca, Ba, and Fe. The data can be accessed via: library(ml...
2596 sym R (10724 sym/50 pcs) 11 img
DATA624HW1
2.1 Use the help function to explore what the series gold, woolyrnq and gas represent. library(forecast) ## Registered S3 method overwritten by 'quantmod': ## method from ## as.zoo.data.frame zoo library(ggplot2) help(gold) # Daily morning gold prices in US dollars from Jan 1st 1985 to Mar 31st 1989. help(woolyrnq) # Quarterly ...
4774 sym R (4073 sym/54 pcs) 14 img
Reducing Traffic Mortality in the USA
1. The raw data files and their format While the rate of fatal road accidents has been decreasing steadily since the 80’s, the past ten years have seen a stagnation in this reduction. Coupled with the increase in number of miles driven in the nation, the total number of traffic related-fatalities has now reached a ten year high and is rapidly i...
9276 sym R (14060 sym/50 pcs) 7 img
Dr. Semmelweis and the Discovery of Handwashing
1. Meet Dr. Ignaz Semmelweis This is Dr. Ignaz Semmelweis, a Hungarian physician born in 1818 and active at the Vienna General Hospital. If Dr. Semmelweis looks troubled it’s probably because he’s thinking about childbed fever: A deadly disease affecting women that just have given birth. He is thinking about it because in the early 1840s a...
4522 sym R (5523 sym/24 pcs) 3 img
Final_Project_DATA607_Chunjie_Nan
0.1 Reasearch Question: Is increase in life expectancy associated with higher economic growth? The week 2 blackboard discussion triggered my interest in the relationships between economy and life expectancy, as I have mentioned on the discussion board, I believe the strength of a country’s economy is the proxy measure of life expectancy. In thi...
12233 sym R (5093 sym/46 pcs) 1 img
Discussion: Tinder Recommender System
Discussion - Tinder Recommender Systems Overview Your task is to analyze an existing recommender system that you find interesting. You should: Perform a Scenario Design analysis as described below. Consider whether it makes sense for your selected recommender system to perform scenario design twice, once for the organization (e.g. Amazon.com) a...
5343 sym
Assignment10_DATA607_Chunjie_Nan
Rererence# Silge, J., & Robinson, D. (2017). Text mining with R: A tidy approach. O’Reilly Media. 1. Textbook Code 2.2 Sentiment analysis with inner join get_sentiments("afinn") # value from -5 to 5 ## # A tibble: 2,477 × 2 ## word value ## <chr> <dbl> ## 1 abandon -2 ## 2 abandoned -2 ## 3 abandons -2 ...
924 sym R (12326 sym/67 pcs) 7 img
Assignment 9_DATA607_Chunjie Nan
The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis You’ll need to start by signing up for an API key. Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. library(httr) library(jsonlite) From ...
882 sym R (15624 sym/3 pcs)
Project3_DATA607_Chunjie_Nan
Background What are the most valued data science skills? The purpose of the project was to effectively collaborate on acquiring appropriate datasets then tidying and transforming to analyze and visualize the dataset in effort to answer the questions. Data Source: “Data Scientist Jobs” from Kaggle (url: https://www.kaggle.com/andrewmvd/data-s...
3543 sym R (5320 sym/20 pcs) 3 img