Publications by CHUNJIE NAN

Assignment3_DATA607_Chunjie_Nan

12.09.2021

1. Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” library(stringr) url<-"https://raw.githubusercontent.com/fivethirtyeight/data/master/coll...

1677 sym R (2690 sym/17 pcs)

Assignment2_DATA607_Chunjie_Nan

06.09.2021

I have asked five friends and get a simple survey of rating recent movies they have seen. Six movies were picked, and each friend of mine have rated according to their satisfaction of movies. The table was created with MySQl, and exported to Github as a CSV file.SQL code on git hub is here[linked phrase]https://raw.githubusercontent.com/nancunjie...

1289 sym R (3709 sym/22 pcs) 1 img

Assignment1_Data607_Chunjie_Nan

29.08.2021

The author of “How Americans Like Their Stake” brings out a question about how does a risk-taking behavior associate with steak in rare. Along with other variables, he found that the the risk-taking behavior is statistically insignificant to steak rareness. https://fivethirtyeight.com/features/how-americans-like-their-steak/ Import data as c...

1338 sym R (4440 sym/24 pcs) 2 img

Project1_DATA607_Chunjie_Nan

18.09.2021

In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players:Player’s Name, Player’s State, Total Number of Po...

2035 sym R (8227 sym/16 pcs) 1 img

Assignment5_DATA607_Chunjie_Nan

26.09.2021

Import data library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) vaccine<-read.csv("https://raw.githubusercontent.com/nancunjie4560/Data607/master/israe...

2516 sym R (4179 sym/32 pcs) 1 img

Project2_DATA607_Chunjie_Nan

03.10.2021

Task Choose any three of the “wide” datasets identified in the Week 6 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 6 assignment!) For each of the three chosen datasets: Create a .CSV file (or optionally, a MySQL database!) that includes all of the information incl...

3706 sym R (9769 sym/38 pcs) 2 img

Assignment7_DATA607_Chunjie_Nan

08.10.2021

Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three books, and separately create three files which store t...

1247 sym R (2258 sym/13 pcs)

DATA607_Project4_Chunjie_Nan

14.11.2021

0.1 Task It can be useful to be able to classify new “test” documents using already classified “training” documents. A common example is using a corpus of labeled spam and ham (non-spam) e-mails to predict whether or not a new document is spam. For this project, you can start with a spam/ham dataset, then predict the class of new document...

8449 sym R (4653 sym/20 pcs) 2 img

Degrees That Pay You Back

11.01.2022

1. Which college majors will pay the bills? Wondering if that Philosophy major will really help you pay the bills? Think you’re set with an Engineering degree? Choosing a college major is a complex decision evaluating personal interest, difficulty, and career prospects. Your first paycheck right out of college might say a lot about your salary ...

9075 sym R (10228 sym/26 pcs) 7 img

Text Mining America's Toughest Game Show

17.01.2022

1. This… is… Jeopardy! “This… is… Jeopardy!” These words will ring a bell for anyone who has watched the American game show, Jeopardy! . This iconic TV show could be described as quizbowl with gambling. In each 30-minute episode, three contestants compete in answering questions with specific monetary value, accumulating and wagering t...

6336 sym R (6930 sym/24 pcs) 3 img