Publications by Peter
assignment9-607
Introduction The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis. You’ll need to start by signing up for an API key.Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. Loading of the require...
971 sym R (16321 sym/11 pcs)
FinalProject_606
1) Introduction Hate crimes and racism are one of the biggest social issues the world is suffering from. In the US there was surge in extremist groups and hate crimes after the election of Donald Trump, the last US president. In this project I will look into the hate crimes data between 9 to 18 Nov 2016 and try to see any relationship between the...
3074 sym R (6625 sym/21 pcs) 5 img
homework9-606
Baby weights, Part I. (9.1, p. 350) The Child Health and Development Studies investigate a range of topics. One study considered all pregnancies between 1960 and 1967 among women in the Kaiser Foundation Health Plan in the San Francisco East Bay area. Here, we study the relationship between smoking and weight of the baby. The variable smoke is c...
7163 sym R (1353 sym/20 pcs) 2 img
project4-607
Introduction For this project, you can start with a spam/ham dataset, then predict the class of new documents (either withheld from the training dataset or from another source such as your own spam folder). One example corpus: https://spamassassin.apache.org/old/publiccorpus/ Loading of the required libraries library("easypackages") library("st...
930 sym R (7656 sym/62 pcs) 2 img
tidyverse_part2-607
Introduction This vignette is focused on the purrr and forcats packages from the Tidyverse, and were chosen based on the simple logic that I am the least unfamiliar with them so it seemed like a good opportunity to change that. Load packages Loading the two packages directly, though you could also load the whole tidyverse package library(purrr) ...
1717 sym R (23414 sym/25 pcs) 1 img
Lab8-606
The Human Freedom Index is a report that attempts to summarize the idea of “freedom” through a bunch of different variables for many countries around the globe. It serves as a rough objective measure for the relationships between the different types of freedom - whether it’s political, religious, economical or personal freedom - and other s...
10387 sym R (2484 sym/20 pcs) 6 img
assignment10-607
Introduction In Text Mining with R, Chapter 2 looks at Sentiment Analysis, in this assignment, you should start by getting the primary example code from chapter 2 working in an R Markdown document. You should provide a citation to this base code. You’re then asked to extend the code in two ways: Work with a different corpus of your choosing,...
1075 sym R (6989 sym/47 pcs) 5 img
homework8-606
Nutrition at Starbucks, Part I. (8.22, p. 326) The scatterplot below shows the relationship between the number of calories and amount of carbohydrates (in grams) Starbucks food menu items contain. Since Starbucks only lists the number of calories on the display items, we are interested in predicting the amount of carbs a menu item has based on i...
6122 sym R (267 sym/8 pcs) 9 img
project-proposal-606
library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(readr) library(ggplot2) Data Preparation initial <- read_csv("https://raw.githubusercontent.com/fiv...
1026 sym R (8266 sym/17 pcs) 1 img
Lab9-606
Grading the professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these student evaluations as an indicator of course quality and teaching effectiveness is often criticized because these measures may reflect the influence of non-teaching related charac...
14806 sym R (1397 sym/22 pcs) 13 img