Publications by Christian Uriostegui
Data 607 Assignment Week 3
data <- 'https://raw.githubusercontent.com/curiostegui/CUNY-SPS/main/Data%20607/Assignment%20Week%203/majors-list.csv' majors <- read.csv(file = data, header = TRUE, sep = ",") library(stringr) Exercise 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset, provide code that identifies the majors that contain either ...
1616 sym R (3148 sym/13 pcs)
Project 1 Data 607
Introduction Using a data set that contains the results a chess tournament, our job is to create an R Markdown file that generates a .CSV files with the following information for all players: Player’s Name, Player’s State, Total Number of Points, Player’s Pre-Rating, and Average Pre Chess Rating of Opponents The following is an example o...
1769 sym R (4113 sym/18 pcs) 1 tbl
Lab 3 Class 606
Load Packages library(tidyverse) ## -- Attaching packages --------------------------------------- tidyverse 1.3.2 -- ## v ggplot2 3.3.6 v purrr 0.3.4 ## v tibble 3.1.6 v dplyr 1.0.9 ## v tidyr 1.2.0 v stringr 1.4.0 ## v readr 2.1.2 v forcats 0.5.1 ## -- Conflicts ------------------------------------------ tidyverse_c...
3822 sym R (2840 sym/22 pcs) 2 img
Lab 4 Class 606
Load Packages library(tidyverse) ## -- Attaching packages --------------------------------------- tidyverse 1.3.2 -- ## v ggplot2 3.3.6 v purrr 0.3.4 ## v tibble 3.1.6 v dplyr 1.0.9 ## v tidyr 1.2.0 v stringr 1.4.0 ## v readr 2.1.2 v forcats 0.5.1 ## -- Conflicts ------------------------------------------ tidyverse_c...
4264 sym R (3859 sym/39 pcs) 12 img
607 Lab
Introduction Using the csv data of arrival times in the Alaska and AM West airlines, I will transform from it into a tidy table and perform analysis. Download Library library(tidyr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are ...
1162 sym R (1414 sym/14 pcs)
Project 2 Lab 607
Task Using three datasets identified in the discussion board items or your own dataset, create a .CSV file that includes all of the information in the dataset; read the information from your CSV file into R and use tidyr and dplyr as needed to to tidy and transform data; perform analysis, and present conclusions. Introduction Anthony Fontano...
3460 sym R (10224 sym/32 pcs)
Web APIs Data 607
Assignment The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis You’ll need to start by signing up for an API key. Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame Load library library(ht...
1071 sym R (5270 sym/18 pcs)
Data 607 Sentiment Analysis
Assignment Use the primary example code from Chapter 2 of “Text Mining with R” from Julia Silge & David Robinson and extended the code in two ways: Work with a different corpus of your choosing Incorporate at least one additional sentiment lexicon Text Mining with R Example code taken from https://www.tidytextmining.com/sentiment.html “...
3269 sym R (6950 sym/47 pcs) 3 img
Project 4 DATA 607
Introduction In my document, I will be detailing the creation and utilization of a Naive Bayes classifier to identify spam and ham emails. The spam and ham datasets utilized can be downloaded from https://spamassassin.apache.org/old/publiccorpus/. The two files I downloaded are easy_ham (which contains 2,551 email files) and spam (which contai...
4152 sym R (6072 sym/29 pcs) 2 img