Publications by Esteban Aramayo
Week 3 Assignment - R Character Manipulation and Date Processing
1. Overview Please deliver links to an R Markdown file (in GitHub and rpubs.com) with solutions to the problems below. You may work in a small group, but please submit separately with names of all group participants in your submission. 2. Filtering data using Regular Expressions (regex) Using the 173 majors listed in fivethirtyeight.com’s Coll...
2011 sym R (4437 sym/14 pcs) 10 tbl
Project 01 - Data Analysis - DATA607
Overview In this project, we are given a text file with chess tournament results where the information has some structure. Our job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s State, Total Num...
1783 sym R (6035 sym/14 pcs) 3 tbl
Week 09 Assignment - Web APIs
Assignment Description The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis You’ll need to start by signing up for an API key. Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. NY Times API ...
4361 sym R (3124 sym/8 pcs) 1 img
Week 07 Assignment - Working with HTML, XML, and JSON
# Packages to be used library(tidyverse) library(rvest) # used to manipulate HTML #> Warning: package 'rvest' was built under R version 4.0.4 library(XML) #> Warning: package 'XML' was built under R version 4.0.4 library(xml2) library(jsonlite) Problem Overview Pick three of your favorite books on one of your favorite subjects. At leas...
2138 sym R (672 sym/10 pcs) 3 tbl
Week Five Assignment - Working with Tidy Data
1. Overview Create a .CSV file (or optionally, a MySQL database!) that includes all of the information above. Use a “wide” structure similar to how the information appears below, so that tidying and transformations can be used on the data. # Libraries to be used library(tidyverse) 2. Read untidy data 2.1 Read the arrival data Read the arr...
1433 sym R (1376 sym/8 pcs) 3 img 4 tbl
Project 2 - Data Transformation 1 of 3
Step 1: Problem Statement In this project the goal is to practice data transformation applied to untidy data. For this purpose, I chose a classmate’s untidy dataset from our discussion board. Using such dataset, the following items will be addressed: Create a .CSV or .XLSX file (or optionally, a MySQL database!) that includes all of the inform...
3758 sym R (5274 sym/7 pcs) 2 img 4 tbl
Week 10 Assignment - Text Mining
Assignment Description In Text Mining with R, Chapter 2 looks at Sentiment Analysis. In this assignment, you should start by getting the primary example code from chapter 2 working in an R Markdown document. You should provide a citation to this base code. You’re then asked to extend the code in two ways: Work with a different corpus of your c...
3004 sym R (4453 sym/15 pcs) 2 img
Week 11 Assignment - Recommender Systems
Recommender Systems - Analyzing WayFair’s Visual Complements Model recommender system I have been looking for furniture for my new living room but I was not sure what kind of furniture to get. I needed some ideas and/or advise from someone. I looked at multiple web sites where I found overwhelming variety of styles, sizes, and prices that make ...
9462 sym 2 img
Project 4 - Document Classification
library(tidyverse) library(tidymodels) library(parsnip) Project Overview It can be useful to be able to classify new “test” documents using already classified “training” documents. A common example is using a corpus of labeled spam and ham (non-spam) e-mails to predict whether or not a new document is spam. For this project, we will ta...
3187 sym R (7387 sym/33 pcs)
DATA605 - Homework 06
library(gtools) Exercise 1 A bag contains 5 green and 7 red jellybeans. How many ways can 5 jellybeans be withdrawn from the bag so that the number of green ones withdrawn will be less than 2? green_beans <- 5 red_beans <- 7 total_beans <- green_beans + red_beans zero_green <- nrow(combinations(n = red_beans, r = 5, v = 1:red_beans)) on...
10036 sym R (781 sym/4 pcs)