Publications by David Moste
607 Final
Introduction For this project, I wanted to take on a task at the school where I work that has been of great interest. The reason I’m in this program (besides finding the material fun and interesting) is so that I can use some of these skills to help my school improve. To that end, I wanted to build a simple predictor that would be able to predi...
5476 sym R (9218 sym/39 pcs) 6 img
Tidyverse EXTEND
Introduction We will see some uses of the dplyr package by loading a data set of contestants on the Bachelorette season’s 11-15. library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, set...
1415 sym R (28078 sym/30 pcs)
Problem 8.33
Problem 8.33David Moste04.22.20 The Problem The scatterplot below summarizes husbands' and wives' heights in a random sample of 170 married couples in Britain, where both partners' ages are below 65 years. Summary output of the least squares fit for predicting wife's height from husband's height is also provided in the table. Part A ( a ) I...
2195 sym 1 img
Sentiment Analysis
Introduction For this assignment, I am tasked with getting an example from Text Mining with R running and then extending the example to a new corpus and a neew sentiment lexicon. Sections 1-6 are directly from Text Mining with R1. Section 7 is my extension. 1. Sentiments Dataset library(tidytext) ## Warning: package 'tidytext' was built under R ...
2663 sym R (14369 sym/77 pcs) 15 img
Web APIs
Introduction In this assignment, I was asked to connect with the NYT API and import data into an R dataframe. I decided to import data from three different NYT API sections: their live feed, most viewed, and most shared (via facebook). Code After signing up for a NYT API key, the first step was to loas libraries and download each API. library(js...
1539 sym R (3375 sym/11 pcs) 3 img
Coronavirus
Introduction I used the provided coronavirus dataset for this tidying. Code I first loaded the required libraries and the dataset. # load required libraries library(tidyverse) ## -- Attaching packages -------------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.2.1 v purrr 0.3.3 ## v tibble 2.1.3 v dplyr 0.8.3 #...
1344 sym R (4886 sym/24 pcs) 11 img
MBTA
Introduction I used the MBTA dataset for this cleaning and tidying project. Code I started by loading in the libraries and dataset. # load required libraries library(tidyverse) ## -- Attaching packages -------------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.2.1 v purrr 0.3.3 ## v tibble 2.1.3 v dplyr 0.8.3 ...
838 sym R (2921 sym/14 pcs) 4 img
Tidying and Transforming Data
Introduction I was tasked with creating a database to hold some untidy airline data, then import that data into R, tidy the data, and analysize the performance of the airlines. The two airlines were Alaska and AM West and I was simply comparing on time/delayed data for several destinations. Code First, I imported the data from a MySQL database u...
2489 sym R (3230 sym/13 pcs) 5 img
Project 1
Introduction For this project, I was asked to import a text file containing data from a chess tournament into R ands output a csv file with columns for the player names, states, total points, pre tournament ranking, and average ranking of their opponents. Code I started by importing the text file and breaking up the data into two data frames bas...
826 sym R (2213 sym/9 pcs) 1 img
R Character Manipulation
Question 1 Start by reading the csv into a data frame. Then use str_view to see all the majors that contain either DATA or STATISTICS. library(tidyverse) ## -- Attaching packages ------------------------------------------------ tidyverse 1.3.0 -- ## v ggplot2 3.2.1 v purrr 0.3.3 ## v tibble 2.1.3 v dplyr 0.8.3 ## v tidyr 1.0.0 ...
1250 sym R (1281 sym/18 pcs)