Publications by Devin Teran

Data606-FinalProject

13.05.2020

Final Project DATA 606 Part 1 - Introduction Part 2 - Data Part 3 - Exploratory data analysis Import & Clean Data Variables Multiple Regression Part 4: Inference Part 5: Conclusion References Devin Teran Part 1 - Introduction We’re going to be investigating if specific characters of county residents can help us predict whether or not a res...

6988 sym R (8742 sym/24 pcs) 12 img

FinalProject607

10.05.2020

Final Project Research Objective Data Sources: Necessary R Packages: Gather Data Daily Presidential White House Briefings Stock Market Data Trump Approval Ratings Cleaning Data Clean WH Briefing Data Clean Approval Ratings Data Analysis White House Briefing & Sentiment Analysis Trump approval ratings analysis Stock Market Performance Twitter ...

10256 sym R (36153 sym/89 pcs) 15 img

Document

27.04.2020

Project 4 - Spam Email Overview Libraries Needed: Get Spam Data Create spam corpus Get Non-Spam (Ham) Email Data Create ham corpus Remove Stop Words Combine the ham and spam document Spam Word Cloud Setting up test & training data Prediction Resources Devin Teran 4/25/2020 Overview During this project we’re going to use a set of spam and...

2030 sym R (4883 sym/16 pcs) 1 img

Document

19.04.2020

Introduction We will see some uses of the dplyr package by loading a data set of contestants on the Bachelorette season’s 11-15. library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff,...

2288 sym R (10429 sym/22 pcs) 1 img

Data607-RecommenderSystem

15.04.2020

Netflix Recommender System Introduction Scenario Design for Netflix: Scenario Design for Customers: Reverse Engineer Site Homepage Layout Image Personalization Recommendations on Improvements: Resources: Devin Teran & Gabe Abreu Introduction Netflix is a pioneering media streaming service. It contains thousands of movies and tv series from a...

7427 sym 2 img

Assignment10-ReplicaCode

05.04.2020

Overview This code is a replica of the code found in the book, Text Mining with R, in Chapter 2: Sentiment Analysis with Tidy Data. tidy_books <- austen_books() %>% group_by(book) %>% mutate(linenumber = row_number(), chapter = cumsum(str_detect(text, regex("^chapter [\\divxlc]", igno...

416 sym R (6780 sym/43 pcs) 5 img

Assignment10_NewExample

05.04.2020

Overview Overview This code begins by loading the book, Alice in Wonderland, from the Gutenburg Project. This site has released over 60,000 free books online for the public to read. From there we will analyze the sentiment of each sentence in the book, using the package Syuzhet. This package will score each sentence with a negative number being ...

2470 sym R (4911 sym/9 pcs) 3 img 2 tbl

RColorBrewer

01.04.2020

Designing Graphs for User Accessibility Designing Graphs for User Accessibility What is Color Blindess? Color blindness is when people can’t distinguish colors Affects approximately 8% of population of men, 0.5% of women Issues distinguishing red & green is most common Color Brewer Online https://colorbrewer2.org RColorBrewer Package dis...

1338 sym R (662 sym/7 pcs) 7 img

Data607-NYT API

29.03.2020

Data607-Assignment9 Overview Connect to API View Raw Data Cleaner Data Devin Teran 3/28/2020 Overview The goal for this assignment is to connect and retrieve JSON data using the New York Times Movie Review API. Once connected, we want to put this data into a dataframe and view the dataframe. Necessary libraries: library(jsonlite) library(st...

767 sym R (1945 sym/10 pcs) 4 tbl

Tidyverse_Create

29.03.2020

stringr: a package used to manipulate strings Ordering Strings Combining Strings Replacing Strings Get the Length of a String Devin Teran 2020-03-28 Getting started First we need to load these packages: tidyverse stringr dplyr - used for subsetting data in our analysis rmdformats - used to for styling html document We’re going to load ...

2934 sym R (2486 sym/19 pcs)