Publications by Joshua Registe
DATA606ProjectProposal
Research Question In the project proposal i intend to use the dataset from fivethirtyeight called “hate_crimes”. The dataset is described below. The research question i would like to answer is are their any significant relationships between hatecrimes in the US to other parameters in the dataset such as unemployment, median household income,...
1324 sym R (7523 sym/16 pcs)
Data 608 Final Project Proposal
1 Proposal As part of this final project, I will be looking at emission patterns nationally over the past decade looking at potential major contributors to the cause of global warming. Engineers and scientists often get posed questions along the lines of infrastructure resiliency and protection against the effects of storms, rising sea level, ai...
1229 sym 1 img
Data 606 Final Project
Part 1 - Introduction Research Question In the project proposal I intend to use the dataset from fivethirtyeight called “hate_crimes”. The dataset is described below. The research question i would like to answer is are their any significant relationships between hatecrimes in the US to other parameters in the dataset such as unemployment, me...
11481 sym R (17230 sym/29 pcs) 21 img
Data612 Project 3
library(tidyverse) library(readr) library(sqldf) library(dplyr) library(tidyr) library(tinytex) library(recommenderlab) library(kableExtra) library(gridExtra) Project 3 - Dimensionality Reduction Question The task is to implement a matrix factorization method - such as singular value decomposition (SVD) or Alternating Least Squares (ALS...
5574 sym R (7410 sym/15 pcs) 3 img 2 tbl
Data612 Discussion 2
Discussion 2 Summary of "Music Recommendations at Scale with Spark - Christopher Johnson (Spotify) Spotify is a Swedish music streatming and media services provider and a lot of their systems revolve around recommender systems to keep users engaged. A Youtube presentation from Chris Johnson Machine Learning Guy discusses how they do recommenda...
4289 sym
Data612 Discussion 1
IMDB Recommender Describe how you think it works (content-based, collaborative filtering, etc). Does the technique deliver a good experience or are the recommendations off-target? The IMDB rating system works by allowing users to cast votes on individual titles (movies or TV shows) that are aggregated into a rating. It is not the only tool that...
2683 sym 1 img
Data612 Project2
Project 2 Dataset and Setup The following assignment dives into the use of recommender systems via the r package recommenderlab. the vignette for this library can be found here. The dataset that will be used will be pulled directly from the recommender systems package and this project will expand upon examples produced in Chapter 3 and Chapter ...
5004 sym R (10322 sym/23 pcs) 7 img 1 tbl
Data605 Assignment 1
#Problem 1 You can think of vectors representing many dimensions of related information. For instance, Netflix might store all the ratings a user gives to movies in a vector. This is clearly a vector of very large dimensions (in the millions) and very sparse as the user might have rated only a few movies. Similarly, Amazon might store the items p...
2110 sym R (1134 sym/12 pcs)
Data612 Final Project
library(tidyverse) library(readr) library(sqldf) library(dplyr) library(tidyr) library(tinytex) library(recommenderlab) library(kableExtra) library(gridExtra) Final Project Goal The goal for the final project is for you to build out a recommender system using a large dataset (ex: 1M+ ratings or 10k+ users, 10k+ items. There are three de...
7143 sym R (10817 sym/19 pcs) 5 img 2 tbl
Data612 Discussion 4
Research Assignment 4 - How to counter the radicalizing effects of recommender systems or ways to prevent algorithmic discrimination. The article by Zeynep Tufekci from the New York Times suggest that youtube is a immense radicalization effects on society today. This is notable when going through the videos that the popular video streaming web...
1723 sym