Publications by Joshua Registe

Data 612 Final Project Proposal

03.07.2020

Final Project Goal The goal for the final project is for you to build out a recommender system using a large dataset (ex: 1M+ ratings or 10k+ users, 10k+ items. There are three deliverables, with separate dates: [1] Planning Document Find an interesting dataset and describe the system you plan to build out. If you would like to use one of the d...

2837 sym

Data 612 Project 4

29.06.2020

Project 4 Deliverables: 1. As in previous assignments, compare the accuracy of at least two recommender systems algorithms against your offline data. Implement support for at least one buisiness or user experience goal such as increased serendipity, novelty, or diversity. Compare and report on any change in accuracy before and after you’ve ma...

10093 sym R (10821 sym/21 pcs) 5 img

Data612 Project 5

08.07.2020

Introduction The goal of this project is to practice working with a distributed recommender system. This project adapts one of the recommendation systems previously used with spark. This markdown will discuss algorithims and implementation as well as the necessity of moving to a Spark platform. library(tidyverse) library(readr) library(sqldf) ...

2047 sym R (3445 sym/18 pcs)

Data 612 Discussion 3

13.07.2020

Research Discussion Assignment 3 As more systems and sectors are driven by predictive analytics, there is increasing awareness of the possibility and pitfalls of algorithmic discrimination. In what ways do you think Recommender Systems reinforce human bias? Recommendation systems can reinforce human biases because similarity matrices are inheren...

2917 sym

Data605 Assignment 2

07.09.2020

Problem Set 1 Show that AT A 6= AAT in general. (Proof and demonstration.) The following example will compute a Matrix A, and the transpose of that matrix and check for equality library(matlib) ## Warning: package 'matlib' was built under R version 4.0.2 A<-matrix(c(1,7,4,63,6,7,2,9,67),nrow = 3) A_T<-t(A) A_T%*%A == A%*%A_T ## [,1] [...

1484 sym R (1701 sym/20 pcs)

Data605 Assignment 8

19.10.2020

Assignment: #11 and #14 on page 303 of probability text, and #1 on page 320-321 Question 1 A company buys 100 lightbulbs, each of which has an exponential lifetime of 1000 hours. What is the expected time for the first of these bulbs to burn out? (See Exercise 10.) Solution For exponential distribution, \(E[X] = \frac{1}{\lambda}\), so \(\frac{...

2746 sym

Data605 Assignment 7

11.10.2020

1. Let \(X_1, X_2, . . . , X_n\) be \(n\) mutually independent random variables, each of which is uniformly distributed on the integers from \(1\) to \(k\). Let \(Y\) denote the minimum of the \(X_i\)’s. Find the distribution of \(Y\). Solution We first consider how many ways we can assign \(Y\) to one of the \(x_i\) variables. Since there are...

3353 sym R (1616 sym/4 pcs)

Data605 assignment 5

28.09.2020

Choose independently two numbers B and C at random from the interval [0, 1] with uniform density. Prove that B and C are proper probability distributions. Note that the point (B,C) is then chosen at random in the unit square. Find the probability that (a) B + C < 1/2 B and C are have the boundary interval [0,1]and are independent and thus: Lets ...

2903 sym

Data 621 Homework 2 (Group 3)

27.09.2020

Source code: https://github.com/djlofland/DS621_F2020_Group3/tree/master/Homework_2 Overview This assignment will present various classification metrics through creating functions in R that will carry out these calculations. These calculations will be will be compared against built-in functions from various R packages and a graphical representat...

2257 sym R (9132 sym/41 pcs) 2 img

Data605 Assignment 4

21.09.2020

Problem Set 1) In this problem, we will verify using R that SVD and Eigenvalues are related as worked out in the weekly module. Given a 3 x 3 matrix A \[ A=\begin{bmatrix} 1 & 2 & 3 \\ -1 & 0 & 4 \end{bmatrix} \] Write code in r to compute \[ X=AA^{T}\\ Y=A^{T}A \] Then compute the eigenvalues and eigenvectors of X and Y using build in comma...

2284 sym R (2682 sym/23 pcs)