Publications by Jordan Glendrange
data606projectproposal
## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.0 ── ## ✓ ggplot2 3.3.3 ✓ purrr 0.3.4 ## ✓ tibble 3.0.6 ✓ dplyr 1.0.4 ## ✓ tidyr 1.1.2 ✓ stringr 1.4.0 ## ✓ readr 1.4.0 ✓ forcats 0.5.1 ## ─...
2112 sym R (1517 sym/10 pcs) 1 img
Data 607 Final Project
Introduction For my final project I decided to create a webscraper that will go through some pages on Yelp in 4 difference cities: Los Angeles, New York City, Riverside, and San Francisco. I understand that yelp has a powerful API, and it would probably work for a better analysis, but I used an API for my data 606 project and I wanted to mix it u...
2597 sym R (3807 sym/13 pcs) 4 img
Data 606 Lab 9
library(tidyverse) library(openintro) library(GGally) Exercise 1 This would be an observational study because there is no control group. Since there isn’t a control group to compare against we can only know if the two variables are correlated. I would rephrase the question to be: “Is beauty correlated to differences in course evaluations?”...
10299 sym R (11475 sym/34 pcs) 13 img
Data 607 Project 4
library(tidyverse) library(tm) library(purrr) library(randomForest) library(caTools) Reading email data Here I am reading the email data directly from 2 separate folders. 1 for spam data and the other for my easy ham data. I ran quite a few functions on each set to get the data formatted correctly. Notice I am encoding each file to ‘latin1’....
1431 sym R (2812 sym/9 pcs)
Data 606 Lab 8
library(tidyverse) library(openintro) data('hfi', package='openintro') Exercise 1 The dimensions of the dataset are 1,458 rows by 123 columns. dim(hfi) ## [1] 1458 123 Exercise 2 The appropriate plot would be a scatter plot. The relationship looks linear, so I would be comfortable using a linear regression here. ggplot(hfi, aes(x=pf_score, y=p...
5650 sym R (8377 sym/30 pcs) 8 img
Stitch Fix Recommender System
Stitch Fix Overview Stitch fix is a subscription company that sends clothing based on user preferences at some cadence. Their recommender system is pivotal to their business model, which we will be reviewing. Reverse Engineer The recommender system starts with a survey the user takes when registering their account. Since the product is sending c...
2098 sym
Data 606 Project
## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.0 ── ## ✓ ggplot2 3.3.3 ✓ purrr 0.3.4 ## ✓ tibble 3.0.6 ✓ dplyr 1.0.4 ## ✓ tidyr 1.1.2 ✓ stringr 1.4.0 ## ✓ readr 1.4.0 ✓ forcats 0.5.1 ## ─...
7350 sym R (5432 sym/30 pcs) 11 img 2 tbl
Data 605 Final
Problem 1: Playing With PageRank Form the A matrix. Then, introduce decay and form the B matrix as we did in the course notes. First let’s form A: A <- matrix(c(0, 1/2, 1/2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1/3, 1/3, 0, 0, 1/3, 0, 0, 0, 0, 0, 1/2, 1/2, 0, 0, 0, 1/2, 0, 1/2, 0, 0, 0...
5053 sym R (31056 sym/120 pcs) 42 img
Data 605 Discussion 15
Chapter 12.1 Question 13 Give the domain and range of the multi-variable function \[f(x,y) = \sqrt{9-x^2 - y^2}\] Since we are taking the square root we need the result to be ge to 0 \[0\le 9-x^2-y^2\] \[x^2+y^2\le 9\] \[D = \{(x,y)|x^2+y^2\le9\}\] So the max output is 3 (when x and y = 0) \[R: [0,3]\] ...
311 sym
Data 605 Homework 11
Regression Model library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.1 ── ## ✓ ggplot2 3.3.5 ✓ purrr 0.3.4 ## ✓ tibble 3.1.4 ✓ dplyr 1.0.7 ## ✓ tidyr 1.1.3 ✓ stringr 1.4.0 ## ✓ readr ...
767 sym R (1635 sym/15 pcs) 6 img