Publications by Matthew Lucich

Assignment Week 9 - Web APIs

11.04.2021

library(tidyverse) library(jsonlite) Overivew The objective is to create an interface with one of the NY Times APIs, read in the JSON response, and convert it into a data frame. I chose to call the archive API, specifically querying articles from April 1921, then filtering the results further for articles from April 10th 1921. Finally, to make t...

1127 sym R (9391 sym/7 pcs)

GoodReads.com Recommender System

21.04.2021

Introduction It may seem counterintuitive to have a recommender system that intentionally does not recommend items the users will likely want most, nevertheless in the right circumstances I believe this cannot only be beneficial to the users, but to the organization as well. The proliferation of echo chambers in recent years is often attributed t...

3341 sym 1 img

Assignment 10 – Sentiment Analysis

19.04.2021

library(tidyverse) library(ggplot2) library(tidytext) library(textdata) library(janeaustenr) library(gutenbergr) library(SentimentAnalysis) Overview The objective was to re-create the code supplied in chapter 2 of Text Mining with R, then to extend the exercise with a new corpus and lexicon. Recreating the analysis in chapter 2 came with no issu...

2882 sym R (12843 sym/31 pcs) 7 img

Data 605 HW1

13.01.2022

Build the first letters for both your first and last name using point plots in R. # ML x <- c(rep(-1.5,500), seq(-0.75,-1.5,length.out=500), seq(-0.75,0,length.out=500), rep(1,500), seq(2,1,length.out=500), rep(0,500)) y <- c(seq(-2,1,length.out=500), seq(-2,1,length.out=500), seq(-2,1,length...

636 sym R (1717 sym/5 pcs) 5 img

Data 621 - Final

27.11.2021

Data Cleaning df_wb <- read_csv("world-bank.csv") #glimpse(df_wb) # Filter for Brazil, Russia, India, China, and South Africa df_brics <- df_wb %>% filter(grepl("BRA|CHN|IND|RUS|ZAF", `Country Code`)) variables_to_include <- "NY.GDP.PCAP.KD.ZG|FS.AST.CGOV.GD.ZS|FP.CPI.TOTL|NY.GDP.MKTP.KD.ZG|MS.MIL.XPND.GD.ZS|SE.PRM.AGES|DT.DOD.DSTC.ZS|SP.POP....

2645 sym R (104329 sym/240 pcs) 34 img

Data 621 - Homework #2

04.10.2021

Overview In this homework assignment, you will work through various classification metrics. You will be asked to create functions in R to carry out the various calculations. You will also investigate some functions in packages that will let you obtain the equivalent results. Finally, you will create graphical output that also can be used to evalu...

5938 sym R (8814 sym/39 pcs) 4 img

HW 1 Rough Draft

21.09.2021

In this data set we are trying to identify good and bad teams in major league baseball team’s season. We are assuming some of the predictors will be higher for good teams. We will try to predict how many times a team will win in this season. DATA EXPLORATION: We can observe the response variable (TARGET_WINS) looks to be normally distributed. ...

7806 sym R (17602 sym/48 pcs) 10 img

Publish Document

18.09.2021

library(dplyr) library(Metrics) library(MLmetrics) library(leaps) library(car) library(MASS) library(tidyverse) library(caret) library(leaps) Load the Data train_raw <-read.csv("https://raw.githubusercontent.com/akarimhammoud/Data_621/main/Assignment_1/data/moneyball-training-data.csv") evaluation_raw <-read.csv("https://raw.githubusercontent....

3103 sym R (16647 sym/43 pcs) 7 img

HW 1 - Model Selection

15.09.2021

library(dplyr) library(Metrics) library(MLmetrics) library(leaps) library(car) Load Data Clean data # Review NA counts colSums(is.na(data)) ## INDEX TARGET_WINS TEAM_BATTING_H TEAM_BATTING_2B ## 0 0 0 0 ## TEAM_BATTING_3B TEAM_BATTING_HR TEAM_BATTING_BB TEAM_BA...

2619 sym R (5149 sym/20 pcs) 5 img

Data 621 - Homework #4 Data Cleaning DRAFT

01.11.2021

library(tidyverse) library(ggplot2) library(mice) library(car) library(Hmisc) library(corrplot) library(pscl) library(boot) library(nlme) Load data # Load insurance csv df_ins_raw <- read.csv("insurance_training_data.csv") # Removing index as instructed df_ins_raw <- subset(df_ins_raw, select = -c(INDEX)) # Preview data glimpse(df_ins_raw) ## ...

792 sym R (38253 sym/74 pcs) 12 img