Publications by George Cruz

DS607 - Project 2 - Part 1

03.10.2020

library(tidyr) library(dplyr) Introduction The goal of this assignment is to practice in preparing different datasets for downstream analysis work. We were tasked with choosing three wide datasets identified in the Week 5 Discussion items. The datasets I chose were: NYC Subway Ridership by Cameron Smith US Gross Domestic Product by Quarter (20...

18532 sym R (14683 sym/37 pcs) 6 img

ds607-hw7

10.10.2020

Data 607 Homework 7 Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three books, and separately create three...

1112 sym R (3249 sym/10 pcs)

DS606-HW5

21.10.2020

Foundations for Inference library(tidyverse) library(openintro) Heights of adults. (7.7, p. 260) Researchers studying anthropometry collected body girth measurements and skeletal diameter measurements, as well as age, weight, height and gender, for 507 physically active individuals. The histogram below shows the sample distribution of heights ...

24231 sym R (1777 sym/37 pcs) 6 img

DS606 - HW6

22.10.2020

library(tidyverse) library(openintro) library(scales) 6.6 2010 Healthcare Law. On June 28, 2012 the U.S. Supreme Court upheld the much debated 2010 healthcare law, declaring it constitutional. A Gallup poll released the day after this decision indicates that 46% of 1,012 Americans agree with this decision. At a 95% confidence level, this sampl...

16509 sym R (1484 sym/19 pcs)

DS607-HW8

25.10.2020

Assignment – Web APIs The task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. Setting up I will store the api-key value in an environment variable by using Sys.setenv library(tidyverse) library(openintro) library(httr) ## Warning: package 'httr' was bui...

5375 sym R (3453 sym/13 pcs) 1 img

Facebook Recommender System

07.11.2020

Facebook Facebook is an American Social media platform which currently has over 2.74 billion monthly active users. It started as a social network and grew into a conglomerate that offers several social media platforms: Facebook, Instagram, WhatsApp, Messenger, among others. Scenario Analysis Who are your target users? Facebook’s target user ...

4287 sym

DS607-HW10

01.11.2020

Week 10 Assignment Reproduce and extend Sentiment analysis with tidy data library(tidytext) library(janeaustenr) library(dplyr) library(stringr) library(tidyr) Starting analysis Get Sentiment tables AFINN from Finn Årup Nielsen ## # A tibble: 2,477 x 2 ## word value ## <chr> <dbl> ## 1 abandon -2 ## 2 abandon...

23022 sym R (11268 sym/60 pcs) 7 img

DS606 - Data Project Proposal

02.11.2020

Data Preparation # load data library(tidyverse) library(scales) library(infer) library(psych) library(httr) library(jsonlite) The Data Set was obtained from Kaggle. This dataset was collected using the YouTube API. Loading the Data. #Get the videos csv raw_video_df <- read_csv(file="https://raw.githubusercontent.com/georg4re/ds606/main/dat...

12985 sym R (7486 sym/20 pcs) 2 img

DS606 - Lab 6

02.11.2020

#Inference for categorical data library(tidyverse) library(openintro) library(infer) library(scales) Exercise 1 What are the counts within each category for the amount of days these students have texted while driving within the past 30 days? data(yrbss) text_while_driving <- yrbss %>% filter(text_while_driving_30d == 30) summary(text_w...

15148 sym R (7307 sym/35 pcs) 4 img

DS605-HW12

26.04.2021

Introduction to Data 1. ScatterPlot and Initial Linear Regression 2. Which Model is Better? 3. Forecast 4. Build Models. 5. Forecast LifeExp Introduction to Data The attached who.csv dataset contains real-world data from 2008. The variables included follow. Country: name of the country LifeExp: average life expectancy for the country in years In...

4141 sym R (4413 sym/16 pcs) 4 img