Publications by Coffy Andrews-Guo
NYCHPD Multiple Dwelling Registrations
Project Proposal Where is NYC landscape changing faster in multi-dwelling (residential rental unit) housing? Origin of the Data set The Department of Housing Preservation and Development collects registration information from owners of residential rental unit. The data set has 171708 observations and 16 variables beginning from 1993 to prese...
2252 sym 2 img 2 tbl
apache
D607 Project 4 Coffy Andrews-Guo, Krutika Patel 14 November, 2021 Document Classification - Ham / Spam This assignment is to classify new “test” documents using already classified “training” documents from an open source anti-spam platform, Apache SpamAssassin. Load Libraries We will being to load libraries library(tidyverse) library(e...
5009 sym R (11538 sym/56 pcs) 2 img
LinkedIn Engine
Recommender Engine Linkedln Learning platform is a large-scale content recommendation engine developed by the organization’s AI team. The learning platform currently serves over 690 LinkedIn members and customers based on a hyper-personalized model to help learners access courses based on their personal and professional developmental interest...
4177 sym 3 img
Project Proposal
Data Preparation library(readr) Comparable_20Returns_20to_20Education_20sheet3 <- read_csv("https://raw.githubusercontent.com/candrewxs/Project_Proposal_D606/main/data/Comparable%20Returns%20to%20Education%20sheet3.csv") comp_edu <- Comparable_20Returns_20to_20Education_20sheet3 View(comp_edu) Research question Will the returns on education i...
2281 sym R (6031 sym/23 pcs) 3 img
Vignette
Kagle Data - Tidyverse Coffy Andrews-Guo 2021-10-24 Source: Kagle data - Data Analysis Jobs, Based on NYC Jobs - October 2021 This dataset contains current job postings available on the City of New York’s official jobs site ( http://www.nyc.gov/html/careers/html/search/search.shtml ). Internal postings available to city employees and external po...
2126 sym R (3392 sym/9 pcs) 2 img
NYTimes Best Selling Books
Source: GitHub RPubs Load packages library("tidyverse") library("httr") library("reactable") library("jsonlite") library("tibble") Make a request to the API Server In this assignment, The New York Times, JSON data was pulled using the Developer zone Book API on their Best Sellers list https://developer.nytimes.com/apis. A request for data w...
1923 sym R (2542 sym/15 pcs) 1 img
D607 - Working with XML & JSON
Working with XML and JSON in R Coffy Andrews-Guo October 10, 2021 Assignment Three of my favorite books on ‘Self Development’ were selected this assignment. The assignment involves working with XML and JSON files with these criteria: (1) At least one of the books should have more than one author. (2)For each book, include the title, authors, a...
1058 sym R (5167 sym/21 pcs)
Vaccination
Assignment # Read XLSX file from Github github_link <- "https://github.com/candrewxs/Vaccination/blob/main/israeli_vaccination_data_analysis_start.xlsx?raw=true" library(httr) temp_file <- tempfile(fileext = ".xlsx") req <- GET(github_link, # authenticate using GITHUB_PATH authenticate(Sys.getenv("GITHUB_PATH"), ""), ...
1741 sym R (9257 sym/31 pcs)
Document
Project 1 In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s State, Total N...
1125 sym R (3774 sym/6 pcs)
Summarizing Data
Stats scores. (2.33, p. 78) Below are the final exam scores of twenty introductory statistics students. 57, 66, 69, 71, 72, 73, 74, 77, 78, 78, 79, 79, 81, 81, 82, 83, 83, 88, 89, 94 Create a box plot of the distribution of these scores. The five number summary provided below may be useful. ## Min. 1st Qu. Median Mean 3rd Qu. Max. ##...
6351 sym R (103 sym/1 pcs) 5 img