Publications by Philip Tanofsky

DATA 607 Project 2 Part II Submission

08.03.2020

Introduction In Part II of the Project 2, the assignment presents the effort to tidy the CSV file suggested by myself based on coronoavirus data at https://github.com/CryptoKass/ncov-data/blob/master/world.latest.bno.csv from roughly a month ago. The initial CSV required multiple steps to perform a proper tidy and clean of the data. The analysis ...

5793 sym R (7481 sym/20 pcs) 7 img

DATA 607 Project 2 Part I Submission

08.03.2020

Introduction In Part I of the Project 2, the assignment presents the effort to tidy the CSV file suggested by colleague Sung Lee based on the data at https://data.cityofnewyork.us/Education/2010-AP-College-Board-School-Level-Results/itfs-ms3e. The initial CSV required multiple steps to perform a proper tidy and clean of the data. The analysis of ...

7208 sym R (9549 sym/33 pcs) 7 img

DATA 607 Project 2 Part III Submission

08.03.2020

Introduction In Part III of the Project 2, the assignment presents the effort to tidy the CSV file suggested by colleague Justin Hsi based on the Medium article at https://medium.com/analytics-vidhya/untidy-data-a90b6e3ebe4c. The initial CSV required multiple steps to perform a proper tidy of the data along with the creation of two separate table...

4638 sym R (10888 sym/38 pcs) 5 img

DATA 607 Week 5 Assignment

02.03.2020

Introduction This assignment presents the capabilities of the R packages of tidyr and dplyr in order to tidy an input CSV file and provide analysis through transformations of the tidy dataframe. The tidy nature of the data follows the definition of Tidy Data as defined by Hadley Wickham. The analysis compares the flight arrivals at five American ...

4894 sym R (11815 sym/32 pcs)

DATA 607 Project 1 Submission

24.02.2020

Introduction This project solution creates a .CSV files based on a text file with chess tournament results where the information has a defined structure. The final output in the .CSV files contains one line for each player which includes the player’s name, player’s state, player’s total number of points, player’s rating before the tournam...

2150 sym R (8967 sym/12 pcs)

Assignment01_TestRun_Publish

02.02.2020

library(RCurl) ## Loading required package: bitops movings_ratings_url <- getURL("https://raw.githubusercontent.com/fivethirtyeight/data/master/fandango/fandango_score_comparison.csv") movie_ratings <-data.frame(read.csv(text=movings_ratings_url, header=T)) dim(movie_ratings) ## [1] 146 22 head(movie_ratings) ## FILM ...

1058 sym R (3452 sym/8 pcs) 1 img

DATA607_Week01_Assignment

02.02.2020

Introduction The article “Be Suspicious Of Online Movie Ratings, Especially Fandango’s” by Walt Hickey published at https://fivethirtyeight.com/features/fandango-movies-ratings/ outlines the disparity in ratings between the movie ticket-selling company Fandango against the ratings of aggregation sites such as Rotten Tomatoes, Metacritic, an...

4208 sym R (17902 sym/17 pcs)

DATA 607 Week 3 Assignment

16.02.2020

Problem 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” # Import RCurl to pull the csv file from the Github repo library(RCurl) ## Loadi...

1908 sym R (2392 sym/17 pcs)

DATA 608 Assignment 1

06.09.2020

Principles of Data Visualization and Introduction to ggplot2 I have provided you with data about the 5,000 fastest growing companies in the US, as compiled by Inc. magazine. lets read this in: inc <- read.csv("https://raw.githubusercontent.com/charleyferrari/CUNY_DATA_608/master/module1/Data/inc5000_data.csv", header= TRUE) And lets preview this...

11335 sym R (12064 sym/37 pcs) 4 img

Tanofsky Module 2

21.09.2020

Assignment 2¶Author: Philip Tanofsky Date: September 19, 2020 In [1]: import datashader as ds import datashader.transfer_functions as tf import datashader.glyphs from datashader import reductions from datashader.core import bypixel from datashader.utils import lnglat_to_meters as webm, export_image from datashader.colors import colormap_selec...

16436 sym R (8944 sym/28 pcs) 7 img