Publications by Orli Khaimova
DATA 608 Module 1
Principles of Data Visualization and Introduction to ggplot2 I have provided you with data about the 5,000 fastest growing companies in the US, as compiled by Inc. magazine. lets read this in: inc <- read.csv("https://raw.githubusercontent.com/charleyferrari/CUNY_DATA_608/master/module1/Data/inc5000_data.csv", header= TRUE) And lets preview thi...
1792 sym Python (4963 sym/12 pcs) 3 img
The Effect of Gasoline Prices on Public Transit Ridership in New York City
Loading Data INSERT DESCRIPTION OF DATA HERE mta2020 <- read.csv("https://raw.githubusercontent.com/okhaimova/DATA698/main/MTA_Daily_Ridership_Data__Beginning_2020.csv") gas <- read.csv("https://raw.githubusercontent.com/okhaimova/DATA698/main/NYC%20Weekly%20Retail%20Gasoline%20and%20Diesel.csv", skip = 2, header = TRUE) bicycl...
2914 sym Python (39669 sym/80 pcs) 23 img
Week 3 Final Project
The presentation approach is up to you but it should contain the following: Data Exploration: This should include summary statistics, means, medians, quartiles, or any other relevant information about the data set. Please include some conclusions in the R Markdown text. Data wrangling: Please perform some basic transformations. They will need to...
5847 sym R (6161 sym/21 pcs) 5 img
Week 1 Homework
Write a loop that calculates 12-factorial Factorial <- 1 for(i in 1:12) { Factorial <- Factorial * i } Show how to create a numeric vector that contains the sequence from 20 to 50 by 5. NumVector <- seq(from = 20, to = 50, by = 5) NumVector ## [1] 20 25 30 35 40 45 50 NumVector2 <- seq(20, 50, 5) NumVector2 ## [1] 20 25 30 35 40 45 50...
304 sym R (1087 sym/14 pcs)
Week 2 Homework
#install.packages("Stat2Data") #install.packages("tidyverse") library (Stat2Data) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyverse) ## --...
884 sym R (4290 sym/43 pcs)
DATA 607 Assignment 1
A Handful Of Cities Are Driving 2016’s Rise In Murders Overview In 2016, there was preliminary evidence that showed that the number of murders for 2016 rose by over 10 percent by October of that year. It rose by 10.8 percent prior to that year, which was a large increase. This data was collected prior to the FBI releasing the official data and...
1175 sym R (3826 sym/12 pcs)
Project 3
Team Members: John Mazon, Orli Khaimova, Shana Green, Dominika Markowska-Desvallons, Mark Gonsalves Introduction This project’s goal is to determine the most valued data science skills. To do this, we pulled data from Kaggle.com. The key steps for this project were: We first pulled the data from Kaggle.com and moved the data to our GitHub.com...
4329 sym R (5011 sym/14 pcs) 5 img
DATA 607 Week 7: Working with XML and JSON in R
Task Description Separately create three files which store the book’s information in HTML , XML, and JSON formats containing book information and include the title, authors, and two or three other attributes that you find interesting. With R and using any packages, load information from the three sources into separate data frames. Check if all ...
2774 sym R (637 sym/5 pcs)
DATA 607 Project 2: “Data Transformation”
Income vs Religion Task An analysis that could be performed is once the data is ‘tidy’ we can see the columns as ‘religion’, ‘income’ then ‘frequency’, with each row being an entry(unique) to that person or entry point. We can then grab this data and analyze mathematically amounts, averages, etc" Reading Data This data can also ...
5499 sym R (9692 sym/40 pcs) 6 img
Problem 3.31
3.31 Hearts win. In a new card game, you start with a well-shuffled full deck and draw 3 cards without replacement. If you draw 3 hearts, you win $50. If you draw 3 black cards, you win $25. For any other draws, you win nothing. Create a probability model for the amount you win at this game, and find the expected winnings. Also compute the stand...
1239 sym R (85 sym/1 pcs)