Publications by Orli Khaimova

DATA 608 Module 1

13.02.2023

Principles of Data Visualization and Introduction to ggplot2 I have provided you with data about the 5,000 fastest growing companies in the US, as compiled by Inc. magazine. lets read this in: inc <- read.csv("https://raw.githubusercontent.com/charleyferrari/CUNY_DATA_608/master/module1/Data/inc5000_data.csv", header= TRUE) And lets preview thi...

1792 sym Python (4963 sym/12 pcs) 3 img

The Effect of Gasoline Prices on Public Transit Ridership in New York City

12.12.2022

Loading Data INSERT DESCRIPTION OF DATA HERE mta2020 <- read.csv("https://raw.githubusercontent.com/okhaimova/DATA698/main/MTA_Daily_Ridership_Data__Beginning_2020.csv") gas <- read.csv("https://raw.githubusercontent.com/okhaimova/DATA698/main/NYC%20Weekly%20Retail%20Gasoline%20and%20Diesel.csv", skip = 2, header = TRUE) bicycl...

2914 sym Python (39669 sym/80 pcs) 23 img

Week 3 Final Project

01.08.2020

The presentation approach is up to you but it should contain the following: Data Exploration: This should include summary statistics, means, medians, quartiles, or any other relevant information about the data set. Please include some conclusions in the R Markdown text. Data wrangling: Please perform some basic transformations. They will need to...

5847 sym R (6161 sym/21 pcs) 5 img

Week 1 Homework

19.07.2020

Write a loop that calculates 12-factorial Factorial <- 1 for(i in 1:12) { Factorial <- Factorial * i } Show how to create a numeric vector that contains the sequence from 20 to 50 by 5. NumVector <- seq(from = 20, to = 50, by = 5) NumVector ## [1] 20 25 30 35 40 45 50 NumVector2 <- seq(20, 50, 5) NumVector2 ## [1] 20 25 30 35 40 45 50...

304 sym R (1087 sym/14 pcs)

Week 2 Homework

27.07.2020

#install.packages("Stat2Data") #install.packages("tidyverse") library (Stat2Data) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyverse) ## --...

884 sym R (4290 sym/43 pcs)

DATA 607 Assignment 1

01.09.2020

A Handful Of Cities Are Driving 2016’s Rise In Murders Overview In 2016, there was preliminary evidence that showed that the number of murders for 2016 rose by over 10 percent by October of that year. It rose by 10.8 percent prior to that year, which was a large increase. This data was collected prior to the FBI releasing the official data and...

1175 sym R (3826 sym/12 pcs)

Project 3

18.10.2020

Team Members: John Mazon, Orli Khaimova, Shana Green, Dominika Markowska-Desvallons, Mark Gonsalves Introduction This project’s goal is to determine the most valued data science skills. To do this, we pulled data from Kaggle.com. The key steps for this project were: We first pulled the data from Kaggle.com and moved the data to our GitHub.com...

4329 sym R (5011 sym/14 pcs) 5 img

DATA 607 Week 7: Working with XML and JSON in R

10.10.2020

Task Description Separately create three files which store the book’s information in HTML , XML, and JSON formats containing book information and include the title, authors, and two or three other attributes that you find interesting. With R and using any packages, load information from the three sources into separate data frames. Check if all ...

2774 sym R (637 sym/5 pcs)

DATA 607 Project 2: “Data Transformation”

04.10.2020

Income vs Religion Task An analysis that could be performed is once the data is ‘tidy’ we can see the columns as ‘religion’, ‘income’ then ‘frequency’, with each row being an entry(unique) to that person or entry point. We can then grab this data and analyze mathematically amounts, averages, etc" Reading Data This data can also ...

5499 sym R (9692 sym/40 pcs) 6 img

Problem 3.31

30.09.2020

3.31 Hearts win. In a new card game, you start with a well-shuffled full deck and draw 3 cards without replacement. If you draw 3 hearts, you win $50. If you draw 3 black cards, you win $25. For any other draws, you win nothing. Create a probability model for the amount you win at this game, and find the expected winnings. Also compute the stand...

1239 sym R (85 sym/1 pcs)