Publications by Mohamed Hassan-El Serafi

Data 606 Lab 5b - Foundations of Statistical Inference

12.03.2023

If you have access to data on an entire population, say the opinion of every adult in the United States on whether or not they think climate change is affecting their local community, it’s straightforward to answer questions like, “What percent of US adults think climate change is affecting their local community?”. Similarly, if you had demog...

13824 sym 1 img 1 tbl

Data 607 Assignment 7

12.03.2023

The three books I selected were all politically-themed, focusing on the number of pages, the year each book was published, and the current price of each book. Two of the books had multiple authors. I used html, xml, and json as separate file types to input the information for each book and create tables you will see below. I will explain further th...

863 sym R (990 sym/4 pcs) 4 tbl

New York State Gas Price Breakdown

06.03.2023

My analysis focuses on weekly gas price data of New York State counties from 2007-2023. I will focus on the monthly gas price averages, as well as which year had the highest gas prices. I obtained this dataset from the New York State Data website. library(tidyverse) ## ── Attaching packages ──────────────────...

2665 sym R (15304 sym/44 pcs) 3 img 1 tbl

Data 606 Lab 4

06.03.2023

In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate rand...

10510 sym 11 img

Data 607 Project 2 - MTA Daily Ridership

06.03.2023

Since the Covid-19 pandemic began in March 2020, the mass transit system in New York has struggled to maintain the same levels of ridership from pre-pandemic levels. Recently, the MTA has proposed cutting subway service on Fridays and Mondays, arguing that there are less commuters on those days than the other three days during the workweek. For pur...

3666 sym Python (2018338 sym/49 pcs) 10 img 11 tbl

Affects of Covid-19 in New York City

06.03.2023

Since the end of February 2020, Covid-19 has affected the way of life for billions of people worldwide. New York was the epicenter of the pandemic, infecting thousands, then millions of people. This dataset was taken from the Open Data NYC website, tracking Cases, Hospitalizations, and Deaths among the five boroughs of New York. I will look at the ...

5220 sym R (28100 sym/93 pcs) 24 img 2 tbl

MTA Ridership Breakdown

28.02.2023

library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.4.1 ✔ purrr 1.0.1 ## ✔ tibble 3.1.8 ✔ dplyr 1.1.0 ## ✔ tidyr 1.3.0 ✔ stringr 1.5.0 ## ✔ readr 2.1.4 ✔ for...

972 sym R (22996 sym/46 pcs) 4 img 4 tbl

Data 606 Lab 3

26.02.2023

The Hot Hand Basketball players who make several baskets in succession are described as having a hot hand. Fans and players have long believed in the hot hand phenomenon, which refutes the assumption that each shot is independent of the next. However, a 1985 paper by Gilovich, Vallone, and Tversky collected evidence that contradicted this belief an...

12167 sym 3 img

Data 607 Week 5 Assignment

26.02.2023

library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.4.1 ✔ purrr 1.0.1 ## ✔ tibble 3.1.8 ✔ dplyr 1.1.0 ## ✔ tidyr 1.3.0 ✔ stringr 1.5.0 ## ✔ readr 2.1.4 ✔ for...

2375 sym R (9272 sym/39 pcs) 4 img 3 tbl

Data 607 Project 1

20.02.2023

Reading Raw Data File Using readLines, I input the file of the Chess Tournament data: df <- readLines("https://raw.githubusercontent.com/moham6839/Data_607_Project_1/main/tournamentinfo.txt") ## Warning in ## readLines("https://raw.githubusercontent.com/moham6839/Data_607_Project_1/main/tournamentinfo.txt"): ## incomplete final line found on ## 'ht...

1114 sym