Publications by Mohamed Hassan-El Serafi
Data 606 Lab 5b - Foundations of Statistical Inference
If you have access to data on an entire population, say the opinion of every adult in the United States on whether or not they think climate change is affecting their local community, it’s straightforward to answer questions like, “What percent of US adults think climate change is affecting their local community?”. Similarly, if you had demog...
13824 sym 1 img 1 tbl
Data 607 Assignment 7
The three books I selected were all politically-themed, focusing on the number of pages, the year each book was published, and the current price of each book. Two of the books had multiple authors. I used html, xml, and json as separate file types to input the information for each book and create tables you will see below. I will explain further th...
863 sym R (990 sym/4 pcs) 4 tbl
New York State Gas Price Breakdown
My analysis focuses on weekly gas price data of New York State counties from 2007-2023. I will focus on the monthly gas price averages, as well as which year had the highest gas prices. I obtained this dataset from the New York State Data website. library(tidyverse) ## ── Attaching packages ──────────────────...
2665 sym R (15304 sym/44 pcs) 3 img 1 tbl
Data 606 Lab 4
In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate rand...
10510 sym 11 img
Data 607 Project 2 - MTA Daily Ridership
Since the Covid-19 pandemic began in March 2020, the mass transit system in New York has struggled to maintain the same levels of ridership from pre-pandemic levels. Recently, the MTA has proposed cutting subway service on Fridays and Mondays, arguing that there are less commuters on those days than the other three days during the workweek. For pur...
3666 sym Python (2018338 sym/49 pcs) 10 img 11 tbl
Affects of Covid-19 in New York City
Since the end of February 2020, Covid-19 has affected the way of life for billions of people worldwide. New York was the epicenter of the pandemic, infecting thousands, then millions of people. This dataset was taken from the Open Data NYC website, tracking Cases, Hospitalizations, and Deaths among the five boroughs of New York. I will look at the ...
5220 sym R (28100 sym/93 pcs) 24 img 2 tbl
MTA Ridership Breakdown
library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.4.1 ✔ purrr 1.0.1 ## ✔ tibble 3.1.8 ✔ dplyr 1.1.0 ## ✔ tidyr 1.3.0 ✔ stringr 1.5.0 ## ✔ readr 2.1.4 ✔ for...
972 sym R (22996 sym/46 pcs) 4 img 4 tbl
Data 606 Lab 3
The Hot Hand Basketball players who make several baskets in succession are described as having a hot hand. Fans and players have long believed in the hot hand phenomenon, which refutes the assumption that each shot is independent of the next. However, a 1985 paper by Gilovich, Vallone, and Tversky collected evidence that contradicted this belief an...
12167 sym 3 img
Data 607 Week 5 Assignment
library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.4.1 ✔ purrr 1.0.1 ## ✔ tibble 3.1.8 ✔ dplyr 1.1.0 ## ✔ tidyr 1.3.0 ✔ stringr 1.5.0 ## ✔ readr 2.1.4 ✔ for...
2375 sym R (9272 sym/39 pcs) 4 img 3 tbl
Data 607 Project 1
Reading Raw Data File Using readLines, I input the file of the Chess Tournament data: df <- readLines("https://raw.githubusercontent.com/moham6839/Data_607_Project_1/main/tournamentinfo.txt") ## Warning in ## readLines("https://raw.githubusercontent.com/moham6839/Data_607_Project_1/main/tournamentinfo.txt"): ## incomplete final line found on ## 'ht...
1114 sym