Publications by Julia Ferris

Project 2 - Tidy Data

04.10.2023

library(readr) library(janitor) ## ## Attaching package: 'janitor' ## The following objects are masked from 'package:stats': ## ## chisq.test, fisher.test library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:ba...

11540 sym R (20171 sym/53 pcs) 27 img 1 tbl

Assignment 5 - Tidy Data

30.09.2023

Introduction This assignment uses a small data set containing the number of flights for two airlines that arrive on time or delayed. The data set is originally untidy, and it is then converted to a tidy data set. Then, the delays for both airlines are compared to determine which airline has fewer delays on average and total. Pull in data from ...

1236 sym R (1520 sym/14 pcs)

Lab 4 - Probability Distributions

28.09.2023

library(tidyverse) library(openintro) library(ggplot2) Exercise 1 Make a plot (or plots) to visualize the distributions of the amount of calories from fat of the options from these two restaurants. How do their centers, shapes, and spreads compare? The distribution of the McDonald’s data has a center around 250, it is right-skewed, and it ...

17781 sym 21 img

Project 1 - Chess

27.09.2023

Introduction In this document, the text file containing information about players in a chess tournament is imported and then formatted a specific way. First, the text file is imported. After it is imported, it is formatted into a new data frame that contains all relevant columns. Once the new data frame is completed, it is then output as a csv ...

1343 sym R (2929 sym/6 pcs) 1 tbl

Extra Credit - Movie Ratings

18.09.2023

Extra Credit - DATA 607 Julia Ferris 2023-09-18 Step 1: Load the packages needed. Two packages are needed. readr is used to import the data. gt is used to format the data nicely. library(readr) library(gt) Step 2: Import the data. movies <- read_csv("https://raw.githubusercontent.com/juliaDataScience-22/cuny-fall-23/manage-acquire-data/newMo...

1634 sym 3 tbl

Assignment 3 - Regular Expressions

21.09.2023

Introduction In this document, the four parts of Assignment 3 will be answered with code as the answer or as examples. Part 1: Identify the majors that contain DATA or STATISTICS. In this section, the readr package is used to import data from a CSV file that is located in a Git Hub repository. Then, majors with DATA or STATISTICS are filtered o...

2637 sym R (2574 sym/22 pcs) 2 tbl

Lab 2 - NYC Flights

14.09.2023

library(tidyverse) library(openintro) library(ggplot2) Exercise 1 All three histograms are right-skewed. In the first histogram, the first bin has the highest count, and the counts decrease with each bin. A bin width smaller than the default 30 would provide more information. In the second histogram, the second bin has the highest count.The f...

12971 sym Python (5489 sym/50 pcs) 7 img 1 tbl

Assignment 2 - Movie Ratings

13.09.2023

Importing Data The data loaded for this document comes from the MySQL .CSV file saved after running the MySQL script. library(readr) movies <- read_csv("C:/ProgramData/MySQL/MySQL Server 8.0/Uploads/movieFile1.csv", show_col_types = FALSE) Replacing NA Values Some of the people included in the data set did not see the movies in the list. This m...

2280 sym R (4064 sym/14 pcs) 11 img 1 tbl

Bob Ross Paintings - Statistical Analysis

09.09.2023

Bob Ross Paintings September 9, 2023 Author Julia Ferris Introduction The data set included in this document displays information about the work of Bob Ross. The article titled “A Statistical Analysis of the Work of Bob Ross” describes the data analysis and interview conducted by the author, Walt Hickey. In the article, Hickey stated that B...

4825 sym 3 tbl

Final Project - Summer Bridge

26.07.2023

library(readr) schoolData <- read.csv("https://raw.githubusercontent.com/juliaDataScience-22/cuny-summer-23/main/Early.csv") View(schoolData) schoolData <- schoolData[,3:5] head(schoolData) ## cog age trt ## 1 103 1.0 Y ## 2 119 1.5 Y ## 3 96 2.0 Y ## 4 106 1.0 Y ## 5 107 1.5 Y ## 6 96 2.0 Y Question: Did the infants in t...

2167 sym R (4255 sym/46 pcs) 5 img