Publications by Ariba

Lab 5a- Data 606

12.10.2024

In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. Setting a seed: We will take some random ...

12141 sym 4 img

Assignment 4- Data 607

10.10.2024

R Markdown # Load necessary libraries library(readr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) # Read the CSV file (use `skip = 1` to skip the...

434 sym R (3057 sym/18 pcs) 1 img

Project 1- DATA 607

09.10.2024

# Load necessary libraries library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ forcats 1.0.0 ✔ readr 2.1.5 ## ✔ ggplot2 3.5.1 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.3 ✔ tidyr 1.3.1 ## ✔ purrr 1.0.2 ...

12 sym R (69798 sym/9 pcs)

Assignment 4- Data 607

08.10.2024

R Markdown # Load necessary libraries library(readr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) # Read the CSV file (use `skip = 1` to skip the...

39 sym R (3057 sym/18 pcs) 1 img

Lab 04 - Data 606

07.10.2024

In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate rand...

10907 sym Python (4973 sym/22 pcs) 8 img

Lab 03- Data 606

07.10.2024

The Hot Hand Basketball players who make several baskets in succession are described as having a hot hand. Fans and players have long believed in the hot hand phenomenon, which refutes the assumption that each shot is independent of the next. However, a 1985 paper by Gilovich, Vallone, and Tversky collected evidence that contradicted this belief an...

12204 sym 1 img

Assignment 3- Data 607

18.09.2024

Provide an example of at least three dataframes in R that demonstrate normalization. The dataframes can contain any data, either real or synthetic. Although normalization is typically done in SQL and relational databases, you are expected to show this example in R, as it is our main work environment in this course. Unnormalized Data: Contains non-...

1672 sym R (4733 sym/36 pcs)

Assignment 2 - Introduction to Data

18.09.2024

Some define statistics as the field that focuses on turning information into knowledge. The first step in that process is to summarize and describe the raw information – the data. In this lab we explore flights, specifically a random sample of domestic flights that departed from the three major New York City airports in 2013. We will generate sim...

10971 sym 9 img

Lab 02 - Data 06

15.09.2024

#Part 1: Build Table • Choose six recent popular movies. #I choose following movies: Barbie, Spider Man, Mission Impossible, Super Mario, Garden of Galaxy • Ask at least five people that you know (friends, family, classmates, imaginary friends if necessary) to rate each of these movies that they have seen on a scale of 1 to 5. #Part 2: Store da...

1047 sym

Assignment 1 - Loading Data to Dataframe

06.09.2024

##Introduction: The data set offers predictions of soccer match outcomes based on FiveThirtyEight’s Soccer Power Index (SPI). This index is used to rate teams’ offensive and defensive capabilities, which are then combined to generate win probabilities and expected scores for upcoming matches. spi_matches.csv contains match-by-match SPI ratings ...

804 sym