Publications by Josh Iden

DATA 607 Project 1

25.09.2022

Introduction In this project, we take chess tournament results data from a text file and prepare it for export as .csv Strategy My strategy for this project was to identify the individual steps I needed to take to transform the data from the original text file. The steps I identified to complete: 1. Removing |’s and -’s 2. Isolating the play...

2270 sym R (13315 sym/30 pcs) 1 img

DATA 607 HW6

30.09.2022

Assignment: Tidying and Transforming Data Introduction For this assignment, we’re given the below data and tasked with reading the file into R as a .CSV, preparing the data and performing a basic analysis comparing delay rates for two airlines: Loading the Data First we load packages and read the data into R: library(tidyverse) library(ggplot...

907 sym R (3530 sym/8 pcs) 3 img

DATA 605 HW6

02.10.2022

Homework 6 1. A bag contains 5 green and 7 red jellybeans. How many ways can 5 jellybeans be withdrawn from the bag so that the number of green ones withdrawn will be less than 2? choose(5,0)*choose(7,5)+choose(5,1)*choose(7,4) ## [1] 196 2. A certain congressional committee consists of 14 senators and 13 representatives. How many ways can a su...

3072 sym

DATA 606 Lab4

02.10.2022

In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate ra...

10316 sym 9 img

DATA 605 HW7

09.10.2022

HW7 1. Let X1, X2, . . . , Xn be n mutually independent random variables, each of which is uniformly distributed on the integers from 1 to k. Let Y denote the minimum of the Xi’s. Find the distribution of Y Let \(k^n\) be the combinations for \(Xi\) for integers \(1\) to \(k\) Let \(j\) be the combinations for minimum \(k^n\) \(Y = 1 \le j \le...

1942 sym

DATA 607 Project 2

09.10.2022

Project 2 The goal of this assignment is to practice preparing untidy datasets for downstream analysis. Load packages: library(tidyverse) library(stringr) library(readxl) Dataset 1 For the first dataset, I chose the following data selected by Jhalak Das source:: Read the file into R: file <- "https://raw.githubusercontent.com/josh1den/DATA-607...

1833 sym R (8783 sym/24 pcs) 5 img

DATA 607 HW7

13.10.2022

Introduction For this assignment, I picked three books from one of my favorite genres: horror fiction, and created three files in which I stored each book’s information, and read each file into R: HTML XML JSON Reading Into R For this assignment, I utilized the tidyverse package in order to pipe my data into R for readability. The individual p...

1085 sym R (3240 sym/10 pcs)

DATA 605 HW8

15.10.2022

Question 1 μ = 1000 (mean of exponential density of individual variable ) n = 100 (number of light bulbs) The density of minimum value among n independent random variables with an exponential density has mean \(\frac{μ}{n}\), So the mean of the minimum value (first bulb to burn out) is: μ = 1000 n = 100 print(μ/n) ## [1] 10 Question 2 \(f...

1819 sym 4 img

DATA 606 Lab6

16.10.2022

Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro) library(infer) The data You wi...

9318 sym Python (6169 sym/46 pcs) 1 img

DATA 605 HW9

23.10.2022

n = 364 #rest of year m = 0 #mean EX = 100 #expected value var = 1/4 #σ^2 barvar = n * var barsd = sqrt(barvar) \(>= 100\): z = (100 - EX) / barsd pnorm(z, lower.tail=FALSE) ## [1] 0.5 \(>= 110\): z = (110 - EX) / barsd pnorm(z, lower.tail=FALSE) ## [1] 0.1472537 \(>= 120\) z = (120 - EX) / barsd pnorm(z, lower.tail=FALSE) ## [1] 0.01801584 Giv...

1904 sym 3 img