Publications by Josh Iden
DATA 607 Project 1
Introduction In this project, we take chess tournament results data from a text file and prepare it for export as .csv Strategy My strategy for this project was to identify the individual steps I needed to take to transform the data from the original text file. The steps I identified to complete: 1. Removing |’s and -’s 2. Isolating the play...
2270 sym R (13315 sym/30 pcs) 1 img
DATA 607 HW6
Assignment: Tidying and Transforming Data Introduction For this assignment, we’re given the below data and tasked with reading the file into R as a .CSV, preparing the data and performing a basic analysis comparing delay rates for two airlines: Loading the Data First we load packages and read the data into R: library(tidyverse) library(ggplot...
907 sym R (3530 sym/8 pcs) 3 img
DATA 605 HW6
Homework 6 1. A bag contains 5 green and 7 red jellybeans. How many ways can 5 jellybeans be withdrawn from the bag so that the number of green ones withdrawn will be less than 2? choose(5,0)*choose(7,5)+choose(5,1)*choose(7,4) ## [1] 196 2. A certain congressional committee consists of 14 senators and 13 representatives. How many ways can a su...
3072 sym
DATA 606 Lab4
In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate ra...
10316 sym 9 img
DATA 605 HW7
HW7 1. Let X1, X2, . . . , Xn be n mutually independent random variables, each of which is uniformly distributed on the integers from 1 to k. Let Y denote the minimum of the Xi’s. Find the distribution of Y Let \(k^n\) be the combinations for \(Xi\) for integers \(1\) to \(k\) Let \(j\) be the combinations for minimum \(k^n\) \(Y = 1 \le j \le...
1942 sym
DATA 607 Project 2
Project 2 The goal of this assignment is to practice preparing untidy datasets for downstream analysis. Load packages: library(tidyverse) library(stringr) library(readxl) Dataset 1 For the first dataset, I chose the following data selected by Jhalak Das source:: Read the file into R: file <- "https://raw.githubusercontent.com/josh1den/DATA-607...
1833 sym R (8783 sym/24 pcs) 5 img
DATA 607 HW7
Introduction For this assignment, I picked three books from one of my favorite genres: horror fiction, and created three files in which I stored each book’s information, and read each file into R: HTML XML JSON Reading Into R For this assignment, I utilized the tidyverse package in order to pipe my data into R for readability. The individual p...
1085 sym R (3240 sym/10 pcs)
DATA 605 HW8
Question 1 μ = 1000 (mean of exponential density of individual variable ) n = 100 (number of light bulbs) The density of minimum value among n independent random variables with an exponential density has mean \(\frac{μ}{n}\), So the mean of the minimum value (first bulb to burn out) is: μ = 1000 n = 100 print(μ/n) ## [1] 10 Question 2 \(f...
1819 sym 4 img
DATA 606 Lab6
Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro) library(infer) The data You wi...
9318 sym Python (6169 sym/46 pcs) 1 img
DATA 605 HW9
n = 364 #rest of year m = 0 #mean EX = 100 #expected value var = 1/4 #σ^2 barvar = n * var barsd = sqrt(barvar) \(>= 100\): z = (100 - EX) / barsd pnorm(z, lower.tail=FALSE) ## [1] 0.5 \(>= 110\): z = (110 - EX) / barsd pnorm(z, lower.tail=FALSE) ## [1] 0.1472537 \(>= 120\) z = (120 - EX) / barsd pnorm(z, lower.tail=FALSE) ## [1] 0.01801584 Giv...
1904 sym 3 img