Publications by IvanTikhonov
607_Assignment3
#1. Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/], provide code that identifies the majors that contain either “DATA” or “STATISTICS” data_majors<- read.csv("https://raw.githubusercontent.com/fivethirtyeight/data/master...
1768 sym
DATA605_Assignment_4
With the attached data file, build and visualize eigenimagery that accounts for 80% of the variability. Provide full R code and discussion. library(doParallel) ## Loading required package: foreach ## Loading required package: iterators ## Loading required package: parallel library(foreach) library(iterators) library(parallel) library(jpeg) ...
695 sym R (1882 sym/16 pcs) 3 img
605_Assignment_5
(Bayesian). A new test for multinucleoside-resistant (MNR) human immunodeficiency virus type 1 (HIV-1) variants was recently developed. The test maintains 96% sensitivity, meaning that, for those with the disease, it will correctly report “positive” for 96% of them. The test is also 98% specific, meaning that, for those without the diseas...
4225 sym
DATA607_Project1
Project Description In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s ...
1221 sym R (5268 sym/15 pcs)
DATA606_Lab3
library(usdata) library(cherryblossom) library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.3.6 ✔ purrr 0.3.4 ## ✔ tibble 3.1.8 ✔ dplyr 1.0.10 ## ✔ tidyr 1.2.1 ...
6730 sym R (3182 sym/24 pcs) 2 img
DATA607_Assignment_5
Assignment – Tidying and Transforming Data Load required packages library(tidyr) library('dplyr') ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union Create a .CSV file (or...
638 sym R (5522 sym/18 pcs)
DATA605_Assignment6
A bag contains 5 green and 7 red jellybeans. How many ways can 5 jellybeans be withdrawn from the bag so that the number of green ones withdrawn will be less than 2? # For the draw that contains 1 greenjelly beans we can have 5 choose 1 where 5 jelly beans can have 1 position choose(5,1) ## [1] 5 # Then for other spots must be filled with red...
2579 sym Python (5026 sym/56 pcs)
DATA606_Lab4
The normal distribution In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and a...
9900 sym R (4253 sym/34 pcs) 13 img
DATA605HW7
Let X1, X2, . . . , Xn be n mutually independent random variables, each of which is uniformly distributed on the integers from 1 to k. Let Y denote the minimum of the Xi’s. Find the distribution of Y . For 1≤j≤k,m(j)=((k−j+1)n−(k−j)n)/kn Since Y is the minimum value of Xi over all of the Xi’s, then in order to find the distribut...
2132 sym
DATA606_Lab5_Sampling distributions
Foundations for statistical inference - Sampling distributions In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate,...
11671 sym R (4906 sym/31 pcs) 5 img