Publications by yina qiao
Data 606 lab 7
In this lab, I will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. LOAD Packages library(tidyverse) library(openintro) library(infer) The Data Every two years, the Centers for Disease Control and...
17823 sym 5 img
Data 606 final project proposal
Data Preparation Install Packages install.packages("tidyverse") ## Installing package into '/cloud/lib/x86_64-pc-linux-gnu-library/4.2' ## (as 'lib' is unspecified) library(tidyverse) ## load data my_loan_data<- read.csv("https://raw.githubusercontent.com/yinaS1234/data-606/main/606%20final%20project/loan_data.csv") head(my_loan_data) ## Loan_...
7059 sym 5 img
Data 606 Lab 6
LOAD Packages library(tidyverse) library(openintro) library(infer) set.seed(74226) The Data I will be analyzing the same dataset as in the previous lab, where you delved into a sample from the Youth Risk Behavior Surveillance System (YRBSS) survey, which uses data from high schoolers to help discover health patterns. The dataset is called yrbss. ...
12560 sym Python (6552 sym/31 pcs) 7 img
DATA 606 LAB 5 Part A
Introduction In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. Setting a seed: We will take ...
6995 sym R (8528 sym/36 pcs) 4 img
Data 606 lab 5 Part B
Introduction In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. Libraries library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ──...
5866 sym R (3051 sym/25 pcs)
DATA 607 lab 7
Introduction Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three books, and separately create three files wh...
1275 sym R (4253 sym/10 pcs)
Data 606 lab 4
Load packages knitr::opts_chunk$set(eval = TRUE, message = FALSE, warning = FALSE) library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── ## ✔ ggplot2 3.4.1 ✔ purrr 0.3.4 ## ✔ tibble 3.1.7 ✔ dplyr ...
4453 sym R (5864 sym/37 pcs) 17 img
data 607 project 2
Introduction The goal of this assignment is to practice in preparing different datasets for downstream analysis work DataSet 1 NYC MTA Subway Ridership from 2013 load packages # Load required libraries library(tidyverse) load data url <- 'https://raw.githubusercontent.com/yinaS1234/data-607/main/project%202/Annual20Subway20Ridership.csv' dfMTA ...
1215 sym R (8772 sym/22 pcs) 4 img
Data 606 lab 3
The Hot Hand Basketball players who make several baskets in succession are described as having a hot hand. Fans and players have long believed in the hot hand phenomenon, which refutes the assumption that each shot is independent of the next. However, a 1985 paper by Gilovich, Vallone, and Tversky collected evidence that contradicted this belief an...
12080 sym R (5146 sym/30 pcs) 2 img
DATA 607 LAB 5
Introduction This assignment will import uncleaned data from a csv. file. My task is to tidy and transform data as described below. (1) Read the information from a .CSV file into R, and use tidyr and dplyr as needed to tidy and transform the data. (2) Perform analysis to compare the arrival delays for the two airlines Install and load pakages libr...
975 sym R (3979 sym/14 pcs)