Publications by Steph Bradley
learning-r-wk6
1) Compare Two Means 1.1) Create two sample sets Create two samples from the gcookbook::heightweight data set , each of size 15, but one Female only, and the other Male only. # Create initial data set from gcookbook pop <-gcookbook::heightweight # Set seed and sample number 'N' set.seed(1515) N <- 15 # Female sample of 15 students samp_fem <- ...
2891 sym Python (6974 sym/8 pcs) 2 img
learning-r-wk8
About the data The World Happiness Report is a survey of the state of global happiness. It ranks countries by how happy their citizens perceive themselves to be, primarily using data from the Gallup World Poll. The first report was published in 2012, and has focused on a specific theme each subsequent year. Data is collected from people in over 1...
6140 sym 6 img 1 tbl
learning-r-wk5
1) Histogram and mean of population Use the data set gcookbook::heightweight as the ‘population’ pop <- gcookbook::heightweight 1.1) Calculate the count, mean, and standard deviation of the population weightLb pop_count <- nrow(pop) pop_mean_wt <- round(mean(pop$weightLb),2) pop_sd_wt <- round(sd(pop$weightLb),2) There are 236 observations in...
1410 sym 2 img
learning-r-wk4
Data Aquisition I restructured the data from Wikipedia’s Anscombe’s Quartet in Excel to create a new .xlsx file with 44 observations of 3 variables, and read the new file into my main data frame. Note: My new file, called “aquartet.xlsx”, is included in the submitted zip file. df <- read_excel("aquartet.xlsx") Summary Statitics I created...
868 sym 1 img 1 tbl
learning-r-wk3
1. Data Aquisition Automate the download of 2 files “File Layout”: cc-est2020-alldata6.pdf dated 27-JUL-2021 “All States”: cc-est2020-alldata6.csv dated 27-JUL-2021 Note: This is a 135MB file, ~836K rows with 50 columns This chunk of code does the following: Checks if a folder called “datafiles” exists, if not, creates it Checks i...
3314 sym
learning-r-wk2
Libraries used: tidyverse kableExtra cowplot Load data and present summary statistics df <- read.csv("hw2_DataSet.csv") For each data set, calculate mean, median, and 1st and 3rd quartile values. I first printed a summary of the data so that I would have correct results to compare my code to. summary(df) ## sample random_A ...
638 sym 1 img 1 tbl
learning-r-wk1
For this assignment, I used the following libraries: plyr tidyverse ggplot2 Aquire data set Download and import data from https://public.tableau.com/en-us/s/resources url <- "https://public.tableau.com/s/sites/default/files/media/titanic%20passenger%20list.csv" dest <- "titanic passenger list.csv" download.file(url, dest) Import data as titanic...
1690 sym 1 img 1 tbl