Publications by PK O’Flaherty
RBridgeWK1HW1
1. Write a loop that calculates 13-factorial. Bonus - try to do it two different ways (for example use a for loop and a while loop). Do not use the standard factorial function. The goal is to learn about how R uses loops. #calculate 13! with a for loop result1 <-1 for (x in 1:13) { result1 <- result1*x } #calculate 13! with a while loop result...
572 sym R (1382 sym/12 pcs)
RBridgeWK3Project
Project Contents Question for Analysis Data Exploration data selection summary statistics initial conclusions Data Wrangling manipulating strings manipulating numbers Graphics table of urban/rural divide by sex of head of household boxplot and pie chart of expenses by type histogram of age of head of household scatterplot of size of househo...
5711 sym R (6478 sym/19 pcs) 8 img
DATA607 WK1 Assignment
Overview In Walt Hickey’s 2014 article, How Americans Like Their Steak, he uses a survey of risk taking behavior along with how well done the respondent likes their steak to investigate the question, “Are risk-averse people more likely to order their steak well done?” Hickey (2014) How Americans Like Their Steak He didn’t see any relation...
2465 sym R (2719 sym/9 pcs) 1 img
DATA606 WK1 Lab
library(tidyverse) library(openintro) Exercise 1 arbuthnot$girls ## [1] 4683 4457 4102 4590 4839 4820 4928 4605 4457 4952 4784 5332 5200 4910 4617 ## [16] 3997 3919 3395 3536 3181 2746 2722 2840 2908 2959 3179 3349 3382 3289 3013 ## [31] 2781 3247 4107 4803 4881 5681 4858 4319 5322 5560 5829 5719 6061 6120 5822 ## [46] 5738 5717 5847 6203 6033 ...
5115 sym R (2395 sym/14 pcs) 3 img
DATA606 Data Project Proposal
Data Preparation # load data schsb <- read.csv(file = 'https://raw.githubusercontent.com/pkofy/DATA606/main/Data%20Project/F_SCH_SB_2020_latest.csv') Research question You should phrase your research question in a way that matches up with the scope of inference your dataset allows for. Are pension plans with higher liabilities more likely to be ...
2805 sym R (578 sym/5 pcs) 2 img
DATA607 Project 3
Getting Started Instructions Use data to answer the question, “Which are the most valued data science skills?” Proposed Implementation Our preliminary goal was to use multiple data sets from Kaggle to: create a rough association between job titles and pay bands identify skill sets in job listings using a codex assign a value to individual ...
8260 sym R (45115 sym/40 pcs)
DATA607 WK7 Assignment
Working with XML and JSON in R Getting Started Instructions Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about the...
2194 sym R (5711 sym/15 pcs)
DATA606 WK6 Lab
Inference for Categorical Data Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro)...
11482 sym R (5396 sym/36 pcs) 2 img
DATA607 Project 2
Getting Started Instructions Choose any three of the “wide” datasets identified in the Week 6 Discussion items. For each of the three chosen datasets: Create a .CSV file (or optionally, a MySQL database!) that includes all of the information included in the dataset. You’re encouraged to use a “wide” structure similar to how the infor...
10285 sym R (19049 sym/62 pcs) 2 tbl
DATA 606 WK5A Lab
Foundations for statistical inference - Sampling distributions Getting Started Load packages We are operating in the tidyverse. # Load packages ---------------------------------------------------------------- library(tidyverse) library(openintro) library(infer) # Needed for resampling library(shiny) # Needed for the shiny app The data A 2019...
8137 sym R (5319 sym/19 pcs) 5 img