Publications by Marilyn Macy

RollerCoasters_Project1_clean

15.03.2021

The starting data set The original file of roller coaster data from Kaggle, Kaggle roller coaster data, is a tidy dataset with 14 variables in 157 rows. The data includes: Coaster name, character theme or amusement Park, City and State where it’s located, all character Year Opened, four digit integer Type of construction, categorical: wood or ...

7724 sym 6 img 8 tbl

Data 101: Feb 25 Lab pivot exercises - Macy

03.03.2021

library(tidyverse) ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.0 ── ## ✓ ggplot2 3.3.3 ✓ purrr 0.3.4 ## ✓ tibble 3.1.0 ✓ dplyr 1.0.4 ## ✓ tidyr 1.1.2 ✓ stringr 1.4.0 ## ✓ readr 1.4.0 ✓ f...

101 sym R (15717 sym/46 pcs)

Data 101 - Tidy and Untidy datasets - Macy

03.03.2021

Assignment Find two examples of un-tidy datasets and two of tidy datasets. In R tidy the un-tidy, be prepared to discuss in class next week. library(tidyverse) ## -- Attaching packages --------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.3.3 v purrr 0.3.4 ## v tibble 3.1.0 v dplyr 1.0.4 ## v tidyr 1.1.2 ...

1563 sym R (10401 sym/19 pcs)

Data 110-Project 1: RollerCoasters

07.03.2021

pacman::p_load(knitr, readr, RColorBrewer, tidyverse) The starting data set The original csv file of roller coaster data from Kaggle, Kaggle roller coaster data, is a tidy dataset with 14 variables in 157 rows. rc <- as_tibble(read.csv("C:/Users/Owner/Desktop/My Documents/School/DataFiles/RollerCoaster/RollerCoaster.csv")) str(rc) ## tibble [15...

8465 sym R (17041 sym/51 pcs) 6 img 5 tbl

Data101 Chapter 13 Exercises - Macy

10.03.2021

pacman::p_load(tidyverse, nycflights13, maps, RColorBrewer, lubridate, knitr) Exercises 13.4.6 https://r4ds.had.co.nz/relational-data.html#exercises-30 1. Compute the average delay by destination, then join on the airports data frame so you can show the spatial distribution of delays. delays <- filter(flights, arr_delay > 0) %>% # delays...

3559 sym R (11697 sym/39 pcs) 8 img 2 tbl

Data 101 Chapter 14 Exercises - Macy

25.03.2021

pacman::p_load(tidyverse) 14.2.5 Exercises In code that doesn’t use stringr, you’ll often see paste() and paste0(). What’s the difference between the two functions? What stringr function are they equivalent to? How do the functions differ in their handling of NA? paste concatenates with a space between elements, and changes NA value to c...

3464 sym R (31417 sym/181 pcs)

Data 110 - Unit 9 Webscraping

03.04.2021

Read IMDB webpage for ranked movies pacman::p_load(rvest, ggplot2, tidyverse, knitr) #Loading packages #Specifying the url for desired website to be scraped url <- 'http://www.imdb.com/search/title?count=100&release_date=2016,2016&title_type=feature' webpage <- read_html(url) #Reading the HT...

1511 sym R (15528 sym/83 pcs) 3 img 3 tbl

Data 101 - Spring Semester 2021 Final Project

07.05.2021

Parkinson’s Disease Speech Data Abstract: A study of speech analysis measurements for untreated Parkinson’s disease patients and subjects with REM sleep behavior disorder reveals that an automated tool for early diagnosis of Parkinson’s Disease can be reliable. The study is presented in the article “Automated analysis of connected speech...

9853 sym R (24718 sym/32 pcs) 15 img 6 tbl

Data 110 - Project 2 Parkinson Speech Analysis

19.04.2021

Data 110 - Project 2 Automating Parkinson’s Disease diagnosis using Speech Recognition Dysarthria is a speech disorder where the muscles used to produce speech are damaged, paralyzed, or weakened. Close to 90% of people with Parkinson’s Disease (PD) present with hypokinetic dysarthria, evidenced by reduced vocal loudness, monotone, reduced f...

8241 sym R (23492 sym/24 pcs) 13 img 5 tbl

Data 110 - Project 3 Parkinson's Speech Analysis continued

09.05.2021

Overview In “Data 110 Project 2 - Parkinson’s Speech Analysis”3 I approximated a diagram from the article “Automated analysis of connected speech reveals early biomarkers of Parkinson’s disease in patients with rapid eye movement sleep behaviour disorder”1 using the data associated with the study. In this extension of Project 2 I’ll...

8954 sym R (19704 sym/25 pcs) 11 img 12 tbl