Publications by Vyanna Hill
library(tidyverse) library(openintro) library(infer) library(dplyr) data("yrbss",package="openintro") Exercise 1 Overall there was 4792 students that don’t text while driving and 7873 students that text while driving in the past 30 days. yrbss%>%count(text_while_driving_30d) ## # A tibble: 9 x 2 ## text_while_driving_30d n ## <c...
library(tidyverse) library(openintro) library(infer) library(ggplot2) library(dplyr) data('yrbss', package='openintro') Exercise 1 There are 13 cases in the data set. glimpse(yrbss) ## Rows: 13,583 ## Columns: 13 ## $ age <int> 14, 14, 15, 15, 15, 15, 15, 14, 15, 15, 15, 1~ ## $ gender <chr> "female...
library(tidyverse) library(dplyr) library(scales) Intro| Loading Excel sheet In order to look inside the vaccination chart, I created my own sheet in a “wide” format. Let’s uploaded the work below. data<-read.csv("Israel.csv",header = TRUE) print(data) ## Age Population Percentage severe ## 1 <50 1116834|3501118 0.233|0.73 ...
library(tidyverse) library(dplyr) library(lemon) library(rmarkdown) library(stringr) library(lubridate) library(ggpubr) library(ggplot2) <- lemon_print Intro to wide data set collection For project 3, I selected three data sets from our discussion boards. I will play with problems given by Benson, Tora, and Jay. B...
library(tidyverse) library(rvest) library(dplyr) library(lemon) library("XML") library("methods") library("rjson") <- lemon_print Introduction into formats outside of .csv For this assignment, I will read a collection of books off of three different files with R. The files will be .XML, .JSON, .HTML. All the books wi...
library(tidyverse) library(dplyr) library(tidytext) library(janeaustenr) library(dplyr) library(stringr) library(tm) library(textdata) library(gutenbergr) library(stopwords) library(ggplot2) library(wordcloud) library(lexicon) Introduction into sentiments and text analysis From Chapter 2 in “Text Mining with R: The Tidy approach”...
Data 606 Final Project Vyanna Hill 4/20/2022 Abstract In reflection, the data set is a collection of randomly-picked surveys structured by the Behavioral Risk Factor Surveillance System (BRFSS). For the statistical analysis, the null hypothesis was used to determine if there was a relationship between the survey takers’ education and their rout...
Food Deserts in the US Vyanna Hill 2022-05-11 library(tidyverse) library(ggplot2) library(tidycensus) library(data.table) library(readr) library(dplyr) library(ggpubr) library(sf) census_api_key("3d3549857b43fb9f12d359e3822a007cb6ad8ca9") Introduction For this project, I looked into the long term problem of food inequality in low incom...
Introduction For my recommender system of choice, I selected Twitter’s search engine. It is one of few social media platforms, where users can choose between types of media in their search results. For example, Users can search through specific types of media (both organic/promoted tweets). The Tweets can be organized by Users profiles, tweets,...
Data607 Final Proposal Vyanna Hill 4/23/2022 Abstract For my final project, I decided to re-visit a data set from earlier in the semester. I wanted a data set where I can use location based analysis. For my project’s data , I chosen Chicago’s Food Inspection 1. There are two main objectives for my analysis. If there’s a present correlation...
