Publications by LeTicia Cancel

DATA622_Homework1

29.10.2022

Libraries library(rcompanion) library(datasets) library(caTools) library(dplyr) library(party) library(rpart.plot) library(caret) Import Data df100 <- read.csv("https://raw.githubusercontent.com/ltcancel/DATA622/main/Homework1/1000%20Sales%20Records.csv") df1000 <- read.csv("https://raw.githubusercontent.com/ltcancel/DATA622/main/Homework...

1812 sym R (16526 sym/29 pcs) 1 img

DATA622 Homework 4

24.12.2022

Homework #4 Assignment: You get to decide which dataset you want to work on. The data set must be different from the ones used in previous homework You can work on a problem from your job, or something you are interested in. You may also obtain a dataset from sites such as Kaggle, Data.Gov, Census Bureau, USGS or other open data portals. Sele...

4889 sym R (7505 sym/34 pcs) 14 img

DATA607_HWK1_F20

30.08.2020

Intro The data for this assignment is from the lastest Governor Polls data found on the FiveThirtyEight site. Load libraries #install.packages("RCurl") library(RCurl) library(ggplot2) Load Governors Polls Data from GitHub and create a dataframe x <- getURL("https://raw.githubusercontent.com/ltcancel/DATA607_HWK1/master/Week1/polls/governor_po...

544 sym R (5650 sym/5 pcs) 2 img

HWK2_Data607_F20

06.09.2020

Introduction This is an analysis of movie reviews taken from five imaginary individuals. Each “reviewer” was asked to share their opinion of up to six films on a scale of 1 – 5 with 1 being “Below Average” and 5 being “Excellent”. There is also a response of N/A – Did not watch film which is meant to catch any reviewers who left t...

2892 sym R (2724 sym/10 pcs) 2 img

HWK7_Data607_F20

10.10.2020

Libraries library(RCurl) library(xml2) library(dplyr) library(tidyr) library(jsonlite) library(rvest) Import Data Get all 3 book files from GitHub #Book 1 - XML file xmlFile <- "https://raw.githubusercontent.com/ltcancel/Homework7_Data607_F20/main/Book1.xml" book1 <- read_xml(xmlFile) #Book 2 - HTML file htmlFile <- "https://raw.github...

162 sym R (3459 sym/18 pcs)

Project2_Data607_F20

04.10.2020

Introduction Below is a analysis of different data sets provided by classmates. This analysis will attempt to answer specific questions from the person who provided the data, or a question that seems appropriate during data tidying. Libraries needed for analysis. library(sf) library(DBI) library(RCurl) library(readr) library(ggplot2) library...

2754 sym R (7019 sym/20 pcs) 5 img

HWK3_Data607_F20

12.09.2020

Question 1 Code that identifies majors that contain either “DATA” or “STATISTICS” from [Five Thirty Eight] (https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/). library(rvest) library(xml2) library(tidyverse) # Get URL of College Majors list site <- "https://projects.fivethirtyeight.com/mid-levels/col...

1111 sym R (2757 sym/11 pcs)

DATA607_Project1_F20

20.09.2020

# Libraries #library(tidyverse) library(RCurl) library(dplyr) library(stringr) library(tidyr) Import Tournament Data The tournament text file is saved in GitHub. The data is saved to the table t_df and the columns are renamed. #get URL and import data from GitHub site <- getURL("https://raw.githubusercontent.com/ltcancel/Project1_Data607_F...

878 sym R (10935 sym/16 pcs)

HWK5_Data607_F20

26.09.2020

#Libraries needed library(RCurl) library(dplyr) library(tidyverse) library(tidyfast) library(tidyr) library(ggplot2) Import Flights Data Import Flights data from GitHub site <- getURL("https://raw.githubusercontent.com/ltcancel/Homework-5/master/Airline%20Data.csv") df <- read.csv(text = site, sep = ",", header = TRUE) head(df) ## ...

732 sym R (6237 sym/24 pcs) 3 img

Project3_DATA607_F20

17.10.2020

Introduction Below is code for scrapping Data Scientist job postings from LinkedIn.com using rvest. The goal of this scrape is to answer the question “Which are the most valued data science skills?” The code below was also used by my teammates to create functions that could be called to extract the same data, or to possibly create an R packag...

2086 sym R (4712 sym/11 pcs) 2 img