Publications by Jamal Rogers
Applied Data Science: Module 1 Lesson 4 Abstraction 1
Table of contents Practical advice Other arguments Other file types Reading Data from File Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to begin. library(tidyverse) An external comma-separated values (csv) can be read into R using read_csv() function from the readr package, a core package of the tidyverse. The ...
4940 sym
Applied Data Science: Module 1 Lesson 4 Abstraction 2
Table of contents Guessing types Missing values, column types, and problems Column types Controlling Column Types Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to continue using the readr package. library(tidyverse) A CSV file doesn’t contain any information about the type of each variable (i.e. whether it’...
4093 sym
Applied Data Science: Module 1 Lesson 4 Abstraction 3
Reading Data from Multiple Files Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to continue using the readr package. library(tidyverse) Sometimes your data is split across multiple files instead of being contained in a single file. For example, you might have sales data for multiple months, with each month’s data...
1351 sym
Applied Data Science: Module 1 Lesson 4 Abstraction 4
Table of contents Writing to a file Data Entry Writing to a File and Data Entry Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to continue using the readr package. library(tidyverse) Writing to a file readr also comes with two useful functions for writing data back to disk: write_csv() and write_tsv(). The most i...
2107 sym
Applied Data Science: Module 1 Lesson 4 Application
Table of contents Star Wars Module 1 Lesson 4 Application Author Jamal Rogers Published May 16, 2023 library(tidyverse) library(rvest) Star Wars rvest includes a very simple example in vignette(“starwars”). This is a simple page with minimal HTML so it’s a good place to start. I’d encourage you to navigate to that page now and use...
1373 sym 1 img
Applied Data Science: Module 1 Lesson 3 Application
Table of contents A Real-life Example Dataset Saving the dataset as csv Module 1 Lesson 3 Application Author Jamal Rogers Published May 12, 2023 library(tidyverse) A Real-life Example Dataset The data comes from the 2014 World Health Organization Global Tuberculosis Report, available at http://www.who.int/tb/country/data/download/en/. Ther...
899 sym
Applied Data Science: Module 1 Lesson 2 Abstraction 4
Table of contents group_by() summarize() The slice_ functions Grouping by multiple variables Ungrouping dplyr Group Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) So far you’ve l...
4302 sym
Applied Data Science: Module 1 Lesson 2 Abstraction 1
Table of contents filter() arrange() distinct() dplyr Row Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) The most important verbs that operate on rows of a dataset are filter(), wh...
3300 sym
Applied Data Science: Module 1 Lesson 2 Abstraction 3
dplyr Pipe Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) We’ve shown you simple examples of the pipe above, but its real power arises when you start to combine multiple verbs. For...
961 sym
Applied Data Science: Module 1 Lesson 2 Abstraction 2
Table of contents mutate() select() rename() relocate() dplyr Column Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) There are four important verbs that affect the columns without c...
4361 sym