Publications by Jamal Rogers

Applied Data Science: Module 1 Lesson 4 Abstraction 1

16.05.2023

Table of contents Practical advice Other arguments Other file types Reading Data from File Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to begin. library(tidyverse) An external comma-separated values (csv) can be read into R using read_csv() function from the readr package, a core package of the tidyverse. The ...

4940 sym

Applied Data Science: Module 1 Lesson 4 Abstraction 2

16.05.2023

Table of contents Guessing types Missing values, column types, and problems Column types Controlling Column Types Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to continue using the readr package. library(tidyverse) A CSV file doesn’t contain any information about the type of each variable (i.e. whether it’...

4093 sym

Applied Data Science: Module 1 Lesson 4 Abstraction 3

16.05.2023

Reading Data from Multiple Files Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to continue using the readr package. library(tidyverse) Sometimes your data is split across multiple files instead of being contained in a single file. For example, you might have sales data for multiple months, with each month’s data...

1351 sym

Applied Data Science: Module 1 Lesson 4 Abstraction 4

16.05.2023

Table of contents Writing to a file Data Entry Writing to a File and Data Entry Author Jamal Rogers Published May 16, 2023 We load the tidyverse package to continue using the readr package. library(tidyverse) Writing to a file readr also comes with two useful functions for writing data back to disk: write_csv() and write_tsv(). The most i...

2107 sym

Applied Data Science: Module 1 Lesson 4 Application

16.05.2023

Table of contents Star Wars Module 1 Lesson 4 Application Author Jamal Rogers Published May 16, 2023 library(tidyverse) library(rvest) Star Wars rvest includes a very simple example in vignette(“starwars”). This is a simple page with minimal HTML so it’s a good place to start. I’d encourage you to navigate to that page now and use...

1373 sym 1 img

Applied Data Science: Module 1 Lesson 3 Application

12.05.2023

Table of contents A Real-life Example Dataset Saving the dataset as csv Module 1 Lesson 3 Application Author Jamal Rogers Published May 12, 2023 library(tidyverse) A Real-life Example Dataset The data comes from the 2014 World Health Organization Global Tuberculosis Report, available at http://www.who.int/tb/country/data/download/en/. Ther...

899 sym

Applied Data Science: Module 1 Lesson 2 Abstraction 4

12.05.2023

Table of contents group_by() summarize() The slice_ functions Grouping by multiple variables Ungrouping dplyr Group Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) So far you’ve l...

4302 sym

Applied Data Science: Module 1 Lesson 2 Abstraction 1

12.05.2023

Table of contents filter() arrange() distinct() dplyr Row Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) The most important verbs that operate on rows of a dataset are filter(), wh...

3300 sym

Applied Data Science: Module 1 Lesson 2 Abstraction 3

12.05.2023

dplyr Pipe Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) We’ve shown you simple examples of the pipe above, but its real power arises when you start to combine multiple verbs. For...

961 sym

Applied Data Science: Module 1 Lesson 2 Abstraction 2

12.05.2023

Table of contents mutate() select() rename() relocate() dplyr Column Operations Author Jamal Rogers Published May 12, 2023 We shall use the flights dataset from the nycflights13 package. The tidyverse is loaded for the dplyr package. library(nycflights13) library(tidyverse) There are four important verbs that affect the columns without c...

4361 sym