Publications by Eric

VDE2

17.05.2024

knitr::opts_chunk$set(echo = TRUE) netflix = read.csv("netflix_titles.csv") library(ggplot2) ## Warning: package 'ggplot2' was built under R version 4.3.3 custom_col = c("red", "green", "blue") pie <- ggplot(netflix, aes(x = "", fill = factor(type))) + geom_bar(width = 1) + theme(axis.line = element_blank(), plot.title = element_...

45 sym R (5119 sym/33 pcs) 9 img

VDE1

17.05.2024

test1 knitr::opts_chunk$set(echo = TRUE) orgdat <- read.csv("organdata.csv", header = T) head(orgdat) nrow(orgdat) ## [1] 238 ncol(orgdat) ## [1] 21 summary(orgdat) ## country year donors pop ## Length:238 Length:238 Min. : 5.20 Min. : 3514 ## Class :character Class ...

48 sym R (9541 sym/29 pcs) 4 img

The R-Podcast Episode 5: Basic Package Management

15.04.2012

After a brief delay here’s episode 5 of the R-Podcast. In this episode: R 2.15.0 released, listener feedback, and discussion on basic package management. I discuss helpful resources for finding packages, installation procedures, and how to determine what packages are installed in your R system, among other considerations. If you are interested...

2289 sym R (395 sym/1 pcs)

The R-Podcast Episode 6: Importing Data from External Sources

29.04.2012

In this episode: Listener feedback and importing data from external sources into R. We dive into the basics of importing delimited text files using read.table and its varients. We also discuss recommendations for importing MS Excel spreadsheet files, relational databases such as MySQL, data from HTML tables, and files produced by other statisti...

2228 sym Python (573 sym/1 pcs)

The R-Podcast Episode 7: Best Practices for Workflow Management

28.05.2012

Hello everybody, I am finally back with a new episode! In this episode: Hardware issues, major update to RStudio, new forums, and discussion on managing your workflow for projects. I discuss useful functions for executing R scripts and saving/loading R objects for future sessions, and summarize different solutions for organizing R code based on...

2284 sym R (449 sym/1 pcs)

The R-Podcast Episode 8: Visualization with ggplot2

20.06.2012

I’m happy to present this jam-packed episode of the R-Podcast dedicated to using the ggplot2 package for visualization. This episode will have a companion screencast released in the next few days. I use data from the Hockey Summary Project to demonstrate how to create a series of boxplots of NHL regular season attendance for each team. The R ...

2505 sym R (484 sym/1 pcs)

The R-Podcast Screencast 2: Visualization with ggplot2

23.06.2012

Here is the second screencast episode of the R-Podcast to accompany episode 8 of the R-Podcast: Visualization with ggplot2. In this screencast I demonstrate a real-time session of using ggplot2 to create boxplots for a visualization of hockey attendance in the NHL. The R code created in this screencast is available in our GitHub repository, and ...

2300 sym

The R-Podcast Episode 9: Adventures in Data Munging Part 1

05.08.2012

It’s great to be back with a new episode after an eventful break! This episode begins a series on my adventures in data munging, a.k.a data processing. I discuss three issues that demonstrate the flexibility and versatility R brings for recoding messy values, important inconsistent data files, and pinpointing problematic observations and vari...

1580 sym R (528 sym/1 pcs)

The R-Podcast Episode 10: Adventures in Data Munging Part 2

16.09.2012

I’m happy to present episode 10 of the R-Podcast! Season 1 of the R-Podcast concludes with part 2 of my series on data munging, in which I discuss issues surrounding importing data sets contained in HTML tables. I share how I used the XML and RCurl packages to validate and import data from hockey-reference.com for storage into a MySQL databas...

2306 sym R (525 sym/1 pcs)

The R-Podcast Episode 11: Reproducible Analysis Part 1 (Introduction)

13.11.2012

Season 2 of the R-Podcast is up and running! This episode begins a multi-part series on reproducible analysis using R. In this episode I discuss the usage of Sweave and LaTeX for producing reproducible reports, an introduction to the capabilities of the knitr package (more episodes will be coming dedicated to this package), and my motivation for...

2197 sym R (485 sym/1 pcs)