Publications by Karsten W.

Convenient access to Gapminder’s datasets from R

16.07.2012

In April, Hans Rosling examined the influence of religion on fertility. I used R to replicate a graphic of his talk:> library(datamart) > gm <- gapminder() > #queries(gm) > # > # babies per woman > tmp <- query(gm, "TotalFertilityRate") > babies <- as.vector(tmp["2008"]) > names(babies) <- names(tmp) > babies <- babies[!is.na(babies)] > countries...

803 sym R (1127 sym/1 pcs) 2 img

ScraperWiki in R

29.07.2012

ScraperWiki describes itself as an online tool for gathering, cleaning and analysing data from the web. It is a programming oriented approach, users can implement ETL processes in Python, PHP or Ruby, share these processes among the community (or pay for privacy) and schedule automated runs. The software behind the service is open source, and the...

5378 sym R (1412 sym/5 pcs)

Some of Excel’s Finance Functions in R

16.02.2013

Last year I took a free online class on finance by Gautam Kaul. I recommend it, although there are other classes I can not compare it to. The instructor took great efforts in motivating the concepts, structuring the material, and enable critical thinking / intuition. I believe this is an advantage of video lectures over books. Textbooks often co...

1462 sym R (156 sym/1 pcs)

Unit conversion in R

17.05.2013

Last weekend I submitted an update of my R package datamart to CRAN. It has been more than a half year since the last update, however there are only minor advances. The package is still in its early stages, and very experimental.One new feature is the function uconv. Think iconv, but instead of converting character vectors between different encod...

1479 sym R (338 sym/5 pcs)

Consecutive Numbers in Lottery Draws

02.03.2014

A historian, a data scientist, a programmer, a mathematician, and a philosopher discuss the question, how likely it is that a lottery draw (6 out of 49) contains two consecutive numbers.The historianThe historian argues that from 1955 up to 2011, there were 5026 lottery draws in Germany, every Saturday, and from 2000 on, two draws every Wednesd...

3999 sym R (721 sym/6 pcs)

Age of U.S. President Candidates

08.01.2016

This is a remake of a chart at reddit 6 months ago. I had an idea back then, but did not work it out and now the discussion is closed. The data comes from wikipedia, dimdat and NYT. The graph was created with R, here is the source code. Related To leave a comment for the author, please follow the link and comment on their blog: factbased. ...

637 sym 2 img

Age of U.S. President Candidates

08.01.2016

This is a remake of a chart at reddit 6 months ago. I had an idea back then, but did not work it out and now the discussion is closed. The data comes from wikipedia, dimdat and NYT. The graph was created with R, here is the source code. Related To leave a comment for the author, please follow the link and comment on their blog: factbased. ...

637 sym 2 img

Intrinsic time for cryptocurrency data

20.01.2019

This week, I attended a three-day hacking event of the Crypto Traders Berlin Meetup group. The aim was to find relationships between sentiment data of bitcointalk and twitter and the price of crypto currencies. In practise, it turns out to be not that easy. For starters, it is not easy to determine if the social media discussion spik...

1865 sym 2 img

Reviewing my First Shiny Project (1/n) – Buttons

03.03.2022

Related To leave a comment for the author, please follow the link and comment on their blog: factbased. R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job. Want to share your content on R-bloggers? click here if you h...

391 sym