Publications by smarterpoland
A link that can tell more than dozens of lines of R code – what’s new in archivist?
Can you spot the difference between this plot: And this one: You are right! The latter has an embedded piece of R code. What for? It’s a call to a function aread from archivist – a package that manages external copies of R objects. This piece of code was added by the function addHooksToPrint(), that enriches knitr reports in links to all ob...
2841 sym R (1405 sym/5 pcs) 4 img
MinechaRts #1 (Minecraft + R + Edgar Anderson’s Iris Data)
How to use R to draw 3D scatterplots in Minecraft? Let’s see. Minecraft is a game about placing blocks and going on adventures (source). Blocks are usually placed by players but there are add-ons that allow to add/modify/remove blocks through external API. And this feature is being used in educational materials that show how to use Minecraft t...
2000 sym R (1095 sym/3 pcs) 6 img
Program of the european R users meeting [only 7 days to go]
The european R users meeting [eRum] is going to start in just 7 days. We expect over 250 participants, 10 invited talks, 47 regular talks, 13 lightning talks and 12 posters. In order to handle that much content we scheduled 18 sessions [+ workshops]. Find the program of the conference here or here. In the second sheet you will find a detailed lis...
884 sym 4 img
ggmail + forecast = how many emails I will get tomorrow?
During the eRum 2016, Adam Zagdański gave a very good tutorial about time series modeling. Among other things I’ve learned that the forecast package (created by Rob Hyndman) got cool new plots based on the ggplot2 package. Let’s use it to play with mailbox statistics for my gmail account! 1. Get the data Follow this link to download the data...
2168 sym 14 img
PISA 2015 – how to read/process/plot the data with R
Yesterday OECD has published results and data from PISA 2015 study (Programme for International Student Assessment). It’s a very cool study – over 500 000 pupils (15-years old) are examined every 3 years. Raw data is publicly available and one can easily access detailed information about pupil’s academic performance and detailed data from s...
1314 sym R (1358 sym/1 pcs) 2 img
How to weigh a dog with a ruler? (looking for translators)
We are working on a series of comic books that introduce statistical thinking and could be used as activity booklets in primary schools. Stories are built around adventures of siblings: Beta (skilled mathematician) and Bit (data hacker). What is the connection between these comic books and R? All plots are created with ggplot2. The first story (...
1375 sym 10 img
Is it a job offer for a Data Scientist?
TL;DR Konrad Więcko and Krzysztof Słomczyński (with tiny help from my side) have created a system that is tracing what skills are currently in demand among job offers for data scientists in Poland. What skills, how frequent and how the demand is changing over time. The full description how this was done. static, +shiny. Here: The shiny applica...
2192 sym 2 img
DIY – cheat sheets
I found recently, that in addition to a great list of cheatsheets designed by RStudio, one can also download a template for new cheatsheets from RStudio Cheat Sheets webpage. With this template you can design your own cheatsheet, and submit it to the collection of Contributed Cheatsheets (Garrett Grolemund will help to improve the submission if n...
1025 sym 4 img
intsvy: PISA for research and PISA for teaching
The Programme for International Student Assessment (PISA) is a worldwide study of 15-year-old school pupils’ scholastic performance in mathematics, science, and reading. Every three years more than 500 000 pupils from 60+ countries are surveyed along with their parents and school representatives. The study yields in more than 1000 variables con...
3283 sym 6 img
Explain! Explain! Explain!
Predictive modeling is fun. With random forest, xgboost, lightgbm and other elastic models… Problems start when someone is asking how predictions are calculated. Well, some black boxes are hard to explain. And this is why we need good explainers. In the June Aleksandra Paluszynska defended her master thesis Structure mining and knowledge extrac...
1708 sym 4 img