Publications by inside data blog
Using association rule mining in forensic accounting
A few years ago I started to use the R programming language more intensive while writing my master thesis. I used the wonderful arules package for mining association rules and frequent item sets from Michael Hahsler and others. I used this package in the field of forensic accounting. Forensic data analysis is a branch of digital forensics. It exa...
2019 sym 2 img
How to import a subset of a (too huge) csv file
Sometimes people create csv files that are just too huge to upload them into your R session while most of the times you just need a subset of this data set. Recently I tapped into this problem and first I tried to import the whole file with functions like fread or the classic read.csv but this did not help much as the file was just too big and my...
1367 sym Python (103 sym/1 pcs) 2 img
How to import multiple data files (the fast way)
Sometimes you have your data stored in multiple csv files and want to load them into a single data frame in R. There are several answers on the web to this questions and I recently found a fast solution to this problem. # packages require(data.table) # set wd setwd("PathToYourFolder") # import files files = list.files(pattern="*.csv") dataset =...
895 sym R (243 sym/1 pcs) 2 img
Doing a Twitter Analysis with R
Recently I took part at Coding Durer, a five days international and interdisciplinary hackathon for art history and information science. The goal of this hackathon is to bring art historians and information scientists together to work on data. It is kind of an extension to the cultural hackathon CodingDaVinci where I participated in the past. The...
2767 sym R (2119 sym/4 pcs) 4 img
Shiny App for cultural hackathon
Recently I took part at Coding Durer, a five days international and interdisciplinary hackathon for art history and information science. The goal of this hackathon is to bring art historians and information scientists together to work on data. It is kind of an extension to the cultural hackathon CodingDaVinci where I participated in the past. I a...
2343 sym 4 img
Creating abstract city maps for Leaflet usage
Leaflet is a great way to display spatial information in an interactive way. If you want to display the difference between different neighborhoods you would usually get the proper shapefiles on the web and connect your data to them. But sometimes it does not need detailed shapefiles and you want more abstraction to get your information across. I ...
1289 sym R (2590 sym/1 pcs) 4 img