Publications by Longhow Lam
A “poor man’s video analyzer”…
Introduction Not so long ago there was a nice dataiku meetup with Pierre Gutierrez talking about transfer learning. RStudio recently released the keras package, an R interface to work with keras for deep learning and transfer learning. Both events inspired me to do some experiments at my work here at RTL and explore the usability of it for us at ...
2580 sym 16 img
Interactive sunbuRst graphs in Power BI in 5 minutes!!
Introduction If I mention Power BI to fellow data scientists I often get strange looks. However, I quite like the tool, it is an easy and fast way to share results, KPI’s and graphs with others. With the latest release, Power BI now supports interactive R graphs, and they are easy to create as well. Steps to follow 1. Install Node.JS from ...
1993 sym R (95 sym/3 pcs) 14 img
Oil leakage… those old BMW’s are bad :-)
Introduction My first car was a 13 year Mitsubishi Colt, I paid 3000 Dutch Guilders for it. I can still remember a friend that would not like me to park this car in front of his house because of possible oil leakage. Can you get an idea of which cars will likely to leak oil? Well, with open car data from the Dutch RDW you can. RDW is the Nethe...
2448 sym 12 img
The one function call you need to know as a data scientist: h2o.automl
Introduction Two things that recently came to my attention were AutoML (Automatic Machine Learning) by h2o.ai and the fashion MNIST by Zalando Research. So as a test, I ran AutoML on the fashion mnist data set. H2o AutoML As you all know a large part of the work in predictive modeling is in preparing the data. But once you have done that, ideall...
2539 sym R (142 sym/1 pcs) 8 img
The Bold & Beautiful Character Similarities using Word Embeddings
Introduction I often see advertisement for The Bold and The Beautiful, I have never watched a single episode of the series. Still, even as a data scientist you might be wondering how these beautiful ladies and gentlemen from the show are related to each other. I do not have the time to watch all these episodes to find out, so I am going to use wo...
2679 sym Python (627 sym/4 pcs) 10 img
Dataiku 4.1.0: More support for R users!
Introduction Recently, Dataiku 4.1.0 was released, it now offers much more support for R users. But wait a minute, Data-what? I guess some of you do not know Dataiku, so what is Dataiku in the first place? It is a collaborative data science platform created to design and run data products at scale. The main themes of the product are: Collaboratio...
5211 sym 20 img
Association rules using FPGrowth in Spark MLlib through SparklyR
Introduction Market Basket Analysis or association rules mining can be a very useful technique to gain insights in transactional data sets, and it can be useful for product recommendation. The classical example is data in a supermarket. For each customer we know what the individual products (items) are that he has bought. With association rules m...
4612 sym R (1105 sym/7 pcs) 16 img
The ‘I-Love-IKEA’ – web app, built at the IKEA Hackaton with R and Shiny
Introduction On the 8th, 9th and 10th of December I participated at the IKEA hackaton. In one word it was just FANTASTIC! Well organized, good food, and participants from literally all over the world, even the heavy snow fall on Sunday did not stop us from getting there! I formed a team with Jos van Dongen and his son Thomas van Dongen and we cr...
2934 sym 20 img
t-sne dimension reduction on Spotify mp3 samples
Introduction Not long ago I was reading on t-Distributed Stochastic Neighbor Embedding (t-sne), a very interesting dimension reduction technique, and on Mel frequency cepstrum a sound processing technique. Details of both techniques can be found here and here. Can we combine the two in a data analysis exercise? Yes, and with not too much R code ...
3780 sym R (609 sym/5 pcs) 10 img
Is that a BMW or a Peugeot?
Introduction My son is 8 years old and he has shown a lot of interest in cars, which is strange because I have zero interest in cars. But he is driving me crazy when we have a car ride: “dad is that an Peugeot?“, “dad, that is an Audi” and “that is a BMW, right?“, “That is another cool BMW, why don’t we have a BMW?“. He is prett...
2570 sym R (644 sym/3 pcs) 8 img