Publications by Longhow Lam

A “poor man’s video analyzer”…

16.06.2017

Introduction Not so long ago there was a nice dataiku meetup with Pierre Gutierrez talking about transfer learning. RStudio recently released the keras package, an R interface to work with keras for deep learning and transfer learning. Both events inspired me to do some experiments at my work here at RTL and explore the usability of it for us at ...

2580 sym 16 img

Interactive sunbuRst graphs in Power BI in 5 minutes!!

13.07.2017

Introduction If I mention Power BI to fellow data scientists I often get strange looks. However, I quite like the tool, it is an easy and fast way to share results, KPI’s and graphs with others. With the latest release, Power BI now supports interactive R graphs, and they are easy to create as well. Steps to follow 1. Install Node.JS from ...

1993 sym R (95 sym/3 pcs) 14 img

Oil leakage… those old BMW’s are bad :-)

18.08.2017

Introduction My first car was a 13 year Mitsubishi Colt, I paid 3000 Dutch Guilders for it. I can still remember a friend that would not like me to park this car in front of his house because of possible oil leakage. Can you get an idea of which cars will likely to leak oil? Well, with open car data from the Dutch RDW you can. RDW is the Nethe...

2448 sym 12 img

The one function call you need to know as a data scientist: h2o.automl

30.08.2017

Introduction Two things that recently came to my attention were AutoML (Automatic Machine Learning) by h2o.ai and the fashion MNIST by Zalando Research. So as a test, I ran AutoML on the fashion mnist data set. H2o AutoML As you all know a large part of the work in predictive modeling is in preparing the data. But once you have done that, ideall...

2539 sym R (142 sym/1 pcs) 8 img

The Bold & Beautiful Character Similarities using Word Embeddings

12.10.2017

Introduction I often see advertisement for The Bold and The Beautiful, I have never watched a single episode of the series. Still, even as a data scientist you might be wondering how these beautiful ladies and gentlemen from the show are related to each other. I do not have the time to watch all these episodes to find out, so I am going to use wo...

2679 sym Python (627 sym/4 pcs) 10 img

Dataiku 4.1.0: More support for R users!

20.11.2017

Introduction Recently, Dataiku 4.1.0 was released, it now offers much more support for R users. But wait a minute, Data-what? I guess some of you do not know Dataiku, so what is Dataiku in the first place? It is a collaborative data science platform created to design and run data products at scale. The main themes of the product are: Collaboratio...

5211 sym 20 img

Association rules using FPGrowth in Spark MLlib through SparklyR

23.11.2017

Introduction Market Basket Analysis or association rules mining can be a very useful technique to gain insights in transactional data sets, and it can be useful for product recommendation. The classical example is data in a supermarket. For each customer we know what the individual products (items) are that he has bought. With association rules m...

4612 sym R (1105 sym/7 pcs) 16 img

The ‘I-Love-IKEA’ – web app, built at the IKEA Hackaton with R and Shiny

10.12.2017

Introduction On the 8th, 9th and 10th of December I participated at the IKEA hackaton. In one word it was just FANTASTIC! Well organized, good food, and participants from literally all over the world, even the heavy snow fall on Sunday did not stop us from getting there! I formed a team with Jos van Dongen and his son Thomas van Dongen and we cr...

2934 sym 20 img

t-sne dimension reduction on Spotify mp3 samples

31.01.2018

Introduction Not long ago I was reading on t-Distributed Stochastic Neighbor Embedding (t-sne), a very interesting dimension reduction technique, and on Mel frequency cepstrum a sound processing technique. Details of both techniques can be found here and here. Can we combine the two in a data analysis exercise? Yes, and with not too much R code ...

3780 sym R (609 sym/5 pcs) 10 img

Is that a BMW or a Peugeot?

13.04.2018

Introduction My son is 8 years old and he has shown a lot of interest in cars, which is strange because I have zero interest in cars. But he is driving me crazy when we have a car ride: “dad is that an Peugeot?“, “dad, that is an Audi” and “that is a BMW, right?“, “That is another cool BMW, why don’t we have a BMW?“. He is prett...

2570 sym R (644 sym/3 pcs) 8 img