Publications by Pablo Casas

Fast data exploration for predictive modeling

18.09.2019

The problem: Before modeling, we need to check/change numerical, categorical, NAs, one unique value and high cardinality variables. The new version of funModeling 1.9.2 was released aimed to have assistance during the prior step in creating machine learning models. Introduction data_integrity function provide information about the format of all t...

2597 sym R (3536 sym/9 pcs) 6 img

Fast data exploration for predictive modeling

18.09.2019

The problem: Before modeling, we need to check/change numerical, categorical, NAs, one unique value and high cardinality variables. The new version of funModeling 1.9.2 was released aimed to have assistance during the prior step in creating machine learning models. Introduction data_integrity function provide information about the format of all t...

2597 sym R (3536 sym/9 pcs) 6 img

Automatic data types checking in predictive models

14.10.2019

The problem: We have data, and we need to create models (xgboost, random forest, regression, etc). Each one of them has its constraints regarding data types. Many strange errors appear when we are creating models just because of data format. The new version of funModeling 1.9.3 (Oct 2019) aimed to provide quick and clean assistance on this. Cover...

2932 sym R (1799 sym/8 pcs) 4 img

How Auth0’s Data Team uses R and Python

03.12.2019

The Data team is responsible for crunching, reporting, and serving data. The team also does data integrations with other systems, creating machine learning, and deep learning models. With this post, we intend to share our favorite tools, which are proven to run with thousands of millions of data. Scaling processes in real-world scenarios is a hot...

7706 sym 10 img

Tips before migrating to a newer R version

28.04.2020

This post is based on real events. Several times when I installed the latest version of R, and proceeded to install all the packages I had in the previous version, I encountered problems. It also applies when updating packages after a while. I decided to make this post after seeing the community reception to a quick post I made: This post -also ...

5394 sym 12 img

funModeling: New site, logo and version ?

15.06.2020

Hi there! {tl;dr} Website, here ✅ In case you don’t know funModeling is the package I’ve been developing during the last years. It’s focused on exploratory data analysis, data preparation and the evaluation of models. News Yesterday I published the latest version which fixes one of the plots in cross_plot. But that’s not as funny as the...

1920 sym 14 img

funModeling: New site, logo and version ????

15.06.2020

Hi there! {tl;dr} Website, here ✅ In case you don’t know funModeling is the package I’ve been developing during the last years. It’s focused on exploratory data analysis, data preparation and the evaluation of models. News Yesterday I published the latest version which fixes one of the plots in cross_plot. But that’s not as funny as the...

1925 sym 14 img