Publications by Pablo Casas
Fast data exploration for predictive modeling
The problem: Before modeling, we need to check/change numerical, categorical, NAs, one unique value and high cardinality variables. The new version of funModeling 1.9.2 was released aimed to have assistance during the prior step in creating machine learning models. Introduction data_integrity function provide information about the format of all t...
2597 sym R (3536 sym/9 pcs) 6 img
Fast data exploration for predictive modeling
The problem: Before modeling, we need to check/change numerical, categorical, NAs, one unique value and high cardinality variables. The new version of funModeling 1.9.2 was released aimed to have assistance during the prior step in creating machine learning models. Introduction data_integrity function provide information about the format of all t...
2597 sym R (3536 sym/9 pcs) 6 img
Automatic data types checking in predictive models
The problem: We have data, and we need to create models (xgboost, random forest, regression, etc). Each one of them has its constraints regarding data types. Many strange errors appear when we are creating models just because of data format. The new version of funModeling 1.9.3 (Oct 2019) aimed to provide quick and clean assistance on this. Cover...
2932 sym R (1799 sym/8 pcs) 4 img
How Auth0’s Data Team uses R and Python
The Data team is responsible for crunching, reporting, and serving data. The team also does data integrations with other systems, creating machine learning, and deep learning models. With this post, we intend to share our favorite tools, which are proven to run with thousands of millions of data. Scaling processes in real-world scenarios is a hot...
7706 sym 10 img
Tips before migrating to a newer R version
This post is based on real events. Several times when I installed the latest version of R, and proceeded to install all the packages I had in the previous version, I encountered problems. It also applies when updating packages after a while. I decided to make this post after seeing the community reception to a quick post I made: This post -also ...
5394 sym 12 img
funModeling: New site, logo and version ?
Hi there! {tl;dr} Website, here ✅ In case you don’t know funModeling is the package I’ve been developing during the last years. It’s focused on exploratory data analysis, data preparation and the evaluation of models. News Yesterday I published the latest version which fixes one of the plots in cross_plot. But that’s not as funny as the...
1920 sym 14 img
funModeling: New site, logo and version ????
Hi there! {tl;dr} Website, here ✅ In case you don’t know funModeling is the package I’ve been developing during the last years. It’s focused on exploratory data analysis, data preparation and the evaluation of models. News Yesterday I published the latest version which fixes one of the plots in cross_plot. But that’s not as funny as the...
1925 sym 14 img