Publications by Business Science
Cut your EDA time into 5 minutes with Exploratory DataXray Analysis (EDXA)
Do you know how long EDA (exploratory data analysis) used to take me? Not hours, not days… A full week! Listen, you don’t know how good you have it. With this new R package I’m about to show you (plus one BONUS hack), you’ll cut your EDA time into 5 minutes. Here’s how. Table of Contents Today I’m going to show you how to use dataxray...
9800 sym 30 img
SweetViz: Automated Exploratory Data Analysis (EDA) in Python
SweetViz is a Python library that makes exploratory data analysis (EDA) fast and effective. Learn how to investigate feature relationships using correlation and associations in the automated SweetViz report.Python Tips WeeklyThis article is part of Python-Tips Weekly, a bi-weekly video tutorial that shows you step-by-step how to do common Python ...
4079 sym 20 img
3 Ways to Read Multiple CSV Files: For-Loop, Map, List Comprehension
Reading many CSV files is a common task for a data scientist. In this free tutorial, we show you 3 ways to streamline reading CSV files in Python. You’ll read and combine 15 CSV Files using the top 3 methods for iteration.Python Tips WeeklyThis article is part of Python-Tips Weekly, a bi-weekly video tutorial that shows you step-by-step how to ...
4823 sym 16 img
explore: simplified exploratory data analysis (EDA) in R
When I began applying data science to the company I worked for in 2015, exploratory data analysis (the critical process for performing initial investigations to find important relationships in my data, cleaning my data, spotting anomalies, and checking my assumptions through data visualization) would take me any where from 1 day to 1 week. It was...
11688 sym 34 img
ggradar: radar plots with ggplot in R
Businesses have tons of data. But 90% have no clue how to use it. Here’s what they are overlooking… Within-Group Analysis. Groups are important (think customer segments). But group-wise comparisons can be tricky. Thankfully an R package exists to help us compare within groups using a visualization called a Radar Plot. What R package is it? I...
10541 sym 36 img
The Most Overlooked R Package (That Can Get You Through A Data Science Job Interview)
If you are looking to learn about the most useful R package that can help you get through a data science job interview AND you probably don’t know it yet, you’ve come to the right place, my friend! Here’s what’s in store for you today: If you want a job in data science, I’m going to show you how THIS R package can help you get through ...
10381 sym 40 img
Survival Analysis in R (in under 10-minutes)
Making a survival analysis can be a challenge even for experienced R users, but the good news is I’ll help you make beautiful, publication-quality survival plots in under 10-minutes. Here’s what WE are going to do: Make your first survival model and plot your survival analysis with survminer And I’ll give you 3 SPECIAL BONUSES! (that will ...
7996 sym 28 img
modelDown: Automate Explainable AI (Machine Learning) in R
Machine learning is great… until you have to explain it. Stakeholders are normally non-technical, C-suites that ultimately want to know what the model does for the business. And how it helps increase revenue or decrease costs. A new R package, modelDown can help. And, I want to teach you how to extract that precious business value that company�...
13365 sym 48 img
ggdensity: A new R package for plotting high-density regions
As data scientists, it can be downright impossible to drill into messy data. Fortunately, there’s a new R package that helps us focus on a “high-density region”, which is simply an area in a scatter plot defined by a high percentage of the data points. It’s called ggdensity. High Density Regions on a Scatter Plot In this R-tip, I’m goi...
6746 sym 26 img
Python Integration in RStudio – Data Science IDE Review
The two major data science languages, Python and R, have historically taken two separate paths when it comes to where data scientists are doing the coding. The R language has the RStudio IDE, which is a great IDE for data science because of its feature rich setup for efficiently developing analyses. The Python language has the Jupyter Notebook (a...
6920 sym R (6147 sym/25 pcs) 14 img 1 tbl