Publications by Nagdev

MinIO for Machine Learning Model Storage using Python

13.01.2020

MinIO is a object storage database which uses S3(from Amazon). This is a very convenient tool in for data scientists or machine learning engineers to easily collaborate and share data and machine learning models. MinIO is a cloud storage server compatible with Amazon S3, released under Apache License v2. As an object store, MinIO can store unstru...

2891 sym Python (3330 sym/24 pcs)

Using Cassandra Through R

11.12.2018

In the last couple of years, there has been a lot of buzz around open source community. Almost every day, there are a lot of tools being open sourced. With a ton of open source tools in the market, don’t expect to have drivers built for every platform. I am a big fan of open source and the main reason is the huge community behind it. I came acr...

2135 sym R (626 sym/1 pcs)

Statistical Process Control (SPC) in R

26.02.2019

Statistical Process Control (SPC) is a quality control technique that uses statistical techniques to monitor and control the process and product quality. Although this is an age old technique, this is widely used in various applications such as manufacturing, health care, banking and other service related industries. In this blog post, I will not...

1004 sym

Top 8 Docker Images for Data Science

01.03.2019

Dockerizing Data Science: Introduction PreReqs: Docker, images, and containers Dockerizing data science packages have become more relevant these days mainly because you can isolate your data science projects without breaking anything. Dockerizing data science projects also make most of your projects portable and sharable and not worrying about in...

5748 sym

Loops! Loops! Loops in R. A Microbenchmark

03.05.2019

Loops are the holy grail in data science. You might use it when you want to repeat your task or a function or build a model say “n” times or iterations. There are quite few types of loops and most common ones are for and while. The main difference between while and for is, in while you run it until a condition is met like “run until you fin...

1988 sym 4 img

How to test the integrity of your clusters?

10.10.2019

Machine learning (ML) and AI has become the new buzz word in town. With that being said, there is a lot of demand for data scientists and machine learning engineers across various industries including IT, telecom, automotive, manufacturing and many more. Today, there are hundreds to thousands of machine learning online courses that are being offe...

3376 sym 2 img

minio.s3: A MinIO connector package for R

27.10.2019

MinIO is a high performance, distributed object storage system. It is software-defined, runs on industry standard hardware and is 100% open source under the Apache V2 license[1]. Today, MinIO is deployed globally with over 272.5M+ docker pulls and 18K+ git commits. MinIO is written in “go” language. So, expect it to have fast response. You ca...

4982 sym R (1159 sym/7 pcs) 4 img

EnsembleML: An R package for Parallel Ensemble Modeling in R

02.11.2019

Ensemble in machine learning is being used for a while. Ensemble is a concept of training multiple machine learning models and using them for predicting using either voting or feeding the prediction result to a different machine learning model. You could also build ensemble of ensembles. So, this is pretty cool! Why do we ever need the concept of...

4343 sym R (4774 sym/10 pcs) 2 img

Data Science in Manufacturing: An Overview

27.02.2020

Original article published in opendatascience.com In the last couple of years, data science has seen an immense influx in various industrial applications across the board. Today, we can see data science applied in health care, customer service, governments, cyber security, mechanical, aerospace, and other industrial applications. Among these, man...

10288 sym 6 img

Convolutional Neural Network under the Hood

27.02.2020

Neural networks have really taken over for solving image recognition and high sample rate data problems in the last couple of years. In all honesty, I promise I won’t be teaching you what neural networks are or CNN’s are. There are hundred’s of resources that are published everyday explaining them. I’ll post few links below. I am a seriou...

4749 sym R (5770 sym/12 pcs) 10 img