Publications by suresh kumar Gorakala

Time Series Analysis using R – forecast package

17.04.2014

In today’s blog post, we shall look into time series analysis using R package – forecast. Objective of the post will be explaining the different methods available in forecast package which can be applied while dealing with time series analysis/forecasting. What is Time Series?A time series is a collection of observations of well-defined data...

7287 sym R (739 sym/3 pcs) 16 img 1 tbl

Basic recommendation engine using R

25.05.2014

In our day to day life, we come across a large number of Recommendation engines like Facebook Recommendation Engine for Friends’ suggestions, and suggestions of similar Like Pages, Youtube recommendation engine suggesting videos similar to our previous searches/preferences. In today’s blog post I will explain how to build a basic recommender ...

3317 sym 8 img

Regression Analysis using R

04.10.2014

What is a Prediction Problem?A business problem which involves predicting future events by extracting patterns in the historical data. Prediction problems are solved using Statistical techniques, mathematical models or machine learning techniques.For example: Forecasting stock price for the next week, predicting which football team wins the world...

4815 sym Python (981 sym/5 pcs) 8 img

Exposing R-script as API

08.04.2015

R is getting popular programming language in the area of Data Science. Integrating Rscript with web UI pages is a challenge which many application developers are facing. In this blog post I will explain how we can expose R script as an API, using rApache and Apache webserver. rApache is a project supporting web application development using the ...

2384 sym 16 img

Introduction to Logistic Regression with R

06.10.2015

In my previous blog I have explained about linear regression. In today’s post I will explain about logistic regression.         Consider a scenario where we need to predict a medical condition of a patient (HBP) ,HAVE HIGH BP or NO HIGH BP, based on some observed symptoms – Age, weight, Issmoking, Systolic value, Diastolic value, RACE, et...

5814 sym R (41 sym/1 pcs) 16 img

Data Mining Standard Process across Organizations

18.10.2015

Recently I have come across a term, CRISP-DM – a data mining standard. Though this process is not a new one but I felt every analyst should know about commonly used Industry wide process. In this post I will explain about different phases involved in creating a data mining solution. CRISP-DM, an acronym for Cross Industry Standard Process for D...

7254 sym 2 img

Item Based Collaborative Filtering Recommender Systems in R

18.11.2015

In the series of implementing Recommendation engines, in my previous blog about recommendation system in R, I have explained about implementing user based collaborative filtering approach using R. In this post, I will be explaining about basic implementation of Item based collaborative filtering recommender systems in r. Intuition:Item based Coll...

4889 sym R (145 sym/2 pcs) 12 img

Data Science with R

24.12.2015

As R programming language becoming popular more and more among data science group, industries, researchers, companies embracing R, going forward I will be writing posts on learning Data science using R. The tutorial course will include topics on data types of R, handling data using R, probability theory, Machine Learning, Supervised – unSupervi...

2484 sym 10 img

Basic Data Types in r

16.02.2016

As part of tutorial series on Data Science with R from Data Perspective, this first tutorial introduces the very basics of R programming language about basic data types in R.What we learn:Assignment OperatorNumericIntegerComplex numberlogicalCharacterFactorVectorData FrameAfter the end of the chapter, you are provided with R console so that you c...

4872 sym R (5323 sym/25 pcs)

Principal Component Analysis using R

27.02.2016

Curse of Dimensionality:One of the most commonly faced problems while dealing with data analytics problem such as recommendation engines, text analytics is high-dimensional and sparse data. At many times, we face a situation where we have a large set of features and fewer data points, or we have data with very high feature vectors. In such scena...

5295 sym Python (2375 sym/7 pcs) 18 img