Publications by Giorgio Garziano

Financial time series forecasting – an easy approach

21.03.2017

Financial time series analysis and their forecasting have an history of remarkable contributions. It is then quite hard for the beginner to get oriented and capitalize from reading such scientific literature as it requires a solid understanding of basic statistics, a detailed study of the ground basis of time series analysis tools and the knowled...

9650 sym R (3322 sym/11 pcs) 10 img

Visualizing Tennis Grand Slam Winners Performances

01.05.2017

Data visualization of sports historical results is one of the means by which champions strengths and weaknesses comparison can be outlined. In this tutorial, we show what plots flavors may help in champions performances comparison, timeline visualization, player-to-player and player-to-tournament relationships. We are going to use the Tennis Gran...

5386 sym R (11266 sym/18 pcs) 24 img

Weather forecast with regression models – part 1

02.06.2017

In this tutorial we are going to analyse a weather dataset to produce exploratory analysis and forecast reports based on regression models. We are going to take advantage of a public dataset which is part of the exercise datasets of the “Data Mining and Business Analytics with R” book (Wiley) written by Johannes Ledolter. In doing that, we ar...

7767 sym R (9524 sym/20 pcs) 26 img

Weather forecast with regression models – part 2

05.06.2017

In the first part, I introduced the weather dataset and outlining its exploratory analysis. In the second part of our tutorial, we are going to build multiple logistic regression models to predict weather forecast. Specifically, we intend to produce the following forecasts: tomorrow’s weather forecast at 9am of the current day tomorrow’s wea...

8053 sym R (22214 sym/34 pcs)

Weather forecast with regression models – part 3

11.06.2017

In the previous part of this tutorial, we build several models based on logistic regression. One aspect to be further considered is the decision threshold tuning that may help in reaching a better accuracy. We are going to show a procedure able to determine an optimal value for such purpose. ROC plots will be introduced as well. Decision Threshol...

4489 sym R (11642 sym/17 pcs) 8 img

Weather forecast with regression models – part 4

16.06.2017

Results so far obtained allow us to predict the RainTomorrow Yes/No variable. In the first part, we highlighted that such factor variable can be put in relationship with the Rainfall quantitative one by: all.equal(weather_data$Rainfall > 1, weather_data$RainToday == "Yes") As a consequence, we are able so far to predict if tomorrow rainfall sha...

9131 sym R (24855 sym/33 pcs) 16 img

Structural Changes in Global Warming

17.07.2017

In time series analysis, structural changes represent shocks impacting the evolution with time of the data generating process. That is relevant because one of the key assumptions of the Box-Jenkins methodology is that the structure of the data generating process does not change over time. How can structural changes be identified ? The strucchange...

8950 sym R (17230 sym/37 pcs) 30 img

ARIMA models and Intervention Analysis

07.10.2017

In my previous tutorial Structural Changes in Global Warming I introduced the strucchange package and some basic examples to date structural breaks in time series. In the present tutorial, I am going to show how dating structural changes (if any) and then Intervention Analysis can help in finding better ARIMA models. Dating structural changes con...

15828 sym R (24541 sym/48 pcs) 46 img

Outliers Detection and Intervention Analysis

04.12.2017

In my previous tutorial Arima Models and Intervention Analysis we took advantage of the strucchange package to identify and date time series level shifts structural changes. Based on that, we were able to define ARIMA models with improved AIC metrics. Furthermore, the attentive analysis of the ACF/PACF plots highlighted the presence of seasonal ...

7259 sym R (5643 sym/21 pcs) 24 img

Tennis Grand Slam Tournaments Champions Basic Analysis

11.12.2017

The present tutorial analyses the Tennis Grand Slam tournaments main results from the statistical point of view. Specifically, I try to answer the following questions: – How to fit the distribution of the Grand Slam tournaments number of victories across players? – How to compute the probability of having player’s victories greater than a ...

7339 sym R (11891 sym/32 pcs) 20 img