Publications by John Pauline Pineda
Methods : Selecting Informative Predictors Using Univariate Filters
1. Table of Contents This document implements univariate filters for selecting informative predictors using various helpful packages in R. 1.1 Sample Data The AlzheimerDisease dataset from the AppliedPredictiveModeling package was used for this illustrated example. Preliminary dataset assessment: [A] 333 rows (observations) ...
26314 sym 137 img 10 tbl
Methods : Selecting Informative Predictors Using Simulated Annealing and Genetic Algorithms
1. Table of Contents This document implements simulated annealing and genetic algorithms for selecting informative predictors using various helpful packages in R. 1.1 Sample Data The AlzheimerDisease dataset from the AppliedPredictiveModeling package was used for this illustrated example. Preliminary dataset assessment: [A] 333 rows (o...
16200 sym 136 img 10 tbl
Methods : Remedial Procedures for Skewed Data with Extreme Outliers
1. Table of Contents This document presents a non-exhaustive list of remedial procedures applied for data with high skewness and extreme outliers using various helpful packages in R. 1.1 Sample Data The Solubility dataset from the AppliedPredictiveModeling package was used for this illustrated example. The original numeric response was...
19381 sym 57 img 11 tbl
Methods : Extracting Information Using Dimensionality Reduction Algorithms
1. Table of Contents This document implements dimensionality reduction algorithms for extracting information using various helpful packages in R. 1.1 Sample Data The NCI6 dataset from the ISLR package was used for this illustrated example. Only a subset of observations representing major cancer types was used for the analysis. Prelimi...
8922 sym 253 img
Methods : Formulating Segmented Groups Using Clustering Algorithms
1. Table of Contents This document implements clustering algorithms for formulating segmented groups using various helpful packages in R. 1.1 Sample Data The NCI6 dataset from the ISLR package was used for this illustrated example. Only a subset of observations representing major cancer types was used for the analysis. Preliminary dat...
9812 sym 264 img
Methods : Implementing Gradient Descent Algorithm in Estimating Regression Coefficients
1. Table of Contents This document implements the gradient descent algorithm to estimate regression coefficients using various helpful packages in R. 1.1 Sample Data The Solubility dataset from the AppliedPredictiveModeling package was used for this illustrated example. Other original predictors were removed from the dataset leaving onl...
17339 sym 15 img 10 tbl
Methods : Data Quality Assessment, Preprocessing and Exploration for a Classification Modelling Problem
1. Table of Contents This document presents a non-exhaustive list of various data quality assessment, preprocessing and exploration methods for a classification modelling problem using various helpful packages in R. 1.1 Sample Data The schedulingData dataset from the AppliedPredictiveModeling package was used for this illustrated example. Pre...
9442 sym R (38356 sym/75 pcs) 14 img 31 tbl
Methods : Survival Analysis and Descriptive Modelling for a Three-Group Right-Censored Data with Time-Independent Variables Using Cox Proportional Hazards Model
1. Table of Contents This document presents a non-exhaustive list of survival analysis and descriptive modelling methods for a three-group right-censored data with time-independent variables using various helpful packages in R. 1.1 Sample Data The Cancer dataset from the book Supervised Machine Learning was used for this illustrated example. ...
16977 sym R (147883 sym/161 pcs) 23 img 14 tbl
Methods : Survival Analysis and Descriptive Modelling for a Two-Group Right-Censored Data with Time-Independent Variables Using Cox Proportional Hazards Model
1. Table of Contents This document presents a non-exhaustive list of survival analysis and descriptive modelling methods for a two-group right-censored data with time-independent variables using various helpful packages in R. 1.1 Sample Data The Leukemia dataset from the book Survival Analysis : A Self-Learning Text was used for this illustrat...
11091 sym R (66503 sym/127 pcs) 18 img 11 tbl
Methods : Data Quality Assessment, Preprocessing and Exploration for a Regression Modelling Problem
1. Table of Contents This document presents a non-exhaustive list of various data quality assessment, preprocessing and exploration methods for a regression modelling problem using various helpful packages in R. 1.1 Sample Data The ChemicalManufacturingProcess dataset from the AppliedPredictiveModeling package was used for this illustrated exa...
12500 sym R (82051 sym/85 pcs) 117 img 38 tbl