Publications by John Pauline Pineda

Methods : Exploring Penalized Models for Handling High-Dimensional Survival Data

27.07.2023

1. Table of Contents This project explores different regularization methods for minimizing model complexity by promoting coefficient sparsity in high-dimensional survival data using various helpful packages in R. Using a Cox Proportional Hazards Regression model structure, penalty functions applied during coefficient estimation included the L...

35831 sym Python (1741529 sym/386 pcs) 69 img 12 tbl

Case Study : Characterizing Life Expectancy Drivers Across Countries Using Model-Agnostic Interpretation Methods for Black-Box Models

27.07.2023

1. Table of Contents 1.1 Introduction Life expectancy is a statistical measure that represents the average number of years a person is expected to live from birth, assuming current mortality rates remain constant along the entire life course. It provides an estimation of the overall health and well-being of a population and is often reflecti...

67191 sym 145 img 10 tbl

Methods : Exploring and Visualizing Extracted Dimensions from Principal Component Algorithms

04.06.2023

1. Table of Contents This project explores the various principal component-based dimensionality reduction algorithms for extracting and visualizing information using various helpful packages in R. Methods applied in the analysis to transform and reduce high dimensional data included the Principal Component Analysis, Correspondence Analysis, ...

33297 sym Python (242197 sym/318 pcs) 93 img

Methods : Sample Size and Power Calculations for Tests Comparing Proportions in Clinical Research

21.05.2023

1. Table of Contents This document presents a non-exhaustive list of sample size and power calculations for clinical research proportion comparison tests using various helpful packages in R. Proportion comparison tests applied during clinical research refer to trials evaluated in terms of discrete clinical endpoints. The objectives of the in...

30436 sym Python (87964 sym/264 pcs) 16 img

Methods : Sample Size and Power Calculations for Tests Comparing Means in Clinical Research

15.05.2023

1. Table of Contents This document presents a non-exhaustive list of sample size and power calculations for clinical research mean comparison tests using various helpful packages in R. Mean comparison tests applied during clinical research refer to trials evaluated in terms of mean responses of certain primary study endpoints. The objectives...

27200 sym Python (80473 sym/236 pcs) 14 img

Methods : Comparing Oversampling and Undersampling Algorithms for Class Imbalance Treatment

30.04.2023

1. Table of Contents This document presents a non-exhaustive list of oversampling and undersampling algorithms applied for severe class imbalance using various helpful packages in R. Oversampling and undersampling algorithms address imbalanced classification problems by augmenting the data set used for model training based on its inherent ch...

31637 sym Python (230885 sym/233 pcs) 32 img 12 tbl

Methods : Exploring Performance Evaluation Metrics for Survival Prediction

10.03.2023

1. Table of Contents This document implements a comparison of evaluation metrics for assessing survival model performance using various helpful packages in R. Survival analysis aims to study the relationship between independent covariates and survival time and event outcomes. Evaluation metrics in survival analysis are used to ascertain the a...

17136 sym 24 img 12 tbl

Methods : Exploring Robust Logistic Regression Models for Handling Quasi-Complete Separation

19.02.2023

1. Table of Contents This document presents a non-exhaustive list of robust model variants applied to handle quasi-complete or complete separation during logistic regression modelling using various helpful packages in R. 1.1 Sample Data The sex2 dataset from the logistf package was used for this illustrated example. One of the original c...

16570 sym Python (119567 sym/130 pcs) 16 img 2 tbl

Methods : Identifying Multivariate Outliers Using Density-Based Clustering Algorithms

08.02.2023

1. Table of Contents This document implements density-based clustering algorithms for identifying multivariate outliers using various helpful packages in R. 1.1 Sample Data The NCI6 dataset from the ISLR package was used for this illustrated example. Only a subset of observations representing major cancer types was used for the analysi...

16391 sym 19 img 3 tbl

Methods : Clinical Research Prediction Model Development and Evaluation for Prognosis

14.09.2022

1. Table of Contents This document presents a non-exhaustive list of procedures for developing valid clinical prediction models for prognosis using various helpful packages in R. The methods and results presented in the chapter entitled Case Study on Survival Analysis: Prediction of Cardiovascular Events using the Second Manifestations ...

20985 sym Python (360338 sym/463 pcs) 10 img 2 tbl