Publications by BayesianN

Introduction to SQL and databases

12.11.2023

0.1 Introduction We do realise that most of the softwares around are designed to solve a problem!. To better understand what databases are ,we need to first understand the problems that necessitated the advent of databases. The answer to this is quite obvious ! you have some data or large amounts of information that you need to store, this dat...

7624 sym 5 img 1 tbl

HR Analytics with SQL and R and machine learning

10.11.2023

All models are wrong, but some are useful. - George Box 0.0.1 Setup library(tidyverse) library(odbc) library(DBI) library(RSQLite) library(tvthemes) library(ggthemes) library(scales) dat_new <- read_csv("./input/WA_Fn-UseC_-HR-Employee-Attrition.csv") dat_new <- dat_new %>% mutate_if(is.character, as_factor) %>% mutate( Environ...

4074 sym 8 img 5 tbl

HR Analytics with SQL and R

06.11.2023

library(tidyverse) library(odbc) library(DBI) library(RSQLite) library(tvthemes) library(ggthemes) library(scales) dat_new <- read_csv("./input/WA_Fn-UseC_-HR-Employee-Attrition.csv") dat_new <- dat_new %>% mutate_if(is.character, as_factor) %>% mutate( EnvironmentSatisfaction = factor(EnvironmentSatisfaction, ordered = TRUE), ...

2268 sym 5 img 3 tbl

About me

05.11.2023

MY PASSION LIES IN SOLVING BUSINESS PROBLEMS USING STATISTICS AND DATA SCIENCE MY NAME IS BONGANI NCUBE I Have a passion for Utilizing Data Global Covid-19 Projections Descriptive Analytics How much sales generated last month? What is the top product sold? What is the least profitable product? What is the average spending of a customer? Pred...

2433 sym 29 img

doing correlations

03.11.2023

Statistical correlations Bongani Ncube 2023-11-03 Data Scientist Volucentric Consultancy r195334vncube@gmail.com Correlation Correlation measures the strength and direction of association between two variables. There are three common correlation tests: the Pearson product moment (Pearson’s r), Spearman’s rank-order (Spearman’s rho), ...

2902 sym 3 img 5 tbl

MANOVA

30.10.2023

Multivariate Analysis Of Variance Data science nuggets Multivariate Analysis Of Variance Introduction MANOVA Multivariate Analysis of Variance(MANOVA) MANOVA IN R Fitting A manova in R look to see which ones differ Bongani Ncube (Data Analyst) - Volucentric Introduction my previous blog talked about comparing two means u...

6164 sym 2 img 3 tbl

test for association

26.10.2023

Library setup library(tidyverse) library(flextable) Motivation On one occassion i helped a student do her statistics assignment , up to know i don’t like how university concentrate so much on the theory without students doing the practical aspect of the methods. This tutorial shows how to calculate : frequencies proportions Test for associ...

5681 sym 3 tbl

Regression with tidymodels

22.10.2023

Linear Regression Modeling with A case study Bongani Ncube 2023-10-22 library Setup library(tidyverse) library(tidymodels) library(glue) library(lubridate) library(patchwork) library(scales) library(summarytools) library(glmnet) library(randomForest) library(xgboost) library(conflicted) library(flextable) library(tvthemes) slice <- d...

31951 sym 25 img 8 tbl

A guide to statistical inference

17.10.2023

A guide to statistical inference - Part 1 Neat Elite Research and data analytics consultancy Bongani Ncube (Data Scientist) 2023-10-17 About The author I am a data scientist ,statistical and data literacy trainer . Did my honours at the University Of Zimbabwe and certified as a Proffesional data scientist by datacamp. I currently work as a d...

8011 sym 3 tbl

R and Excel

12.10.2023

Automating excel outputs R with Excel Automating excel outputs Introduction Creating a temporary file Exporting as an excel table Further customisation options Modifying an existing file Split the dataframe and write each group to a different worksheet Bongani Ncube Introduction In this tutorial i wil...

1870 sym 6 img