Publications by Zane Dax
Canadian Cyber Crime Dashboard
Canadian Cyber Security Dashboard Enterprise MeasuresIncidentsReportingSpending Anti-malware software 75.8% Email Security 75.4% Network Security 68.8% Enterprise Cyber Security Measures This plot shows various cyber security measures enterprises have in place by industry and size of enterprise Source: StatsCan Enterprise measures details...
6146 sym 4 img
CustomerBehaviouralAnalysis
Customer Personality Analysis Customer Personality Analysis is a detailed analysis of a company’s ideal customers. It helps a business to better understand its customers and makes it easier for them to modify products according to the specific needs, behaviors and concerns of different types of customers. Customer personality analysis helps a bus...
21128 sym R (35686 sym/116 pcs) 24 img 6 tbl
Reviews Sentiment Analysis
Sentiment Analysis Sentiment analysis, also known as opinion mining, computationally identifies and categorizes opinions expressed in text data. It is the use of natural language processing, text analysis, computational linguistics, and biometrics to systematically identify, extract, quantify, and study affective states and subjective information. ...
6502 sym R (11924 sym/34 pcs) 5 img
Data Mining
Tidy Text Content from Text Mining by Julia Silge & David Robinson. Simple rules: - each variable is a column - each observation is a row ( 1 token per row) - each type of observational unit is a table text mining approaches: String: Text can, of course, be stored as strings, i.e., character vectors, within R, and often text data is first read int...
1722 sym R (7043 sym/29 pcs) 1 img
Markdown & Netlify
Allison Hill’s YouTube tutorials A series of instructions on how to deploy websites with Netlify using Rmd files. Day 1 - Postcards Postcards is a simple landing website, basic quick information about you and social media links. Steps install.packages('postcards') go to GitHub, make a new repo with a README file and default settings. copy th...
4021 sym
Text Analysis with Quanteda
Source: University of Virginia Library, Leah Malkovich. Nov 27, 2018. Beginner’s Guide Quanteda has 3 major components of text: the corpus document-feature-matrix (dfm) tokens The corpus is the entire text body object, such as a book or chapter of a book. Document feature matrix organizes tokenized words into columns, which makes doing analy...
9633 sym R (6845 sym/28 pcs)
Red Flags of Fraud
What are Red Flags of Fraud ? Frauds can be detected, a particularly effective and inexpensive way is to identify fraudsters by scrutinizing personnel behaviour for peculiarities typical of fraudsters. checklist of behavioural red flags of fraud based on data collected, 16 different behavioural red flags exhibited by fraudsters. An examination of...
32930 sym 1 img 2 tbl
Machine Learning in R
Machine Learning in R by Udemy step 1 - import dataset dataset = read.csv('Data.csv') step 2 - missing data Often datasets have missing data, and the common practice is to remove missing data rows but it has a negative impact on observations. Best practice is to take the means of each columns with missing data. to select the dataframe columns,...
100032 sym R (45016 sym/117 pcs) 31 img 1 tbl
Document
The basics Symmetrical plots: left skewed and right skewed Population size = N Central tendency is where the data gathers around mean is the average median the middle value mode most frequent occurring data item range largest data item - smallest item variance average distance from the mean, \(\sigma^2\) standard deviation square root of varianc...
11982 sym R (89 sym/6 pcs) 2 img
The Frog Princess: An Analysis
Story from Grimm’s Fairy Tales by Jacob & Wilhelm Grimm The Story Start A young princess who is enamoured with a bouncy ball loses it in a well, a talking frog appears and right off tells her that if she loves him with a few conditions - in his favour, she will get her ball back. This princess thinks for a minutes about the situation, realizes...
5947 sym