Publications by finnstats

Exploratory Data Analysis (EDA)

04.05.2021

Exploratory Data Analysis is one of the critical processes of performing initial investigations on data analysis. Basic idea is to discover the patterns, anomalies, test hypotheses, and check the assumptions with the help of summary statistics and graphical representations. The main idea about exploratory data analysis are maximize insight into a...

2427 sym R (1952 sym/14 pcs) 14 img

Stock Prediction-Intraday Trading

05.05.2021

Stock Prediction-Intraday is one of the trading norms of the stock market, buy shares at the opening time of the market and then sell the same at the closing time of the same day. Today we are dealing with one of the data sets, based on daily data of seven years from 2014 to 2021. We are going to use a simple machine learning algorithm to underst...

2927 sym R (4858 sym/16 pcs) 8 img

Class Imbalance-Handling Imbalanced Data in R

06.05.2021

Class Imbalance classification refers to a classification predictive modeling problem where the number of observations in the training dataset for each class is not balanced. In other words, the class distribution is not equal or close and it is skewed into one particular class. So, the prediction model will be accurate for skewed classes and we ...

4794 sym R (6701 sym/26 pcs) 2 img

Principal component analysis (PCA) in R

07.05.2021

PCA is used in exploratory data analysis and for making decisions in predictive models. PCA commonly used for dimensionality reduction by using each data point onto only the first few principal components (most cases first and second dimensions) to obtain lower-dimensional data while keeping as much of the data’s variation as possible. The firs...

4833 sym R (3786 sym/22 pcs) 6 img

Power analysis in Statistics with R

07.05.2021

Power analysis in Statistics, there is a probability of committing an error in making a decision about a hypothesis. Hence two types of errors can occur in hypothesis, Type I error and Type II Error. The probability of Type I error is denoted as α and the probability of Type II error is β. Type 1 Error:- p(reject H0/H0 is true)=α Type II Error...

3480 sym R (608 sym/7 pcs) 2 tbl

How to Measure Contingency-Coefficient (Association Strength)

08.05.2021

Association strength, when the hypothesis of independence of attributes in a contingency table is rejected by performing a chi-square test, ensures the association between two attributes. Such kinds of situations interested to calculate the strength of association and it is a desideratum.  For this a measure is known as the coefficient of contin...

3260 sym R (1589 sym/8 pcs) 4 img

Completely Randomized Experimental Design

10.05.2021

Experimental Design, when do you call an experimental design a randomized design? Experimental designs in which the treatments are allocated randomly to the experimental units that come under the category of randomized designs. Randomized designs are classified as completely randomized design, randomized block design, Latin square design, split p...

3258 sym R (428 sym/2 pcs) 1 tbl

Methods in Quantitative Statistical Analysis

11.05.2021

Methods in Quantitative Statistical Analysis, What is hypothesis testing?. Hypothesis testing is an act in statistics whereby an analyst tests an assumption regarding a population parameter. Major four principles involved in a statistical test. Evolving a test statisticTo know the sampling distribution of the test statisticSetting of hypotheses...

5004 sym R (481 sym/4 pcs)

Visualization Graphs-ggside with ggplot

12.05.2021

Visualization graphs, Huge information is being collected through data in the business world, we must need a tool to picture of that data so we can interpret it and make decisions on time. Data visualization provides a clear idea of what the information means by giving it visual context through maps or graphs. Visualization allows humans to ident...

2294 sym R (1551 sym/7 pcs) 10 img

Correlation Analysis Different Types of Plots in R

13.05.2021

Correlation analysis, correlation is a term that is a measure of the strength of a linear relationship between two quantitative variables. Pearson’s Product-Moment Correlation One of the most common measures of correlation is Pearson’s product-moment correlation, which is commonly referred to simply as the correlation, or just the letter r. C...

2772 sym R (1983 sym/18 pcs) 32 img