Publications by finnstats
Exploratory Data Analysis (EDA)
Exploratory Data Analysis is one of the critical processes of performing initial investigations on data analysis. Basic idea is to discover the patterns, anomalies, test hypotheses, and check the assumptions with the help of summary statistics and graphical representations. The main idea about exploratory data analysis are maximize insight into a...
2427 sym R (1952 sym/14 pcs) 14 img
Stock Prediction-Intraday Trading
Stock Prediction-Intraday is one of the trading norms of the stock market, buy shares at the opening time of the market and then sell the same at the closing time of the same day. Today we are dealing with one of the data sets, based on daily data of seven years from 2014 to 2021. We are going to use a simple machine learning algorithm to underst...
2927 sym R (4858 sym/16 pcs) 8 img
Class Imbalance-Handling Imbalanced Data in R
Class Imbalance classification refers to a classification predictive modeling problem where the number of observations in the training dataset for each class is not balanced. In other words, the class distribution is not equal or close and it is skewed into one particular class. So, the prediction model will be accurate for skewed classes and we ...
4794 sym R (6701 sym/26 pcs) 2 img
Principal component analysis (PCA) in R
PCA is used in exploratory data analysis and for making decisions in predictive models. PCA commonly used for dimensionality reduction by using each data point onto only the first few principal components (most cases first and second dimensions) to obtain lower-dimensional data while keeping as much of the data’s variation as possible. The firs...
4833 sym R (3786 sym/22 pcs) 6 img
Power analysis in Statistics with R
Power analysis in Statistics, there is a probability of committing an error in making a decision about a hypothesis. Hence two types of errors can occur in hypothesis, Type I error and Type II Error. The probability of Type I error is denoted as α and the probability of Type II error is β. Type 1 Error:- p(reject H0/H0 is true)=α Type II Error...
3480 sym R (608 sym/7 pcs) 2 tbl
How to Measure Contingency-Coefficient (Association Strength)
Association strength, when the hypothesis of independence of attributes in a contingency table is rejected by performing a chi-square test, ensures the association between two attributes. Such kinds of situations interested to calculate the strength of association and it is a desideratum. For this a measure is known as the coefficient of contin...
3260 sym R (1589 sym/8 pcs) 4 img
Completely Randomized Experimental Design
Experimental Design, when do you call an experimental design a randomized design? Experimental designs in which the treatments are allocated randomly to the experimental units that come under the category of randomized designs. Randomized designs are classified as completely randomized design, randomized block design, Latin square design, split p...
3258 sym R (428 sym/2 pcs) 1 tbl
Methods in Quantitative Statistical Analysis
Methods in Quantitative Statistical Analysis, What is hypothesis testing?. Hypothesis testing is an act in statistics whereby an analyst tests an assumption regarding a population parameter. Major four principles involved in a statistical test. Evolving a test statisticTo know the sampling distribution of the test statisticSetting of hypotheses...
5004 sym R (481 sym/4 pcs)
Visualization Graphs-ggside with ggplot
Visualization graphs, Huge information is being collected through data in the business world, we must need a tool to picture of that data so we can interpret it and make decisions on time. Data visualization provides a clear idea of what the information means by giving it visual context through maps or graphs. Visualization allows humans to ident...
2294 sym R (1551 sym/7 pcs) 10 img
Correlation Analysis Different Types of Plots in R
Correlation analysis, correlation is a term that is a measure of the strength of a linear relationship between two quantitative variables. Pearson’s Product-Moment Correlation One of the most common measures of correlation is Pearson’s product-moment correlation, which is commonly referred to simply as the correlation, or just the letter r. C...
2772 sym R (1983 sym/18 pcs) 32 img