Publications by Rishav Dhariwal
Milestone Report
Exploratory Analysis of the Corpus Introduction This documentation describes the exploratory analysis and data preprocessing performed in response to the project’s requirements. The analysis is part of an ongoing project aimed at developing a prediction algorithm and a Shiny app. The main goal here is to demonstrate an understanding of the d...
3094 sym R (5108 sym/5 pcs) 3 img 1 tbl
Swiftkey app (text prediction)
2023-11-10 How SwiftText Works 1. Algorithm Overview Utilizes n-grams (unigram, bigram, trigram, 4-gram) frequency tables with stupid backoff for predictive modeling. Implements a dictionary for efficient word matching. Employs Jaccard metric-based string distance for handling erroneous text input. 2. Data Preprocessing Data undergoes thorough ...
1207 sym 2 img
Milestone Report
Exploratory Analysis of the Corpus Introduction This documentation describes the exploratory analysis and data preprocessing performed in response to the project’s requirements. The analysis is part of an ongoing project aimed at developing a prediction algorithm and a Shiny app. The main goal here is to demonstrate an understanding of the d...
3089 sym 3 img 1 tbl
Document
a ## Analysis of Deviance Table ## ## Model 1: GDPIMPR ~ 1 ## Model 2: GDPIMPR ~ lag(li1) + lag(li2) ## Resid. Df Resid. Dev Df Deviance Pr(>Chi) ## 1 242 307.94 ## 2 240 271.84 2 36.099 1.449e-08 *** ## --- ## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 ## Analysis...
2256 sym
Cleaning Data Pitch Presentation
2023-07-20 Cleaning Data Made Easy with Shiny App Welcome to our Shiny App that simplifies data cleaning tasks! Cleaning Workflow Uploading Your Data Data Cleaning Functions Split Columns Variable Substitution Changing Variable Types Downloading Cleaned Data Streamline Data Cleaning Process Upload your CSV file. Choose whether the file has he...
956 sym 4 img
Document
Synopsis This project aims to explore the properties of the exponential distribution and compare it with the Central Limit Theorem using simulations in R. We will investigate the distribution of averages of 40 exponentials and analyze the sample mean, variance, and the approximate normality of the distribution. Additionally, we will conduct bas...
4204 sym R (7623 sym/44 pcs) 2 img
Effect of major storms and weather events in the United States from 1950 - 2011
Synopsis In this data analysis project, we aim to explore the U.S. National Oceanic and Atmospheric Administration’s storm database, which tracks major storms and weather events in the United States. Our objective is to determine which types of events pose the greatest risks to population health and which events have the most significant econ...
3120 sym R (6548 sym/12 pcs) 2 img