Publications by Irina White

Reproducible Research Final Project

19.03.2021

Synopsis: The key goal of the analysis is to identify most harmful weather event(s) across USA for both Public and Economic perspectives. The data (Storm Data [47Mb]) for this analysis was obtained from the U.S. National Oceanic and Atmospheric Administration’s (NOAA) storm database. The key variables included year of the observation, levels of...

2544 sym R (6838 sym/20 pcs) 2 img

NLP project part 1

22.09.2021

Introduction This is a first phase of the projec: demonstrate the text mining skills. The data is the SwiftKet dataset, that contain files in English, Russian, Finnish, German. For the purpose of this project on English files had been considered: en_US.twitter.txt, en_US.blogs.txt, en_US.news.txt. In addition, the profanity file [https://github.c...

1967 sym R (6231 sym/23 pcs) 3 img

1st use of Plotly

10.09.2021

05/08/2021 Data Set Overview The data used for this presentation can be freely obtained from Covid19 GitHub page https://github.com/owid/covid-19-data/tree/master/public/data) The full explanation of the data can be found on the same webpage. The used database is SCV file, that has been downloaded, and the data was modified as follows: - Se...

1097 sym R (1393 sym/4 pcs)

Human Activity Recognition

10.09.2021

PROJECT QUESTION The goal of the project is to predict the manner in which the participants did the exercise. Furthermore, data is split into two sets for training and testing purposes. After the initial data overview and preparation the machine learning algorithm is applied to obtain satisfying level of performance first on the training set and ...

2983 sym R (8105 sym/38 pcs) 2 img

Motor Trend, Regression Models Course Project

10.09.2021

Introduction This model has been constructed to answer the two key questions using regression models and exploratory data analyses: Is an automatic or manual transmission better for MPG? Quantify the MPG difference between automatic and manual transmissions? Summary For the purpose of this analysis the data set mtcars has been used with the vari...

3012 sym R (3627 sym/17 pcs) 2 img

Leaflet sample

10.09.2021

Overview and Data preparation The project is using the free dataset from the simple map website that includes latitudes and longitudes of major cities and towns around the world in addition to other information. # Download temp zip file to obtain data temp <- tempfile() download.file("https://simplemaps.com/static/data/world-cities/basic/simple...

556 sym R (1075 sym/4 pcs)