Publications by Vladimir Nimchenko

Document

22.09.2022

DATA 607 - Project # 1 Vladimir Nimchenko Introduction: The chess cross table data in the text file was unstructured and not suitable for analysis. Using regular expressions I parsed the necessary data in order to transform it into a structure needed for analysis. I also performed the necessary calculations to get the total and average play...

1099 sym R (4241 sym/9 pcs)

Document

28.09.2022

DATA 607 - Homework Assignment # 4 Vladimir Nimchenko 1.Create a .CSV file (or optionally, a MySQL database!) that includes all of the information above. You’re encouraged to use a “wide” structure similar to how the information appears above, so that you can practice tidying and transformations as described below. library(tidyr) libra...

2271 sym R (4287 sym/22 pcs) 1 img

Document

05.10.2022

DATA 607 - Project # 2 Vladimir Nimchenko INTRODUCTION: The air quality data set(I chose only a small subset of it for the purposes of this project) shows the amount of the different categories of air quality (Ozone,Solar.r,Wind,and Temp) and their frequencies. I will tidy/transform the data to prepare it for analysis. DATA LOAD library(tidyr...

2802 sym R (2369 sym/25 pcs) 4 img

Document

05.10.2022

DATA 607 - Project # 2 Vladimir Nimchenko INTRODUCTION: The alcohol consumption by country data set(I chose only a small subset of it for the purposes of this project) by country shows how much of each type of alcohol (three categories: beer,spirit,and wine) is consumed. I will tidy/transform the data to prepare it for analysis. DATA LOAD lib...

1964 sym R (2576 sym/11 pcs) 1 img

Document

05.10.2022

DATA 607 - Project # 3 Vladimir Nimchenko INTRODUCTION: The alcohol consumption by country data set(I chose only a small subset of it for the purposes of this project) by country shows how much of each type of alcohol (three categories: beer,spirit,and wine) is consumed. I will transform and tidy my data in a few ways. The reason being is I ...

3545 sym R (3743 sym/13 pcs) 2 img

Document

12.10.2022

DATA 607 - Homework Assignment # 5 Vladimir Nimchenko INTRODUCTION: Below, I have created three files: HTML,JSON, and XML. All three files contain the title, author, and attributes of three of my favorite books. I uploaded the files to GitHub and then retrieved them and put them into a data frame. Finally, I printed the output of each file. l...

925 sym R (1023 sym/4 pcs) 4 tbl

Document

27.10.2022

DATA 607 - Homework Assignment # 6 Vladimir Nimchenko INTRODUCTION: I am utilizing the Movie Reviews API to read JSON data into a data frame and tidy/transform the data. Loading the needed libraries library(jsonlite) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter,...

330 sym R (2534 sym/7 pcs)

Document

02.11.2022

DATA 607 - Discussion/Assignment # 11 Vladimir Nimchenko 1).Perform a Scenario Design analysis as described below. Consider whether it makes sense for your selected recommender system to perform scenario design twice, once for the organization (e.g. Amazon.com) and once for the organization’s customers. The recommender system I want to per...

4860 sym

Document

03.11.2022

DATA 607 - Homework Assignment # 7 Vladimir Nimchenko INTRODUCTION: For my data, I took a Amazon reviews file of with the columns review title and review text. The data comes from Kaggle.I shortened the reviews to 100 (got the file to 100 rows) for simplicity purposes. Loading the needed libraries library(tidyverse) ## -- Attaching packages --...

739 sym R (4469 sym/22 pcs) 2 img

Document

17.11.2022

DATA 607 - Project # 4 Vladimir Nimchenko INTRODUCTION: I downloaded two folders from the public corpus directory (ham and spam). I than unzipped the folders. I started from this ham/spam data set and predicted new documents. I then put them into their respective (Ham and Spam) data frames adding a spam column to each data frame - 1 in spam ...

2595 sym R (5408 sym/35 pcs)