Publications by Abdellah Ait Elmouden
Final Project
Will Lockdowns change the fitness industry, and what to expect of post-pandemic workouts Table of Contents Introduction The Data Extract Reddit urls Extract Reddit Comments Clean the Text Data Fixing the Date Format Remove ponctuation and Unicode Characters Sentiment Analysis R Markdown and other files Introduction As lockdown was announced...
7054 sym R (2029 sym/16 pcs)
Project 4: Document Classification
Introduction Document classification or Document categorization is to classify documents into one or more classes/categories manually or algorithmically. Today we try to classify classify new “test” documents using already classified “training” documents, using spam/ham dataset, then predict the class of new documents. classDiagram The ...
2710 sym R (9052 sym/38 pcs) 4 img
proposal
library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union Data Preparation # load data library("readxl") data <- read.csv("https://data.cityofnewyork.us/api/views/jb7j-dta...
2120 sym R (3269 sym/13 pcs) 2 img
Document
I created an API Key for the books from NYTimes. I will be looking for books in the health category. I use PostMan to see how the output of the call to the web API will look like in a beautiful JSON format. This is what most developers out there use. loading libraries library(httr) library(dplyr) ## ## Attaching package: 'dplyr' ## The followin...
356 sym R (1672 sym/10 pcs) 1 tbl
Project 2 - Data Transformation
Objective The goal of this project is to use R tools to transform and prepare dataset for downstream analysis work. three datasets will be selected and used in this project. Data I: National Agricultural Statistics Service Dataset Description : The data will be extracted using tabulizer library from the following land values report published in ...
3722 sym R (23370 sym/57 pcs) 2 img
Week 5 assignment
Objective of the Assignment: The goal of this assignment is to use R to tidy (reshape) data, using “base R” functionality and Hadley Wickham’s tidyr and dplyr packages. The folowing data that was used in this assignment describes arrival delays for two airlines across five destinations. Los.Angeles Phoenix San.Diego San.Franciso Seattle A...
1182 sym R (2052 sym/16 pcs) 1 img 1 tbl
Project1
Objective of the Assignment: The Objective of the this project is to processes and extract information from a text file that includes chess tournament results and create a R script that generates an .CSV file, after extracting the needed information. Data source The provided chess tournament results text file was uploaded to Github, and it will ...
1945 sym R (11715 sym/38 pcs)
Week 3 assignment
Objective of the Assignment: In previous assignments we analyze data that comes in tables. the objective of this assignment is to use regular expressions and essential string functions to analyze data that are not available as a neatly organized dataset but in plain text? Exersice 1: Using the 173 majors listed in fivethirtyeight.com’s College...
3417 sym R (2079 sym/24 pcs)
DATA607 | Assignment 1 : Loading Data into a Data Frame
Objective of the Assignment: The goal of this assignment is to practice Loading Data into a Data Frame, study the dataset and the associated description of the data. The data was available from fivethirtyeight.com datasets. the selected data is behind an article about Picking A College Major. All data is from American Community Survey 2010-2012 P...
1032 sym R (14603 sym/15 pcs) 2 img 1 tbl
LAB 1 | Introduction to R and RStudio
The goal of this lab is to introduce you to R and RStudio, which you’ll be using throughout the course both to learn the statistical concepts discussed in the texbook and also to analyze real data and come to informed conclusions. To straighten out which is which: R is the name of the programming language itself and RStudio is a convenient inte...
12415 sym R (6745 sym/51 pcs) 7 img