Publications by Souleymane Doumbia, Group Member: Fomba Kassoh

DATA 607 FINAL PROJECT

13.12.2023

Final Project Fomba Kassoh & Souleymane Doumbia 2023-12-10 Abstract The analysis provides a comprehensive examination of audiobook data. It employs web scraping techniques, utilizing Selenium and Scraper Spider, to gather audiobook information. The data, initially saved as CSV and JSON, is processed using R, following the OSEMN framework and Had...

15948 sym Python (13847 sym/45 pcs) 10 img

Vignette: Tidyverse Create

13.11.2023

##1. Loading the Tidyverse packages and the data into R: library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.3 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.4 ✔ tibble ...

3538 sym R (4590 sym/12 pcs) 1 img

Sentiment Analysis

13.11.2023

library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidytext) library(stringr) library(XML) library(jsonlite) library(rvest) library(httr) library(tidyverse) ##...

4913 sym R (15041 sym/66 pcs) 6 img

Working with html, xml, and json in R

22.10.2023

#install.packages('rvest') #install.packages('XML') #install.packages('jsonlite') library(rvest) library(XML) library(jsonlite) library(httr) # Reading from HTML, also you can access this html text file directly on the web by clicking on the link below. gitHtml <- "https://raw.githubusercontent.com/Doumgit/Working-With-XML-and-JSO...

75 sym R (4451 sym/14 pcs)

statistical inference - Sampling distributions

12.10.2023

knitr::opts_chunk$set(root.dir = getwd()) In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. ...

15259 sym 5 img

Project2 Data Tidying, Transformation and Analysis

16.10.2023

Introduction World Development Indicators The here, we will use the world_development_indicators.csv dataset. Load the relevant libraries We start by installing the relevant packages and loading the libraries as below library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────�...

3375 sym R (16179 sym/31 pcs) 11 img

Tidying and Transforming Data

09.10.2023

Choice of questions for analysis What is the total number of flights by each airline to a destination? What proportion of flights are delayed for each airline per destination? Illustrate using a bar chart. If you want to get to your destination on time, which airlines will you use? What carrier provides the most service and what is the proportion ...

2655 sym R (7366 sym/35 pcs) 3 img

Tidying & Transforming Data

09.10.2023

Choice of questions for analysis What is the total number of flights by each airline to a destination? What proportion of flights are delayed for each airline per destination? Illustrate using a bar chart. If you want to get to your destination on time, which airlines will you use? What carrier provides the most service and what is the proportion ...

2645 sym R (8276 sym/33 pcs) 3 img

DATA 607, Project 1

30.09.2023

library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.2 ✔ tidyr 1.3.0 ## ...

138 sym R (8833 sym/8 pcs)