Publications by Souleymane Doumbia, Group Member: Fomba Kassoh
DATA 607 FINAL PROJECT
Final Project Fomba Kassoh & Souleymane Doumbia 2023-12-10 Abstract The analysis provides a comprehensive examination of audiobook data. It employs web scraping techniques, utilizing Selenium and Scraper Spider, to gather audiobook information. The data, initially saved as CSV and JSON, is processed using R, following the OSEMN framework and Had...
15948 sym Python (13847 sym/45 pcs) 10 img
Vignette: Tidyverse Create
##1. Loading the Tidyverse packages and the data into R: library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.3 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.4 ✔ tibble ...
3538 sym R (4590 sym/12 pcs) 1 img
Sentiment Analysis
library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidytext) library(stringr) library(XML) library(jsonlite) library(rvest) library(httr) library(tidyverse) ##...
4913 sym R (15041 sym/66 pcs) 6 img
Working with html, xml, and json in R
#install.packages('rvest') #install.packages('XML') #install.packages('jsonlite') library(rvest) library(XML) library(jsonlite) library(httr) # Reading from HTML, also you can access this html text file directly on the web by clicking on the link below. gitHtml <- "https://raw.githubusercontent.com/Doumgit/Working-With-XML-and-JSO...
75 sym R (4451 sym/14 pcs)
statistical inference - Sampling distributions
knitr::opts_chunk$set(root.dir = getwd()) In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. ...
15259 sym 5 img
Project2 Data Tidying, Transformation and Analysis
Introduction World Development Indicators The here, we will use the world_development_indicators.csv dataset. Load the relevant libraries We start by installing the relevant packages and loading the libraries as below library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────�...
3375 sym R (16179 sym/31 pcs) 11 img
Tidying and Transforming Data
Choice of questions for analysis What is the total number of flights by each airline to a destination? What proportion of flights are delayed for each airline per destination? Illustrate using a bar chart. If you want to get to your destination on time, which airlines will you use? What carrier provides the most service and what is the proportion ...
2655 sym R (7366 sym/35 pcs) 3 img
Tidying & Transforming Data
Choice of questions for analysis What is the total number of flights by each airline to a destination? What proportion of flights are delayed for each airline per destination? Illustrate using a bar chart. If you want to get to your destination on time, which airlines will you use? What carrier provides the most service and what is the proportion ...
2645 sym R (8276 sym/33 pcs) 3 img
DATA 607, Project 1
library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.2 ✔ tidyr 1.3.0 ## ...
138 sym R (8833 sym/8 pcs)