Publications by Keeno Glanville

DATA606 Project Proposal

23.10.2022

Data Preparation # load data library(ggplot2) library(corrplot) library(manifestoR) library(readr) library(dplyr) url.test<-"https://raw.githubusercontent.com/kglan/MSDS/main/DATA606/DATA%20606%20Data%20Project%20Proposal/test.csv" testdata <- read_csv(url(url.test)) url.train <- "https://raw.githubusercontent.com/kglan/MSDS/main/DATA606...

1271 sym R (6079 sym/8 pcs) 2 img

DATA 607 Project 3 : In-Demand skills for Data Science roles

24.10.2022

The main goal of the project is to determine which skills are most valued by the employer for the field of data science. For us to identify relevant answers to this question, we decided to find out through the current posting and look for those skills that were most frequently requested and required by the employers. Since a data set with the...

1893 sym Python (10680 sym/17 pcs) 4 img

Working with Web APIs

30.10.2022

Setup url and API key url <- 'https://api.nytimes.com/svc/books/v3/lists/current/hardcover-fiction.json' api <- 'GUz2FMGJsWSqNi5ZC8ZGDAUkqoscMzAI' bookurl <- paste(url,'?api-key=',api, sep = '') Change data from list to Dataframe booksrough <- fromJSON(bookurl, flatten = TRUE) books <- data.frame(booksrough$results$books) View(books) books...

67 sym

Tidy-text-mining - John F Kennedy’s Inaugural

07.11.2022

Data I tried forking the repository to get the data however when I ran the codes they were very ahrd to run as the dependencies seem outdate4d so I essentially copied the main functions needed Reference for expenditure: John F Kennedy’s Inaugural Address : https://www.gutenberg.org/cache/epub/3/pg3-images.html Reference for repository: https...

1269 sym R (14451 sym/85 pcs) 8 img

Coursera Recommendation System

07.11.2022

Coursera Recommendation System Background: Coursera is an educational platoform that individuals utilize to gain knowledge on various topics of their choosing. It has become a strong tool in certification and many individuals utilizes its resources to gain career changing knowledge. The reason I chose this platform system is due to my person...

4877 sym 3 img

Multiple linear regression

09.11.2022

Grading the professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these student evaluations as an indicator of course quality and teaching effectiveness is often criticized because these measures may reflect the influence of non-teaching related ch...

11011 sym Python (11225 sym/31 pcs) 22 img

Tidyverse EXTEND

21.11.2022

library(tidyverse) #> Warning: package 'tidyverse' was built under R version 4.2.2 #> ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ── #> ✔ ggplot2 3.4.0 ✔ purrr 0.3.5 #> ✔ tibble 3.1.8 ✔ dplyr 1.0.10 ...

3173 sym R (7783 sym/18 pcs) 10 img

Document Classification

23.11.2022

It can be useful to be able to classify new “test” documents using already classified “training” documents. A common example is using a corpus of labeled spam and ham (non-spam) e-mails to predict whether or not a new document is spam. For this project, you can start with a spam/ham dataset, then predict the class of new documents (ei...

1381 sym 1 img