Publications by Kenan Sooklall

SPS R Bridge Homework 2

06.01.2021

library(RCurl) Dataset - Fertility and Contraception Data - https://vincentarelbundock.github.io/Rdatasets/csv/carData/Robey.csv Docs - https://vincentarelbundock.github.io/Rdatasets/doc/carData/Robey.html Column data: region -A factor with levels: Africa; Asia, Asia and Pacific; Latin.Amer, Latin America and Caribbean; Near.East, Near East and...

1799 sym R (5682 sym/24 pcs)

SPS R Bridge Homework 1

15.12.2020

Write a loop that calculates 12-factorial fact <- 1 for (i in c(1:12)) { fact <- fact * i } print(factorial(12) == fact) ## [1] TRUE sprintf('12 factorial = %i', fact) ## [1] "12 factorial = 479001600" Show how to create a numeric vector that contains the sequence from 20 to 50 by 5. vect1 <- c(20:50) vect1 <- as.numeric(vect1) vect1 <- vect...

317 sym R (711 sym/12 pcs)

Homework 3

11.01.2021

Storm track dataset As the planet temperature has been increasing due to human-made global warming it has created conditions that increase the chances of extreme weather. This report is will investigate the claim of increasing storm rates. Description This data is a subset of the NOAA Atlantic hurricane database best track data, https://www.nhc.n...

2449 sym R (3520 sym/10 pcs) 7 img

DATA-606 Lab 1

02.02.2021

library(tidyverse) library(openintro) Exercise 1 arbuthnot$girls ## [1] 4683 4457 4102 4590 4839 4820 4928 4605 4457 4952 4784 5332 5200 4910 4617 ## [16] 3997 3919 3395 3536 3181 2746 2722 2840 2908 2959 3179 3349 3382 3289 3013 ## [31] 2781 3247 4107 4803 4881 5681 4858 4319 5322 5560 5829 5719 6061 6120 5822 ## [46] 5738 5717 5847 6203 6033 ...

6295 sym R (1805 sym/13 pcs) 3 img

DATA-607 Homework 1

05.02.2021

## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.0 ── ## ✓ ggplot2 3.3.3 ✓ purrr 0.3.4 ## ✓ tibble 3.0.4 ✓ dplyr 1.0.2 ## ✓ tidyr 1.1.2 ✓ stringr 1.4.0 ## ✓ readr 1.4.0 ✓ forcats 0.5.0 ## ─...

3750 sym R (5118 sym/16 pcs) 7 img

DATA-607 Project 5

07.05.2021

The data used in this analysis came from two sources - IMDB - Schrute library Both data sources were parsed and combined into one data set TheOffice. IMDB calls the office “A mockumentary on a group of typical office workers, where the workday consists of ego clashes, inappropriate behavior, and tedium.” I have seen the office several times a...

5138 sym R (7507 sym/22 pcs) 13 img

DATA-607 Project 1

27.02.2021

Description The goal of this project is to parse chess tournament results into: Player Number, Player’s Name, Player’s State, Total Number of Points, Player’s Pre-Rating, and Average Pre Chess Rating of Opponents Read the file with read lines, the data can be obtained from my github lines <- read_lines("https://raw.githubusercontent.com/kso...

1354 sym R (4630 sym/14 pcs) 3 img

DATA-605 Homework 5

27.02.2021

Problem 1 - Bayesian A new test for multinucleoside-resistant (MNR) human immunodeficiency virus type 1 (HIV-1) variants was recently developed. The test maintains 96% sensitivity, meaning that, for those with the disease, it will correctly report “positive” for 96% of them. The test is also 98% specific, meaning that, for those without the d...

6683 sym R (1271 sym/45 pcs)

DATA-605 Homework 4

22.02.2021

The goal of this homework is to show how PCA can take advantage of the linear combination between pixel values to create a lower dimensional representation. Set up global variables library(EBImage) library(OpenImageR) ## ## Attaching package: 'OpenImageR' ## The following objects are masked from 'package:EBImage': ## ## readImage, writeImag...

1334 sym R (2127 sym/13 pcs) 2 img

DATA-607 Project 3

03.04.2021

Fulltime Job Listings in LA The first 2 data sets we decided to work with are filltime job listing scraped from job sites. We decided to use LA and NY. The job description column is what we are interested in. After reading the data into a dataframe we clean it up by removing any html and new line characters. jobDescriptions <- read.csv('https://r...

3476 sym R (9724 sym/27 pcs) 13 img