Publications by Vanita Thompson
DATA 605 Final
Load Libraries library(kableExtra) library(tidyverse) ## -- Attaching packages --------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.3.0 v purrr 0.3.3 ## v tibble 3.0.0 v dplyr 0.8.4 ## v tidyr 1.0.2 v stringr 1.4.0 ## v readr 1.3.1 v forcats 0.5.0 ## -- Conflicts ----------------------------------...
6086 sym R (2444096 sym/102 pcs) 7 img 1 tbl
DATA 605 HW 15
Find the equation of the regression line for the given points. Round any final values to the nearest hundredth, if necessary. ( 5.6, 8.8 ), ( 6.3, 12.4 ), ( 7, 14.8 ), ( 7.7, 18.2 ), ( 8.4, 20.8 ) df = data.frame(rbind(c( 5.6, 8.8 ), c( 6.3, 12.4 ), c( 7, 14.8 ), c( 7.7, 18.2 ), c( 8.4, 20.8 ))) model <- lm(df$X2 ~ df$X1, df) plot(df$X2 ~ df...
2247 sym R (275 sym/3 pcs) 1 img
Blog 4
Total COVID-19 Cases vs Population Introduction In this project I will analyze the relationship between the Novel Coronavirus (COVID-19)and population around the world, to demonstrate dataprocessing and visualisation with R, tidyverse and ggplot2. Loading Libraries library(magrittr) # pipe operations library(tidyverse) # ggplot2, tidyr, dplyr....
3088 sym R (11772 sym/49 pcs) 8 img 3 tbl
DATA 606 Data Project
Introduction The purpose of this study is to answer the question, “Which are the most valued data science skills?” I have had experiences applying for jobs, but was relatively new to the data science vernacular. For example, I am student with a background in chemical engineering. This project is my endeavor in how I answered this question. A...
15953 sym R (12273 sym/51 pcs) 11 img
DATA 605 HW 13
Use integration by substitution to solve the integral below: ∫4e−7xdx u=−7x du=−7dx dx=du−7 ∫4eudu−7 4−7∫eudu 4−7eu+C 4−7e−7x+C Biologists are treating a pond contaminated with bacteria. The level of contamination is changing at a rate of dNdt=−3150t4−220 bacteria per cubic centimeter per day, where t is the number o...
2046 sym R (231 sym/3 pcs) 1 img
DATA 605 HW 12
The attached who.csv dataset contains real-world data from 2008. The variables included follow. Country: name of the country LifeExp: average life expectancy for the country in years InfantSurvival: proportion of those surviving to one year or more Under5Survival: proportion of those surviving to five years or more TBFree: proportion of the popul...
1946 sym R (7430 sym/23 pcs) 3 img
DATA 607 Data In Context
Data Reduction, Latent Information and Predictions Background Chaoter 12 of "Data Science For Business", discusses data reduction, latent information, and how they can be a useful tool. It can be useful to manipulate, or "tidy" larger datasets and replace them with a smaller data, while preserving information from the larger dataset. In many...
2947 sym R (33604 sym/37 pcs) 1 img 5 tbl
DATA 607 HW10
Introduction For this assignment, I am tasked with getting an example from Text Mining with R running and then extending the example to a new corpus and a neew sentiment lexicon. Sections 1-6 are directly from Text Mining with R1. I attempted using a mendeley github, and encountered numerous errors. remotes::install_github("zeehio/mendeleyr") ## ...
898 sym R (10515 sym/119 pcs)
DATA 607 Tidyverse
Tidyverse vignette with stringr The stringr package is the package of choice for working with character strings in Tidyverse. We will be using the pipe notation %>%for the vignette. library(tidyverse) Load the data into R from Kaggle containing information about breweries in the United States from the following link: https://www.kaggle.com/brkurz...
866 sym R (6598 sym/25 pcs)
DATA 607 HW 9
Assignment 9 Creating an interface with the NY Times API and read in the json data to extract some information to be stored in a data frame. Loading Libraries Load required libraries library(tidyverse) library(httr) library(kableExtra) library(readxl) Displaying tables showtable <- function(data, title) { kable(data, caption = title) %>% ...
1295 sym R (2179 sym/8 pcs) 2 tbl