Publications by Laura Burenkov

Sentiment Analysis

06.11.2023

In this assignment, you should start by getting the primary example code from chapter 2 working in an R Markdown document. You should provide a citation to this base code. You’re then asked to extend the code in two ways: Work with a different corpus of your choosing, and Incorporate at least one additional sentiment lexicon (possibly from an...

10757 sym Python (12130 sym/69 pcs) 7 img

Project proposal

30.10.2023

Data preparation #Loading packages suppressWarnings({ library(tidyverse) library(openintro) }) Loading dataset url <- "https://raw.githubusercontent.com/lburenkov/Diabetes/main/diabetes_012_health_indicators_BRFSS2015.csv" # Replace with the URL of your CSV data data <- read.csv(url) Research question Can we predict diabetes based on he...

3295 sym

Web APIs

30.10.2023

Introduction The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis You’ll need to start by signing up for an API key. Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. #Loading packages lib...

2318 sym Python (108104 sym/24 pcs)

Web APIs

30.10.2023

Introduction The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis You’ll need to start by signing up for an API key. Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. #Loading packages lib...

2360 sym 1 tbl

Project 3

25.10.2023

Introduction W. Edwards Deming said, “In God we trust, all others must bring data.” Please use data to answer the question, “Which are the most valued data science skills?” Consider your work as an exploration; there is not necessarily a “right answer.” Through data exploration and analysis, I explore the domain of employment opport...

25640 sym Python (101562 sym/51 pcs) 3 img

Lab 7

22.10.2023

library(tidyverse) ## Warning: package 'tidyverse' was built under R version 4.2.3 ## Warning: package 'ggplot2' was built under R version 4.2.3 ## Warning: package 'tibble' was built under R version 4.2.3 ## Warning: package 'tidyr' was built under R version 4.2.3 ## Warning: package 'readr' was built under R version 4.2.3 ## Warning: package 'pur...

19622 sym Python (10008 sym/95 pcs) 4 img

Lab 6

16.10.2023

Loading dataset library(tidyverse) ## Warning: package 'tidyverse' was built under R version 4.2.3 ## Warning: package 'ggplot2' was built under R version 4.2.3 ## Warning: package 'tibble' was built under R version 4.2.3 ## Warning: package 'tidyr' was built under R version 4.2.3 ## Warning: package 'readr' was built under R version 4.2.3 ## Warni...

14171 sym Python (6299 sym/58 pcs) 4 img

Working with XML and JSON in R

17.10.2023

Introduction Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three books, and separately create three file...

7845 sym

Project 2

09.10.2023

Information The goal of this assignment is to give you practice in preparing different data sets for downstream analysis work. Your task is to: Choose any three of the “wide” datasets identified in the Week 6 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 5 assignm...

13335 sym 6 img

Data 607 week 5

02.10.2023

Assignment Create a .CSV file (or optionally, a MySQL database!) that includes all of the information above. You’re encouraged to use a “wide” structure similar to how the information appears above, so that you can practice tidying and transformations as described below. Read the information from your .CSV file into R, and use tidyr and d...

969 sym Python (6620 sym/35 pcs) 1 img