Publications by Shamecca Marshall

TidyVerse CREATE assignment

15.11.2023

In this assignment, you’ll practice collaborating around a code project with GitHub. You could consider our collective work as building out a book of examples on how to use TidyVerse functions. GitHub repository: https://github.com/peterkowalchuk/FALL2023TIDYVERSE FiveThirtyEight.com datasets. Kaggle datasets. Your task here is to Create an Examp...

1133 sym R (1807 sym/10 pcs) 3 img

Data 607 - Sentiment Analysis

13.11.2023

library(tidytext) library(textdata) library(janeaustenr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(stringr) library(tidyr) library(ggplot2) library(t...

79 sym R (12856 sym/67 pcs) 7 img

Lab 8 - Introduction to linear regression

13.11.2023

The Human Freedom Index is a report that attempts to summarize the idea of “freedom” through a bunch of different variables for many countries around the globe. It serves as a rough objective measure for the relationships between the different types of freedom - whether it’s political, religious, economical or personal freedom - and other soc...

10096 sym 7 img

Data 607 - Project 2 UPDATE

26.10.2023

The goal of this assignment is to give you practice in preparing different datasets for downstream analysis work. Your task is to: (1) Choose any three of the “wide” datasets identified in the Week 5 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 6 assignment!) For ea...

1544 sym Python (108892 sym/20 pcs) 2 img 2 tbl

Data 606 - Lab 6: Inference for categorical data

15.10.2023

Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro) library(infer) The data You will...

8931 sym Python (2677 sym/25 pcs) 1 img

Data 607 - Working with XML and JSON in R

11.10.2023

Instructions Load libary library(DT) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(htmltools) library(readr) library(aws.s3) library(jsonlite) library(RC...

98 sym R (2405 sym/18 pcs) 1 img

Data 607 - Project 2

09.10.2023

The goal of this assignment is to give you practice in preparing different datasets for downstream analysis work. Your task is to: (1) Choose any three of the “wide” datasets identified in the Week 5 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 6 assignment!) For ea...

1529 sym R (1801 sym/7 pcs) 1 img

Data 606 - Lab 5a

08.10.2023

In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. Setting a seed: We will take some random ...

11267 sym 2 img

Data 606 - Lab 5b

08.10.2023

If you have access to data on an entire population, say the opinion of every adult in the United States on whether or not they think climate change is affecting their local community, it’s straightforward to answer questions like, “What percent of US adults think climate change is affecting their local community?”. Similarly, if you had demog...

9475 sym 1 img 1 tbl

Data 606 - Lab 4

08.10.2023

In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate rand...

9973 sym Python (3815 sym/31 pcs) 13 img