Publications by Shamecca Marshall

Data 607 - Project 4

04.12.2023

Instructions It can be useful to be able to classify new “test” documents using already classified “training” documents. A common example is using a corpus of labeled spam and ham (non-spam) e-mails to predict whether or not a new document is spam. For this project, you can start with a spam/ham dataset, then predict the class of new docume...

2732 sym Python (5202 sym/14 pcs)

Data 607 - Spotify's Recommender Systems

03.12.2023

Assignment Instructions Your task is to analyze an existing recommender system that you find interesting. You should: Perform a Scenario Design analysis as described below. Consider whether it makes sense for your selected recommender system to perform scenario design twice, once for the organization (e.g. Amazon.com) and once for the organization...

3070 sym 4 img

Lab 8 - Multiple linear regression

26.11.2023

Grading the professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these student evaluations as an indicator of course quality and teaching effectiveness is often criticized because these measures may reflect the influence of non-teaching related characte...

12007 sym 13 img

Data 607 - TidyVerse CREATE assignment

15.11.2023

In this assignment, you’ll practice collaborating around a code project with GitHub. You could consider our collective work as building out a book of examples on how to use TidyVerse functions. GitHub repository: https://github.com/peterkowalchuk/FALL2023TIDYVERSE FiveThirtyEight.com datasets. Kaggle datasets. Your task here is to Extend an Exist...

597 sym R (1807 sym/10 pcs) 3 img

TidyVerse CREATE assignment

15.11.2023

In this assignment, you’ll practice collaborating around a code project with GitHub. You could consider our collective work as building out a book of examples on how to use TidyVerse functions. GitHub repository: https://github.com/peterkowalchuk/FALL2023TIDYVERSE FiveThirtyEight.com datasets. Kaggle datasets. Your task here is to Create an Examp...

1133 sym R (1807 sym/10 pcs) 3 img

Data 607 - Sentiment Analysis

13.11.2023

library(tidytext) library(textdata) library(janeaustenr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(stringr) library(tidyr) library(ggplot2) library(t...

79 sym R (12856 sym/67 pcs) 7 img

Lab 8 - Introduction to linear regression

13.11.2023

The Human Freedom Index is a report that attempts to summarize the idea of “freedom” through a bunch of different variables for many countries around the globe. It serves as a rough objective measure for the relationships between the different types of freedom - whether it’s political, religious, economical or personal freedom - and other soc...

10096 sym 7 img

Data 607 - Project 2 UPDATE

26.10.2023

The goal of this assignment is to give you practice in preparing different datasets for downstream analysis work. Your task is to: (1) Choose any three of the “wide” datasets identified in the Week 5 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 6 assignment!) For ea...

1544 sym Python (108892 sym/20 pcs) 2 img 2 tbl

Data 606 - Lab 6: Inference for categorical data

15.10.2023

Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro) library(infer) The data You will...

8931 sym Python (2677 sym/25 pcs) 1 img

Data 607 - Working with XML and JSON in R

11.10.2023

Instructions Load libary library(DT) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(htmltools) library(readr) library(aws.s3) library(jsonlite) library(RC...

98 sym R (2405 sym/18 pcs) 1 img