Publications by Shamecca Marshall
Data 607 - Project 4
Instructions It can be useful to be able to classify new “test” documents using already classified “training” documents. A common example is using a corpus of labeled spam and ham (non-spam) e-mails to predict whether or not a new document is spam. For this project, you can start with a spam/ham dataset, then predict the class of new docume...
2732 sym Python (5202 sym/14 pcs)
Data 607 - Spotify's Recommender Systems
Assignment Instructions Your task is to analyze an existing recommender system that you find interesting. You should: Perform a Scenario Design analysis as described below. Consider whether it makes sense for your selected recommender system to perform scenario design twice, once for the organization (e.g. Amazon.com) and once for the organization...
3070 sym 4 img
Lab 8 - Multiple linear regression
Grading the professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these student evaluations as an indicator of course quality and teaching effectiveness is often criticized because these measures may reflect the influence of non-teaching related characte...
12007 sym 13 img
Data 607 - TidyVerse CREATE assignment
In this assignment, you’ll practice collaborating around a code project with GitHub. You could consider our collective work as building out a book of examples on how to use TidyVerse functions. GitHub repository: https://github.com/peterkowalchuk/FALL2023TIDYVERSE FiveThirtyEight.com datasets. Kaggle datasets. Your task here is to Extend an Exist...
597 sym R (1807 sym/10 pcs) 3 img
TidyVerse CREATE assignment
In this assignment, you’ll practice collaborating around a code project with GitHub. You could consider our collective work as building out a book of examples on how to use TidyVerse functions. GitHub repository: https://github.com/peterkowalchuk/FALL2023TIDYVERSE FiveThirtyEight.com datasets. Kaggle datasets. Your task here is to Create an Examp...
1133 sym R (1807 sym/10 pcs) 3 img
Data 607 - Sentiment Analysis
library(tidytext) library(textdata) library(janeaustenr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(stringr) library(tidyr) library(ggplot2) library(t...
79 sym R (12856 sym/67 pcs) 7 img
Lab 8 - Introduction to linear regression
The Human Freedom Index is a report that attempts to summarize the idea of “freedom” through a bunch of different variables for many countries around the globe. It serves as a rough objective measure for the relationships between the different types of freedom - whether it’s political, religious, economical or personal freedom - and other soc...
10096 sym 7 img
Data 607 - Project 2 UPDATE
The goal of this assignment is to give you practice in preparing different datasets for downstream analysis work. Your task is to: (1) Choose any three of the “wide” datasets identified in the Week 5 Discussion items. (You may use your own dataset; please don’t use my Sample Post dataset, since that was used in your Week 6 assignment!) For ea...
1544 sym Python (108892 sym/20 pcs) 2 img 2 tbl
Data 606 - Lab 6: Inference for categorical data
Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro) library(infer) The data You will...
8931 sym Python (2677 sym/25 pcs) 1 img
Data 607 - Working with XML and JSON in R
Instructions Load libary library(DT) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(htmltools) library(readr) library(aws.s3) library(jsonlite) library(RC...
98 sym R (2405 sym/18 pcs) 1 img