Publications by Gabriella Martinez

DATA 607 Project 4

01.05.2021

Document Classification Assignment Overview It can be useful to be able to classify new “test” documents using already classified “training” documents. A common example is using a corpus of labeled spam and ham (non-spam) e-mails to predict whether or not a new document is spam. For this project, you can start with a spam/ham dataset, th...

15227 sym R (42768 sym/47 pcs) 7 img 2 tbl

DATA605 Week 13 Post

26.04.2021

FORUM DESCRIPTION Using R, provide the solution for any exercise in either Chapter 4 or Chapter 7 of the calculus textbook. If you are unsure of your solution, post your concerns. library(mosaic) Exercises from Chapter 4.3 Maximization Find the maximum product of two numbers (not necessarily integers) that have a sum of 100. Answer: x=50, y=50...

2953 sym R (1042 sym/23 pcs)

DATA607 TidyVerse Extend

24.04.2021

Dplyr TidyVerse Vignette CREATE Assignment Overview Your task here is to Create an Example. Using one or more TidyVerse packages, and any dataset from fivethirtyeight.com or Kaggle, create a programming sample “vignette” that demonstrates how to use one or more of the capabilities of the selected TidyVerse package with your selected dataset...

8046 sym R (9837 sym/29 pcs)

DATA605 HW12

23.04.2021

Regression Analysis in R Packages library(RCurl) library(readr) library(ggplot2) library(dplyr) Load the Data x <- url("https://raw.githubusercontent.com/gabbypaola/DATA605/main/who.csv") who <- read_csv(x) tibble(who) ## # A tibble: 190 x 10 ## Country LifeExp InfantSurvival Under5Survival TBFree PropMD PropRN PersExp ## <chr> ...

5760 sym R (5287 sym/34 pcs) 8 img

DATA607 Data Science in Context Presentation

23.04.2021

Data Science in Context Presentation Gabriella Martinez 4/21/2021 Senitment Analysis and Early Detection of Depression What is Sentiment Analysis? Sentiment Analysis is the most common text classification tool that analyses an incoming message and tells whether the underlying sentiment is positive, negative our neutral. 1 What is Depressi...

5696 sym 5 img

DATA 607 Week 11 Discussion

21.04.2021

Discussion Prompt Your task is to analyze an existing recommender system that you find interesting. You should: 1. Perform a Scenario Design analysis as described below. Consider whether it makes sense for your selected recommender system to perform scenario design twice, once for the organization (e.g. Amazon.com) and once for the organizatio...

5366 sym 2 img

DATA607 HW10 republish

19.04.2021

Assignment Overview Re-create and analyze primary code from Text Mining with R 1. Provide citation to text book, using a standard citation syntax like APA or MLA. Identify and implement a different corpus to perform sentiment analysis. Identify and implement an additional lexicon for sentiment analysis. Packages library(RCurl) library(tidyverse...

9108 sym R (9230 sym/27 pcs) 12 img

DATA605 HW11

19.04.2021

Assignment Overview Using the “cars” dataset in R, build a linear model for stopping distance as a function of speed and replicate the analysis of your textbook chapter 3 (visualization, quality evaluation of the model, and residual analysis.) Load the Data data("cars") head(cars,3) ## speed dist ## 1 4 2 ## 2 4 10 ## 3 ...

1022 sym R (1203 sym/12 pcs) 8 img

DATA605 Week 11 Post

22.04.2021

FORUM DESCRIPTION Using R, build a regression model for data that interests you. Conduct residual analysis. Was the linear model appropriate? Why or why not? This discussion uses the hsb21 dataset from the openintro library which is sourced from the UCLA Institute for Digital Research & Education - Statistical Consulting. The variables taken into...

3077 sym R (2655 sym/9 pcs) 6 img

Principal Component Analysis and BBC poll of best hip hop songs ever

10.12.2021

TidyTuesday 04-14-2020 What is TidyTuesday? Before going into the code along to Julia Silge’s screencast1 2, TidyTuesday is a weekly social data project using the R programming language. For those unfamiliar with what “tidy” data is, looks like, and how to do it, Tidy Data and tidyr YouTube video3 is a gentle introduction to data wrangling...

6020 sym R (18573 sym/57 pcs) 7 img