Publications by Adriana Medina

Week 10 assignment- Adriana

02.04.2024

Introduction: In Text Mining with R, Chapter 2 looks at Sentiment Analysis. In this assignment, you should start by getting the primary example code from chapter 2 working in an R Markdown document. You should provide a citation to this base code. You’re then asked to extend the code in two ways: Work with a different corpus of your choosing, and...

1346 sym R (8960 sym/50 pcs) 5 img

Data 607- Week 9 assignment- Adriana

24.03.2024

Introduction The assignment for week 9 is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. I chose an API that retrieves the most popular article for the most recent day (24hrs). Load library library(httr) library(jsonlite) library(tidyverse) ## ── Attaching c...

429 sym R (2928 sym/12 pcs)

Data 607-Project 3-Project Bluebook

18.03.2024

Introduction: The purpose of this project was to collect the most relevant job skills needed for data science positions. Job descriptions from several job posting websites: Zip Recruiter, Linkedin, Builtin, and Monster were observed and the soft skills and technical skills were collected and put into a table. The table was split up into smaller ta...

3288 sym R (9810 sym/45 pcs) 2 img

Data606 lab 5b

16.03.2024

If you have access to data on an entire population, say the opinion of every adult in the United States on whether or not they think climate change is affecting their local community, it’s straightforward to answer questions like, “What percent of US adults think climate change is affecting their local community?”. Similarly, if you had demog...

11084 sym 4 img 1 tbl

Data 607-Week 7 assignment-Adriana

10.03.2024

Introduction For this assignment I have prepared three separate files in HTML, XML, and JSON formats, each containing the following information about my favorite books: Title Author Published Year Genre Each of the files were loaded into Github and then into R. Load necessary libraries: library(tidyverse) ## ── Attaching core tidyverse pack...

1168 sym R (3376 sym/23 pcs)

Data 607 Project 2

04.03.2024

Project 2 section 1 by Keith DeNivo Introduction: The data set for this section included population estimates based on census data as well as immigration and deaths for the United States. The data set contained population estimates from April to September of 1990 for each sex, race, and age. “Monthly Postcensal Resident Population, by single y...

9360 sym R (35947 sym/121 pcs) 7 img

Data-607-Project-2 Nobel Peace Prize Awards

02.03.2024

Introduction: This is a dataset that I’ve chosen from the Harvard Dataverse containing a list of awarded noble prizes and metadata about the winners. https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/HYRJDX Load Libraries: library(tidyr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked...

1762 sym R (4560 sym/23 pcs) 2 img

Data 607-Assignment-5-Adriana

24.02.2024

R Markdown Load packages library(tidyr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(ggplot2) Load raw data raw_data_path<-("https://raw.githubusercont...

1179 sym R (4783 sym/19 pcs) 2 img

Lab-2-Adriana Medina

18.02.2024

Some define statistics as the field that focuses on turning information into knowledge. The first step in that process is to summarize and describe the raw information – the data. In this lab we explore flights, specifically a random sample of domestic flights that departed from the three major New York City airports in 2013. We will generate sim...

11310 sym 8 img

Project-1-Adriana Medina

18.02.2024

Introduction In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s State, Total ...

2372 sym R (8082 sym/28 pcs)