Publications by Andreina A

Final project

13.12.2024

Introduction The leading cause of death in NYC for 2020 seemed to be Covid-19, and I would like to make a comparsion wiht the leading cause of death in NYC with the USA, to see if the concerning dieases in NYC werwe also main concerns to the whole USA. I will use data from NYC open data on the leading causes of death in NY since 2007 “https://dat...

4234 sym Python (10902 sym/15 pcs) 1 img

Recommender_Engine_IG

17.11.2024

Recommender Intro A recommender systems is an artifical Intelligence system used to make suggest to customers based on their interactive behavior and patterns, which works by gathering the customers interactive data, storing the data, analyzing the data with machine learing to detect a pattern, filtering the data by applying certain mathematical ru...

4367 sym

Week10_607

10.11.2024

library(tidytext) library(janeaustenr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(stringr) library(ggplot2) library(gutenbergr) library(wordcloud) ## L...

4362 sym R (10974 sym/55 pcs) 4 img

Project_3_Data_607

03.11.2024

Introduction For this assisgnment we were asked to use data to answer this question, “Which are the most valued data science skills?”. Data from Kaggel “Data Science Job Postings & Skills (2024)” :https://www.kaggle.com/datasets/asaniczka/data-science-job-postings-and-skills?select=job_postings.csv Kaggle data Liscensure Open Data Commons A...

2236 sym R (41748 sym/14 pcs) 1 img

API Assignment

03.11.2024

Assignment 9 The New York Times web site provides a rich set of APIs, as described here: https://developer.nytimes.com/apis You’ll need to start by signing up for an API key. Your task is to choose one of the New York Times APIs, construct an interface in R to read in the JSON data, and transform it into an R DataFrame. The api data I used from N...

1167 sym Python (42987 sym/5 pcs)

Tidyverse_Assignment

02.11.2024

Your task here is to Create an Example. Using one or more TidyVerse packages, and any dataset from fivethirtyeight.com or Kaggle, create a programming sample “vignette” that demonstrates how to use one or more of the capabilities of the selected TidyVerse package with your selected dataset.(25 points) Later, you’ll be asked to extend an exist...

2409 sym R (2140 sym/7 pcs) 1 img

Assignment_7_Data607

20.10.2024

Assignment 7:working with JSON, HTML, XML, and Parquet in R Introduction For this assignment I had to prepare data that was provided from the CUNYMart inventory, which is located at located at 123 Example Street,Anytown, USA. To prepare the data for analysis, I had to turn the given data into JSon, HTML, XML, and Parquet files and then read them i...

3963 sym R (30834 sym/29 pcs)

Project 2

13.10.2024

Project 2 To choose three data sets that are untidy, read it as a CSV file, tidy and transform that data. Loaded needed packages library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.4 ✔ readr 2.1.5 ## ✔ forcats ...

3947 sym R (16213 sym/40 pcs) 2 img

Project 1 Chess Tournament Results

06.10.2024

Introduction Given a text file with the results of a chess tournament, I had to create a R Markdown file that generates the .CSV file and it had to have the players name, player’s state, total number of points, player’s pre-Rating, and average pre chess rating of opponents. For the text file, I was able to imported into R markdown and transform...

3378 sym R (10894 sym/22 pcs)

Assignment 4 Data 607

29.09.2024

Loading packages library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.4 ✔ readr 2.1.5 ## ✔ forcats 1.0.0 ✔ stringr 1.5.1 ## ✔ ggplot2 3.5.1 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.3 ✔ ti...

2859 sym R (5012 sym/19 pcs) 2 img