Publications by Anthony Josue Roman

Data607: Final Project

09.12.2024

Table of contents Introduction Data Acquisition Data Cleaning and Transformation Analysis Techniques The Data Exploratory Data Analysis (EDA) Effective Temperature Distribution Stellar Radius vs. Mass Logarithmic Luminosity Distribution Habitable Zone by Star Type Orbital Distance vs. Exoplanet Size Advanced Analysis Habital Zone Analysis St...

12427 sym 8 img

Data607 Project 4: Document Classification

01.12.2024

Project 4: Document Classification Author Anthony Josue Roman Introduction Document classification is a fundamental task in natural language processing, finding its applications in a number of real-world scenarios. Other than spam detection in e-mails, document classification can be used to categorize customer feedback for sentiment analysis, p...

5963 sym 3 img

Data607: TidyverseCREATE

11.11.2024

Data607: Tidyverse Create Author Anthony Josue Roman library(tidyverse) ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ✔ dplyr 1.1.4 ✔ readr 2.1.5 ✔ forcats 1.0.0 ✔ stringr 1.5.1 ✔ ggplot2 3.5.1 ✔ tibble 3.2.1 ...

7319 sym Python (23112 sym/56 pcs) 12 img

Data607: Scenario Design Analysis of Steam's Recommender System

11.11.2024

Data607: Scenario Design Analysis of Steam’s Recommender System Author Anthony Josue Roman 1. Introduction Steam, developed by Valve Corporation, is a leading digital distribution platform for video games. With an extensive library of titles, Steam employs sophisticated recommender systems to assist users in discovering games that align with ...

3032 sym

Data607: Sentiment Analysis

04.11.2024

Data607: Sentiment Analysis Author Anthony Josue Roman General Overview In this Assignment, we will obtain a code example from Chapter 2 of Textmining with R. The following assignment will be accessible via my GitHub Repository. The following libraries will be used in this assignment: library(stringr) library(dplyr) library(tidyr) library(t...

4529 sym Python (30766 sym/129 pcs) 13 img

Data606 Project Proposal

04.11.2024

Data606 Project Proposal Author Anthony Josue Roman Data Preperation # Load Gaia data rawgaia <- getURL("https://raw.githubusercontent.com/spacerome/Data607_Project_2/refs/heads/main/gaiadata.csv") rawgaia2 <- getURL("https://raw.githubusercontent.com/spacerome/Data607_Project_2/refs/heads/main/gaiadata2.csv") # Prepare the data gaiadf ...

2551 sym Python (10734 sym/18 pcs) 4 img

Data607: Web APIs

27.10.2024

Data607: Web APIs Author Anthony Josue Roman Introduction This assignment is done for the purpose of expounding on how to use Web APIs to get and analyze data. More precisely, the assignment is going to deal with the use of New York Times APIs to gain access to real-time, structured data using programmatic requests. By constructing an interface...

5820 sym Python (5655 sym/8 pcs)

Data607: Project 3 - Data Science Skills

21.10.2024

Data607: Project 3 - Data Science Skills Author Anthony Roman, John Ferrara, Alinzon Simon, Akeem Lawrence, Ben Wolin Introduction The aim of this project is to find out what skills in data science are mostly demanded in the labor market that is now booming. Data science has become a highly important field in all sectors, where innovation and d...

13468 sym Python (13149 sym/38 pcs) 4 img 2 tbl

Data607: Working with XML and JSON in R

14.10.2024

This assignment will use the following packages: library(bslib) library(readr) library(RCurl) library(stringr) library(dplyr) library(tidyr) library(tidyverse) library(ggplot2) library(knitr) library(kableExtra) library(xml2) library(rvest) library(jsonlite) Overview The purpose of this assignment is to work with HTML, XML, and JSON f...

1887 sym R (3062 sym/19 pcs) 3 tbl

Data607 - Project 2: Data Transformation with Astronomical Data

07.10.2024

This assignment will use the following packages: library(readr) library(RCurl) library(stringr) library(dplyr) library(tidyr) library(tidyverse) library(ggplot2) library(knitr) library(kableExtra) Introduction Astronomy has played a key role in advancement of human civilization and to understand our universe. Exoplanets, planets not apart...

12047 sym R (11294 sym/29 pcs) 7 img