Publications by Anthony Josue Roman
Data607: Final Project
Table of contents Introduction Data Acquisition Data Cleaning and Transformation Analysis Techniques The Data Exploratory Data Analysis (EDA) Effective Temperature Distribution Stellar Radius vs. Mass Logarithmic Luminosity Distribution Habitable Zone by Star Type Orbital Distance vs. Exoplanet Size Advanced Analysis Habital Zone Analysis St...
12427 sym 8 img
Data607 Project 4: Document Classification
Project 4: Document Classification Author Anthony Josue Roman Introduction Document classification is a fundamental task in natural language processing, finding its applications in a number of real-world scenarios. Other than spam detection in e-mails, document classification can be used to categorize customer feedback for sentiment analysis, p...
5963 sym 3 img
Data607: TidyverseCREATE
Data607: Tidyverse Create Author Anthony Josue Roman library(tidyverse) ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ✔ dplyr 1.1.4 ✔ readr 2.1.5 ✔ forcats 1.0.0 ✔ stringr 1.5.1 ✔ ggplot2 3.5.1 ✔ tibble 3.2.1 ...
7319 sym Python (23112 sym/56 pcs) 12 img
Data607: Scenario Design Analysis of Steam's Recommender System
Data607: Scenario Design Analysis of Steam’s Recommender System Author Anthony Josue Roman 1. Introduction Steam, developed by Valve Corporation, is a leading digital distribution platform for video games. With an extensive library of titles, Steam employs sophisticated recommender systems to assist users in discovering games that align with ...
3032 sym
Data607: Sentiment Analysis
Data607: Sentiment Analysis Author Anthony Josue Roman General Overview In this Assignment, we will obtain a code example from Chapter 2 of Textmining with R. The following assignment will be accessible via my GitHub Repository. The following libraries will be used in this assignment: library(stringr) library(dplyr) library(tidyr) library(t...
4529 sym Python (30766 sym/129 pcs) 13 img
Data606 Project Proposal
Data606 Project Proposal Author Anthony Josue Roman Data Preperation # Load Gaia data rawgaia <- getURL("https://raw.githubusercontent.com/spacerome/Data607_Project_2/refs/heads/main/gaiadata.csv") rawgaia2 <- getURL("https://raw.githubusercontent.com/spacerome/Data607_Project_2/refs/heads/main/gaiadata2.csv") # Prepare the data gaiadf ...
2551 sym Python (10734 sym/18 pcs) 4 img
Data607: Web APIs
Data607: Web APIs Author Anthony Josue Roman Introduction This assignment is done for the purpose of expounding on how to use Web APIs to get and analyze data. More precisely, the assignment is going to deal with the use of New York Times APIs to gain access to real-time, structured data using programmatic requests. By constructing an interface...
5820 sym Python (5655 sym/8 pcs)
Data607: Project 3 - Data Science Skills
Data607: Project 3 - Data Science Skills Author Anthony Roman, John Ferrara, Alinzon Simon, Akeem Lawrence, Ben Wolin Introduction The aim of this project is to find out what skills in data science are mostly demanded in the labor market that is now booming. Data science has become a highly important field in all sectors, where innovation and d...
13468 sym Python (13149 sym/38 pcs) 4 img 2 tbl
Data607: Working with XML and JSON in R
This assignment will use the following packages: library(bslib) library(readr) library(RCurl) library(stringr) library(dplyr) library(tidyr) library(tidyverse) library(ggplot2) library(knitr) library(kableExtra) library(xml2) library(rvest) library(jsonlite) Overview The purpose of this assignment is to work with HTML, XML, and JSON f...
1887 sym R (3062 sym/19 pcs) 3 tbl
Data607 - Project 2: Data Transformation with Astronomical Data
This assignment will use the following packages: library(readr) library(RCurl) library(stringr) library(dplyr) library(tidyr) library(tidyverse) library(ggplot2) library(knitr) library(kableExtra) Introduction Astronomy has played a key role in advancement of human civilization and to understand our universe. Exoplanets, planets not apart...
12047 sym R (11294 sym/29 pcs) 7 img