Publications by Dell
ELO calculations
This is a continuation of my first project of Chess tournament, the chunks below are essential for the next part of this project where the data is loaded, cleaned and tidying to move for the ELO calculations and make a comparison between the rates of chess players. # List of packages to install packages <- c("RCurl", "knitr", "kableExtra", "tid...
5182 sym R (12839 sym/21 pcs) 1 img 2 tbl
NobelPrize
##Introduction Working with the two JSON files available through the API at nobelprize.org, ask and answer 4 interesting questions, e.g. “Which country “lost” the most nobel laureates (who were born there but received their Nobel prize as a citizen of a different country)?” #Preparing the dataset Loading Required Packages: The code s...
3825 sym R (5105 sym/22 pcs)
Israel vaccination analysis
##Introduction This project analyzes the severe disease rates per 100K people by vaccination status across different age groups. The dataset includes metrics for vaccinated and unvaccinated populations, focusing on identifying trends and differences in severe case rates. The goal is to highlight the impact of vaccination on reducing severe case...
3188 sym R (8158 sym/54 pcs) 1 img
Final Project D607
Introduction The U.S. Chronic Disease Indicators (CDI) dataset is a comprehensive resource that includes data on various chronic diseases, including cardiovascular disease, diabetes, cancer, and more in a time period of 2000 to 2020. The main goal of this analysis is to explore and understand the trends and patterns of these chronic diseases ac...
5950 sym R (12553 sym/39 pcs) 3 img
Recommender System
Introduction In today’s world of digital media, recommendation systems play a crucial role in enhancing user experience by suggesting relevant content. Among various techniques, the Global Baseline Estimate (GBE) approach offers a non-personalized yet effective method for predicting user preferences. This project aims to implement a GBE recom...
3330 sym R (7668 sym/38 pcs)
Project4 D607
##Introduction In today’s digital age, email communication is ubiquitous, yet it brings with it the persistent challenge of spam. Spam emails not only clutter our inboxes but also pose significant security risks. This project aims to develop a robust spam classification system using a dataset from the SpamAssassin Public Corpus. By leveraging...
2839 sym R (5886 sym/35 pcs) 1 img
Final Project D606
#Introduction This project examines mortality trends using the New York City Leading Causes of Death dataset. The analysis aims to uncover demographic disparities, temporal trends, and potential public health insights by evaluating the relationship between leading causes of death and various demographic and temporal factors. #Research Question ...
8783 sym R (43778 sym/86 pcs) 10 img
Lab 09 - Multiple linear regression
Grading the professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these student evaluations as an indicator of course quality and teaching effectiveness is often criticized because these measures may reflect the influence of non-teaching related char...
17365 sym 16 img
Week 10 Assignment
Going through Sentiment analysis This study looks at the feelings in Edgar Allan Poe’s writings using computer-based text analysis. Known for his dark and mysterious style, Poe’s works are perfect for sentiment analysis. The goal is to break down, clean, and analyze his texts with the NRC word-emotion lexicon to find emotional patterns. Usi...
2461 sym R (3120 sym/21 pcs) 2 img
Assignment Week 9 Tidyverse
For this assignment dataset used is a Dataset that is a collection of hotel bookings information that includes various details about each booking, such as the hotel type, booking dates, customer demographics, booking status, and more. This dataset is useful for analyzing trends in hotel bookings, cancellations, and customer behavior. Dataset ex...
1764 sym R (8140 sym/15 pcs) 1 img