Publications by jojoecp
Lyric's mood identifier
Music is not only able to affect your mood. listening to particularly happy or sad music can even change the way we perceive the world, according to researchers from the University of Groningen. In this modern world we have an ability to choose what music we want to listen easily. Some music player platform such as Spotify are known to its music ...
7759 sym R (22391 sym/50 pcs) 1 img
Detecting Defected Items
Classification case, Anomaly Detection for Detecting Defected Manufactured Semi-Conductors. In this project we propose machine learning techniques to automatically generate an accurate predictive model to predict equipment faults during the wafer fabrication process of the semiconductor industries. Aim at constructing a decision model to help det...
7945 sym R (29559 sym/94 pcs) 1 img
Is Alcohol Consumption Make You Happy?
In this article we will try to do regression to analyze “Is alcohol consumption makes you happy?”. This project based on data availiable in this kaggle. The dataset contain not only alcohol consumption like beer, wine, and spirit, but also social indexes like GDP, HDI, region, and hemisphere as comparison. The HappinessScore itself is measure...
9164 sym R (12338 sym/32 pcs) 6 img
Wine Quality
In this article we will try to do regression to analyze the quality of wine based on this kaggle dataset. These datasets can be viewed as classification or regression tasks. The classes are ordered and not balanced (e.g. there are much more normal wines than excellent or poor ones). Background Objective We’ll try to build linear regression ...
8271 sym R (7750 sym/30 pcs) 8 img
Dota Science #1
You know what’s Dota right? DotA stands for Defends of the Ancients. Dota 2 is a multiplayer online battle arena (MOBA) video game developed and published by Valve. Dota 2 is played in matches between two teams (called Radiant and Dire) of five players, with each team occupying and defending their own separate base on the map. Each of the ten p...
11577 sym R (82025 sym/61 pcs) 13 img
Replicating economist plot
In this article, we will try to replicate/remaking a plot from The Economist article using ggplot2 package. You’ll see all the packages that will be used in this code under the Libraries tab. The article and dataset can be read and download in the Article tab. Thank you! 1 Background 1.1 Article This article is titled “Emissions from energy...
2533 sym R (11949 sym/43 pcs) 12 img
Dota Science #2
For this second part of DotaScience, we’ll do unsupervised learning: Clustering and Principal Component Analysis (PCA). Dota 2 is a multiplayer online battle arena (MOBA) video game developed and published by Valve. Dota 2 is played in matches between two teams (called Radiant and Dire) of five players, with each team occupying and defending th...
11345 sym R (37191 sym/39 pcs) 12 img
JetRail Passenger Forecast
There is a company that consider making an investment in a new form of transportation - JetRail. JetRail uses Jet propulsion technology to run rails and move people at a high speed! While JetRail has mastered the technology and they hold the patent for their product, the investment would only make sense, if they can get more than 1 Million monthl...
12162 sym R (21637 sym/67 pcs) 17 img
Topic Modeling with LDA
Natural Languange Processing has a wide area of knowledge and implementation, one of them is Topic Model. Topic Model is a type of statistical model for discovering the abstract “topics” that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for discovery of hidden semantic structures in a text body. For...
14629 sym R (21814 sym/88 pcs) 18 img
IMDb Web Scraping
In this markdown, we will do a web scraping technique using rvest package. Web scraping or web harvesting or web data exctraction is data scraping used for extracting data from websites. Web scraping software may access the World Wide Web directly using the Hypertext Transfer Protocol, or through a web browser. We will scrap data from IMDb top 25...
11528 sym R (17767 sym/32 pcs) 12 img