Publications by jojoecp

Lyric's mood identifier

11.05.2020

Music is not only able to affect your mood. listening to particularly happy or sad music can even change the way we perceive the world, according to researchers from the University of Groningen. In this modern world we have an ability to choose what music we want to listen easily. Some music player platform such as Spotify are known to its music ...

7759 sym R (22391 sym/50 pcs) 1 img

Detecting Defected Items

27.04.2020

Classification case, Anomaly Detection for Detecting Defected Manufactured Semi-Conductors. In this project we propose machine learning techniques to automatically generate an accurate predictive model to predict equipment faults during the wafer fabrication process of the semiconductor industries. Aim at constructing a decision model to help det...

7945 sym R (29559 sym/94 pcs) 1 img

Is Alcohol Consumption Make You Happy?

15.04.2020

In this article we will try to do regression to analyze “Is alcohol consumption makes you happy?”. This project based on data availiable in this kaggle. The dataset contain not only alcohol consumption like beer, wine, and spirit, but also social indexes like GDP, HDI, region, and hemisphere as comparison. The HappinessScore itself is measure...

9164 sym R (12338 sym/32 pcs) 6 img

Wine Quality

12.04.2020

In this article we will try to do regression to analyze the quality of wine based on this kaggle dataset. These datasets can be viewed as classification or regression tasks. The classes are ordered and not balanced (e.g. there are much more normal wines than excellent or poor ones). Background Objective We’ll try to build linear regression ...

8271 sym R (7750 sym/30 pcs) 8 img

Dota Science #1

07.04.2020

You know what’s Dota right? DotA stands for Defends of the Ancients. Dota 2 is a multiplayer online battle arena (MOBA) video game developed and published by Valve. Dota 2 is played in matches between two teams (called Radiant and Dire) of five players, with each team occupying and defending their own separate base on the map. Each of the ten p...

11577 sym R (82025 sym/61 pcs) 13 img

Replicating economist plot

22.02.2020

In this article, we will try to replicate/remaking a plot from The Economist article using ggplot2 package. You’ll see all the packages that will be used in this code under the Libraries tab. The article and dataset can be read and download in the Article tab. Thank you! 1 Background 1.1 Article This article is titled “Emissions from energy...

2533 sym R (11949 sym/43 pcs) 12 img

Dota Science #2

08.04.2020

For this second part of DotaScience, we’ll do unsupervised learning: Clustering and Principal Component Analysis (PCA). Dota 2 is a multiplayer online battle arena (MOBA) video game developed and published by Valve. Dota 2 is played in matches between two teams (called Radiant and Dire) of five players, with each team occupying and defending th...

11345 sym R (37191 sym/39 pcs) 12 img

JetRail Passenger Forecast

29.06.2020

There is a company that consider making an investment in a new form of transportation - JetRail. JetRail uses Jet propulsion technology to run rails and move people at a high speed! While JetRail has mastered the technology and they hold the patent for their product, the investment would only make sense, if they can get more than 1 Million monthl...

12162 sym R (21637 sym/67 pcs) 17 img

Topic Modeling with LDA

26.07.2020

Natural Languange Processing has a wide area of knowledge and implementation, one of them is Topic Model. Topic Model is a type of statistical model for discovering the abstract “topics” that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for discovery of hidden semantic structures in a text body. For...

14629 sym R (21814 sym/88 pcs) 18 img

IMDb Web Scraping

06.07.2020

In this markdown, we will do a web scraping technique using rvest package. Web scraping or web harvesting or web data exctraction is data scraping used for extracting data from websites. Web scraping software may access the World Wide Web directly using the Hypertext Transfer Protocol, or through a web browser. We will scrap data from IMDb top 25...

11528 sym R (17767 sym/32 pcs) 12 img