Publications by git-comp
Capstone Exploratory Data Analysis
Exploratory Data Analysis In this assignment, I investigate three files (en_US.blogs.txt, en_US_news.txt, en_US.twitter.txt) and try to figure out how large those files are, what they contain, how often words and word combinations (bigrams: how often does a combination of two words appear; trigrams: how often does a combination of three words app...
1537 sym R (3766 sym/11 pcs) 3 img
Coursera "Developing Data Products", Week 4
24 5 2021 Context, Data, Sources Context For this assignment, I have used official COVID-19 data for Switzerland available as of May 21, 2021. I have generated an interactive overview of COVID-19 cases per calender week from Feb 2020 until May 2021 per State (called Canton in Switzerland). In the ShinyUI, checkboxes allow the selection of indivi...
712 sym R (1152 sym/2 pcs) 1 img
COVID-19 cases in Switzerland per region and calendar week
23 5 2021 Playground for Slides with Plotly In this presentation, I am playing around with Plotly within generated slides. Everything is compiled with Knitr from within R Studio, plots should be interactive. Context, Data, Sources Context For this assignment, I have used official COVID-19 data for Switzerland available as of today. I try to ge...
676 sym R (1438 sym/2 pcs)
Coursera Statistical Inference, Week 4, Project Part 2 (Basic Inferential Data Analysis)
Part 2: Basic Inferential Data Analysis Introduction & Synopsis In this project, I will analyze the ToothGrowth data and try to draw conclusions. Load data ## load datasets and ggplot2 libraries library(datasets) library(ggplot2) ## Show first lines head(ToothGrowth) ## len supp dose ## 1 4.2 VC 0.5 ## 2 11.5 VC 0.5 ## 3 7.3 ...
1084 sym R (3454 sym/24 pcs) 2 img
Coursera Statistical Inference, Week 4, Project Part 1 (Simulation Exercise)
Part 1: Simulation Exercise Introduction & Synopsis In this project, I will investigate the exponential distribution in R and compare it with the Central Limit Theorem. We will run with 40 exponentials with 1000 simulations. Simulation ## load ggplot library(ggplot2) ## set seed for reproducibility and parameters according to the assignment ...
804 sym R (1289 sym/8 pcs) 2 img
Coursera Reproducible Research, Week 4, Project 2
Synopsis Severe weather events like storms have a significant impact on population and economy. With this analysis, I investigate two types of harmful impacts, (1) injuries and fatalities and (2) economic consequences based on property damage and crop damage. As basis, the NOAA Storm Database will be used, with additional documentation from NOAA ...
2266 sym R (5942 sym/44 pcs) 2 img
leaflet Playground
Playing around with leaflet() library(leaflet) map = leaflet() %>% addTiles() %>% addMarkers(lat=45.97642, lng=7.65852, popup="My favorite mountain: The Swiss Matterhorn") %>% addMarkers(lat=46.0086953,lng=7.735334, popup="My favorite after-ski bar: Hennu Stall") map ...
39 sym R (239 sym/1 pcs)
Capstone Project
My word Predictorgit-compSept & Oct 2021 Goal of the app This project summarizes my efforts from the Data Science Capstone project on Coursera. An easy-to-use app is developed (based on Shiny) to showcase the results of the predictive model on sentences with up to four words (4-gram). It focuses on English, using blogs, news and twitter data to...
1717 sym