Publications by Super User

Automate R processes

21.03.2018

Last week we updated the cronR R package and released it to CRAN allowing you to schedule any R code on whichever timepoint you like. The package was updated in order to comply to more stricter CRAN policies regarding writing to folders. Along the lines, the RStudio add-in of the package was also updated. It now looks as shown below and is tailor...

1370 sym 2 img

An overview of keyword extraction techniques

03.04.2018

In this blogpost, we will show 6 keyword extraction techniques which allow to find keywords in plain text. Keywords are frequently occuring words which occur somehow together in plain text. Common examples are New York, Monte Carlo, Mixed Models, Brussels Hoofdstedelijk Gewest, Public Transport, Central Station, p-values, … If you master these ...

6316 sym R (4336 sym/8 pcs) 12 img

Basic R Automation

11.05.2018

Last Wednesday, a small presentation was given at the RBelgium meetup in Brussels on Basic R Automation. For those of you who could not attend, here are the slides of that presentation which showed the use of the cronR and taskscheduleR R packages for automating basic R scripts. If you are interested in setting up a project for more advanced ways...

945 sym

Upcoming public courses on Text mining with R, Statistical machine learning with R, Applied Spatial Modelling with R, Advanced R programming, Computer Vision and Image Recognition

06.09.2018

I’m happy to announce that the following list of courses for R users is ready to be booked. All courses are face-to-face courses held in Belgium. 08-09/10/2018: Text mining with R. Brussels (Belgium). http://di-academy.com/bootcamp + send mail to [email protected] 15-16/10/2018: Statistical machine learning with R. Leuven (Belgium). Subscrib...

1403 sym 2 img

How to detect hatespeech in plain text #schildnvrienden

07.09.2018

Yesterday there was a pretty controversial Pano TV documentary called ‘Wie is Schild & Vrienden echt’ at the national television channel ‘één’ (https://www.vrt.be/vrtnu/a-z/pano/2018/pano-s2018a10). The documentary revealed the internal communication of a right-wing group from Belgium, called #schildnvrienden. After that, there was a sh...

3191 sym R (1019 sym/1 pcs) 2 img

udpipe version 0.7 for Natural Language Processing (#NLP) alongside #tidytext, #quanteda, #tm

11.09.2018

This blogpost announces the release of the udpipe R package version 0.7 on CRAN. udpipe is an R package which does tokenization, parts of speech tagging, lemmatization, morphological feature tagging and dependency parsing. It’s main feature is that it is a lightweight R package which works on more than 50 languages and gives you rich NLP output...

3935 sym R (2105 sym/6 pcs) 6 img

Last call for the course on text mining of next week

02.10.2018

Last call for the 2-day course on Text Mining with R, held next week (08-09 October 2018) in Brussels, Belgium. Subscribe at https://www.eventbrite.co.uk/e/dsb2018-text-mining-with-r-jan-wijffels-bnosac-session-03-04-tickets-50586501588 You’ll learn during that course the following: Cleaning of text data, regular expressions String distances G...

2202 sym 2 img

crfsuite for natural language processing

29.10.2018

A new R package called crfsuite supported by BNOSAC landed safely on CRAN last week. The crfsuite package (https://github.com/bnosac/crfsuite) is an R package specific to Natural Language Processing and allows you to easily build and apply models for named entity recognition text chunking part of speech tagging intent recognition or classif...

2802 sym R (185 sym/1 pcs) 4 img

Starspace for NLP #nlproc

04.12.2018

Our recent addition to the NLP R universe is called R package ruimtehol which is open sourced at https://github.com/bnosac/ruimtehol This R package is a wrapper around Starspace which provides a neural embedding model for doing the following on text: Text classification Learning word, sentence or document level embeddings Finding sentence or doc...

2876 sym R (2033 sym/1 pcs) 4 img

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

08.01.2019

A small note on the growing list of users of the udpipe R package. In the last month of 2018, we’ve updated the package on CRAN with some noticeable changes The default models which are now downloaded with the function udpipe_download_model are now models built on Universal Dependencies 2.3 (released on 2018-11-15) This means udpipe now has mo...

6319 sym R (2442 sym/4 pcs) 4 img