Publications by Mark Edney

Text Prediction Shiny App pt 2

07.06.2022

Description This is the second part for the creation of a text prediction Shiny Application. From the previous post, we have developed and Corpus of text to start creating text prediction applications. We have also explored the corpus, looking at the frequency of words in the vocabulary. It is now time to start to develop ngram models. N-gram mo...

9647 sym R (4419 sym/21 pcs) 58 img

Webscraping in R with Rvest

21.06.2022

Web scraping has become an incredibly important tool in data science, as an easy way to generate new data. The main advantage is the automation of some pretty repetitive tasks. Web scrapping can also be a good way of keeping up with new data on a website, assuming it doesn’t have a big change in its HTML structure. Introduction This project is...

5843 sym R (994 sym/13 pcs) 2 img

Relationship Extraction with Spacyr

03.07.2022

This is the continuation of the previous project, where we scrapped the Cooper Mind website with the rvest package. Please refer to that posting for the necessary steps to obtain the verified character names. As a reminder, this project was inspired by the work of Thu Vu where she created a network mapping of the characters in the Witcher series....

8835 sym R (1931 sym/18 pcs) 2 img

Network Graphs in R

11.07.2022

Introduction Network graphs are an important tool for network analysis. They illustrate points, referred to as nodes, with connecting lines, referred to as edges. Since network graphs are such useful tools, there are many options for graph generation. In this posting, I will demonstrate three different techniques for developing network graphs in ...

6114 sym R (1564 sym/9 pcs) 8 img