Publications by Gary Hutson

Data.Table – everything you need to know to get you started in R

03.03.2021

I will take you through step by step how to use the data.table package, and compare it with base R operations, to see the performance gains you get when using this optimised package. Load in data.table To load the package in you can follow the below instructions: #install.packages(data.table) library(data.table) You should now have everything yo...

7834 sym R (13816 sym/30 pcs) 12 img

A function to speed up and simplify writing to SQL Server databases in R

09.03.2021

I had a recent enquiry on our NHS-R community slack channel about which package is best to work with larger datasets, such as 5 million plus records, with high dimensionality. I got to thinking that the DBI and odbc packages would be best for this purpose, as this is still not considered big data, it is more medium data with manageable dimensiona...

4207 sym R (2517 sym/5 pcs) 4 img

NHSDataDictionaRy package has arrived on CRAN

10.03.2021

Thanks to the NHS-R community I have had time to work on another package, due to their pledge to get more packages in R funded. A big thanks to Mohammed Amin Mohammed and all the R community team. This package utilises all the excellent lookups provided by NHS Digital and the NHS Data Dictionary and allows you to access these lookups in one plac...

3734 sym R (2641 sym/1 pcs) 2 img

NHSDataDictionaRy webinar date and other brilliant webinars

17.03.2021

This week is an exciting week. Why is this? Well an addition to the NHS-R communities packages has been launched – the NHSDataDictionaRy – now on CRAN. To learn how to use this please look below. I am so excited about the launch of this package and I really think it will help to standardise NHS lookups in the NHS, for R users. Where do I fin...

3390 sym 4 img

Text Mining – Term Frequency analysis and Word Cloud creation using the tm package

17.03.2021

Analysing the pre-conference workshop sentiments I think back to a post I put on the NHS-R community website, but never posted on my own site, as I have been doing some textual analysis recently and referenced this post again. The driver for this package is the tm package and is still one of the main packages in R, but it assumes a non-tidy forma...

7095 sym R (2045 sym/11 pcs) 14 img

ConfusionTableR package has a new function

06.04.2021

The ConfusionTableR package has a new function. Welcome to var_impeR which takes a trained caret R model and produces a tibble and a supporting variable importance plot. How to use the new var_impeR function The code following shows how to use the new function: Training a CARET model The following steps were used on the NHSRDatasets package to t...

1592 sym R (1178 sym/2 pcs) 6 img

Deploying a CARET Machine Learning model as an API with Plumber

15.04.2021

I have deployed many models in my time working as a Senior Data Scientist and as a Principal Analyst at Nottingham University Hospitals. I have deployed into SQL, on client sites, in Node.JS and other types of deployment. This was the first time I used the Plumber library, and I loved it. How to deploy your model as an API My new YouTube page is...

2378 sym 2 img

NHSDataDictionaRy – R package launch and how to use it

22.04.2021

The NHS-R community commissioned a package at the start of the year to address the need to pull lookups real time from the NHS Data Dictionary website. I thought this would be a good package, but what I have included, in addition, is the ability to scrape tables and text from any website. What are the benefits of the NHSDataDictionaRy In the pre...

2016 sym 4 img

NHSDataDictionaRy is back on CRAN

06.05.2021

The NHSDataDictionaRy package is now back on CRAN, and I am pleased as punch. This update contains the OpenSafely scraper to get data from the website for lookups developed by Ben Goldacre’s team. Why did it disappear? The package disappeared due to me taking it down for major script and function updates. This has now been done and it is back...

2027 sym 2 img

Deploying trained CARET Machine Learning Model with Docker and Plumber

14.05.2021

I have been busy learning Docker, as it was one of those things I have been putting off for a while. But, I am so glad that I did. Essentially, I have gone through all the pain so you don’t have to. Building the ML Model and creating API with Plumber I can’t tell you how much I love Plumber. This tool allows you to create REST APIs so quickly...

2542 sym 4 img