Publications by Jeroen Ooms
jsonlite 0.9.22: distinguish between double and integer
Today a new version of the jsonlite package was released to CRAN. This update includes a few internal enhancements and one new feature. Doubles vs integers The new always_decimal parameter forces formatting of doubles in decimal notation. That is to include at least one digit right of the decimal dot. This allows us to distingish them from intege...
1783 sym R (537 sym/3 pcs) 2 img
New in Magick 0.3
A new version of the ropensci magick package has been released to CRAN. Magick is a package for Advanced Image-Processing in R. It wraps the ImageMagick STL which is perhaps the most comprehensive open-source image processing library available today. Our original announcement has more details. New features This new version now includes a beautifu...
1222 sym R (27 sym/1 pcs) 2 img
Hunspell 2.0: High-Performance Stemmer, Tokenizer, and Spell Checker for R
A new version of the ropensci hunspell package has been released to CRAN. Hunspell is the spell checker library used by LibreOffice, OpenOffice, Mozilla Firefox, Google Chrome, Mac OS-X, InDesign, Opera, RStudio and many others. It provides a system for tokenizing, stemming and spelling in almost any language or alphabet. The R package exposes bo...
1428 sym R (75 sym/2 pcs) 2 img
New package graphql: A GraphQL Query Parser
The new ropensci graphql package is now on CRAN. It implements R bindings to the libgraphqlparser C++ library to parse GraphQL syntax and export the syntax tree in JSON format: graphql2json("{ field(complex: { a: { b: [ $var ] } }) }") A syntax parser is perhaps not super useful to most end-users, but can be used to validate graphql queries or i...
1645 sym R (130 sym/3 pcs)
Encryption and Digital Signatures in R using GPG
A new package gpg has appeared on CRAN. From the package description: Bindings to GnuPG for working with OpenGPG (RFC4880) cryptographic methods. Includes utilities for public key encryption, creating and verifying digital signatures, and managing your local keyring. Note that some functionality depends on the version of GnuPG that is installed ...
1958 sym R (1702 sym/4 pcs)
The new Tesseract package: High Quality OCR in R
Optical character recognition (OCR) is the process of extracting written or typed text from images such as photos and scanned documents into machine-encoded text. The new rOpenSci package tesseract brings one of the best open-source OCR engines to R. This enables researchers or journalists, for example, to search and analyze vast numbers of docum...
4754 sym R (377 sym/4 pcs) 6 img
High Performance CommonMark and Github Markdown Rendering in R
This week the folks at Github have open sourced their fork of libcmark (based on the extensive PR by Mathieu Duponchelle), which they use to render markdown text within documents, issues, comments and anything else on the Github website. The new release of the commonmark R package incorporates this library so that we can take advantage of Github ...
3128 sym R (47 sym/1 pcs) 6 img
Tesseract Update: Options and Languages
A few weeks ago we announced the first release of the tesseract package: a high quality OCR engine in R. We have now released an update with extra features. Installing Training Data As explained in the first post, the tesseract system is powered by language specific training data. By default only English training data is installed. Version 1.3 ad...
1716 sym R (412 sym/5 pcs)
Announcing pdftools 1.0
This week we released version 1.0 of the ropensci pdftools package to CRAN. Pdftools provides utilities for extracting text, fonts, attachments and other data from PDF files. It also supports rendering of PDF files into bitmap images. This release has a few internal enhancements and fixes an annoying bug for landscape PDF pages. The version bump ...
1816 sym R (825 sym/5 pcs)
Update jsonlite 1.2
A new version of jsonlite package to CRAN. This is a maintenance release with enhancements and bug fixes. A summary of changes in v1.2 from the NEWS file: Add read_json and write_json convenience wrappers, #161 Update modp_numtoa from upstream, fixes a rounding issue in #148. Ensure asJSON.POSIXt does not use sci notation for negative values, #1...
1646 sym R (654 sym/2 pcs)