Publications by rOpenSci Blog - R

OA week – A simple use case for programmatic access to PLOS full text

21.10.2013

Open access week is here! We love open access, and think it's extremely important to publish in open access journals. One of the many benefits of open access literature is that we likely can use the text of articles in OA journals for many things, including text-mining. What's even more awesome is some OA publishers provide API (application pro...

1950 sym R (2636 sym/6 pcs) 6 img

Interactive maps with polygons using R, Geojson, and Github

22.10.2013

Previously on this blog we have discussed making geojson maps and uploading to Github for interactive visualization with USGS BISON data, and with GBIF data, and on my own personal blog. This is done using a file format called geojson, a file format based on JSON (JavaScript Object Notation) in which you can specify geographic data along with any...

2641 sym R (442 sym/3 pcs)

Species occurrence data to CartoDB

04.11.2013

We have previously written about creating interactive maps on the web from R, with the interactive maps on Github. See here, here, here, and here. A different approach is to use CartoDB, a freemium service with sql interface to your data tables that provides a map to visualize data in those tables. They released an R interace to their sql API on ...

2161 sym R (1730 sym/6 pcs)

taxize changes

19.11.2013

We are building a taxonomic toolbelt for R called taxize – which gives you programmatic access to many sources of taxonomic data on the web. We just pushed a new version to CRAN (v0.1.5) with a lot of changes (see here for a rundown). Here are a few highlights of the changes. Note: the windows binary may not be available yet… Install and loa...

2087 sym R (10726 sym/9 pcs)

rgbif changes in v0.4

21.11.2013

The Global Biodiversity Information Facility (GBIF) is a warehouse of species occurrence data – collecting data from a lot of different sources. Our package rgbif allows you to interact with GBIF from R. We interact with GBIF via their Application Programming Interface, or API. Our last version on CRAN (v0.3) interacted with the older version o...

2742 sym R (11809 sym/14 pcs)

Open Science with R

02.12.2013

Upcoming Book on Open Science with R We're pleased to announce that the rOpenSci core team has just signed a contract with CRC Press/Taylor and Francis R series to publish a new book on practical ways to implement open science into your own research using R. Given all the talk about the importance of open science, the discussion often lacks prac...

1960 sym

Highlighting text in text mining

02.12.2013

rplos is an R package to facilitate easy search and full-text retrieval from all Public Library of Science (PLOS) articles, and we have a little feature which aren't sure if is useful or not. I don't actually do any text-mining for my research, so perhaps text-mining folks can give some feedback. You can quickly get a lot of results back using r...

1783 sym R (854 sym/3 pcs) 2 img

solr – an R interface to Solr

27.01.2014

A number of the APIs we interact with (e.g., PLOS full text API, and USGS's BISON API in rplos and rbison, respectively) expose Solr endpoints. Solr is an Apache hosted project – it is a powerful search server. Given that at least two, and possibly more in the future, of the data providers we interact with provide Solr endpoints, it made sense...

3760 sym R (14599 sym/18 pcs)

Introducing the ecoengine package

30.01.2014

Natural history museums have long been valuable repositories of data on species diversity. These data have been critical for fostering and shaping the development of fields such as biogeography and systematics. The importance of these data repositories is becoming increasingly important, especially in the context of climate change, where a stron...

5783 sym R (7810 sym/13 pcs) 4 img

Caching API calls offline

02.02.2014

I've recently heard the idea of “offline first” via especially Hood.ie. We of course don't do web development, but primarily build R interfaces to data on the web. Internet availablility is increasinghly ubiqutous, but there still are times and places where you don't have internet, but need to get work done. In the R packages we write there ...

5185 sym R (2078 sym/6 pcs)