Publications by Michael J Bommarito II
Connecting R to an Oracle database with RJDBC
In many circumstances, you might want to connect R directly to a database to store and retrieve data. If the source database is an Oracle database, you have a number of options: ROracle RODBC RJDBC Using ROracle should theoretically provide you with the best performing client, as this library is a wrapper around the Oracle OCI driver. Th...
2098 sym
Revisiting text processing with R and Python
Back in 2011, I covered the relative performance difference of the most popular libraries for text processing in R and Python. In case you can’t guess the answer, Python and NLTK won by a significant margin over R and tm. Text processing with R seemed simple on paper, but performance and flexibility limitations have kept me away sinc...
2285 sym
Plotting Oracle RMAN backup durations with R
How long does your Oracle RMAN backup take to complete? How does this vary over time? Are there patterns by week, week of month, or day of week? The gist below can help you evaluate questions like these. If you’re interested in using R with Oracle more generally, check out this previous post on connecting R to an Oracle database wit...
786 sym 2 img
Plotting average read and write operation size by ASM disk for Oracle
Throughput, throughput, throughput – for many databases, this is the performance measure of importance. When you are working with a fixed number of IOPS but see mixed workload types, system health can be assessed through the average read and write operation size. In an ASM environment, we can query this information by ASM disk from gv$as...
1487 sym 4 img
Is the Tax Code the longest Title?
Last week, I shared that Dan Katz and I had finally published a draft of our paper, Measuring the Complexity of the Law: The U.S. Code. We’d previewed this research on Computational Legal Studies years ago. Since then, we’ve received great feedback and a number of questions. The most common question, even among legal professionals...
2549 sym 10 img
ipython notebook for R: Quickstart for Ubuntu
If you’re like me, you love ipython notebook but often write R. RStudio’s integrated RMarkdown is nice, but for some contexts like quick demos or basic training, a browser-based interface is unbeatable. What if we could get the best of both worlds – an ipython notebook for R? The answer is rNotebook, and if you haven’t seen it yet, ta...
1768 sym