Publications by R on Redwall Analytics
How Does Stamford Charter School for Excellence do it?
Introduction The State of Connecticut is required to test all of its students over the course of their academic careers. The data by year and school is disclosed as it becomes available at the State Department of Education’s Edsight website. In keeping with its explorations of open CT data, one of Redwall Analytics’s projects is to look for i...
3605 sym
Tracking R&D spending by 700 Listed US Pharma Companies – Part 2
# Re-load data previously stored for purposes of this blog post pharma <- fread("~/Desktop/David/Projects/xbrl_investment/data/pharma_inc.csv") Introduction In A Walk Though of Accessing Financial Statements with XBRL in R - Part 1, we went through the first steps of pulling XBRL data for a single company from Edgar into R. Although an improv...
5006 sym R (6004 sym/8 pcs) 2 img
A Walk Though of Accessing Financial Statements with XBRL in R – Part 1
Introduction As financial professionals and analytic software lovers, the ability to efficiently load a large number of financial statements, and conduct an analysis has always been a key objective. In previous posts, Redwall Analytics worked with a 15-year time series of municipal Comprehensive Annual Financial Reports (CAFR) for 15 Fairfield Co...
11417 sym R (14991 sym/12 pcs) 4 img
Parsing Mass Municipal PDF CAFRs with Tabulizer, pdftools and AWS Textract – Part 1
# Libraries packages <- c("data.table", "rlist", "stringr", "DT", "janitor", "readxl", "xlsx" ) if (length(setdiff(packages,rownames(installed.packages()))) > 0) { install.packages(setdiff(packages, rownames(installed.packages()))) } invisible(lapply(packages, library, character.only = TRUE)) knitr::opts_chun...
5705 sym R (413 sym/1 pcs)
Tabulizer and pdftools Together as Super-powers – Part 2
# Libraries packages <- c("data.table", "stringr", "rlist", "tabulizer", "pdftools", "parallel", "DT" ) if (length(setdiff(packages,rownames(installed.packages()))) > 0) { install.packages(setdiff(packages, rownames(installed.packages()))) } invisible(lapply(packages, library, character.only = TRUE)) knitr::o...
7465 sym R (13928 sym/5 pcs)
Evaluating Mass Muni CAFR Tabulizer Results – Part 3
# Libraries packages <- c("data.table", "rlist", "stringr", "pdftools", "readxl" ) if (length(setdiff(packages,rownames(installed.packages()))) > 0) { install.packages(setdiff(packages, rownames(installed.packages()))) } invisible(lapply(packages, library, character.only = TRUE)) knitr::opts_chunk$set(comment=NA, fig...
3660 sym R (1228 sym/3 pcs)
Scraping Failed Tabulizer PDFs with AWS Textract – Part 4
# Libraries packages <- c("data.table", "stringr", "rlist", "paws.machine.learning", "paws.storage", "paws.common", "tabulizer", "pdftools", "keyring", "listviewer" ) if (length(setdiff(packages,rownames(installed.packages()))) > 0) { install.packages(setdiff(packages, rownames(installed.packages())))...
5623 sym R (3361 sym/5 pcs)
Evaluating Mass Muni CAFR Textract Results – Part 5
# Libraries packages <- c("data.table", "reticulate", "paws.machine.learning", "paws.common", "keyring", "pdftools", "listviewer", "readxl" ) if (length(setdiff(packages,rownames(installed.packages()))) > 0) { install.packages(setdiff(packages, rownames(installed.packages()))) } invisible(lapply(packages, ...
8864 sym R (6210 sym/7 pcs)
Exploring Big MT Cars with Python datatable-Part 1
# R Libraries library("reticulate") library("skimr") knitr::opts_chunk$set( fig.width = 15, fig.height = 8, out.width = '100%') # Install Python packages lapply(c("datatable", "pandas"), function(package) { conda_install("r-reticulate", package, pip = TRUE) }) # Python libraries from datatable import * import numpy as np import re...
15714 sym R (21829 sym/13 pcs)
Visualizing Big MT Cars with Python plotnine-Part 2
# R Libraries library("reticulate") knitr::opts_chunk$set( fig.width = 15, fig.height = 8, out.width = '100%') # Choose Python 3.7 miniconda reticulate::use_condaenv( condaenv = "r-reticulate", required = TRUE ) # Install Python packages lapply(c("plotnine"), function(package) { conda_install("r-reticulate", package, pip = T...
10928 sym R (9908 sym/10 pcs) 12 img