Publications by randyzwitch.com

RSiteCatalyst Version 1.4.4 Release Notes

13.07.2015

It’s been about six months since the last RSiteCatalyst update, and this update is really just a single bug fix, but a big bug fix at that! Sparse Data = Opaque Error Messages Numerous people have reported receiving an error message from RSiteCatalyst similar to the following: ‘names’ attribute [1] must be the same length as the vector [0]...

3008 sym 2 img

Getting Started: Adobe Analytics Clickstream Data Feed

04.08.2015

“Well, first you need a TMS and a three-tiered data layer, then some jQuery with a node backend to inject customer data into the page asynchronously if you want to avoid cookie-based limitations with cross-domain tracking and be Internet Explorer 4 compatible…” Blah Blah Blah. There’s a whole cottage industry around jargon-ing each other...

6967 sym R (3448 sym/4 pcs) 2 img

Authenticated API Testing Using Travis CI

06.08.2015

As I’ve become more serious about contributing in the open-source community, having quality tests for my packages has been something I’ve spent much more time on than when I was just writing quick-and-dirty code for my own purposes. My most used open-sourced package is RSiteCatalyst, which accesses the Adobe Analytics (authenticated) API, ...

4366 sym R (2468 sym/3 pcs)

RSiteCatalyst Version 1.4.5 Release Notes

17.08.2015

It’s only been a month since the last RSiteCatalyst update, and this update is also a pretty minor update in terms of functionality. Set Your Own Endpoint For the overseas users (or companies with weird setups), you can now use the endpoint argument in the SCAuth() function to specify your API endpoint. For the most part, this is not recommen...

2538 sym

A Million Text Files And A Single Laptop

28.01.2016

More often that I would like, I receive datasets where the data has only been partially cleaned, such as the picture on the right: hundreds, thousands…even millions of tiny files. Usually when this happens, the data all have the same format (such as having being generated by sensors or other memory-constrained devices). The problem with data l...

6990 sym R (3922 sym/6 pcs) 2 img

RSiteCatalyst Version 1.4.7 (and 1.4.6.) Release Notes

01.02.2016

It seems as though I missed release notes for version RSiteCatalyst 1.4.6, so we’ll do those and RSiteCatalyst 1.4.7 (now on CRAN) and the same time… RSiteCatalyst 1.4.6 This release was mostly tweaking some settings, specifically: Adding a second top argument within the Queue* functions for more control on results returned. It used to be th...

3361 sym

Calling RSiteCatalyst From Python

22.02.2016

@randyzwitch Do you know if anyone has gotten RSiteCat running in a Jupyter Notebook that ran RPY2? Tired of using 2 different environments — Adam Gitzes (@FootballActuary) February 18, 2016 This will be a very short post, because the only “new” information I’m going to provide is the minimal example to answer the question. Yes, it...

2137 sym Python (360 sym/2 pcs) 2 img

Adobe Analytics Clickstream Data Feed: Loading To Relational Database

18.03.2016

In my previous post about the Adobe Analytics Clickstream Data Feed, I showed how it was possible to take a single day worth of data and build a dataframe in R. However, most likely your analysis will require using multiple days/weeks/months of data, and given the size and complexity of the feed, loading the files into a relational database makes...

4418 sym R (2877 sym/3 pcs)

RSiteCatalyst Version 1.4.8 Release Notes

04.04.2016

For being in RSiteCatalyst retirement, I’m ending up working on more functionality lately ¯_(ツ)_/¯. Here are the changes for RSiteCatalyst 1.4.8, which should be available on CRAN shortly: Segment Stacking RSiteCatalyst now has the ability to take multiple values in the segment.id keyword for the Queue* functions. This functionality was ...

2810 sym R (317 sym/1 pcs) 2 img

Travis CI: “You Have Too Many Tests LOLZ!”

05.04.2016

As part of getting RSiteCatalyst 1.4.8 ready for CRAN, I’ve managed to accumulate hundreds of testthat tests across 63 test files. Each of these tests runs on Travis CI against an authenticated API, and the API frequently queues long-running reports. Long-story-short, my builds started failing, creating the error log message quoted below: No...

4615 sym R (61 sym/1 pcs)