Publications by Randy Zwitch

A Beginner’s Look at Julia

23.07.2013

Over the past month or so, I’ve been playing with a new scientific programming language called ‘Julia‘, which aims to be a high-level language with performance approaching that of C. With that goal in mind, Julia could be a replacement for the ‘multi-language’ problem of needing to move between R, Python, MATLAB, C, Fortran, Scala, etc....

5910 sym R (9 sym/1 pcs)

Tabular Data I/O in Julia

06.08.2013

Importing tabular data into Julia can be done in (at least) three ways: reading a delimited file into an array, reading a delimited file into a DataFrame and accessing databases using ODBC. Reading a file into an array using readdlm The most basic way to read data into Julia is through the use of the readdlm function, which will create an array:...

4524 sym

Anomaly Detection Using The Adobe Analytics API

15.08.2013

As digital marketers & analysts, we’re often asked to quantify when a metric goes beyond just random variation and becomes an actual “unexpected” result. In cases such as A/B..N testing, it’s easy to calculate a t-test to quantify the difference between two testing populations, but for time-series metrics, using a t-test is likely not app...

5225 sym 4 img

RSiteCatalyst Version 1.1 Release Notes

25.08.2013

RSiteCatalyst version 1.1 is now available on CRAN. Changes from version 1 include: Support for Correlations/Subrelations in the QueueRanked function Support for Current Data in all ‘Queue‘ functions Support Anomaly Detection for QueueOvertime and QueueTrended functions (example usage with ggplot2 graph) Decrease in wait time for API calls ...

1630 sym

Fun With Just-In-Time Compiling: Julia, Python, R and pqR

02.09.2013

Recently I’ve been spending a lot of time trying to learn Julia by doing the problems at Project Euler. What’s great about these problems is that it gets me out of my normal design patterns, since I don’t generally think about prime numbers, factorials and other number theory problems during my normal workday. These problems have also given...

5680 sym 6 img

Clustering Search Keywords Using K-Means Clustering

17.09.2013

One of the key tenets to doing impactful digital analysis is understanding what your visitors are trying to accomplish. One of the easiest methods to do this is by analyzing the words your visitors use to arrive on site (search keywords) and what words they are using while on the site (on-site search). Although Google has made it much more diffi...

7459 sym 2 img

RSiteCatalyst Version 1.2 Release Notes

05.11.2013

Version 1.2 of the RSiteCatalyst package to access the Adobe Analytics API is now available on CRAN! Changes include: Removed RCurl package dependency Changed argument order for GetAdminConsoleLog to avoid error when date not passed Return proper numeric type for metric columns Fixed bug in GetEVars function Added validate:true flag to API to...

3072 sym 2 img

Adobe Analytics Implementation Documentation in 60 seconds

09.12.2013

When I was working as a digital analytics consultant, no question quite had the ability to cause belly laughs AND angst as, “Can you send me an updated copy of your implementation documentation?” I saw companies that were spending six-or-seven-figures annually on their analytics infrastructure, multi-millions in salary for employees and yet t...

3546 sym

Quickly Create Dummy Variables in a Data Frame

02.01.2014

On Quora, a question was asked about how to fix the error of the randomForest package in R not being able to handle more than 32 levels in a categorical variable. Seeing as how I’ve seen this question asked on Kaggle forums, StackOverflow and elsewhere, here’s the answer: code your own dummy variables instead of relying on Factors! Code snipp...

2251 sym 2 img

RSiteCatalyst Version 1.3 Release Notes

04.02.2014

Version 1.3 of the RSiteCatalyst package to access the Adobe Analytics API is now available on CRAN! Changes include: Search via regex functionality in QueueRanked/QueueTrended functions Support for Realtime API reports: Overtime and one-element Ranked report Allow for variable API request timing in Queue functions Fixed validate flag in JSON re...

3688 sym 2 img