Publications by Sam Weiss

Modeling "This Time is Different"

14.04.2014

Check out PDF for EquationsAnd Code to run yourselfIntroduction:“This Time is different” by Reinhart and Rogoff is an empirical history of financial crises and panics.  It describes many economic events such as; inflation, bubbles, and defaults.The theme that ties these events together is the idea that people have an unrealistic expectation...

4532 sym 2 img

Modeling "This Time is Different" in R

14.04.2014

Check out PDF for EquationsAnd Code to run yourselfIntroduction:“This Time is different” by Reinhart and Rogoff is an empirical history of financial crises and panics.  It describes many economic events such as; inflation, bubbles, and defaults.The theme that ties these events together is the idea that people have an unrealistic expecta...

4524 sym 2 img

didYouMean() Function: Using Google to correct errors in Strings

22.05.2014

A function that will take a String as an input and return the “Did you mean..” or “Showing Results for..” from google.com. Good for misspelled names or locations.Githublibrary(RCurl)didYouMean=function(input){  input=gsub(” “, “+”, input)  doc=getURL(paste(“https://www.google.com/search?q=”,input,”/”, sep=””))   �...

1860 sym

Mideast Graph 3: Slate Middle East Friendship

23.07.2014

Slate recently published a great info-graphic about Middle-East Relationships. It shows the relationships (Friend, Enemy, or Complicated) of 13 countries / organizations and the relationships between each pair. One draw back of the chart is that it doesn’t show if countries / organizations cluster among each other. This post will attempt to rec...

2415 sym 4 img

Predicting Monthly Car Sales for Brands in US: First Step

06.08.2014

I’ve set out to produce monthly forecasts of monthly car sales by brand in the US. So far I’ve made a SUTSE dynamic linear model (code on Github) and created a Shiny app (http://sweiss.shinyapps.io/carvis/) as a prototype (no predictions yet).Basically, I want to combine my interest in cars with my interest in stats.My current roadblock is...

885 sym

Predicting Monthly Car Sales: The Residuals are the Story

07.10.2014

I’ll produce predictions for US car sales by manufacture every month. There are already several blogs that describe the industry and sales that do a great job. Autoblog by the Numbers and Counting Cars are some to mention. Unlike their analysis, I’ll try to focus on the residuals (the stuff I can’t predict) to tell the story. To highli...

1464 sym 2 img 2 tbl

Western Front Battle Exchange Rates

22.03.2015

Introduction:GithubThe Western Front in World War One is considered a a war of outdated tactics combined with brutal efficiency of technology. And while the war saw the introduction of several innovations (eg. tanks and airplanes) the general routine of heavy artillery bombardments followed by men “going over the top” to meet barbed wire and ...

4928 sym 6 img

WW1 Monthly Casualties by Fronts and Belligerents

02.04.2015

GithubI’ve been reading a few books on WW1 and wanted to see a time series plot of battle casualty/pow by country to get a better understanding of how the conflict fits together. I couldn’t find any database for military casualties in WW1 but Wikipedia does include casualty statistics for each battle. I wrote some code to scrape / parse it bu...

2031 sym 4 img

Slate Middle East Graph #2

07.10.2015

I recently saw Slate.com redid its Mideast Graphic and thought it would be good to redo my previous analysis. From the original slate graphic it’s hard to make much sense of what’s happening and who’s friends with whom.From the graph below we can see 3 clearly (more or less) defined groups. Nusra and ISIS are Jihadists and have no friends....

1360 sym 4 img

Beyond Beta: Relationships between partialPlot(), ICEbox(), and predcomps()

06.12.2015

Machine Learning models generally outperform standard regression models in terms of predictive performance. However, these models tend to be poor in explaining how they achieve a particular result. This post will discuss three methods used to peak inside “black box” models and the connections between them.Ultimately the methods discussed here...

3843 sym 8 img