Publications by Francis Smart

Overwhelming Growth In National Support for Bernie Sanders Mapped

25.02.2016

The FEC just released the most recent campaign contributor data and the results show a strong continued widespread growth in support for Bernie Sanders across the country.Figure 1: A map of what counties and states support Bernie Sanders relative to that of Hillary Clinton in January 2016.As of the end of January 2016, 88% of states have more rep...

3335 sym 10 img 5 tbl

FALSE: Clinton Funded by "Grassroots"

01.03.2016

The blatant distortions of reality put forth by the Clinton campaign are so offensive as to be laughable at times. In the victory speech of Hillary Clinton in South Carolina she spent a significant portion of it talking about how her campaign is financed by “grassroots”.Well, looking at the breakdown of funding for her campaign, only about 12...

2914 sym 2 img

Calculating Average Consumption From One Week of Purchases

14.04.2016

A number of large surveys have attempted to quantify consumer consumption from a limited period of time observed. This task can be fairly complex as it is fraught with potentially large difficulties directly observing who is consuming what. Rather than this expensive method some researchers have attempted to substitute more easily observed purcha...

7143 sym 4 img 3 tbl

Calculating Average Consumption From One Week of Purchases

14.04.2016

A number of large surveys have attempted to quantify consumer consumption from a limited period of time observed. This task can be fairly complex as it is fraught with potentially large difficulties directly observing who is consuming what. Rather than this expensive method some researchers have attempted to substitute more easily observed purcha...

7118 sym 4 img 3 tbl

Efficiently Saving and Sharing Data in R

01.12.2016

After spending a day the other week struggling to make sense of a federal data set shared in an archaic format (ASCII fixed format dat file). It is essential for the effective distribution and sharing of data that it use the minimum amount of disk space and be rapidly accessible for use by potential users. In this post I test four different file ...

7397 sym 8 img

Plotting the Impact of Atlantic Hurricanes on the US

12.09.2018

With hurricane Florence bearing down with expected to be devastating force, it might be a good time to reflect on the history of Atlantic Hurricanes on the United States. As an easy if not full proof source for hurricane data I drew on public data collated through Wikipedia’s list of costliest Atlantic hurricanes as well as linked articles. On ...

3051 sym 12 img 6 tbl

Strategizing Retirement Investments (In the US)

26.09.2018

Major caveat! I have no investment training or finance training and all of my calculations are back of the envelope calculations put together from what information I can gather online. In addition, given that investment planning usually spans decades, massive uncertainties exist in tax schemes and expected rate of returns. Please consult a profes...

12303 sym 12 img 7 tbl

Sexual Assault in the 80s and Christine Blasey Ford’s Testimony

28.09.2018

Dr. Ford’s testimony alleging that Judge Kavanaugh attempted to rape her when she was 15 was extremely difficult to watch as she was and still is deeply traumatized by the event. Kavanaugh’s abrasive and highly rehearsed obstructionist response was even more difficult to watch. In response I decided to look at public data to see if the type ...

5581 sym 12 img

The importance of Graphing Your Data – Anscombe’s Clever Quartet!

19.03.2019

Francis Anscombe’s seminal paper on “Graphs in Statistical” analysis (American Statistician, 1973) effectively makes the case that looking at summary statistics of data is insufficient to identify the relationship between variables. He demonstrates this by generating four different data sets (Anscombe’s quartet) which have nearly identic...

6853 sym R (1101 sym/2 pcs) 4 img 2 tbl

Data Fun – Inspired by Darasaurus

22.03.2019

After my recent post on Anscombe’s Quartet in which I demonstrated how to efficiently adjust any data set to match mean, variance, correlation (x,y), as well as regression coefficients. Philip Waggoner tuned me onto Justin Matejka and George Fitzmaurice’s Datasaurus R package/paper in which the authors demonstrate an alternative method of mod...

3307 sym 16 img 2 tbl