Publications by Wingfeet

Project Tycho, Correlation between states

27.04.2014

In this fourth post on Measles data I want to have a look at correlation between states. As described before, the data is from Project Tycho, which contains data from all weekly notifiable disease reports for the United States dating back to 1888. These data are freely available to anybody interested.DataI discovered an error in previous co...

4504 sym 6 img

European MEP Data

04.05.2014

Pretty soon we will be having European Elections. I cannot tell when exactly, that depends on country. Over here the elections get a some attention, which is how I ran into data from votewatch.eu on voting of MEPs. That was just too interesting, so I made some plots.DataData describe how often MEPs voted what in the European Parliamen...

5374 sym 10 img

European MEP Data, Part 2

11.05.2014

Following last week’s short examination, I now wanted to drill down a bit more in the voting behaviour as given in data from votewatch.eu on voting of MEPs.Votewatch’s Data describe how often MEPs voted what in the European Parliament. For each MEP the number of votes, percentages Yes, No, Abstain, number of elections and number ...

7254 sym 6 img

European MEP data, part 3

18.05.2014

As final post on European MEPs voting I wanted to look at the individual MEP. The variables examined are how often present and how often present but not voted. The latter might be a marker of sign in and slope off. The analysis chosen is a hierarchical Bayesian analysis, which should push individual’s outcomes towards the population...

14827 sym 4 img

Significant birthdays in the weekend

25.05.2014

I am a listener to BBC‘s podcast More or Less. In the program Tim Harford looks at data with both humour and determination to find what the numbers mean. Last week he handled a listener question. Does everybody get a significant birthday (20, 30 years etc.) in a weekend. His back of the envelope answer was, yes, we think so. Given t...

2745 sym

Autocorrelation in project Tycho’s measles data

01.06.2014

Project Tycho includes data from all weekly notifiable disease reports for the United States dating back to 1888. These data are freely available to anybody interested.I have looked at Ptoject Tycho’s measles data before, general look, incidence, some high incidence data and correlation between states. After a detour, it is now time to lo...

2951 sym 2 img

stone flakes

06.06.2014

I browsed through UC Irvine Machine Learning Repository! the other day and noticed a nice data set regarding stone flakes produced by our ancestors, the prehistoric men. To quote the dataset owners:‘The data set concerns the earliest history of mankind. Prehistoric men created the desired shape of a stone tool by striking on a raw...

5589 sym 8 img

Stone flakes II

16.06.2014

Continuing from last week, the aim is now to classify the stone flakes based on their various properties. Three methods are used. LDA is an obvious standard. A classification tree is both simple and visually appealing. Random forest as a complex method, where more complex relations can easily be captured. Surprising with these data is...

8350 sym 6 img

stone flakes III

22.06.2014

Stone flakes are waste products from the tool making process in the stone age. This is the second post, first post was clustering, second linking to hominid type. The data also contains a more or less continuous age variable, which gives possibility to use regression, which is the topic of this week.DataData source see first post. Re...

8197 sym 8 img

stone flakes IV

29.06.2014

In this post I want to try something new, a causal graphical model. The aim here is just as much to get myself a feel what these things do as to understand how the stone flakes data fit together. DataData are stone flakes data which I analyzed previously. The first post was clustering, second linking to hominid type, third regression....

4028 sym 12 img