Publications by arthur charpentier

The odds of a cluster of airplane accidents

02.08.2014

Recently, there have been a lot of airplane accidents. July, 17th 2014, Hrabove, Ukraine, Malaysia Airlines, Boeing 777, fatalities 298 (/298) July, 23rd 2014, Magong, Taiwan, TransAsia Airways, ATR 72-500, fatalities 47 (/58) July, 24th 2014, Aguelhok, Mali, Air Algerie, Mc Donnell Douglas MD-83, fatalities 116 (/116) It is simple to find a lo...

3277 sym R (1290 sym/14 pcs) 22 img

Men set to live as long as women by 2010?

03.08.2014

A few months ago, in Men set to live as long as women, figures show, it was mentioned that (in the U.K.) the gap between male and female life expectancy is closing and men could catch up by 2030, according to an adviser for the Office for National Statistics. (the slides are available online http://cass.city.ac.uk/…). I don’t really know ...

2250 sym R (3904 sym/6 pcs) 10 img

Social Media Mining and Bioinformatics (with R)

05.08.2014

In June and July, I receive copies of two books, Social Media Mining with R, by Nathan Danneman and Richard Heimann Bioinformatics with R Cookbook, by Paurush Praveen Sinha For the first one, two recent interesting books deal with the same topic. Reza Zafarani, Mohammad Ali Abbasi and Huan Liu published last year Social Media Mining: An Intr...

4923 sym 20 img

Computational Actuarial Science, with R

24.08.2014

The book Computational Actuarial Science, with R is officially out. In the introduction of the book, and on the website of CRC, it is mentioned that the datasets can be found “in an R package on CRAN“, which is unfortunately incorrect. Some datasets are too large, so the package can not be uploaded on CRAN. Hopefully, Christophe host the pac...

1030 sym R (176 sym/2 pcs) 2 tbl

Crowded Cities, Paris, Hong Kong and Montréal

05.09.2014

Over the past years, I’ve been living in different cities, all of them being completely different, compared with the others. I have been living in Paris, which is a big city in Europe, with a large neighborhood, too (la banlieue). Then, I’ve been living in Hong Kong, which is a larger city, in Asia. It was crowded. I mean, it was the feelin...

3827 sym R (3290 sym/18 pcs) 24 img

Multiple Tests, an Introduction

24.09.2014

Last week, a student asked me about multiple tests. More precisely, she ran an experience over – say – 20 weeks, with the same cohort of – say – 100 patients. An we observe some size=100 nb=20 set.seed(1) X=matrix(rnorm(size*nb),size,nb) (here, I just generate some fake data). I can visualize some trajectories, over the 20 weeks, library...

3809 sym R (969 sym/9 pcs) 74 img

R package for Computational Actuarial Science

29.09.2014

A webpage for the book is now hosted on http://cas.uqam.ca/ So far, it is a very basic page, but information regarding the package can be found there. For instance, to install the package, with all the datasets, the R code is > install.packages("CASdatasets", repos = "http://cas.uqam.ca/pub/R/") The reference manual provides a description of all ...

697 sym R (70 sym/1 pcs)

Generating Hurricanes with a Markov Spatial Process

30.09.2014

The National Hurricane Center (NHC) collects datasets with all  storms in North Atlantic, the North Atlantic Hurricane Database (HURDAT). For all sorms, we have the location of the storm, every six jours (at midnight, six a.m., noon and six p.m.). Note that we have also the date, the maximal wind speed – on a 6 hour window – and the pressure...

3095 sym R (3972 sym/13 pcs) 16 img

Cross Validation for Kernel Density Estimation

01.10.2014

In a post publihed in July, I mentioned the so called the Goldilocks principle, in the context of kermel density estimation, and bandwidth selection. The bandwith should not be too small (the variance would be too large) and it should not be too large (the bias would be too large). Another standard method to select the bandwith, as mentioned this...

1473 sym R (521 sym/4 pcs) 8 img

What happens if we forget a trivial assumption ?

04.10.2014

Last week, @dmonniaux published an interesting post entitled l’erreur n’a rien d’original  on  his blog. He was asking the following question : let , and denote three real-valued coefficients, under which assumption on those three coefficients does has a real-valued root ? Everyone aswered , but no one mentioned that it is necessary to...

2709 sym R (588 sym/4 pcs) 78 img