Publications by arthur charpentier
The odds of a cluster of airplane accidents
Recently, there have been a lot of airplane accidents. July, 17th 2014, Hrabove, Ukraine, Malaysia Airlines, Boeing 777, fatalities 298 (/298) July, 23rd 2014, Magong, Taiwan, TransAsia Airways, ATR 72-500, fatalities 47 (/58) July, 24th 2014, Aguelhok, Mali, Air Algerie, Mc Donnell Douglas MD-83, fatalities 116 (/116) It is simple to find a lo...
3277 sym R (1290 sym/14 pcs) 22 img
Men set to live as long as women by 2010?
A few months ago, in Men set to live as long as women, figures show, it was mentioned that (in the U.K.) the gap between male and female life expectancy is closing and men could catch up by 2030, according to an adviser for the Office for National Statistics. (the slides are available online http://cass.city.ac.uk/…). I don’t really know ...
2250 sym R (3904 sym/6 pcs) 10 img
Social Media Mining and Bioinformatics (with R)
In June and July, I receive copies of two books, Social Media Mining with R, by Nathan Danneman and Richard Heimann Bioinformatics with R Cookbook, by Paurush Praveen Sinha For the first one, two recent interesting books deal with the same topic. Reza Zafarani, Mohammad Ali Abbasi and Huan Liu published last year Social Media Mining: An Intr...
4923 sym 20 img
Computational Actuarial Science, with R
The book Computational Actuarial Science, with R is officially out. In the introduction of the book, and on the website of CRC, it is mentioned that the datasets can be found “in an R package on CRAN“, which is unfortunately incorrect. Some datasets are too large, so the package can not be uploaded on CRAN. Hopefully, Christophe host the pac...
1030 sym R (176 sym/2 pcs) 2 tbl
Crowded Cities, Paris, Hong Kong and Montréal
Over the past years, I’ve been living in different cities, all of them being completely different, compared with the others. I have been living in Paris, which is a big city in Europe, with a large neighborhood, too (la banlieue). Then, I’ve been living in Hong Kong, which is a larger city, in Asia. It was crowded. I mean, it was the feelin...
3827 sym R (3290 sym/18 pcs) 24 img
Multiple Tests, an Introduction
Last week, a student asked me about multiple tests. More precisely, she ran an experience over – say – 20 weeks, with the same cohort of – say – 100 patients. An we observe some size=100 nb=20 set.seed(1) X=matrix(rnorm(size*nb),size,nb) (here, I just generate some fake data). I can visualize some trajectories, over the 20 weeks, library...
3809 sym R (969 sym/9 pcs) 74 img
R package for Computational Actuarial Science
A webpage for the book is now hosted on http://cas.uqam.ca/ So far, it is a very basic page, but information regarding the package can be found there. For instance, to install the package, with all the datasets, the R code is > install.packages("CASdatasets", repos = "http://cas.uqam.ca/pub/R/") The reference manual provides a description of all ...
697 sym R (70 sym/1 pcs)
Generating Hurricanes with a Markov Spatial Process
The National Hurricane Center (NHC) collects datasets with all storms in North Atlantic, the North Atlantic Hurricane Database (HURDAT). For all sorms, we have the location of the storm, every six jours (at midnight, six a.m., noon and six p.m.). Note that we have also the date, the maximal wind speed – on a 6 hour window – and the pressure...
3095 sym R (3972 sym/13 pcs) 16 img
Cross Validation for Kernel Density Estimation
In a post publihed in July, I mentioned the so called the Goldilocks principle, in the context of kermel density estimation, and bandwidth selection. The bandwith should not be too small (the variance would be too large) and it should not be too large (the bias would be too large). Another standard method to select the bandwith, as mentioned this...
1473 sym R (521 sym/4 pcs) 8 img
What happens if we forget a trivial assumption ?
Last week, @dmonniaux published an interesting post entitled l’erreur n’a rien d’original on his blog. He was asking the following question : let , and denote three real-valued coefficients, under which assumption on those three coefficients does has a real-valued root ? Everyone aswered , but no one mentioned that it is necessary to...
2709 sym R (588 sym/4 pcs) 78 img