Publications by Joseph Rickert

A gentle introduction to parallel computing in R

19.01.2016

by John Mount Ph.D.Data Scientist at Win-Vector LLC Let's talk about the use and benefits of parallel computation in R. IBM's Blue Gene/P massively parallel supercomputer (Wikipedia). Parallel computing is a type of computation in which many calculations are carried out simultaneously.” Wikipedia quoting: Gottlieb, Allan; Almasi, George S. (1...

2986 sym 2 img

Getting Started with Markov Chains: Part 2

22.01.2016

by Joseph Rickert In a previous post, I showed some elementary properties of discrete time Markov Chains could be calculated, mostly with functions from the markovchain package. In this post, I would like to show a little bit more of the functionality available in that package by fitting a Markov Chain to some data. In this first block of code, I...

6248 sym 4 img

Pipelining R and Python in Notebooks

26.01.2016

by Micheleen HarrisMicrosoft Data Scientist As a Data Scientist, I refuse to choose between R and Python, the top contenders currently fighting for the title of top Data Science programming language. I am not going to argue about which is better or pit Python and R against each other. Rather, I'm simply going to suggest to play to the strengths o...

5396 sym 4 img

R User Groups on GitHub

28.01.2016

by Joseph Rickert Quite a few times over the past few years I have highlighted presentations posted by R user groups on their websites and recommended these sites as a source for interesting material, but I have never thought to see what the user groups were doing on GitHub. As you might expect, many people who make presentations at R user group ...

2453 sym R (826 sym/1 pcs) 8 img

Using Microsoft R Open with RStudio

04.02.2016

by Joseph Rickert A frequent question that we get here at Microsoft about MRO (Microsoft R Open) is: can be used with RStudio? The short answer is absolutely yes! In fact, more than just being compatible, MRO is the perfect complement for the RStudio environment. MRO is a downstream distribution of open source R that supports multiple operating s...

6073 sym 2 img

R User Groups in Poland

09.02.2016

by Przemyslaw Biecek The first meeting of R users in Poland took place in Wroclaw in 2008. It was a one-day conference with 27 participants and 6 talks. Today, we have three large groups of R users in major Polish cities (according to meetup.com there is 640 users in SER – Warsaw, 235 in eRka – Cracow and 64 in PAZUR – Poznań). And there ...

5696 sym 2 img

Getting Started with Microsoft SQL Server on Azure

11.02.2016

by Joseph Rickert If you are an R user and work for an enterprise where Microsoft SQL server is important, it is extremely helpful to have easy access to SQL Server databases. Over the past year, we have run several posts on this topic including a comprehensive four part series from Microsoft's Gregory Vandenbrouck on using various flavors of SQL...

3558 sym R (2713 sym/2 pcs) 6 img

More R User Group Sites

16.02.2016

by Joseph Rickert Last month I wrote about how several R user groups were making use of GitHub and listed some sites that I thought had interesting material. A few readers were kind enough to point out sites that I had missed; so I would just like to give a couple of “shout outs” here. First of all, I should acknowledge LondonR as a leader in...

1531 sym 4 img

Bay Area useR Group Lightning Talks

18.02.2016

by Joseph Rickert Earlier this month the Bay Area useR Group (BARUG) held it annual lightning talk meeting. This is by far our most popular meeting format: eight, 15 minute talks (12 minutes speaking and 3 minutes Q & A while the next speaker is setting up) packed into a two hour time slot. The intensity seems to really energize the speakers and ...

3405 sym 12 img

Analysis of Xbox Usage Trends with R: Time Series Decompositions

23.02.2016

by Daniel MooreDirector of Applied Statistics Engineering, Console DevelopmentMicrosoft In Xbox Hardware, we are interested in the various ways that our hardware is used, and we are especially interested in how that usage changes over time. We employ several several time series analysis techniques that are helpful in getting a holistic view of us...

2697 sym R (502 sym/1 pcs) 2 img