Publications by RJM

DATA606 Lab6 RJM


In August of 2012, news outlets ranging from the Washington Post to the Huffington Post ran a story about the rise of atheism in America. The source for the story was a poll that asked people, “Irrespective of whether you attend a place of worship or not, would you say you are a religious person, not a religious person or a convinced atheist?�...

12730 sym R (10463 sym/84 pcs) 15 img

DATA606 Lab7 RJM


North Carolina births In 2004, the state of North Carolina released a large data set containing information on births recorded in this state. This data set is useful to researchers studying the relation between habits and practices of expectant mothers and the birth of their children. We will work with a random sample of observations from this da...

6760 sym R (5932 sym/37 pcs) 7 img 1 tbl

DATA606 Lab5a RJM


In this lab, we investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. The data We consider real estate data from th...

11510 sym R (3952 sym/46 pcs) 11 img

DATA606 Lab3 RJM


R Markdown Hot Hands Basketball players who make several baskets in succession are described as having a hot hand. Fans and players have long believed in the hot hand phenomenon, which refutes the assumption that each shot is independent of the next. However, a 1985 paper by Gilovich, Vallone, and Tversky collected evidence that contradicted thi...

11462 sym R (3847 sym/27 pcs) 3 img

DATA606 Lab4 RJM


In this lab we’ll investigate the probability distribution that is most central to statistics: the normal distribution. If we are confident that our data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate random...

10028 sym R (2048 sym/45 pcs) 20 img

DATA606 Lab5b RJM


Sampling from Ames, Iowa If you have access to data on an entire population, say the size of every house in Ames, Iowa, it’s straight forward to answer questions like, “How big is the typical house in Ames?” and “How much variation is there in sizes of houses?”. If you have access to only a sample of the population, as is often the case...

7310 sym R (1486 sym/23 pcs) 3 img



Area under the curve, Part I. (4.1, p. 142) What percent of a standard normal distribution \(N(\mu=0, \sigma=1)\) is found in each region? Be sure to draw a graph. \(Z < -1.35\) \(Z > 1.48\) \(-0.4 < Z < 1.5\) \(|Z| > 2\) ## Loading required package: shiny ## Loading required package: openintro ## Please visit for free statistics...

5253 sym R (4739 sym/66 pcs) 11 img

Data 607 Project 4


Calling the appropriate libraries: library(tm); library(e1071); library(dplyr); library(wordcloud); ## Loading required package: NLP ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, u...

694 sym R (38754 sym/48 pcs) 2 img

DATA 607 Project 1


library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(stringr) Regex Example 1 for extra credit in HW3 The code below is the original code with the results fro...

2343 sym R (6706 sym/61 pcs)



Baby weights, Part I. (9.1, p. 350) The Child Health and Development Studies investigate a range of topics. One study considered all pregnancies between 1960 and 1967 among women in the Kaiser Foundation Health Plan in the San Francisco East Bay area. Here, we study the relationship between smoking and weight of the baby. The variable smoke is c...

7440 sym R (2204 sym/32 pcs) 1 img