Publications by Keith Goldfeld
Late anniversary edition redux: conditional vs marginal models for clustered data
This afternoon, I was looking over some simulations I plan to use in an upcoming lecture on multilevel models. I created these examples a while ago, before I started this blog. But since it was just about a year ago that I first wrote about this topic (and started the blog), I thought I’d post this now to mark the occasion. The code below provi...
4577 sym R (4688 sym/9 pcs) 8 img
Re-referencing factor levels to estimate standard errors when there is interaction turns out to be a really simple solution
Maybe this should be filed under topics that are so obvious that it is not worth writing about. But, I hate to let a good simulation just sit on my computer. I was recently working on a paper investigating the relationship of emotion knowledge (EK) in very young kids with academic performance a year or two later. The idea is that kids who are mor...
7027 sym R (5358 sym/11 pcs) 2 img
How the odds ratio confounds: a brief study in a few colorful figures
The odds ratio always confounds: while it may be constant across different groups or clusters, the risk ratios or risk differences across those groups may vary quite substantially. This makes it really hard to interpret an effect. And then there is inconsistency between marginal and conditional odds ratios, a topic I seem to be visiting frequentl...
6282 sym R (1043 sym/2 pcs) 8 img
Randomize by, or within, cluster?
I am involved with a stepped-wedge designed study that is exploring whether we can improve care for patients with end-stage disease who show up in the emergency room. The plan is to train nurses and physicians in palliative care. (A while ago, I described what the stepped wedge design is.) Under this design, 33 sites around the country will recei...
3795 sym R (3090 sym/7 pcs) 4 img
Multivariate ordinal categorical data generation
An economist contacted me about the ability of simstudy to generate correlated ordinal categorical outcomes. He is trying to generate data as an aide to teaching cost-effectiveness analysis, and is hoping to simulate responses to a quality-of-life survey instrument, the EQ-5D. The particular instrument has five questions related to mobility, self...
4196 sym R (2708 sym/4 pcs) 8 img
The power of stepped-wedge designs
Just before heading out on vacation last month, I put up a post that purported to compare stepped-wedge study designs with more traditional cluster randomized trials. Either because I rushed or was just lazy, I didn’t exactly do what I set out to do. I did confirm that a multi-site randomized clinical trial can be more efficient than a cluster ...
8897 sym R (5624 sym/9 pcs) 22 img
Binary, beta, beta-binomial
I’ve been working on updates for the simstudy package. In the past few weeks, a couple of folks independently reached out to me about generating correlated binary data. One user was not impressed by the copula algorithm that is already implemented. I’ve added an option to use an algorithm developed by Emrich and Piedmonte in 1991, and will be...
7568 sym R (5777 sym/9 pcs) 6 img
simstudy update: improved correlated binary outcomes
An updated version of the simstudy package (0.1.10) is now available on CRAN. The impetus for this release was a series of requests about generating correlated binary outcomes. In the last post, I described a beta-binomial data generating process that uses the recently added beta distribution. In addition to that update, I’ve added functionalit...
4143 sym R (6177 sym/11 pcs) 2 img
In regression, we assume noise is independent of all measured predictors. What happens if it isn’t?
A number of key assumptions underlie the linear regression model – among them linearity and normally distributed noise (error) terms with constant variance In this post, I consider an additional assumption: the unobserved noise is uncorrelated with any covariates or predictors in the model. In this simple model: \[Y_i = \beta_0 + \beta_1X_i + e...
5653 sym R (2604 sym/7 pcs) 10 img
Cross-over study design with a major constraint
Every new study presents its own challenges. (I would have to say that one of the great things about being a biostatistician is the immense variety of research questions that I get to wrestle with.) Recently, I was approached by a group of researchers who wanted to evaluate an intervention. Actually, they had two, but the second one was a minor t...
6216 sym R (6291 sym/11 pcs) 8 img