Publications by Keith Goldfeld

Exploring the underlying theory of the chi-square test through simulation – part 2

24.03.2018

In the last post, I tried to provide a little insight into the chi-square test. In particular, I used simulation to demonstrate the relationship between the Poisson distribution of counts and the chi-squared distribution. The key point in that post was the role conditioning plays in that relationship by reducing variance. To motivate some of the ...

13627 sym R (6023 sym/21 pcs) 20 img

Testing multiple interventions in a single experiment

18.04.2018

A reader recently inquired about functions in simstudy that could generate data for a balanced multi-factorial design. I had to report that nothing really exists. A few weeks later, a colleague of mine asked if I could help estimate the appropriate sample size for a study that plans to use a multi-factorial design to choose among a set of interve...

10176 sym R (7544 sym/19 pcs) 2 img

How efficient are multifactorial experiments?

01.05.2018

I recently described why we might want to conduct a multi-factorial experiment, and I alluded to the fact that this approach can be quite efficient. It is efficient in the sense that it is possible to test simultaneously the impact of multiple interventions using an overall sample size that would be required to test a single intervention in a mor...

6715 sym R (6170 sym/16 pcs) 4 img

Is non-inferiority on par with superiority?

13.05.2018

It is grant season around here (actually, it is pretty much always grant season), which means another series of problems to tackle. Even with the most straightforward study designs, there is almost always some interesting twist, or an approach that presents a subtle issue or two. In this case, the investigator wants compare two interventions, but...

6892 sym R (2325 sym/6 pcs) 6 img

A little function to help generate ICCs in simple clustered data

23.05.2018

In health services research, experiments are often conducted at the provider or site level rather than the patient level. However, we might still be interested in the outcome at the patient level. For example, we could be interested in understanding the effect of a training program for physicians on their patients. It would be very difficult to r...

5706 sym R (5961 sym/11 pcs) 2 img

Late anniversary edition redux: conditional vs marginal models for clustered data

12.06.2018

This afternoon, I was looking over some simulations I plan to use in an upcoming lecture on multilevel models. I created these examples a while ago, before I started this blog. But since it was just about a year ago that I first wrote about this topic (and started the blog), I thought I’d post this now to mark the occasion. The code below provi...

4577 sym R (4688 sym/9 pcs) 8 img

Re-referencing factor levels to estimate standard errors when there is interaction turns out to be a really simple solution

25.06.2018

Maybe this should be filed under topics that are so obvious that it is not worth writing about. But, I hate to let a good simulation just sit on my computer. I was recently working on a paper investigating the relationship of emotion knowledge (EK) in very young kids with academic performance a year or two later. The idea is that kids who are mor...

7027 sym R (5358 sym/11 pcs) 2 img

How the odds ratio confounds: a brief study in a few colorful figures

09.07.2018

The odds ratio always confounds: while it may be constant across different groups or clusters, the risk ratios or risk differences across those groups may vary quite substantially. This makes it really hard to interpret an effect. And then there is inconsistency between marginal and conditional odds ratios, a topic I seem to be visiting frequentl...

6282 sym R (1043 sym/2 pcs) 8 img

Randomize by, or within, cluster?

18.07.2018

I am involved with a stepped-wedge designed study that is exploring whether we can improve care for patients with end-stage disease who show up in the emergency room. The plan is to train nurses and physicians in palliative care. (A while ago, I described what the stepped wedge design is.) Under this design, 33 sites around the country will recei...

3795 sym R (3090 sym/7 pcs) 4 img

Multivariate ordinal categorical data generation

14.08.2018

An economist contacted me about the ability of simstudy to generate correlated ordinal categorical outcomes. He is trying to generate data as an aide to teaching cost-effectiveness analysis, and is hoping to simulate responses to a quality-of-life survey instrument, the EQ-5D. The particular instrument has five questions related to mobility, self...

4196 sym R (2708 sym/4 pcs) 8 img