Publications by Keith Goldfeld

Yes, unbalanced randomization can improve power, in some situations

13.04.2020

Last time I provided some simulations that suggested that there might not be any efficiency-related benefits to using unbalanced randomization when the outcome is binary. This is a quick follow-up to provide a counter-example where the outcome in a two-group comparison is continuous. If the groups have different amounts of variability, intuitivel...

4018 sym R (1839 sym/7 pcs) 4 img

Simulation for power in designing cluster randomized trials

27.04.2020

As a biostatistician, I like to be involved in the design of a study as early as possible. I always like to say that I hope one of the first conversations an investigator has is with me, so that I can help clarify the research questions before getting into the design questions related to measurement, unit of randomization, and sample size. In the...

7707 sym R (2950 sym/7 pcs) 8 img

To stratify or not? It might not actually matter…

11.05.2020

Continuing with the theme of exploring small issues that come up in trial design, I recently used simulation to assess the impact of stratifying (or not) in the context of a multi-site Covid-19 trial with a binary outcome. The investigators are concerned that baseline health status will affect the probability of an outcome event, and are interest...

5528 sym R (3965 sym/5 pcs) 8 img

Considering the number of categories in an ordinal outcome

25.05.2020

In two Covid-19-related trials I’m involved with, the primary or key secondary outcome is the status of a patient at 14 days based on a World Health Organization ordered rating scale. In this particular ordinal scale, there are 11 categories ranging from 0 (uninfected) to 10 (death). In between, a patient can be infected but well enough to rema...

6053 sym R (2033 sym/8 pcs) 4 img

When proportional odds is a poor assumption, collapsing categories is probably not going to save you

08.06.2020

Continuing the discussion on cumulative odds models I started last time, I want to investigate a solution I always assumed would help mitigate a failure to meet the proportional odds assumption. I’ve believed if there is a large number of categories and the relative cumulative odds between two groups don’t appear proportional across all categ...

6153 sym R (3892 sym/14 pcs) 8 img

Consider a permutation test for a small pilot study

22.06.2020

Recently I wrote about the challenges of trying to learn too much from a small pilot study, even if it is a randomized controlled trial. There are limitations on how much you can learn about a treatment effect given the small sample size and relatively high variability of the estimate. However, the temptation for researchers is usually just too g...

9324 sym R (3488 sym/14 pcs) 4 img

Simulating multiple RCTs to simulate a meta-analysis

06.07.2020

I am currently involved with an RCT that is struggling to recruit eligible patients (by no means an unusual problem), increasing the risk that findings might be inconclusive. A possible solution to this conundrum is to find similar, ongoing trials with the aim of pooling data in a single analysis, to conduct a meta-analysis of sorts. In an ideal ...

7795 sym R (3106 sym/10 pcs) 6 img

A Bayesian model for a simulated meta-analysis

20.07.2020

This is essentially an addendum to the previous post where I simulated data from multiple RCTs to explore an analytic method to pool data across different studies. In that post, I used the nlme package to conduct a meta-analysis based on individual level data of 12 studies. Here, I am presenting an alternative hierarchical modeling approach that ...

3532 sym R (4005 sym/7 pcs) 6 img

A hurdle model for COVID-19 infections in nursing homes

03.08.2020

Late last year, I added a mixture distribution to the simstudy package, largely motivated to accommodate zero-inflated Poisson or negative binomial distributions. (I really thought I had added this two years ago – but time is moving so slowly these days.) These distributions are useful when modeling count data, but we anticipate observing more ...

8414 sym R (7582 sym/14 pcs) 6 img

Generating data from a truncated distribution

17.08.2020

A researcher reached out to me the other day to see if the simstudy package provides a quick and easy way to generate data from a truncated distribution. Other than the noZeroPoisson distribution option (which is a very specific truncated distribution), there is no way to do this directly. You can always generate data from the full distribution a...

5276 sym R (1762 sym/5 pcs) 14 img