Publications by Keith Goldfeld

A demo of power estimation by simulation for a cluster randomized trial with a time-to-event outcome

22.05.2023

A colleague reached out for help designing a cluster randomized trial to evaluate a clinical decision support tool for primary care physicians (PCPs), which aims to improve care for high-risk patients. The outcome will be a time-to-event measure, collected at the patient level. The unit of randomization will be the PCP, and one of the key design is...

11655 sym R (9573 sym/17 pcs) 10 img

Generating variable cluster sizes to assess power in cluster randomize trials

17.04.2023

In recent discussions with a number of collaborators at the NIA IMPACT Collaboratory about setting the sample size for a proposed cluster randomized trial, the question of variable cluster sizes has come up a number of times. Given a fixed overall sample size, it is generally better (in terms of statistical power) if the sample is equally distribut...

8715 sym R (2758 sym/11 pcs) 2 img

Implementing a one-step GEE algorithm for very large cluster sizes in R

20.03.2023

Very large data sets can present estimation problems for some statistical models, particularly ones that cannot avoid matrix inversion. For example, generalized estimating equations (GEE) models that are used when individual observations are correlated within groups can have severe computation challenges when the cluster sizes get too large. GEE ar...

5498 sym R (6111 sym/9 pcs)

simstudy 0.6.0 released: more flexible correlation patterns

20.02.2023

The new version (0.6.0) of simstudy is available for download from CRAN. In addition to some important bug fixes, I’ve added new functionality that should make data generation with correlated data a little more flexible. In the previous post, I described enhancements to the function genCorMat. As part of this release announcement, I’m describin...

13248 sym R (11021 sym/14 pcs)

Flexible correlation generation: an update to genCorMat in simstudy

13.02.2023

I’ve been slowly working on some updates to simstudy, focusing mostly on the functionality to generate correlation matrices (which can be used to simulate correlated data). Here, I’m briefly describing the function genCorMat, which has been updated to facilitate the generation of correlation matrices for clusters of different sizes and with pot...

7015 sym R (4904 sym/11 pcs)

A GAM for time trends in a stepped-wedge trial with a binary outcome

16.01.2023

In a previous post, I described some ways one might go about analyzing data from a stepped-wedge, cluster-randomized trial using a generalized additive model (a GAM), focusing on continuous outcomes. I have spent the past few weeks developing a similar model for a binary outcome, and have started to explore model comparison and methods to evaluat...

7521 sym R (4637 sym/11 pcs) 18 img

Modeling the secular trend in a stepped-wedge design

12.12.2022

Recently I started a discussion about modeling secular trends using flexible models in the context of cluster randomized trials. I’ve been motivated by a trial I am involved with that is using a stepped-wedge study design. The initial post focused on more standard parallel designs; here, I want to extend the discussion explicitly to the stepped...

9598 sym R (5558 sym/7 pcs) 12 img

Presenting results for multinomial logistic regression: a marginal approach using propensity scores

19.09.2022

Multinomial logistic regression modeling can provide an understanding of the factors influencing an unordered, categorical outcome. For example, if we are interested in identifying individual-level characteristics associated with political parties in the United States (Democratic, Republican, Libertarian, Green), a multinomial model would be a re...

9039 sym R (4124 sym/10 pcs) 8 img 1 tbl

Generating clustered data with marginal correlations

21.11.2022

A student is working on a project to derive an analytic solution to the problem of sample size determination in the context of cluster randomized trials and repeated individual-level measurement (something I’ve thought a little bit about before). Though the goal is an analytic solution, we do want confirmation with simulation. So, I was a littl...

12051 sym R (8368 sym/13 pcs)

Simulating data from a non-linear function by specifying a handful of points

08.08.2022

Trying to simulate data with non-linear relationships can be frustrating, since there is not always an obvious mathematical expression that will give you the shape you are looking for. I’ve come up with a relatively simple solution for somewhat complex scenarios that only requires the specification of a few points that lie on or near the desire...

4344 sym R (2012 sym/10 pcs) 14 img