Lane F. Burgette Ph.D., José J. Escarce, Susan M. Paddock Ph.D., Marjorie S. Ridgely JD, Warren G. Wilder MPhil, Dolores Yanagihara MPH, Cheryl L. Damberg Ph.D.
To sample 40 physician organizations stratified on the basis of longitudinal cost of care measures for qualitative interviews in order to describe the range of care delivery structures and processes that are being deployed to influence the total costs of caring for patients.
Three years of physician organization‐level total cost of care data (n = 156 in California) from the Integrated Healthcare Association's value‐based pay‐for‐performance program.
We fit total cost of care data using mixture and ‐means clustering algorithms to segment the population of physician organizations into sampling strata based on 3‐year cost trajectories (ie, cost curves).
A mixture of multivariate normal distributions can classify physician organization cost curves into clusters defined by total cost level, shape, and within‐cluster variation. ‐means clustering does not accommodate differing levels of within‐cluster variation and resulted in more clusters being allocated to unstable cost curves. A mixture of regressions approach focuses overly on anomalous trajectories and is sensitive to model coding.
Statistical clustering can be used to form sampling strata when longitudinal measures are of primary interest. Many clustering algorithms are available; the choice of the clustering algorithm can strongly impact the resulting strata because various algorithms focus on different aspects of the observed data.