A neutral comparison of statistical methods for time-to-event analyses under non-proportional hazards

统计比例危险模型可解释性非参数统计危险系数无效假设参数统计数学置信区间危害事件（粒子物理）统计假设检验对数秩检验计量经济学航程（航空）计算机科学人工智能工程类有机化学航空航天工程物理化学量子力学

作者

Florian Klinglmüller,Tobias Fellinger,Franz König,Tim Friede,Andrew C. Hooker,Harald Heinzl,Martina Mittlböck,Jonas Brugger,Maximilian Bardo,Cynthia Huber,Norbert Benda,Martin Posch,Robin Ristl

出处

期刊：Cornell University - arXiv 日期：2023-10-10 被引量：3

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.48550/arxiv.2310.05622

摘要

While well-established methods for time-to-event data are available when the proportional hazards assumption holds, there is no consensus on the best inferential approach under non-proportional hazards (NPH). However, a wide range of parametric and non-parametric methods for testing and estimation in this scenario have been proposed. To provide recommendations on the statistical analysis of clinical trials where non proportional hazards are expected, we conducted a comprehensive simulation study under different scenarios of non-proportional hazards, including delayed onset of treatment effect, crossing hazard curves, subgroups with different treatment effect and changing hazards after disease progression. We assessed type I error rate control, power and confidence interval coverage, where applicable, for a wide range of methods including weighted log-rank tests, the MaxCombo test, summary measures such as the restricted mean survival time (RMST), average hazard ratios, and milestone survival probabilities as well as accelerated failure time regression models. We found a trade-off between interpretability and power when choosing an analysis strategy under NPH scenarios. While analysis methods based on weighted logrank tests typically were favorable in terms of power, they do not provide an easily interpretable treatment effect estimate. Also, depending on the weight function, they test a narrow null hypothesis of equal hazard functions and rejection of this null hypothesis may not allow for a direct conclusion of treatment benefit in terms of the survival function. In contrast, non-parametric procedures based on well interpretable measures as the RMST difference had lower power in most scenarios. Model based methods based on specific survival distributions had larger power, however often gave biased estimates and lower than nominal confidence interval coverage.

求助该文献

最长约 10秒，即可获得该文献文件

A neutral comparison of statistical methods for time-to-event analyses under non-proportional hazards

今日热心研友