Commentary--goodbye M(C)ID! Hello MID, where do you come from?

医学 计算机科学
作者
Holger J. Schünemann,Gordon Guyatt
出处
期刊:PubMed 卷期号:40 (2): 593-7 被引量:203
标识
DOI:10.1111/j.1475-6773.2005.00374.x
摘要

Health-related quality of life (HRQL) has overcome many barriers that limited its acceptance as an important outcome in health care. One of the remaining barriers relates to how one should interpret HRQL scores when they change over time within patients, or differ between patients. Indeed, because there is no gold standard methodology, interpreting HRQL scores is a challenging task. While daunting, addressing the challenge of interpretability is crucial to moving the field forward. The choice of what constitutes an important difference in a HRQL score will influence judgments about the success of a health care intervention, the required sample size of clinical studies, and the design of these studies. The issue is relevant to clinicians, payers, funding agencies, and regulatory agencies, and most relevant to patients for whose health care these groups claim responsibility. Several approaches to assessing interpretability exist. Anchor-based approaches rely on examining the relation between scores on a HRQL instrument that is under investigation and an anchor, an independent measure of HRQL that clinicians can easily interpret (Guyatt et al. 2002). Other approaches for evaluation of interpretability of HRQL scores include distribution-based or statistical methods and reliance on experts (panel-based methods) (Lassere et al. 2001). Wyrwich and colleagues utilized the last of these alternatives to determine interpretability of the SF-36 by elegantly combining outcomes research with qualitative research methods (Wyrwich et al. 2003; 2005). This approach focuses on how clinician researchers view patients in relation to their HRQL scores and changes in HRQL scores. The article by Wyrwich's et al. is ingenious for several reasons. First, despite the wealth of literature evaluating the SF-36 as an outcome measure, evidence for the interpretability of the instrument is surprisingly limited. Second, Wyrwich and colleagues focus on three different clinician groups and estimates whether interpretability differs across these groups. Third, the judgments the clinician researchers made were based on detailed patient scenarios. In this commentary we will provide two arguments that readers should consider in the context of the work by Wyrwich and colleagues. First, we argue that research on the interpretability of HRQL instruments should focus primarily on the patient's view. Early work by our group pioneered the methodology of assessing interpretability of HRQL instruments (Jaeschke, Singer, and Guyatt 1989). In that work, we described what became widely known as the minimal clinical important difference (MCID). Because this terminology focuses attention on the clinical arena rather than patients' experience in their day-to-day lives, we subsequently removed the focus on “clinical” interpretations, and the “C” from MCID to focus on the minimal important difference (MID) (Juniper et al. 1994; Schunemann et al. 2005). HRQL is a patient important outcome because it is the patients who experience their HRQL, and only they are in a position to ultimately judge whether a difference is important (Guyatt et al. 2004). We now define the MID as the smallest difference in score in the outcome of interest that informed patients or informed proxies perceive as important, either beneficial or harmful, and which would lead the patient or clinician to consider a change in the management (Schunemann et al. 2005). The revised description of the MID precludes making MID estimates for outcomes that are remote from those important, in themselves, to patients, such as spirometry or laboratory exercise capacity. Further, the definition suggests that only if one had reason to question the reliability or accuracy of data from patients would one rely on proxies to provide estimates of the MID. Investigators or clinicians may differ in the perspective or the methodology they adopt to determine the MID. Thus, readers, when they interpret the results of research on the MID, must attend to who rated the importance of an HRQL change and the specific instructions provided for making the assessment. If one accepts that HRQL measurement must be fundamentally patient-centered, the first choice for establishing the MID should be a patient-based approach. Relative to patients, clinicians may overemphasize treatment effects (Puhan et al. 2004) and agreement between patients and proxies in rating of HRQL is not perfect (Sneeuw, Sprangers, and Aaronson 2002; Ubel, Loewenstein, and Jepson 2003; von Essen 2004). Again, if one accepts that patients are at the center in HRQL measurement, then investigators, when they do use proxies, should instruct those proxies to focus on what they believe patients consider important. Wrywich and colleagues took a different approach. “Panelists were not provided with any specific definition of a CID (clinically important difference), but left to determine their own meaning for this term” (p. 580). Thus, the investigators did not specify to the participants that they were to estimate the difference that patients consider important. Further, they used the term “CID,” further permitting ambiguity about the group to whom the difference is important. Our second argument concerns the results of the study. If readers dismiss our appeal to focus on the patient, they should examine the result of the study. While the results, including the difference in the MID between heart failure clinician researchers and those working in asthma or chronic obstructive pulmonary disease (COPD) provide intriguing insights into clinicians' perspectives, the extent to which they enlighten us concerning differences in SF-36 scores that patients consider important remains questionable. At least two issues are worth considering. First, readers should ask whether the differences in the MID for the three evaluated diseases are true. Methodological issues, such as sampling bias in terms of the physician researchers selected for this exercise, or randomness around the consensus estimates could explain these MID differences. There were no statistical approaches—and perhaps there are none for this study design—to compare the magnitude of the MID differences across diseases obtained from the physician researchers. Although Wyrwich and colleagues performed a laborious study, readers should ask how large would the variation around the obtained MID estimates be if they had conducted multiple focus groups for each disease category. Could chance explain the observed differences in the MIDs from the three focus groups? Readers should not ignore this possible explanation. Second, if the MID on the SF-36 was truly greater for heart failure patients compared with patients suffering from asthma or COPD, it might also have implications for the comparability of SF-36 scores. The SF-36 is a generic HRQL instrument. To be applicable to different patient populations, similar scores across different patient population should signify similar levels or impairment of HRQL. For instance, a mean score of 50 on the 0–100 scale of the physical functioning domain should indicate a similar level of physical function for different patient populations such as patients with asthma, COPD, and heart failure. If interventions caused an improvement of HRQL to 60, the result of the study by Wyrwich and colleagues would suggest that the change is important for the former two groups but not the latter. Would the post-intervention scores of 60 still be comparable across these populations? It is conceivable that differences between patient groups in what constitutes an important change could impair the comparability of generic instruments. Thus, the implications of an MID for the SF-36 that varies by population could be far reaching. Such results could question the SF-36 as a generic instrument that allows a straightforward comparison across diseases. In summary, the MID provides an important strategy to make interpretable the results of HRQL studies. To be maximally informative, representative samples of informed patients or their proxies should provide estimates of the MID.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI5应助gh采纳,获得10
刚刚
刚刚
zho发布了新的文献求助10
刚刚
研友_nPPERn完成签到,获得积分10
1秒前
酷酷的树叶完成签到 ,获得积分10
1秒前
缥缈的平露完成签到,获得积分10
1秒前
2秒前
dew完成签到,获得积分10
2秒前
小蘑菇应助小陈要发一区采纳,获得10
3秒前
宇心完成签到,获得积分10
3秒前
桐桐应助tingting采纳,获得10
6秒前
6秒前
7秒前
shenmexixi完成签到,获得积分10
7秒前
阳仔完成签到 ,获得积分10
8秒前
8秒前
9秒前
10秒前
Yogita完成签到,获得积分10
11秒前
ocean发布了新的文献求助10
11秒前
12秒前
elizabeth339应助圆润润呐采纳,获得50
13秒前
Archer完成签到,获得积分10
13秒前
希望天下0贩的0应助Cactus采纳,获得10
14秒前
萨尔莫斯发布了新的文献求助10
15秒前
田様应助xiaoni采纳,获得10
15秒前
在水一方应助闪闪半芹采纳,获得10
15秒前
彪壮的刺猬完成签到 ,获得积分10
15秒前
slow发布了新的文献求助10
15秒前
cyy完成签到 ,获得积分10
15秒前
16秒前
完美世界应助鳄鱼蛋采纳,获得10
16秒前
小巧的雨柏完成签到,获得积分20
17秒前
zho发布了新的文献求助10
17秒前
脑洞疼应助和道一文字采纳,获得10
18秒前
18秒前
王炎欣完成签到,获得积分20
19秒前
ocean完成签到,获得积分10
19秒前
20秒前
21秒前
高分求助中
Applied Survey Data Analysis (第三版, 2025) 800
Narcissistic Personality Disorder 700
Assessing and Diagnosing Young Children with Neurodevelopmental Disorders (2nd Edition) 700
The Elgar Companion to Consumer Behaviour and the Sustainable Development Goals 540
The Martian climate revisited: atmosphere and environment of a desert planet 500
Images that translate 500
Transnational East Asian Studies 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3843549
求助须知:如何正确求助?哪些是违规求助? 3385850
关于积分的说明 10542709
捐赠科研通 3106659
什么是DOI,文献DOI怎么找? 1711004
邀请新用户注册赠送积分活动 823920
科研通“疑难数据库(出版商)”最低求助积分说明 774380