Commentary: The reliability of telomere length measurements

端粒 可靠性(半导体) 可比性 流行病学 再现性 医学 生物 可靠性工程 统计 遗传学 病理 DNA 数学 物理 功率(物理) 工程类 组合数学 量子力学
作者
Simon Verhulst,Ezra Susser,Pam Factor‐Litvak,Mirre J. P. Simons,Athanase Bénétos,Troels Steenstrup,Jeremy D. Kark,Abraham Aviv
出处
期刊:International Journal of Epidemiology [Oxford University Press]
卷期号:44 (5): 1683-1686 被引量:90
标识
DOI:10.1093/ije/dyv166
摘要

The importance of telomere biology in human disease is increasingly recognized and, in parallel, use of telomere length (TL) measures is proliferating in epidemiological and clinical studies. Such studies measure leukocyte TL (LTL) using several methodological approaches. Shorter LTL is associated with atherosclerosis1 and all-cause mortality.2 Given the increasingly recognized role of TL in human ageing and its related diseases, it is essential to know more about the reliability and validity of TL measurement methods, their comparability and which method is optimal for a specific epidemiological/clinical setting. In an effort to address this knowledge gap, Martin-Ruiz et al. (MR)3 studied the reliability of TL measurement techniques. They compared the popular qPCR method with the labour-intensive Southern blots (SBs) and single telomere length analysis (STELA). MR concluded that ‘neither technique nor laboratory had strong influence on result variation’, and that ‘Southern blotting and qPCR are similar in their reproducibility’. Unfortunately, for the following reasons we believe that for epidemiological studies neither conclusion is justified by the data. Reliability of LTL Most DNA samples (10/12) used by MR were obtained from human placenta, cell cultures and cancer cells. However, the inter-assay reliability of LTL is the pertinent parameter for epidemiological studies. MR included only two DNA samples from leukocytes and, because these were added in the second round of the study, they could not be used to measure inter-assay reliability of LTL. TL results for human placenta, cultured and cancer cells cannot be automatically generalized to LTL reliability, which is the primary concern of epidemiologists. Note also that MR used pooled leukocyte samples of multiple donors, and effects of pooling on assay reliability can therefore not be excluded. A previous comparison of LTL reliability has been done for the SB and the qPCR methods in a study4 cited by MR. The study reported a clear difference in inter-assay coefficient of variation (CV) between SB = 1.74% and qPCR = 6.54%, using 50 leukocyte DNA samples from individual donors. Moreover, Steenstrup et al.5 investigated whether LTL elongation in longitudinal studies can be attributed to measurement error vs a real biological phenomenon. They found little evidence for LTL elongation over and above the effects expected from measurement error. At the same time, the available data indicated a substantially larger proportion of individuals with an apparent LTL elongation in qPCR-based studies when compared with SB-based studies. In our view, the most parsimonious explanation for this finding is the higher measurement error of the qPCR method. MR observed that rank correlations between measurements obtained in different laboratories and with different methods were high, reflecting similar rank orders of the observations. Due to the inclusion of different cell types, the range of TLs in this study (4.7-9.2 kb) is much higher, however, than the age group-specific range (about 3 kb by direct SBs within age groups) used in most epidemiological studies of LTL. This will have inflated the rank correlation beyond what is relevant for LTL in epidemiological studies considerably, contributing to the erroneous conclusion that the SB and qPCR methods yielded similar results. Sample size and composition MR used 12 samples. These were measured by two laboratories using SBs, one laboratory using STELA and seven laboratories using qPCR. As both the number of samples and the number of laboratories using techniques other than qPCR were low, the statistical tests used by MR to infer no difference in reliability between methods are underpowered and consequently of limited value. We are thus left puzzled by the authors’ claim of > 95% power to detect the difference previously reported between inter-assay CVs for LTL using SBs and qPCR in 50 leukocyte DNA samples.4 MR provide no details of their calculation in support of this statement, nor on the exact difference between inter-assay CVs for which they calculated their statistical power. Furthermore, the authors combined the two SB and one STELA laboratories for comparisons of inter-laboratory CV across methods. We see little scientific justification for this choice, which in effect leaves one with no information specific to either the SB or STELA technique. For the two leukocyte samples, the inter-laboratory CVs were 6.2% and 6.5% for the SB/STELA laboratories vs 22.2% and 22.2% for the qPCR laboratories (samples K and L, Table 2, in erratum MR)6. These results, albeit from a tiny sample size, are consistent with higher measurement error of the qPCR over SB/STELA based-methods. This is not specific for the leukocyte samples; overall the inter-laboratory CVs were substantially higher when using qPCR (P = 0.001 according to MR). Finally, for the crucial analyses of the inter-assay and intra-assay CVs, the total number of DNA samples was restricted to 5 and 3, respectively, and none of these were from leukocytes. CV as a measure of reliability A characteristic of the CV is its dependence on the mean, and hence the implicit assumption when using the CV is heteroscedasticity, i.e. that the variance is proportional to the mean. We examined whether this assumption holds in the results presented by MR. Figure 1 suggests that it holds for SB. There is a negligible correlation between mean and CV, which is not surprising given the logarithmic nature of molecular size ladders on gels.7 By contrast, Figure 1 suggests that it does not hold for qPCR. There is a strong negative correlation between average TL and CV, which implies that the error made in qPCR-based TL measurements is not proportional to the mean, but instead is closer to a constant (assay-specific) value. Such a finding undermines the CV as a reliability measure for qPCR-based TL studies. Instead we recommend using the intra-class correlation coefficient, which yields an informative estimate, provided that the ‘test’ samples are similarly distributed as the samples in the investigated population. Figure 1. Coefficient of variation (CV%) between laboratories for SB/STELA vs qPCR plotted against telomere length. Telomere length was standardized per laboratory, dividing the results for all samples by the value obtained for sample G. The X-axis displays the ... Figure 1 also illustrates the larger range of values obtained with qPCR when compared with SB. MR suggest that the larger ‘dynamic range’ obtained with qPCR compensates for the lower precision of the method. However, when CV values are calculated for SB laboratories alone (i.e. ignoring the STELA results), the inter-laboratory CV is in fact over 40% higher for the qPCR laboratories (paired t-test, t = 2.39, df = 18, P < 0.025). Therefore, the larger range in TL values obtained using qPCR compared with SBs was more likely to be caused by a lower precision of qPCR, rather than compensating for it. DNA quality MR reported that they assessed DNA quality (purity and integrity) by ‘UV spectroscopy and agarose gel electrophoresis’, which is not typical of epidemiological studies that use the qPCR-based method. This may be critical if qPCR-based results are influenced by DNA integrity, which cannot be ruled out, as intact amplifiable target sequences are essential for reliable and valid results.8 Therefore, it is important to demonstrate in impartial studies that DNA integrity does not affect the T/S ratio results. Conclusions We see little evidence in MR that the reliabilities of SB and qPCR in measuring TL are equivalent. The number of laboratories performing SBs and STELA in their study was very small, as was the number of samples examined. Furthermore, only two of the 12 samples were from human leukocytes, the standard cell type used in epidemiological studies, and the inter-assay reliability of LTL was not measured. The qPCR does have the advantage over SB and other methods in that it costs less and requires fewer resources, but at the expense of measurement reliability. This implies that to demonstrate the same effect statistically, a larger sample size is needed when using qPCR in comparison with using SB/STELA. It is informative therefore to examine the consequences of lower reliability (higher CVs) for the actual sample sizes required. The following example might serve to contextualize the impact of inter-assay CVs on required sample sizes. On average, women’s LTL is longer by 0.15 kb than men’s LTL. As shown in Figure 2, to detect this difference with 90% power, with an increase in inter-assay CV from 2 to 20%, the required sample size increases by approximately six-fold. Figure 2. Effect of inter-assay coefficient of variation (CV%) on sample size required for a statistical power of 0.9. Shown on the left axis are the multiples of the sample size needed compared with CV = 0% (i.e. perfect reliability). The required ... The paper by MR and this commentary highlight an issue that is of great importance to the future of telomere epidemiology. As proposed in the pages of this journal 5 years ago,9 large-scale epidemiological studies, based on measurements of LTL using both SB and qPCR in laboratories experienced in these techniques, are urgently needed to resolve matters related to ‘noise’ and to assess how the two methods compare in capturing the associations of LTL with a host of human traits. Without such comparison, we fear that the claim by MR that SB and qPCR are equally reliable methods to measure LTL may result in suboptimal choices of methods, thereby wasting precious resources. Conflict of interest: None declared.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
领导范儿应助yerong采纳,获得10
刚刚
代迪完成签到,获得积分20
刚刚
Owen应助老大车采纳,获得10
1秒前
1秒前
Sunny发布了新的文献求助10
1秒前
1秒前
啦啊啦啦啦完成签到,获得积分10
2秒前
科研通AI6.1应助一秒啊采纳,获得10
3秒前
轻松蘑菇完成签到,获得积分10
3秒前
3秒前
落安白完成签到,获得积分10
3秒前
岳岳岳完成签到,获得积分10
3秒前
4秒前
4秒前
WangSiwei发布了新的文献求助10
4秒前
4秒前
4秒前
star009完成签到,获得积分10
6秒前
小二郎应助qiqi采纳,获得10
6秒前
6秒前
6秒前
李爱国应助阳光的芯采纳,获得10
6秒前
CodeCraft应助lxw采纳,获得10
7秒前
Zzz完成签到,获得积分10
7秒前
冷傲小蜜蜂完成签到,获得积分10
7秒前
WenliYin发布了新的文献求助10
7秒前
speedhhh发布了新的文献求助10
8秒前
8秒前
8秒前
科研民工小叶完成签到 ,获得积分10
8秒前
orixero应助12333采纳,获得10
8秒前
8秒前
平常听枫完成签到,获得积分10
9秒前
初雪应助miemie66采纳,获得10
9秒前
9秒前
9秒前
Cullen发布了新的文献求助10
9秒前
10秒前
VV完成签到,获得积分10
10秒前
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Encyclopedia of Forensic and Legal Medicine Third Edition 5000
Introduction to strong mixing conditions volume 1-3 5000
Agyptische Geschichte der 21.30. Dynastie 3000
Aerospace Engineering Education During the First Century of Flight 2000
从k到英国情人 1700
„Semitische Wissenschaften“? 1510
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5774388
求助须知:如何正确求助?哪些是违规求助? 5617373
关于积分的说明 15435636
捐赠科研通 4906846
什么是DOI,文献DOI怎么找? 2640456
邀请新用户注册赠送积分活动 1588251
关于科研通互助平台的介绍 1543249