|
|
||||||||
Perspective |
1
Department of Biostatistics and Epidemiology/Wb4, The Cleveland Clinic
Foundation, 9500 Euclid Ave., Cleveland, OH 44195.
2
Division of Radiology, The Cleveland Clinic Foundation, Cleveland, OH
44195.
3
Department of Biomedical Engineering, Lerner Research Institute, The Cleveland
Clinic Foundation, Cleveland, OH 44195.
Received September 26, 2000;
accepted after revision November 13, 2000.
Address correspondence to N. A. Obuchowski.
Introduction
|
|
|---|
We researched the literature on the epidemiology of screening and have formulated 10 criteria for evaluating screening programs, which we summarize in the Appendix. These criteria encompass the characteristics of the disease, the screening test, and treatment. We apply these criteria to screening for pulmonary and colorectal cancer using multislice CT. Using our findings, we compared multislice CT for the detection of lung and colorectal cancer with mammography for the detection of breast cancer (Table 1). Although mammography is clearly not an ideal screening model [1], it is the only imaging screening test in widespread use. Our evaluation of these screening programs using the proposed 10 criteria allowed us to identify strengths and weaknesses of the programs, shortcomings of our literature, and directions for future research.
|
We now discuss our 10 criteria for effective screening.
1. Disease Has Serious Consequences
|
|
|---|
Both pulmonary and colorectal cancer are serious diseases, being the first and second leading causes of cancer death in the United States [3], respectively. Breast cancer is the second leading cause of cancer death in women. Thus, all three cancers have serious consequences.
2. Screening Population Has High Prevalence of Detectable Preclinical
Phase
|
|
|---|
|
Prevalence is the proportion of patients in the detectable preclinical phase among all patients screened. If the prevalence is low, then screening will not detect many cases of disease, so it may not be cost-effective. Also, if the prevalence is low, then even if the screening test is accurate, the probability of disease after a positive test result will still be low. For example, using Bayes' theorem, if the prevalence is 1% and the test's sensitivity and specificity are both 95%, then the probability of disease after positive test results is only 16%. In contrast, if the prevalence is 5%, then the probability of disease after positive test results is 50%.
For pulmonary cancer, the target population for screening is asymptomatic men and women who are 40 years old or older and who have a history of smoking of at least 10 pack-years. The prevalence of detectable pulmonary cancer in this population is 2-4% [6,7,8]. For colorectal cancer, the target population is asymptomatic men and women who are 50 years old or older and who do not have a known risk factor for colon cancer such as familial polyposis or ulcerative colitis. The prevalence of a 1-cm or larger adenomatous polyp in a 50-year-old person is 3% and in an 80-year-old person is 3% and in an 80-year-old is 5-6% [9]. Compared with breast cancer, for which the prevalence in the screening population is only 0.6-1.0% [10,11,12], both pulmonary and colorectal cancers have a high prevalence in the detectable preclinical phase.
3. Screening Test Detects Little Pseudodisease
|
|
|---|
No direct data exist on the frequency of pseudodisease of pulmonary carcinoma. In particular, no good data exist on the cause-specific death rate for patients with pulmonary cancer. Similarly, no data exist on the incidental occurrence of pulmonary cancer in large autopsy series. Intuitively, however, the frequency must be small, because 80-100% of untreated asymptomatic patients with detected stage I pulmonary cancer die within 5-10 years [13, 14].
With colorectal cancer, not all adenomatous polyps progress to invasive carcinoma. Evidence shows that many small (<1 cm) polyps regress [15]. The rate of adenomatous polyps progressing to cancer has been estimated at about 2.5 polyps per 1000 individuals per year [16]. The presence of pseudodisease for colorectal cancer is supported by a large autopsy study [17]. In this series, colorectal cancer was present in 0.5% of 50-60 year olds, 1% of 60-70 year olds, and 1.5% of 70-80 year olds. None of these individuals was symptomatic, and the cause of death was unrelated.
Similarly, not all breast ductal carcinoma in situ progresses to invasive carcinoma. In one study of 15 patients with untreated breast ductal carcinoma in situ, eight developed invasive carcinoma [18]. In another study of 25 women with untreated breast ductal carcinoma in situ, seven developed invasive carcinoma; the remaining women were followed up for at least 8 years without evidence of invasive tumors [19]. From autopsy studies of women who died of causes other than breast cancer [20,21,22], the prevalence of breast ductal carcinoma in situ was 6-14%. Considerable work has been done in correlating the different subtypes (i.e., architectural, histologic) of breast ductal carcinoma in situ with rates of subsequent invasive carcinoma in an effort to determine the preferred treatment (i.e., lumpectomy versus mastectomy) [23]. However, the presence of pseudodisease in screening for both colorectal polyps and breast cancer limits the effectiveness of these screening programs.
4. Screening Test Has High Accuracy for Detecting the Detectable
Preclinical Phase
|
|
|---|
95%). Most screening tests do not meet this high standard,
which means that the screening program must absorb the costs of many
false-positive results. Increasing the specificity of a screening test will increase the cost-effectiveness of screening. However, it is not always cost-effective to increase a screening test's sensitivity [5]. An increase in sensitivity might mean an increase in the detection of pseudodisease or an increase in the detection of disease after the critical point in the natural history (i.e., after the primary tumor metastasizes). Both these situations are detrimental to screening.
The accuracy of a screening test is difficult to estimate [2]. One common but erroneous approach is to measure a screening test's accuracy among patients who have symptomatic disease. This approach will be misleading because the sensitivity of a test for detecting the preclinical phase will often be less than its sensitivity for detecting clinical disease. A better approach is to apply the screening test to asymptomatic people, then follow up those individuals for a sufficient length of time to determine their true disease status [2]. The thoroughness of the follow-up and the duration of follow-up are critical. If only those who test positive on the screening test are followed up, then the estimate of sensitivity will tend to be falsely high and the estimate of specificity will tend to be falsely low. If the duration of follow-up is too short, then some cases of disease will be missed; if the followup is too long, then new cases of cancer will incorrectly be assessed as false-negative.
No good studies indicate the accuracy of multislice helical CT for detecting either pulmonary cancer or colorectal polyps in asymptomatic people. For pulmonary cancer, three recent studies [8, 24, 25] compared the yield of helical CT and chest radiography, but no studies have been published addressing the sensitivity and specificity of CT in a screening population. In contrast, a number of studies have reported the accuracy of mammography in screening populations. A meta-analysis that was published in 1998 reported a sensitivity range of 83-95% for mammography and a range in the false-positive rate of 0.9-6.5% [26].
5. Screening Test Detects Disease Before Critical Point
|
|
|---|
Three recent studies [8, 24, 25] suggest that CT can detect stage I pulmonary cancer is asymptomatic people. In these three studies, 71-93% of people with pulmonary cancer detected by screening had stage I disease. These numbers sound encouraging, yet the critical point in the natural history of pulmonary cancer has often passed by the time stage I cancer is detected. The 5-year survival rate of patients diagnosed with stage I pulmonary cancer is currently only 49% at best [27]. Compare that with the 5-year survival rates of stage I breast and colorectal cancers97% and 92%, respectively [27]. With respect to all stages of pulmonary cancer, the 5-year survival is only 14% (versus 86% for breast and 62% for colorectal cancer) [27]. Furthermore, this survival rate has changed little in the past 30 years: in 1970, the 5-year survival rate among all cases of pulmonary cancer measured only 10% [27].
For colorectal cancer, the critical point is usually defined as progression of an adenomatous polyp to larger than 1 cm. The risk of carcinoma in a 1-cm adenomatous polyp is less than 1%; for polyps of 1-2 cm, the risk is 5-10%; and for polyps larger than 2 cm, the risk is 10-50% [15]. Metastases from malignant polyps smaller than 1 cm are rare.
For breast cancer screening, the survival rate decreases directly with increasing cancer stage [28]. For stages 0 and I disease, the 5-year survival rates are 92% and 87%, respectively. At stage II, the 5-year survival rate decreases considerably: 78% and 68% for stages IIA and IIB, respectively. Thus, the critical point is at stage I, before metastasis to the regional lymph nodes.
One way to increase the chance of detecting disease before the critical point is to offer screening at an optimal age and with multiple, optimally spaced, screenings [2]. The optimum interval is the one that provides the most benefit to patients relative to costs (both monetary and patient outcome costs). In general, the interval between screenings should be relatively short when screening is inexpensive, the cost of workup of false-positive cases is low, and the time from when the disease is first detectable to the critical point is relatively short.
Other important factors include the sensitivity of the test for the detectable preclinical phase, the prevalence of the disease, and the range of duration of the detectable preclinical phase before the critical point (rather than just the average duration) [2]. A number of reports address the determination of the optimum screening age [29] and the optimum interval between screenings [29,30,31,32,33].
6. Screening Test Causes Little Morbidity
|
|
|---|
For pulmonary cancer screening, the CT study is performed without IV contrast material, so short-term toxicity is not a problem. The long-term adverse effect could be related to radiation exposure. However, in screening for pulmonary cancer low-dose CT is performed, which uses only 20% of the conventional dose (i.e., 20% of 0.4 rads). The increased risk of developing fatal cancer is estimated to be one in 25,000 as a result of a single screening pulmonary CT examination, although this has not been studied directly [34,35,36,37].
For colorectal cancer screening, there is preparation discomfort and dehydration, as well as procedure discomfort (distention, cramping, abdominal pain). The risk of perforation is negligible because the rectal tube is soft and smaller than it is when an air contrast barium enema is performed. The long-term effect of screening is low-dose radiation exposure. As with low-dose chest CT, the longterm risk is negligible.
For breast cancer screening, the short-term effect is patient discomfort. A conservative estimate of the long-term effect is based on the radiation dose from annual two-view mammography beginning at age 40 and continuing to age 75; the estimated excess risk of breast cancer death is one in 7246 [38]. Thus, the risks associated with pulmonary and colorectal cancer screening are no greater than with mammography.
7. Screening Test Is Affordable and Available
|
|
|---|
At our institution, the charge for bilateral screening mammography is approximately two thirds that of screening pulmonary CT. The charge for CT colorectal cancer screening has yet to be determined but will also modestly exceed that of mammography. The charge for pulmonary cancer CT screening is paid by the patient out-of-pocket, although mammography screening is generally covered by most medical insurance. Although this may change if CT screening is found to be cost-effective, the current direct cost of CT screening may prevent many patients from undergoing the examination. CT is probably as widely available as mammography, and many radiology departments have helical CT. Multislice scanners are being placed in radiology departments throughout the United States.
|
|
|---|
For both pulmonary cancer and colorectal polyps, resection and in some cases radiation or chemotherapy is the acceptable treatment. The approach to treatment for breast cancer is similar.
9. Treatment Is More Effective When Applied Before Symptoms
Begin
|
|
|---|
It is surprisingly difficult to demonstrate the benefit of early treatment, even when the treatment is known to be effective once symptoms appear. Four problems exist when comparing survival of screened patients with that of unscreened patients: lead-time bias [39,40,41,42], length bias [42, 43], overdiagnosis bias [5], and stage migration bias [44]. We briefly describe each of these biases.
Lead-Time Bias
For unscreened patients, disease-specific survival is measured as the
length of time from clinical diagnosis of disease to death from the disease.
For screened patients, disease-specific survival is the length of time from
disease detection by the screening test to death from the disease. The lead
time is the length of time between disease detection and the first appearance
of signs or symptoms [2,
39,40,41,42]
(Fig. 1). Even if early
treatment has no benefit, the survival of screened patients is greater than
that of unscreened patients by the addition of the length of the lead
time.
Length Bias
Length bias occurs because not all patients' disease progresses at the same
rate [2,
42,
43]. Cases of slowly
progressing disease have a longer detectable preclinical phase than cases of
quickly progressing disease. The slowly progressing disease is easier to
detect; thus persons with slowly progressing disease will tend to be
overrepresented in the screening cohort. Because the disease is progressing
slowly in these patients, their survival is naturally longer, regardless of
the effectiveness of treatment. However, the longer survival is usually
wrongfully attributed to early treatment.
Overdiagnosis Bias
Overdiagnosis bias occurs when one does not adjust for the occurrence of
pseudodisease in the screened cohort. Patients with pseudodisease do not die
from the specific disease under study. The survival of these patients is often
erroneously attributed to early treatment
[5].
Stage Migration Bias
Stage migration bias, or the "Will Rogers phenomenon"
[44], can occur when the
disease-specific survival rates of the screened and unscreened patients are
compared according to the stage at which the disease was detected. Stage
migration occurs when a cancer's metastases are detected before any symptoms
of metastasis appear. The early detection of metastases shifts the TNM stage
from stage I or II to stage II or III. When this migration occurs, the
survival in the lower stages of the screened cohort appears greater than in
the unscreened cohort because the patients with the silent metastases have
been removed from the lower stages. Similarly, the survival in the higher
stages of the screened cohort appears greater than in the unscreened cohort
because early cases of metastasis, with a naturally longer survival time, have
been added. The overall survival rate of the screened and unscreened patients
could actually be identical, yet the survival rates in each stage falsely
suggest that screening is effective.
Because of these potential biases, disease-specific survival is not a useful measure when studying the effectiveness of early treatment. A better measure, although not ideal, is to compare disease-specific mortality [4]. The disease-specific mortality rate is computed as the number of deaths from the specific disease divided by the number of people at risk. This measure is not subject to the same biases; however, this measure is not especially sensitive to some types of treatment benefits. For example, if early treatment truly does increase the duration of survival but does not prevent death, then this type of benefit will be overlooked. Another problem is that even with large study sample sizes, it is difficult to observe enough disease-specific deaths for meaningful statistical comparisons among cohorts.
Two studies have shown a promising trend of increased survival from pulmonary cancer as the size of the tumor decreases [45, 46]; however, to our knowledge, all published studies are potentially flawed because of biases associated with using disease-specific survival.
In 1998, Towler et al. [47] published a meta-analysis of randomized controlled trials of colorectal cancer screening with fecal occult blood testing. These researchers used disease-specific mortality rates to assess the benefit of screening and found that early detection reduces mortality from colorectal cancer by 16%.
Many large randomized trials have assessed the benefit of early treatment for women screened by mammography. The conclusions from these studies are highly controversial [1].
10. Treatment Is Not Too Risky or Toxic
|
|
|---|
As with the detection of a lesion on mammography, detection of a pulmonary nodule on CT may prompt percutaneous biopsy before surgical resection to minimize the number of false-positive findings at surgery. The 30-day mortality rate after pulmonary resection is 3.7% [48]. In addition, both minor and major complications are associated with surgery (e.g., atelectasis, pneumothorax, bronchopleural fistula, myocardial infarction atrial arrhythmia, pulmonary embolism, prolonged pulmonary insufficiency, chylothorax, and chronic thoracic pain). For patients whose disease is unresectable or inoperable, radiation therapy is used, with complications of lethargy, fatigue, and skin and lung toxicity.
The mortality rate associated with endoscopic polypectomy is only 0.01% [49]. The complication rate (e.g., bleeding) is 1-2.3%; the perforation rate is 0.3%. These are acceptable rates. Similarly, the mortality rates from lumpectomy and mastectomy are effectively zero.
|
|
|---|
The strengths of colorectal cancer screening are that it is a serious disease with a long detectable preclinical phase before the critical point in the natural history. Good data are available on the benefit of early treatment (i.e., polypectomy). A potential shortcoming is the presence of polyps that never progress to invasive carcinoma. The literature is insufficient to evaluate the significance of pseudodisease on the cost-effectiveness of screening. Insufficient data exist as to the accuracy of CT for detecting adenomatous polyps.
|
Acknowledgments
We appreciate the helpful suggestions and direction provided by Michael T.
Modic, Richard C. White, and William A. Chilcote on early drafts of this
article.
|
|
|---|
This article has been cited by other articles:
![]() |
J. G. Ravenel, P. Costello, and G. A. Silvestri Screening for Lung Cancer Am. J. Roentgenol., March 1, 2008; 190(3): 755 - 761. [Abstract] [Full Text] [PDF] |
||||
![]() |
E J Hegedus, A Goode, S Campbell, A Morin, M Tamaddoni, C T Moorman III, and C Cook Physical examination tests of the shoulder: a systematic review with meta-analysis of individual tests Br. J. Sports Med., February 1, 2008; 42(2): 80 - 92. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E Cook, E. Hegedus, R. Pietrobon, and A. Goode A Pragmatic Neurological Screen for Patients With Suspected Cord Compressive Myelopathy Physical Therapy, September 1, 2007; 87(9): 1233 - 1242. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. T. Kolber, G. Zipp, D. Glendinning, and J. J. Mitchell Patient Expectations of Full-Body CT Screening Am. J. Roentgenol., March 1, 2007; 188(3): W297 - W304. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Schoder and M. Gonen Screening for Cancer with PET and PET/CT: Potential and Limitations J. Nucl. Med., January 1, 2007; 48(1_suppl): 4S - 18S. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Macari and E. J. Bini CT Colonography: Where Have We Been and Where Are We Going? Radiology, December 1, 2005; 237(3): 819 - 833. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Furtado, D. A. Aguirre, C. B. Sirlin, D. Dang, S. K. Stamato, P. Lee, F. Sani, M. A. Brown, D. L. Levin, and G. Casola Whole-Body CT Screening: Spectrum of Findings and Recommendations in 1192 Patients Radiology, November 1, 2005; 237(2): 385 - 394. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Swensen, J. R. Jett, T. E. Hartman, D. E. Midthun, S. J. Mandrekar, S. L. Hillman, A.-M. Sykes, G. L. Aughenbaugh, A. O. Bungum, and K. L. Allen CT Screening for Lung Cancer: Five-year Prospective Experience Radiology, April 1, 2005; 235(1): 259 - 265. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Meyer, R. T. Shipley, K. S. Lee, T. Johkoh, and R. Kakinuma Invited Commentary * Authors' Response RadioGraphics, November 1, 2004; 24(6): 1632 - 1636. [Full Text] [PDF] |
||||
![]() |
R. Iannaccone, A. Laghi, C. Catalano, J. A. Brink, F. Mangiapane, S. Trenna, F. Piacentini, and R. Passariello Detection of Colorectal Lesions: Lower-Dose Multi-Detector Row Helical CT Colonography Compared with Conventional Colonoscopy Radiology, December 1, 2003; 229(3): 775 - 781. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. B. Kopans, B. Monsees, and S. A. Feig Screening for "Cancer": When is it Valid?--Lessons from the Mammography Experience Radiology, November 1, 2003; 229(2): 319 - 327. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Swensen, J. R. Jett, T. E. Hartman, D. E. Midthun, J. A. Sloan, A.-M. Sykes, G. L. Aughenbaugh, and M. A. Clemens Lung Cancer Screening with CT: Mayo Clinic Experience Radiology, March 1, 2003; 226(3): 756 - 761. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Berlin Potential Legal Ramifications of Whole-Body CT Screening: Taking a Peek into Pandora's Box Am. J. Roentgenol., February 1, 2003; 180(2): 317 - 322. [Full Text] [PDF] |
||||
![]() |
K. Garg, R. L. Keith, T. Byers, K. Kelly, A. L. Kerzner, D. A. Lynch, and Y. E. Miller Randomized Controlled Trial with Low-Dose Spiral CT for Lung Cancer Screening: Feasibility Study and Preliminary Results Radiology, November 1, 2002; 225(2): 506 - 510. [Abstract] [Full Text] [PDF] |
||||
|
|