Original Research
Cardiothoracic Imaging
March 23, 2022

Strategies for Reducing False-Positive Screening Results for Intermediate-Size Nodules Evaluated Using Lung-RADS: A Secondary Analysis of National Lung Screening Trial Data

Abstract

Please see the Editorial Comment by Darragh Halpenny discussing this article.
BACKGROUND. Lung-RADS version 1.1 (v1.1) classifies all solid nodules less than 6 mm as category 2. Lung-RADS v1.1 also classifies solid intermediate-size (6 to < 10 mm) nodules as category 2 if they are perifissural and have a triangular, polygonal, or ovoid shape (indicative of intrapulmonary lymph nodes). Additional category 2 criteria could reduce false-positive results of screening examinations.
OBJECTIVE. The purpose of this study was to evaluate the impact of proposed strategies for reducing false-positive results for intermediate-size nodules on lung cancer screening CT evaluated using Lung-RADS v1.1.
METHODS. This retrospective study entailed secondary analysis of National Lung Screening Trial (NLST) data. Of 1387 solid nodules measuring 6.0–9.5 mm on baseline screening CT examinations in the NLST, all 38 nodules in patients who developed cancer and a random sample of 200 nodules in patients who did not develop cancer were selected for further evaluation. Cancers were required to correspond with the baseline nodule on manual review. After exclusions, the sample included 223 patients (median age, 62 years; 143 men, 80 women; 196 benign nodules, 27 malignant nodules). Two thoracic radiologists independently reviewed baseline examinations to record nodule diameter and volume using semiautomated software and to determine whether nodules had perifissural location; other subpleural location; and triangular, polygonal, or ovoid shape. Different schemes for category 2 assignment were compared.
RESULTS. Across readers, standard Lung-RADS v1.1 had sensitivity of 89–93% and specificity of 26–31%. A modification assigning nodules less than 10 mm with triangular, polygonal, or ovoid shape in other subpleural locations (vs only perifissural location) as category 2 had sensitivity of 85–93% and specificity of 47–51%. Lung-RADS v1.1 using volume cutoffs had sensitivity of 89–93% and specificity of 37% (both readers). The sensitivity of both modified Lung-RADS v1.1 and Lung-RADS v1.1 with volume cutoffs was not significantly different from standard Lung-RADS v1.1 (all p > .05). However, both schemes' specificity was significantly better than standard Lung-RADS v1.1 (all p < .05). Combining the two strategies yielded sensitivity of 85–93% and specificity of 58–59%.
CONCLUSION. Classifying intermediate-size nodules with triangular, polygonal, or ovoid shape in any subpleural (not just perifissural) location as category 2 and using volume- rather than diameter-based measurements improves Lung-RADS specificity without decreased sensitivity.
CLINICAL IMPACT. The findings can help reduce false-positive results, decreasing 6-month follow-up examinations for benign findings.

HIGHLIGHTS

Key Finding
In comparison with standard Lung-RADS v1.1, classifying intermediate-size nodules with triangular, polygonal, or ovoid shape in any subpleural (not just perifissural) location as category 2 and using volume- rather than diameter-based measurements resulted in a significant increase in specificity from 26–31% to 58–59% without significant change in sensitivity.
Importance
Classifying nodules in any subpleural location as category 2 based on characteristics indicative of intrapulmonary lymph nodes would reduce false-positive results of screening CT examinations.
The National Lung Screening Trial (NLST) proved that lung cancer screening CT is effective in reducing mortality from lung cancer [1]. Further analyses of the NLST results and of other data show that lung cancer screening CT is cost-effective in improving patient life expectancy [2]. However, to be truly cost-effective, lung cancer screening programs must limit the number of false-positive scans, which can lead to additional cost, anxiety, radiation, and adverse events from biopsy of benign nodules. To establish a standardized framework for nodule reporting and follow-up, the American College of Radiology developed Lung-RADS [3]. Lung-RADS provides criteria for nodules to be deemed either benign (category 2), which allows a patient to return for an annual screening examination, or suspicious (categories 3 and 4), which requires the patient to undergo earlier follow-up.
In the initial version of Lung-RADS, only solid nodules measuring less than 6 mm in diameter were classified as category 2. Since the development of Lung-RADS, evidence has accumulated showing that certain pulmonary nodules can be confidently diagnosed as benign intrapulmonary lymph nodes [48] and, therefore, should also be classified as category 2. The typical features of intrapulmonary lymph nodes are solid attenuation; triangular, polygonal, or oval shape; and location along pleural surfaces [6]. The current version of Lung-RADS (version 1.1 [v1.1]) applies these features for characterizing a nodule as an intrapulmonary lymph node, and thus classifying the nodule as category 2, only to perifissural nodules measuring less than 10 mm [9]. Because all solid nodules less than 6 mm are classified as category 2, this recognition of intrapulmonary lymph nodes in Lung-RADS v1.1 specifically reduces false-positive results for nodules measuring at least 6 mm but under 10 mm (hereafter, intermediate-size nodules). Intermediate-size solid nodules with a typical shape are also likely to represent intrapulmonary lymph nodes when present in subpleural locations other than that currently reflected in Lung-RADS v1.1 [7, 10, 11] (Chelala L, presented at the Radiological Society of North America [RSNA] 2021 annual meeting).
Although Lung-RADS is the dominant scheme used to determine follow-up of nodules detected on lung cancer screening in the United States, other countries have adopted different approaches. Notably, the Dutch-Belgian Randomized Lung Cancer Screening (NELSON) trial followed a scheme that primarily relies on nodule volume rather than the linear diameter measurements that are central in Lung-RADS [12]. Though Lung-RADS v1.1 added nodule volume measurements as an alternative measure, standard radiology practice in the United States continues to rely on diameter measurements given the specialized software and additional time required for volumetric measurement. To date, comparisons between diameter- and volume-based nodule classification have yielded mixed results; some studies show slight advantages for volume measurement [13, 14], whereas other studies show superiority of diameter measurements [15, 16]. In the study by Silva et al. [13], volumetric measurements allowed improved risk stratification and reduction in follow-up examinations compared with standard diameter-based Lung-RADS evaluation.
This background suggests a couple of strategies to potentially reduce false-positive results for lung cancer screening examinations: expanding the criteria to consider intermediate-size nodules as representing benign intrapulmonary lymph nodes and using volume-based, instead of diameter-based, measurements. To our knowledge, these strategies have not been evaluated in terms of their impact on the frequency of false-positive lung cancer screening results, and they have not been compared with each other. We therefore conducted this study to evaluate the impact of the proposed strategies for reducing false-positive results for intermediate-size nodules on lung cancer screening CT evaluated using Lung-RADS v1.1.

Methods

Patient Population

This retrospective HIPAA-compliant study solely used nonidentifiable patient data from the NLST that were accessed for purposes of secondary data analysis. Consent to access NLST data was obtained from the national Cancer Data Access System of the National Cancer Institute through a data transfer agreement with the National Cancer Institute. The study was approved by the institutional review board at our institution. The requirement for written informed patient consent was waived for this post hoc analysis.
In brief, the NLST was a randomized controlled trial of current and former smokers who were screened for lung cancer by either chest radiography or low-dose chest CT. The CT protocol included one baseline examination and two subsequent annual screening rounds. Patients were followed after the three annual examinations to evaluate for the development of lung cancer and the occurrence of death [1]. For the purposes of the present study, only baseline CT examinations were evaluated.
The NLST study readers annotated nodules on each CT examination, recording nodule features including location (lobe and slice number), attenuation, margins, and long- and short-axis diameters. Only the dominant (i.e., largest) annotated nodule on each patient's baseline CT was considered for purposes of the current study. Patients were eligible for the current study if the dominant annotated nodule had a mean diameter (mean of the short- and long-axis diameters) of 6.0–9.5 mm, was noncalcified with solid attenuation, and had nonspiculated margins, based on the NLST study reader's annotations. Of the 25,844 patients in the CT arm of the NLST study, 1387 patients (with 1387 dominant nodules) met these criteria (Fig. 1).
Fig. 1 —Flowchart shows patient selection in secondary analysis of CT screening arm of National Lung Screening Trial. Dashed arrows represent nodules reassigned between study groups.
The NLST data indicate cancers that developed during follow-up but do not directly link nodules and cancer diagnoses. During follow-up, lung cancer developed in the same lobe as the dominant nodule in 38 of the 1387 patients. All 38 of these patients were selected for further evaluation in the present study. Of the remaining 1349 patients in whom cancer did not develop in the same lobe as the dominant nodule, 200 were randomly selected for further evaluation. The baseline examinations of the 238 patients (38 with and 200 without cancer in the same lobe as the dominant nodule) were initially reviewed by a board-certified subspecialty-trained thoracic radiologist (M.M.H., with 6 years of posttraining experience). Nodules were excluded based on this review for the following reasons: images not available or not viewable (malignant group, n = 0; benign group, n = 4); no nodule found corresponding to the NLST-annotated nodule (malignant group, n = 1; benign group, n = 3); nodule deemed to represent atelectasis (malignant group, n = 0; benign group, n = 1); nodule was not solid (malignant group, n = 5 [part-solid in three, cystic in two]; benign group, n = 1 [ground-glass]) (Fig. 1). At the time of the initial review, 11 nodules (all in the benign group) were found to have been annotated with the incorrect lobe by the NLST study reader; for these cases, the correct lobe was recorded for the purpose of further analysis. The exclusions resulted in 223 remaining nodules, which underwent independent evaluation by two readers, as described in the Image Review section.
After completion of the independent imaging review, the earlier noted investigator (M.M.H.) performed a subsequent review of certain patients, unblinded to the details of the patients' follow-up results. In this final session, the radiologist reviewed the nodules in patients who developed cancer to assess whether the cancer corresponded with the dominant nodule versus a different nodule in that lobe (e.g., a nodule that developed later). The nodule and the subsequent cancer corresponded in 26 patients. In six patients, the cancer was attributed to a different nodule in the lobe with the dominant nodule; these six nodules were reassigned to the benign nodule group. Also in this final session, the radiologist reviewed the nodules in the benign group that had been assigned the incorrect lobe by the NLST study reader to assess whether a cancer later developed in the correct lobe. In one patient, a cancer later developed in the correct lobe, which corresponded with the dominant nodule; this nodule was reassigned to the malignant nodule group. During this final session, nodules were only reassigned between groups; no additional nodules were excluded.
This process resulted in a final study sample of 223 dominant nodules in 223 patients (median age, 62 years; 143 men, 80 women; 196 with benign nodules, 27 with malignant nodules).

Image Review

Two investigators (the previously noted investigator [M.M.H.] and a board-certified, subspecialty-trained thoracic radiologist [A.R.H.] with 28 years of posttraining experience) independently reviewed the baseline CT examinations in 223 patients in random order, blinded to whether nodules were benign or malignant, using advanced visualization software (syngo.via, version VB40, Siemens Healthineers). The readers identified the dominant nodule in each patient based on the nodule locations in the NLST annotations. The readers measured each nodule using a semiautomated segmentation tool available in the advanced visualization software. After the radiologist marks anywhere within a nodule, the software automatically segments the nodule's margins, which the radiologist can manually adjust. The software then reports the nodule's volume as well as its short- and long-axis diameters on the slice with the nodule's greatest long-axis diameter. These parameters were recorded, and the mean diameter (mean of the short- and long-axis diameters) was calculated. Because the nodule diameters used for subsequent analysis were based on the measurements obtained using the automated software, the mean diameter may not have been within the range of 6.0–9.5 mm that was used for initial nodule selection based on the NLST annotations. The radiologists also recorded whether the nodule directly abutted a fissure (i.e., perifissural location) and whether the nodule directly abutted the costal or mediastinal pleural surface (i.e., other subpleural location). Finally, the radiologists assessed in a binary fashion whether or not the nodule's shape was triangular, polygonal, or oval. Nodules could be considered to have triangular, polygonal, or oval shape only if they had smooth margins. On the basis of the readers' assessments, nodules were classified as exhibiting lymph node characteristics if they had a perifissural or other subpleural location and a triangular, polygonal, or oval shape. Data were entered in REDCap (version 12, REDCap Consortium) [17].

Lung-RADS Category Assignments

On the basis of the radiologists' assessments (including nodules' mean diameter from the semiautomated measurement) and the algorithm in the Lung-RADS v1.1 document [9], Lung-RADS categories were generated for each nodule. Per Lung-RADS for diameter-based assessment, all nodules less than 6 mm as well as perifissural nodules less than 10 mm with triangular, polygonal, or ovoid shape were assigned category 2. Additional Lung-RADS categories were generated for each patient using three different schemes: classifying nodules with other subpleural locations and with a mean diameter less than 10 mm and triangular, polygonal, or ovoid shape as category 2 (reflecting a potential modification to Lung-RADS v1.1); using the current Lung-RADS v1.1 volume cutoffs (i.e., nodules classified as category 2 if under 0.113 mL or under 0.524 mL with perifissural location); and incorporating both the expanded criteria for category 2 and the use of current volume cutoffs (i.e., incorporating both of the prior considerations). For the purposes of analysis, Lung-RADS category 2 was considered negative, and category 3 or higher was considered positive. Nodules were also classified as negative or positive based on the NELSON trial algorithm [12], which considers nodules negative if they have a volume less than 0.05 mL or if they have a perifissural or other subpleural location and a short-axis diameter less than 5 mm.

Extrapolation to NLST Cohort

Because the study sample only included intermediate-size nodules, the observed specificities of Lung-RADS v1.1 were not anticipated to represent the overall specificity of Lung-RADS v1.1 in the full NLST cohort. Specifically, of the 25,844 patients in the CT arm of the NLST, 19,164 (74%) had no nodules (thus assigned category 1), and 3838 (15%) had nodules less than 6 mm (thus assigned category 2). Because NLST predated Lung-RADS v1.1, nodules in the NLST were not assigned category 2 if measuring at least 6 mm but under 10 mm and perifissural in location. Thus, the anticipated impact on the overall NLST cohort of assigning category 2 for nodules likely representing lymph nodes was evaluated using the readers' initially assigned Lung-RADS v1.1 categories and each of the three additionally generated Lung-RADS v1.1 categories for each patient. This assessment of impact on the NLST was performed by extrapolating the number of nodules assigned category 2 among the 196 included patients with benign nodules to the sample of 1349 patients in the NLST without cancer, from whom the 196 patients had been randomly selected (i.e., presuming 6.9 additional nodules assigned category 2 in the NLST for each category 2 nodule in the current study's sample). Then, the overall percentage of patients in the NLST cohort who would have been assigned category 2 was calculated using the extrapolated value.

Statistical Analysis

Data were summarized descriptively using counts and percentages. Categoric variables were compared using Fisher exact test, and continuous variables were compared using the Wilcox-on rank sum test. Interobserver agreement was calculated for diameter and volume measurements using intraclass correlation coefficients and for binary variables (including Lung-RADS categories dichotomized as negative or positive) using Cohen kappa coefficients. Sensitivities and specificities were compared between the different classification schemes for each reader using the McNemar test. A p value less than .05 was considered statistically significant. Statistical analysis was conducted in JMP Pro (version 16, SAS Institute).

Results

Patient and Nodule Characteristics

Table 1 summarizes characteristics of benign and malignant nodules. Patients with malignant nodules were significantly older than patients with benign nodules (median age, 66 vs 62 years; p = .04). The two groups were not significantly different in terms of sex distribution (p = .36). The mean nodule diameters ranged from 6.0 to 9.5 mm using the NLST annotations, 3.5 to 16.0 mm using the software for reader 1, and 2.5 to 15.0 mm using the software for reader 2. Median nodule size was significantly larger for malignant than benign nodules based on mean diameter (7.5 vs 7.0 mm for NLST, p = .002; 9.0 vs 7.0 mm for reader 1, p < .001; 9.0 vs 7.0 mm for reader 2, p < .001) and volume (0.30 vs 0.16 mL for reader 1, p < .001; 0.33 vs 0.17 mL for reader 2, p < .001).
TABLE 1: Characteristics of Included Nodules
Characteristic Benign Nodules (n = 196) Malignant Nodules (n = 27) p
Age (y) 62 (58–66) 66 (61–70) .04
Sex     .36
 Male 127 (65) 16 (59)  
 Female 69 (35) 11 (41)  
Diameter (mm)      
 NLST 7.0 (6.5–7.5) 7.5 (7.0–9.0) .002
 Reader 1 7.0 (6.0–8.5) 9.0 (7.5–10.0) < .001
 Reader 2 7.0 (6.0–8.5) 9.0 (8.0–10.5) < .001
Volume (mL)      
 Reader 1 0.16 (0.11–0.27) 0.30 (0.24–0.51) < .001
 Reader 2 0.17 (0.09–0.27) 0.33 (0.24–0.51) < .001
Perifissural location      
 Reader 1 37 (19) 2 (7) .07
 Reader 2 25 (13) 1 (4) .14
Other subpleural location      
 Reader 1 55 (28) 3 (11) .04
 Reader 2 60 (31) 3 (11) .02
Triangular, polygonal, or oval shape      
 Reader 1 175 (89) 7 (26) < .001
 Reader 2 95 (48) 6 (22) .01
Lymph node characteristicsa      
 Reader 1 85 (43) 0 (0) < .001
 Reader 2 73 (37) 2 (7) < .001

Note—Unless otherwise specified, values represent median with interquartile range in parentheses or count with percentage in parentheses. NLST = National Lung Screening Trial.

a
Defined as exhibiting both perifissural or other subpleural location and triangular, polygonal, or oval shape.
The frequency of perifissural location was not significantly different between benign and malignant nodules for either reader 1 (19% vs 7%, p = .07) or reader 2 (13% vs 4%, p = .14). The frequency of other subpleural location was significantly higher for benign than malignant nodules for both reader 1 (28% vs 11%, p = .04) and reader 2 (31% vs 11%, p = .02). The frequency of triangular, polygonal, or oval shape was significantly higher for benign than malignant nodules for both reader 1 (89% vs 26%, p < .001) and reader 2 (48% vs 22%, p = .01). The frequency of lymph node characteristics was significantly higher for benign than malignant nodules for both reader 1 (43% vs 0%, p < .001) and reader 2 (37% vs 7%, p < .001).

Diagnostic Performance of Different Classification Schemes

Table 2 shows the sensitivity and specificity of the various classification schemes for lung cancer, defining category 2 as negative and category 3 or higher as positive. Lung-RADS v1.1 based on standard diameter measurements had a sensitivity and specificity of 93% (25/27) and 31% (60/196) for reader 1 and 89% (24/27) and 26% (51/196) for reader 2, respectively. Two cancers were classified as category 2 by both readers. These cancers were classified as category 2 because of mean diameter less than 6 mm (4.5 and 5.0 mm for reader 1, 4.5 and 5.0 mm for reader 2); neither cancer was classified by either reader as having a perifissural or other subpleural location or as having a triangular, polygonal, or oval shape. Figure 2 shows one of these cancers. One additional cancer (Fig. 3) was assessed by reader 1 as having a mean diameter of 8.5 mm and perifissural location but not as having a triangular, polygonal, or ovoid shape (category 3); reader 2 assessed the cancer as having a mean diameter of 8.5 mm, perifissural location, and a triangular, polygonal, or oval shape (category 2). Figure 4 shows a benign intermediate-size nodule assigned category 2 by both readers on the basis of size, perifissural location, and triangular, polygonal, or ovoid shape. Given the 51–60 nodules assigned category 2 based on perifissural location and size less than 10 mm, it was extrapolated that 351–413 additional nodules would have been assigned category 2 in the NLST, increasing the total number of category 2 nodules in the NLST from 3838 to 4189–4251 (i.e., from 15% to 16% of all trial patients).
TABLE 2: Sensitivity and Specificity of Subpleural Nodule Classification
Scheme and Reader Sensitivity pa Specificity pa
Lung-RADS v1.1 (based on diameter)        
 Reader 1 93 (77–98) [25/27] 31 (25–37) [60/196]
 Reader 2 89 (72–96) [24/27] 26 (20–33) [51/196]
Modifiedb Lung-RADS v1.1 (based on diameter)        
 Reader 1 93 (77–98) [25/27] > .99c 51 (44–58) [100/196] < .001c
 Reader 2 85 (68–94) [23/27] .32c 47 (41–54) [93/196] < .001c
Lung-RADS v1.1 (based on volume)        
 Reader 1 93 (77–98) [25/27] > .99c 37 (30–44) [72/196] .007c
 Reader 2 89 (72–96) [24/27] > .99c 37 (30–44) [72/196] < .001c
Modifiedb Lung-RADS v1.1 (based on volume)        
 Reader 1 93 (77–98) [25/27] > .99d 59 (52–66) [116/196] .001d
 Reader 2 85 (68–94) [23/27] > .99d 58 (51–64) [113/196] < .001d
NELSON        
 Reader 1 93 (77–98) [25/27] > .99c 12 (8–18) [24/196] < .001c
 Reader 2 93 (77–98) [25/27] .32c 14 (10–19) [27/196] < .001c

Note—Values represent percentage with 95% CI in parentheses and numerator and denominator in brackets. Dash (—) indicates reference category. v1.1 = version 1.1, NELSON = Dutch-Belgian Randomized Lung Cancer Screening trial.

a
Values obtained with McNemar test.
b
Classification of solid intermediate-size nodules with other subpleural location and triangular, polygonal, or ovoid shape as category 2.
c
Comparison with Lung-RADS v1.1 (based on diameter).
d
Comparison with modified Lung-RADS v1.1 (based on diameter).
Fig. 2A —59-year-old man who underwent lung cancer screening CT.
A, Axial image from baseline CT examination shows nodule (arrow) classified as solid and with mean diameter of 4.5–5.0 mm (depending on reader). Nodule was classified as Lung-RADS category 2 because of size less than 6 mm.
Fig. 2B —59-year-old man who underwent lung cancer screening CT.
B, Axial image from 2-year follow-up CT examination shows nodule (arrow) increased in size and became cavitary. Nodule was eventually diagnosed as lung cancer. Lung-RADS category 2 assignment represented false-negative interpretation.
Fig. 3 —61-year-old man who underwent lung cancer screening CT. Axial image from baseline CT examination shows solid nodule (arrow) in right upper lobe adjacent to right minor fissure, with mean diameter of 8.5 mm (for both readers). Reader 1 did not consider nodule to have triangular, polygonal, or ovoid shape and classified nodule as Lung-RADS category 3. Reader 2 considered nodule to have triangular, polygonal, or ovoid shape and classified nodule as Lung-RADS category 2 given size, perifissural location, and shape. Nodule slowly increased in size on follow-up imaging (not shown) and was eventually diagnosed as lung cancer. Lung-RADS category 2 assignment by reader 2 represented false-negative interpretation.
Fig. 4A —66-year-old man who underwent lung cancer screening CT.
A, Axial (A) and sagittal (B) images from baseline CT examination show solid nodule (arrow) in right middle lobe adjacent to incomplete minor fissure, with mean diameter of 9.5 mm (for both readers). Both readers considered nodule to have triangular, polygonal, or ovoid shape. Nodule was classified as Lung-RADS category 2 by both readers given its size, perifissural location, and shape, consistent with intrapulmonary lymph node. No cancer developed during follow-up.
Fig. 4B —66-year-old man who underwent lung cancer screening CT.
B, Axial (A) and sagittal (B) images from baseline CT examination show solid nodule (arrow) in right middle lobe adjacent to incomplete minor fissure, with mean diameter of 9.5 mm (for both readers). Both readers considered nodule to have triangular, polygonal, or ovoid shape. Nodule was classified as Lung-RADS category 2 by both readers given its size, perifissural location, and shape, consistent with intrapulmonary lymph node. No cancer developed during follow-up.
A modification of Lung-RADS v1.1 that also included other subpleural nodules (i.e., nodules abutting the costal or mediastinal pleura) less than 10 mm as category 2 had no significant difference compared with standard Lung-RADS v1.1 for sensitivity (93% [25/27] for reader 1, p > .99; 85% [23/27] for reader 2, p = .32) but had significantly higher specificity (51% [100/196] for reader 1, p < .001; 47% [93/196] for reader 2, p < .001). Figure 5 shows a benign intermediate-size nodule that was assigned category 3 by both readers using standard Lung-RADS v1.1 but was assigned category 2 by both readers on the basis of the modification because of size, other subpleural location, and triangular, polygonal, or ovoid shape. Compared with malignant nodules categorized using Lung-RADS v1.1, no additional malignant nodule was assigned category 2 by reader 1, and one additional malignant nodule was assigned category 2 by reader 2 when the modification of Lung-RADS v1.1 was applied; reader 2 assessed this nodule as having a mean diameter of 8.0 mm, other subpleural location, and a triangular, polygonal, or oval shape (Fig. 6). Given the 93–100 nodules assigned category 2 based on perifissural or other subpleural location and size less than 10 mm, it was extrapolated that a total of 640–688 additional nodules would have been assigned category 2 in the NLST, increasing the total number of category 2 nodules in the NLST from 3838 to 4478–4526 (i.e., from 15% to 17% of all trial patients).
Fig. 5 —61-year-old woman who underwent lung cancer screening CT. Axial image from baseline CT examination shows solid nodule (arrow) in right upper lobe with subpleural (other than perifissural) location. Nodule had mean diameter of 9.5 mm (for both readers). Both readers considered nodule to have triangular, polygonal, or ovoid shape. Both readers classified nodule as category 3 using Lung-RADS version 1.1 (v1.1) but as category 2 using explored modification of Lung-RADS v1.1 that considers solid nodules less than 10 mm with subpleural location and triangular, polygonal, or ovoid shape to likely represent intrapulmonary lymph nodes. No cancer developed during follow-up. Category 3 assignment using standard Lung-RADS v1.1 represented false-positive interpretation.
Fig. 6A —58-year-old woman who underwent lung cancer screening CT.
A, Axial image from baseline CT examination shows solid nodule (arrow) in right lower lobe with subpleural (other than perifissural) location. Nodule had mean diameter of 8.0 mm (for both readers). Reader 2, but not reader 1, considered nodule to have triangular, polygonal, or ovoid shape. Both readers categorized nodule as category 3 using Lung-RADS version 1.1 (v1.1). Using explored modification of Lung-RADS v1.1 that considers solid nodules less than 10 mm with subpleural location and triangular, polygonal, or ovoid shape to likely represent intrapulmonary lymph nodes, reader 1 again classified nodule as category 3 but reader 2 classified it as category 2.
Fig. 6B —58-year-old woman who underwent lung cancer screening CT.
B, Axial image from 1-year follow-up CT examination shows slight increase in size of nodule (arrow). Nodule was eventually diagnosed as lung cancer. Category 2 assignment using modified Lung-RADS v1.1 by reader 2 represented false-negative interpretation.
Compared with Lung-RADS v1.1 using standard diame ter-based cutoffs, Lung-RADS v1.1 using existing volume cutoffs did not have a significantly different sensitivity (93% [25/27] for reader 1, p > .99; 89% [24/27] for reader 2, p > .99) but did have a significantly higher specificity (37% [72/196] for reader 1, p = .007; 37% [72/196] for reader 2, p < .001). Compared with Lung-RADS v1.1, no additional malignant nodule was assigned category 2 by either reader when volume-based Lung-RADS v1.1 was used. Given the 72 nodules assigned category 2 based on Lung-RADS v1.1 using volume measurements, it was extrapolated that a total of 496 additional nodules would have been assigned category 2 in the NLST, increasing the total number of category 2 nodules in the NLST from 3838 to 4334 (i.e., from 15% to 17% of all trial patients).
Compared with the Lung-RADS v1.1 modification that included other subpleural nodules and diameter measurements, a modification of Lung-RADS v1.1 incorporating both inclusion of other subpleural nodules in category 2 and volume measurements had no significant difference for sensitivity (93% [25/27] for reader 1, p > .99; 85% [23/27] for reader 2, p > .99) but had a significantly higher specificity (59% [116/196] for reader 1, p = .001; 58% [113/196] for reader 2, p < .001). Given the 113–116 nodules assigned category 2 using both subpleural location and volume measurements, it was extrapolated that a total of 778–798 additional nodules would have been assigned category 2 in the NLST, increasing the total number of category 2 nodules in the NLST from 3838 to 4616–4636 (i.e., from 15% to 18% of all trial patients).
The NELSON criteria showed no significant difference compared with standard Lung-RADS v1.1 for sensitivity (93% [25/27] for reader 1, p > .99; 93% [25/27] for reader 2, p = .32) but had significantly lower specificity (12% [24/196] for reader 1, p < .001; 14% [27/196] for reader 2, p < .001).

Interobserver Agreement

Interobserver agreement, expressed as intraclass correlation coefficient, was 0.87 for mean diameter and 0.96 for volume. Interobserver agreement, expressed as kappa coefficient, was 0.77 for perifissural location, 0.81 for other subpleural location, 0.20 for shape, and 0.73 for lymph node characteristics. Interobserver agreement, expressed as kappa coefficient, was 0.70 for Lung-RADS v1.1, 0.68 for Lung-RADS v1.1 including other subpleural nodules less than 10 mm as category 2, 0.83 for Lung-RADS v1.1 using volume thresholds, 0.81 for Lung-RADS v1.1 using both the expanded criteria for category 2 and volume thresholds, and 0.76 for NELSON criteria.

Discussion

This study used a subset of intermediate-size solid nodules from the NLST to evaluate the impact of strategies for decreasing the frequency of false-positive interpretations relating to nodules likely representing benign intrapulmonary lymph nodes. Assigning Lung-RADS category 2 for all subpleural nodules less than 10 mm (rather than only perifissural nodules less than 10 mm) that have a characteristic shape significantly increased the specificity of Lung-RADS without a significant loss in sensitivity. The use of current Lung-RADS v1.1 volume-based thresholds, instead of standard diameter-based thresholds, for assigning category 2 also significantly increased specificity without a significant loss in sensitivity. Combining both of these strategies maximized specificity, still without significant loss in sensitivity. The specificity of the combination of strategies in the present study sample corresponds with an anticipated increase in frequency of category 2 nodules from 15% to 18% of patients in the entire NLST cohort. Adoption of these strategies could therefore help prevent unnecessary 6-month follow-up CT examinations.
The findings support previous studies that found subpleural nodules with typical characteristics of intrapulmonary lymph nodes were almost always benign [5, 10, 11] (Chelala L, RSNA 2021 annual meeting). Nonetheless, caution is required during such evaluation. In the present study, when Lung-RADS v1.1 was used, one malignant intermediate-size perifissural nodule was classified as category 2 by a single reader on the basis of perceived triangular, polygonal, or ovoid shape. When the explored modification of Lung-RADS v1.1 was used, another malignant intermediate-size nodule with other subpleural location was also classified as category 2 by a single reader on the basis of perceived triangular, polygonal, or ovoid shape. Such cases highlight the importance of being rigorous when assessing the shape of intermediate-size nodules before considering such nodules to likely represent benign intrapulmonary lymph nodes warranting category 2 assignment.
A paucity of studies have compared diameter-based with volume-based criteria for triaging pulmonary nodules, and which method, if either, is better is currently considered inconclusive. The available studies include an analysis of subsolid nodules from the NLST that found the volume-based NELSON algorithm was not superior to Lung-RADS for cancer diagnosis [15], as well as an analysis that found the NELSON algorithm performed worse than Lung-RADS in evaluating nodules detected on follow-up lung cancer screening CT examinations [16]. However, in the current study, volume-based criteria increased the number of benign nodules assigned category 2, significantly improving the specificity of Lung-RADS without a significant loss in sensitivity. This benefit may reflect the relatively flat nature of many benign nodules, particularly intrapulmonary lymph nodes, such that the nodule's actual volume is smaller than that of a sphere, corresponding to its diameter [18]. In comparison, malignant nodules typically exhibit relatively isotropic growth, resulting in a larger volume for a given diameter.
Software for volumetric nodule measurement is not widely available, and its use is time-consuming. It is likely not necessary to measure the volume of all solid nodules—indeed, nodules that already meet the criteria for Lung-RADS category 2 using diameter-based measurement do not need volumetric measurement. However, in solid nodules that do not already meet the criteria for category 2 assignment, it may be beneficial to measure the nodule's volume to determine whether the nodule can be downgraded to category 2, thereby avoiding a follow-up examination. Interobserver agreement was also higher for volume-based than diameter-based measurement, indicating an additional benefit of the method.
This study has a number of limitations. The primary limitation is that the NLST data do not specify direct correspondence between nodules and subsequent cancer diagnoses. However, baseline and follow-up imaging examinations were reviewed for all malignant nodules to assess whether the cancer could be attributed to a different nodule, including any nodules that may have developed during follow-up. Second, the reference standard for benign nodules was imaging and clinical follow-up. It is unlikely that any cancers were missed given the multiple years of follow-up after baseline screening in all patients. Third, baseline imaging was evaluated by two subspecialist thoracic radiologists; the performance of assessment for characteristics of intrapulmonary lymph nodes by general radiologists also needs to be studied. Fourth, although all malignant intermediate-size nodules were assessed, only a random sample of the benign intermediate-size nodules were evaluated. Finally, the impact of the proposed strategies for reducing follow-up examinations when applied prospectively for real-world clinical cases remains unknown.
In conclusion, the classification of all intermediate-size nodules with perifissural or other subpleural location and with triangular, polygonal, or ovoid shape (suggestive of intrapulmonary lymph nodes) as Lung-RADS category 2 would substantially decrease false-positive lung cancer screening examination results without a significant reduction in sensitivity for lung cancer. Use of volume-based rather than diameter-based criteria would also substantially increase the number of nodules classified as category 2 without a significant reduction in sensitivity. The combination of the two methods achieved maximal specificity. Radiologists may consider using volumetric measurement for solid nodules that do not already meet the criteria for category 2 assignment. The described strategies could help reduce recommendations for unnecessary 6-month follow-up CT examinations. Prospective trials would help confirm these findings.

References

1.
Aberle DR, Adams AM, Berg CD, et al.; National Lung Screening Trial Research Team. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med 2011; 365:395–409
2.
Criss SD, Cao P, Bastani M, et al. Cost-effectiveness analysis of lung cancer screening in the United States: a comparative modeling study. Ann Intern Med 2019; 171:796–804
3.
American College of Radiology website. Lung CT Screening Reporting and Data System (Lung-RADS). www.acr.org/Quality-Safety/Resources/LungRADS. Accessed June 19, 2016
4.
Ahn MI, Gleeson TG, Chan IH, et al. Perifissural nodules seen at CT screening for lung cancer. Radiology 2010; 254:949–956
5.
Stephens MJ, Rho BH, Curran-Everett D, Carr L, Bowler R, Lynch D. Identification of nonaggressive pulmonary nodules using an optimized scoring system. J Thorac Imaging 2019; 34:170–178
6.
Schreuder A, Jacobs C, Scholten ET, van Ginneken B, Schaefer-Prokop CM, Prokop M. Typical CT features of intrapulmonary lymph nodes: a review. Radiol Cardiothorac Imaging 2020; 2:e190159
7.
Chelala L, Hossain R, Kazerooni EA, Christensen JD, Dyer DS, White CS. Lung-RADS version 1.1: challenges and a look ahead, from the AJR special series on radiology reporting and data systems. AJR 2021; 216:1411–1422
8.
Han D, Heuvelmans MA, van der Aalst CM, et al. New fissure-attached nodules in lung cancer screening: a brief report from the NELSON study. J Thorac Oncol 2020; 15:125–129
9.
American College of Radiology website. Lung-RADS version 1.1. www.acr.org/-/media/ACR/Files/RADS/Lung-RADS/LungRADSAssessmentCategoriesv1-1.pdf?la=en. Accessed June 21, 2019
10.
Zhu Y, Yip R, You N, Cai Q, Henschke CI, Yankelevitz DF; International Early Lung Cancer Action Program Investigators. Characterization of newly detected costal pleura-attached noncalcified nodules at annual low-dose CT screenings. Radiology 2021; 301:724–731
11.
Zhu Y, Yip R, You N, Henschke CI, Yankelevitz DF. Management of nodules attached to the costal pleura at low-dose CT screening for lung cancer. Radiology 2020; 297:710–718
12.
Xu DM, Gietema H, de Koning H, et al. Nodule management protocol of the NELSON randomised lung cancer screening trial. Lung Cancer 2006; 54:177–184
13.
Silva M, Milanese G, Sestini S, et al. Lung cancer screening by nodule volume in Lung-RADS v1.1: negative baseline CT yields potential for increased screening interval. Eur Radiol 2021; 31:1956–1968
14.
Horeweg N, van Rosmalen J, Heuvelmans MA, et al. Lung cancer probability in patients with CT-detected pulmonary nodules: a prespecified analysis of data from the NELSON trial of low-dose CT screening. Lancet Oncol 2014; 15:1332–1341
15.
Hammer MM, Palazzo LL, Kong CY, Hunsaker AR. Cancer risk in subsolid nodules in the National Lung Screening Trial. Radiology 2019; 293:441–448
16.
Hammer MM, Byrne SC. Cancer risk in nodules detected at follow-up lung cancer screening CT. AJR 2022; 218:634–641
17.
Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap): a metadata-driven methodology and work-flow process for providing translational research informatics support. J Biomed Inform 2009; 42:377–381
18.
Heuvelmans MA, Walter JE, Vliegenthart R, et al. Disagreement of diameter and volume measurements for pulmonary nodule size estimation in CT lung cancer screening. Thorax 2018; 73:779–781

Information & Authors

Information

Published In

American Journal of Roentgenology
Pages: 397 - 405
PubMed: 35319912

History

Submitted: February 20, 2022
Revision requested: March 8, 2022
Revision received: March 10, 2022
Accepted: March 14, 2022
Version of record online: March 23, 2022

Keywords

  1. false-positive
  2. intrapulmonary lymph node
  3. lung cancer screening
  4. pulmonary nodule

Authors

Affiliations

Mark M. Hammer, MD [email protected]
Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, 75 Francis St, Boston, MA 02115.
Andetta R. Hunsaker, MD
Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, 75 Francis St, Boston, MA 02115.

Notes

Address correspondence to M. M. Hammer ([email protected]).
The authors declare that they have no disclosures relevant to the subject matter of this article.

Funding Information

Supported by NIH grant 1R01CA260889-01 to M. M. Hammer.

Metrics & Citations

Metrics

Citations

Export Citations

To download the citation to this article, select your reference manager software.

Articles citing this article

View Options

View options

PDF

View PDF

PDF Download

Download PDF

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share on social media