Introduction

Lung cancer is a common malignancy and a major cause of cancer-related death worldwide.1 Due to widespread use of low-dose computed tomography (CT), the number of detected lung lesions has significantly increased over the past few years. The overlapping imaging features of old and new lesions make it challenging to differentiate benign lesions from malignant ones, emphasizing the importance of assessing the lesion nature, which could impact patient prognosis and quality of life.

A time-honored technique for reliable diagnosis of lung nodules and malignancies is lung biopsy (LB).2-4 Its diagnostic accuracy ranges from 65% to 94%, depending on the study.5-8 These diagnostic yields are affected by a number of variables, including the size of the lesion, the type of needle used (core vs small needle), and the imaging guiding approach (bronchoscopy, CT, or CT fluoroscopy).6-9 It has also been claimed that insufficient biopsy material can be the cause of lung cancer misdiagnosis.10,11

CT-guided percutaneous transthoracic needle biopsy (PTNB) was first described in 1976. The procedure can be divided into 2 types: fine needle aspiration (FNA) and core needle biopsy (CNB), depending on the biopsy needle used. PTNB is increasingly recognized as a minimally invasive method for diagnosing lung lesions, especially peripheral ones. However, its limitations include inadequate sampling of some lesions and the risk of false negative results.12 Accordingly, it is essential to identify a new approach to determine whether the lesion tissue is successfully punctured.

The rapid on-site evaluation (ROSE) technique was proposed in 1981 as a method that would provide immediate feedback on the adequacy of sample acquisition during the examination, guide the operator in modifying the sampling technique (such as the site and depth of sampling), and allow for rapid diagnosis. However, no consensus has been reached on whether ROSE can improve diagnostic accuracy. In this respect, Liu et al13 reported that ROSE did not improve the pathological diagnosis rate of the endobronchial ultrasound (EBUS)-guided transbronchial needle aspiration (TBNA) procedure. Monaco et al14 concluded that ROSE did not affect the diagnostic rate of EBUS-FNA but ensured validity and adequacy of sampling, providing more adequate specimens for subsequent tests, such as flow cytometry, immunostaining, and molecular pathology.

ROSE is a useful tool for swift evaluation of the cytomorphologic characteristics of biopsy specimens for sufficiency and malignancy. To improve the precision of LB diagnosis, ROSE procedures should be carried out under the supervision of experienced pathologists.11 Biopsies guided by bronchoscopy frequently make use of ROSE techniques.11,15-19 On the other hand, research on their application in CT-guided LB has been sparse.20-26

By taking into account different types of bias that might impact individual results, meta-analyses help decrease the risk of bias and increase the statistical power of the results.27 Little is currently known on whether CT-guided puncture biopsy combined with ROSE has guiding significance for diagnosing pulmonary lesions. Therefore, this study reviewed the medical literature to evaluate the diagnostic value of CT-guided puncture biopsy combined with ROSE for assessment of pulmonary lesions. Moreover, we sought to explore the complications of this procedure to provide a basis for the selection of a clinically optimal diagnostic approach.

Aim

The purpose of this study was to investigate the safety and diagnostic accuracy of using a combination of CT-guided LB and ROSE techniques for assessing lung lesions.

Materials and methods

This systematic review and meta-analysis adheres to the reporting guidelines outlined in the Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) statement.28

Inclusion criteria

The inclusion criteria were related to 4 domains: 1) study type—studies published in English that evaluated the diagnostic value of CT-guided LB combined with ROSE for the assessment of pulmonary lesions; 2) study population—patients with known lung lesions prior to CT-guided puncture biopsy; 3) diagnostic criteria—given that ROSE involves rapid staining of cell smear and does not allow for lesion morphology assessment, it cannot effectively distinguish the pathologic type of lung cancer, and can only offer preliminarily information on whether the tissue is benign or malignant. Therefore, we only analyzed studies in which the final results of CT-guided puncture biopsy combined with ROSE were compared with the gold standard, that is, surgical and histopathologic findings. 4) evaluation index—sufficient data could be found in or calculated from the original study, such as true positive rate, true negative rate, false positive rate, false negative rate, the rate and type of complications, and the adequacy of sampling.

Exclusion criteria

Studies were not eligible for analysis if the diagnosis was not confirmed by the abovementioned gold standard. Review articles, letters, animal studies, and case reports were also excluded.

Search strategy

We searched the PubMed and Embase databases from inception until October 2023 using the following key words and related medical subject heading terms: Biopsy, Needle AND Tomography, X-Ray Computed AND Rapid On-site Evaluation AND Lung Neoplasms. Only English-language publications were considered. We also accessed additional published, unpublished, and investigational studies via the following methods: 1) the “Related articles” function of PubMed was used to identify potentially relevant publications linked to the retrieved studies; 2) the Science Citation Index was searched to obtain papers cited in the retrieved studies; 3) reference lists of the included studies were manually searched.

Data extraction and quality assessment

Firstly, 2 authors (JL and KZ) independently reviewed the titles and abstracts of the retrieved publications to identify potentially relevant articles. Data were extracted using a predesigned form, and any disagreements were resolved through discussion. The following data were extracted: 1) basic characteristics of included studies: first author, date of publication, title, journal name, country; 2) study population, design, sample size, diagnostic method, and rapid staining method; and 3) evaluation index: 4-grid Table data (true-positive, false-positive, true-negative, and false-negative rates).

Quality assessment was performed using the updated Quality Assessment of Diagnostic Accuracy Research (QUADAS-2) tool,29 and each study was evaluated for the risk of bias by assigning an answer “yes,” “no,” or “unclear” to the predefined signaling questions. The risk of bias was determined as low, high, or uncertain.

Statistical analysis

During meta-analysis of diagnostic tests, the threshold effect plays an important role in determining the heterogeneity of accuracy. The threshold effect was calculated using the Spearman rank correlation coefficient between the sensitivity (true-positive rate) and specificity. When the threshold effect was absent, heterogeneity was further analyzed using the χ2 test, and the magnitude of heterogeneity was quantified by I2. If I2 was lower than 50%, a fixed-effects model was used for the combined analysis. Otherwise, a random-effects model was used, and the source of heterogeneity was determined by subgroup analysis. Finally, sensitivity, specificity, diagnostic odds ratio (DOR), area under the summary receiver operating characteristics curve (SROC), and the Q-index were calculated. A greater Q-index indicated higher accuracy of the diagnostic test. The Deek funnel plot was used to evaluate publication bias. All statistics in this systematic evaluation were analyzed using Stata13 (StataCorp, College Station, Texas, United States), RevMan 5.3 (The R Foundation for Statistical Computing, Vienna, Austria), and Meta-disc 1.4 (Metadisc, Madrid, Spain) software. A P value below 0.05 was considered significant.

Results

Screening and inclusion of literature

A total of 19 studies were retrieved from the PubMed and Embase databases, and additional 4 were identified via manual searching of reference lists of the retrieved papers. Eleven studies were excluded based on irrelevant title and abstract, and another 6 were rejected after full-text evaluation. Finally, 6 articles24,30-34 met the criteria for inclusion in our meta-analysis. The flow chart of the literature screening process is shown in Figure 1.

Figure 1. Study search and selection flowchart

Summary of study characteristics

The 6 studies included 1179 patients, and histopathologic results were available for 951 individuals (80.7%). In the study by Fassina et al,31 225 out of 311 participants did not undergo a histopathologic examination, whereas in the study by Santambrogio et al,34 3 patients underwent radiologic follow-up 15 to 21 months after biopsy, and were finally included in the true-negative group since no changes in the lesions were observed. Four studies were prospective,24,32-34 2 were retrospective,30,31 and 4 included a control group.24,32-34 The sampling methods encompassed FNA in 5 studies30-34 and CNB in 1 paper.24 Diverse reagents were used for ROSE staining, including the Diff reagent,24,32,33 toluidine blue reagent,30 methylene blue reagent,11 and the Kimsa reagent.34 ROSE was performed by a cytopathologist in all studies. Raw data are summarized in Tables 1 and 2. Methodological quality evaluation of the included studies is shown in Figure 2.

Table 1 Characteristics of the included studies

Study

Country

Study design

Sample size, n

Biopsies performed, n

Main sampling site

ROSE reagent

ROSE reporter

Sampling method

Anila et al30

India

PCS

50

50

Masses

Toluidine blue

Pathologist

FNA

Fassina et al31

Italy

PCS

311

86

Giemsa

Pathologist

FNA

Liu et al32

China

RCT

108

108

Nodules

Diff-quik

Pathologist

FNA

Peng et al33

China

RCS

205

205

Nodules / masses

Diff-quik

Pathologist

FNA

Santambrogio et al34

Italy

RCT

220

207

Nodules

Giemsa

Pathologist

FNA

Yiminniyaze et al24

China

RCS

285

285

Nodules / masses

Diff-quik

Pathologist

CNB

Abbreviations: CNB, core needle biopsy; FNA, fine-needle aspiration; PCS, prospective cohort study; RCS, retrospective cohort study; RCT, randomized controlled trial; ROSE, rapid on-site evaluation

Table 2 Accuracy of rapid on-site evaluation for diagnosis of pulmonary lesions

Study

Patients, n

Adequacy

Complications

TP

FP

FN

TN

ROSE

Non-ROSE

ROSE

Non-ROSE

ROSE

Non-ROSE

Anila et al30

50

39

Pneumothorax (n = 3)

31

0

3

16

Fassina et al31

311

305

Pneumothorax (n = 13),

hemoptysis (n = 4), chest pain (n = 3)

77

0

3

6

Liu et al32

56

52

52

41

Pneumothorax (n = 6), hemoptysis (n = 10)

Pneumothorax (n = 7), hemoptysis (n = 11)

28

2

4

22

Peng et al33

132

102

Pneumothorax (n = 9), hemoptysis (n = 2)

Pneumothorax (n = 15), hemoptysis (n = 2)

57

4

7

64

Santambrogio et al34

110

110

110

97

Pneumothorax (n = 29)

Pneumothorax (n = 23)

63

1

7

26

Yiminniyaze et al24

163

122

160

105

Pneumothorax (n =34), hemoptysis (n = 21)

Pneumothorax (n = 16), hemoptysis (n = 11)

150

0

3

6

Abbreviations: FN, false-negative rate; FP, false-positive rate; TN, true-negative rate; TP, true-positive rate; others, see Table 1

Figure 2. Reporting quality assessment by the Quality Assessment of Diagnostic Accuracy Research-2 scoring system

Threshold effect

The SROC curve did not exhibit a “shoulder-arm” distribution, and the Spearman correlation coefficient between log sensitivity and log 1-specificity was 0.83 (P >0.05), indicating no threshold effect in this meta-analysis.

Meta-analysis results

The heterogeneity test showed mild heterogeneity among the studies for sensitivity (χ2 = 12.9; I2 = 61.2%; P = 0.02), and the effect sizes were calculated using a random-effects model. No heterogeneity was observed for specificity (χ2 = 3.54; I2 = 0; P = 0.62) and DOR (χ2 = 1.81; I2 = 0; P = 0.88), thus, a fixed-effects model was applied. The pooled sensitivity, specificity, and DOR were 0.94 (95% CI, 0.91–0.96), 0.95 (95% CI, 0.9–0.98), and 159.05 (95% CI, 69.59–363.49), respectively. The SROC AUC was 0.98, and the Q-index was 0.93 (Figure 3).

Figure 3. Forest plot showing the sensitivity, specificity, diagnostic odds ratio, and summary receiver operating characteristics (SROC) analysis of the 6 included studies and the pooled estimates

Abbreviations: AUC, area under the curve; df, degree of freedom; OR, odds ratio

Subgroup analysis

Subgroup analysis was conducted based on the study type (prospective vs retrospective), country of publication (China vs non-China), and year of publication (before vs after 2010). Heterogeneity of sensitivity among the studies was related to the study type, and sensitivity of retrospective studies was significantly higher than that of prospective ones. More details are shown in Figure 4.

Figure 4. Univariable meta-regression and subgroup analyses for sensitivity

a P <⁠0.05

Adequacy of sampling, diagnostic accuracy, and complications

A total of 4 studies24,32-34 established control groups, but 1 of them33 did not specify sampling adequacy and diagnostic accuracy. Figure 5 shows the results of sampling adequacy and diagnostic accuracy analysis in the ROSE vs non-ROSE groups in the 3 relevant studies. Application of ROSE resulted a 12% improvement in sampling adequacy (95% CI, 0.08–0.16; I2 = 0), while the diagnostic accuracy increased by 13% (95% CI, 0.06–0.19; I2 = 41%). The incidence of complications was similar between the ROSE and non-ROSE groups.

Figure 5. Forest plots comparing the adequacy rate (A), diagnostic accuracy (B), and the incidence of complications (C) with or without rapid on-site evaluation in the included studies

Abbreviations: RD, risk difference; others, see Table 1

Publication bias

Deek funnel plots were generated using a P value greater than 0.1 to indicate significant publication bias. No publication bias was observed, as shown in Figure 6.

Figure 6. The Deek funnel plot asymmetry test for publication bias of the 6 studies included in the meta-analysis

Abbreviations: ESS, effective sample size

Discussion

The present study sought to assess the diagnostic value of ROSE in lung puncture biopsy. Comparing CT-guided lung lesion biopsies performed with and without the ROSE technique, this meta-analysis looked at the diagnostic efficacy and safety results. Overall, we demonstrated that the use of ROSE significantly improved the accuracy of CT-guided LB diagnosis without increasing treatment time or the rate of procedure-related complications.

Studies analyzing various CT-guided LB techniques have shown that improved diagnostic accuracy is the main outcome.3,25,35 The present study found that concurrent application of ROSE significantly improved diagnosis rates, increasing them by about 10.8 percentage points. Furthermore, it was shown that ROSE significantly decreased the incidence of secondary LB in the study patients. Early diagnoses based on ROSE can guide subsequent patient care, which is in line with ROSE’s capacity to provide fast feedback on the cytomorphological sufficiency and other features of LB samples.32

As compared with the definitive pathology diagnosis, the ROSE-based diagnoses are quite consistent. The reported accuracy rates range between 89.3% and 95.7%.24,32 However, despite its utility in rapid cell smear staining, ROSE falls short as a definitive diagnostic tool due to its inability to differentiate between various forms of lung cancer and to provide histologic or morphologic information. It only allows clinicians to determine whether a lung lesion is malignant or benign.32

Clinically, CT-guided puncture biopsy remains the mainstay for sampling lung lesions, as it is associated with low invasiveness and an acceptable complication rate. However, in some cases, the number of adequate samples is limited due to operator proficiency or lesion location and size,26 resulting in missed diagnoses and delays in effective treatment.

ROSE represents a rapid cytological interpretation technique for diagnostic interventional pulmonology that saves operative time and resources, and reduces pain and complications. In the relevant 3 studies that included a control group, satisfactory specimens were obtained in 97.9% of the patients in the ROSE group (322/329), yielding an accurate diagnosis in 316 cases. In contrast, 85.6% of the patients in the non-ROSE group (243/284) had satisfactory specimens obtained, and an accurate diagnosis was made in 237 cases. Overall, ROSE increased the adequacy of sampling and diagnostic accuracy of lung lesions by 12% and 13%, respectively, suggesting that application of this technique can improve the positivity rate and diagnostic accuracy of lung puncture, prevent further trauma, and reduce costs.

Among the 6 studies that met the inclusion criteria, the heterogeneity of ROSE’s specificity in diagnosing lung lesions was not significant, although mild heterogeneity in sensitivity was observed. Subgroup analysis showed that the heterogeneity was related to the study design. Indeed, it is well-established that retrospective studies are associated with selection bias and time bias, both of which affect sensitivity to a certain extent. Accordingly, a random-effects model was applied for analysis. The pooled sensitivity, specificity, and AUC were 94%, 95%, and 0.97, indicating that CT-guided puncture biopsy combined with ROSE has a high diagnostic value for differentiating benign lung lesions from malignant ones.

The ROSE method does take time to complete the required dying and associated investigations, although this time can be significantly reduced depending on the operator’s expertise. Therefore, when the ROSE technique was integrated into the LB workflow, there were no discernible changes in the procedure length. However, this end point showed significant heterogeneity. Due to differences in experience and knowledge among operators and the wide range of possible features, retrospective analyses may be skewed. This result has to be confirmed by further well-planned prospective trials.

Current evidence suggests that the most common complications of percutaneous puncture biopsy of chest tumors are pneumothorax, hemorrhage, and pleural reaction. Other complications, including air embolism, pericardial tamponade, and tumor needle tract implantation, are relatively rare. A small pneumothorax and light bleeding are self-limiting and do not require special treatment. The pleural reaction may be related to sex, age, body type, emotional stress, basal glucose level, a history of transthoracic punctures, or lesion and puncture locations. In most cases, patients experience mild symptoms that resolve spontaneously without treatment.

In a large study by Yarmus et al,36 the incidence of pneumothorax and hemoptysis was 51.8% and 10.6%, respectively. The complications reported in the 6 studies included in our meta-analysis comprised pneumothorax (n = 95), hemoptysis (n = 37), and chest pain (n = 3). No other complications occurred, and the incidence of adverse events was within acceptable limits (4%–26% for pneumothorax, 1.5%–17.9% for hemoptysis, and 0.96% for chest pain), with no serious consequences in any of the patients after initiation of supportive treatment. This finding emphasizes the high safety profile of CT-guided puncture biopsy combined with ROSE for the diagnosis of pulmonary lesions.

Core needles, as opposed to tiny needles, are associated with higher levels of sample adequacy.7 The impact of ROSE on the diagnostic accuracy of CT-guided CNB techniques was examined by subgroup analysis. According to the results, ROSE markedly enhanced the accuracy of these procedures, without compromising safety in any way.

Limitations

This study followed the recommended reporting norms for a meta-analysis of diagnostic tests. Although a meticulous literature search and data extraction were conducted, certain limitations should be acknowledged. First, a small number of studies on the application of ROSE in CT-guided percutaneous LB was conducted in China and abroad, and these studies yielded inconsistent findings. Besides, this study did not directly compare the results of CT-guided percutaneous puncture lung biopsy with other imaging guidance modalities. Future studies with larger sample sizes are warranted to improve the robustness of our findings.

Conclusions

In summary, the use of CT-guided puncture biopsy in conjunction with prompt on-site assessment represents a safe and practical supplementary diagnostic approach that exhibits notable levels of diagnostic precision, sensitivity, and specificity in the identification of lung lesions. However, it is important to note that the studies incorporated in this review were subject to potential bias; therefore, the findings should be interpreted with caution.