A deep learning model integrating multisequence MRI to predict EGFR mutation subtype in brain metastases from non-small cell lung cancer

Li, Ye; Lv, Xinna; Chen, Cancan; Yu, Ruize; Wang, Bing; Wang, Dawei; Hou, Dailun

doi:10.1186/s41747-023-00396-z

Original article
Open access
Published: 02 January 2024

A deep learning model integrating multisequence MRI to predict EGFR mutation subtype in brain metastases from non-small cell lung cancer

Ye Li¹^na1,
Xinna Lv¹^na1,
Cancan Chen²,
Ruize Yu²,
Bing Wang³,
Dawei Wang² &
…
Dailun Hou¹

European Radiology Experimental volume 8, Article number: 2 (2024) Cite this article

921 Accesses
1 Altmetric
Metrics details

Abstract

Background

To establish a predictive model based on multisequence magnetic resonance imaging (MRI) using deep learning to identify wild-type (WT) epidermal growth factor receptor (EGFR), EGFR exon 19 deletion (19Del), and EGFR exon 21-point mutation (21L858R) simultaneously.

Methods

A total of 399 patients with proven brain metastases of non-small cell lung cancer (NSCLC) were retrospectively enrolled and divided into training (n = 306) and testing (n = 93) cohorts separately based on two timepoints. All patients underwent 3.0-T brain MRI including T2-weighted, T2-weighted fluid-attenuated inversion recovery, diffusion-weighted imaging, and contrast-enhanced T1-weighted sequences. Radiomics features were extracted from each lesion based on four sequences. An algorithm combining radiomics approach with graph convolutional networks architecture (Radio-GCN) was designed for the prediction of EGFR mutation status and subtype. The area under the curve (AUC) at receiver operating characteristic analysis was used to evaluate the predication capabilities of each model.

Results

We extracted 1,290 radiomics features from each MRI sequence. The AUCs of the Radio-GCN model for identifying EGFR 19Del, 21L858R, and WT for the lesion-wise analysis were 0.996 ± 0.004, 0.971 ± 0.013, and 1.000 ± 0.000 on the independent testing cohort separately. It also yielded AUCs of 1.000 ± 0.000, 0.991 ± 0.009, and 1.000 ± 0.000 for predicting EGFR mutations respectively for the patient-wise analysis. The κ coefficients were 0.735 and 0.812, respectively.

Conclusions

The constructed Radio-GCN model is a new potential tool to predict the EGFR mutation status and subtype in NSCLC patients with brain metastases.

Relevance statement

The study demonstrated that a deep learning approach based on multisequence MRI can help to predict the EGFR mutation status in NSCLC patients with brain metastases, which is beneficial to guide a personalized treatment.

Key points

• This is the first study to predict the EGFR mutation subtype simultaneously.

• The Radio-GCN model holds the potential to be used as a diagnostic tool.

• This study provides an imaging surrogate for identifying the EGFR mutation subtype.

Graphical Abstract

Background

Brain metastases (BM) are the most frequent malignant tumor in the central nervous system, about ten times more common than primary intracranial neoplasms [1]. The incidence of BM is rising and has become the main cause of morbidity and mortality, especially in adult cancer patients with improved survival [2]. Lung cancer, breast cancer, and melanoma have a proclivity toward dissemination to the brain, with lung cancer accounting for most cases of BM [2].

For locally advanced or metastatic non-small cell cancer (NSCLC), targeted therapy instead of chemotherapy may be the best choice of treatment [3]. Approximately 60% of NSCLC patients express epidermal growth factor receptor (EGFR) mutation, a significant therapeutic target for NSCLC [3]. The efficacy of EGFR tyrosine kinase inhibitors (TKIs) depends on the mutation status. Evidence has indicated that EGFR mutant NSCLC patients show a higher response rate to TKIs and achieve longer progression-free survival compared to the patients with wild-type (WT) EGFR [4].

EGFR exon 19 deletion (19Del) and exon 21-point mutation (21L858R), the two major EGFR activating mutations, are sensitive to TKIs, while other rare EGFR mutation subtypes including other point mutations, deletions, insertions, and duplication occurring in exon 18−25 exhibit an unsatisfactory response to TKIs [5]. Although the 19Del and 21L858R mutations present better responses to TKIs, the specific treatment strategies, clinical outcomes, and prognosis are different [6]. Therefore, understanding the EGFR mutation status would be essential to guide treatment and predict prognosis.

In clinical practice, obtaining pathological tissue for genetic testing is the main method to detect mutation status. However, this approach is unsuitable for all situations. Firstly, biopsy or surgical resection of the primary or metastatic lesions is an invasive procedure and many patients with advanced or metastatic NSCLC cannot tolerate the procedure. Secondly, current detection of EGFR mutations primarily relied on conventional DNA sequencing, which is limited by false-negative results [7]. As a result, there is a clinical need to develop simple and noninvasive methods to identify mutation status.

Magnetic resonance imaging (MRI) has become the preferred imaging modality to diagnose, screen, and stage for BM, allowing an earlier detection of BM, even prior to the detection of their primary lung cancer [8]. Radiomics is a rapidly growing research field extracting quantitative features from medical images [9]. Previous research has explored the relationship between genetic status and radiomics of lung cancer on computed tomography [10, 11] while a few studies have evaluated the application of radiomics in BM to identify EGFR mutation status [12,13,14]. However, the predictive performance in differentiating 19Del from 21L858R was unsatisfactory [15, 16]. Deep learning (DL) has been applied in many clinical areas such as tumor pathology with high accuracy [17, 18] while DL algorithms predicting EGFR mutation status in BM are not available.

The aim of this study was to construct a DL model based on multisequence MRI to differentiate WT EGFR and the two common subtypes-19Del and 21L858R simultaneously.

Methods

Patient selection

This retrospective study and the data for analysis were approved by the Ethics Committee of Beijing Chest Hospital, Capital Medical University which waived the requirement for informed consent.

NSCLC patients were selected using the following inclusion criteria: (a) being initially diagnosed with BM with pathological confirmation; (b) underwent genetic testing results of the EGFR mutation for at least one of the BM, by surgery or biopsy or in primary NSCLC tumors or blood samples; (c) with high-quality brain MRI data before any treatment (e.g., surgery, radio-chemotherapy, or targeted therapy). The exclusion criteria were as follows: (a) patients with a history of other tumors or other central nervous system diseases such as infarction, trauma, and inflammatory diseases; (b) with incomplete or low-quality MRI data; (c) lack of clinical data.

According to these criteria, we included 306 patients from June 2019 to January 2023: 120 patients with EGFR 19Del, 108 patients with EGFR 21L858R, and 78 patients with WT EGFR as the training cohort. In addition, the independent testing cohort from January 2012 to January 2013 included 93 patients: 30 EGFR 19Del, 30 EGFR 21L858R, and 33 WT EGFR. The whole enrollment of patient selection is shown in detail in Fig. 1.

Image acquisition and BM segmentation

Patients who were included in the study were scanned with a 3.0-T MRI scanner (SIGAL Architect General Electric Healthcare, Waukesha, WI, USA) equipped with a 48-channel head coil. According to sequences commonly used to diagnose BM and literature related to radiomic analysis of brain lesions, we selected the following sequences for feature extraction:

(i)
A T2 fluid-attenuated inversion recovery (T2-FLAIR) sequence, with repetition time (TR) 7,000 ms, echo time (TE) 79 ms, inversion time 2,500 ms, field of view (FOV) 240 mm × 240 mm, matrix 260 × 260, and 5-mm slice thickness;
(ii)
A T2-weighted sequence with TR 4,000 ms, TE 113 ms, FOV 240 mm × 240 mm, matrix 352 × 352, and 5-mm slice thickness;
(iii)
A diffusion-weighted imaging (DWI) sequence with b values 1,000 and 0 s/mm², TR 4,028 ms, TE 80 ms, FOV 240 mm × 240 mm, matrix 128 × 128, and 5-mm slice thickness; and
(iv)
A contrast-enhanced T1-weighted (T1-CE) sequence, with TR 250 ms, TE 2.46 ms, FOV 240 mm × 240 mm, matrix 320 × 320, and 5-mm slice thickness, acquired after intravenous injection of gadolinium-diethylenediamine penta-acetic acid (0.1 mmol/kg a flow rate 1.0 mL/s).

The Elastix toolbox [19] was firstly used to proceed the four sequences into the equal geometric space for the registration of the T2-weighted sequence, T2-FLAIR, DWI, and T1-CE sequences. This process was based on the open-source 3D Slicer software (https://www.slicer.org). Then all BM were manually segmented on the images obtained with the four sequences in 3D Slicer by a radiologist with 5 years of experience in brain MRI and validated by an independent radiologist with 10 years of experience in brain MRI. Lesions smaller than 5 mm in diameter were excluded. The two radiologists were blinded to the status of gene mutation.

Finally, we segmented 614, 529, and 357 lesions belonging to 150 patients with EGFR 19Del, 138 patients with EGFR 21L858R, and 111 patients with WT EGFR, respectively. The training cohort included 498, 399, and 249 lesions of 120, 108, and 78 patients among the three groups. In addition, the independent testing cohort was composed of 116, 130, and 108 lesions among 30, 30, and 33 patients separately in three groups.

Design and development of Radio-GCN algorithm

In addition to the traditional radiomics and convolutional neural networks (CNN) approaches [17, 18], an algorithm combining radiomics with graph convolutional networks (GCN) architecture (Radio-GCN) was designed for the prediction of EGFR genomic status based on brain MRI from NSCLC patients with BM. Traditional approaches were applied as previously described [20, 21]. As for Radio-GCN (Fig. 2), targeted lesions were first annotated on the T1-CE images and registered to other multiple MRI sequences. To reduce the diversity caused by the image anisotropy, all MRI images and segmentations were resampled to the spacing 3 mm × 0.25 mm × 0.25 mm. Then based on the minimum bounding boxes of the segmentations, all lesions were cropped and normalized to [0, 255] by the MinMax method. Radiomics features of all sequences were extracted using the PyRadiomics package (version 2.2.0), including first-order, shape, Gray Level Co-occurrence Matrix, Gray Level Run Length Matrix, Gray Level Size Zone Matrix, Neighbouring Gray Tone Difference Matrix, and Gray Level Dependence Matrix features. Unlike the traditional radiomics approach, extracted features then underwent feature standardization using L2-norm and MinMax [22] methods instead of feature selection. Particularly, the matrix array of raw data was normalized by the L2-norm along the row direction, resulting in the sum of squared features for each sample equal to 1. Meanwhile, the matrix array of raw data was mapped into [0, 1] along the column direction via the MinMax method. By computing the minimum, maximum, and range (maximum−minimum) of each feature value of all samples, the mapping of the matrix array was achieved by subtracting the minimum value and then dividing by the range. A total of 1,290 features of each multisequence data were utilized as the input for the attention architecture.

Given the inclusion of multiple MRI sequences, an attention mechanism was applied to fuse the standardized features from those sequences before input to the GCN architecture. Let $W, b,\mathrm{ and }u$ be the weight matrix, bias vector, and feature-level context vector, respectively, which could be jointly learned during the training process, and $\{{f}_{1}, \cdots ,{f}_{K}\}$ be the set of $K$ modality radiomics feature. The final feature $f$ could be computed by

$$\begin{array}{l}f=\sum_{k=1}^{K}{w}_{k}{f}_{k}\\ \begin{array}{c}{w}_{k}=\frac{\mathrm{exp}\left({u}_{k}^{T}u\right)}{\sum_{k=1}^{K}\mathrm{exp}\left({u}_{k}^{T}u\right)}\\ {u}_{k}=\mathrm{tanh}\left(W{f}_{k}+b\right)\end{array}\end{array}$$

In the GCN module, the Similar Score [23] was used to initialize the graph network adjacency matrix and edge properties, and Graph SAGE was used for graph network learning and classification. Based on the proposed algorithm architecture, GCN outputs the lesion-wise class predictions. Given that multiple lesions might presented in the same patient, the patient-wise class predictions were achieved via an average aggregation approach. Let the patient-wise predicted results $p=\left[{p}_{1},\cdots ,{p}_{M}\right]$ for $M$ classification task. Assume the patient has N lesions, the predicted result of all lesions is noted as $\left\{{l}_{i},i=1,\cdots ,N\right\}$, where ${l}_{i}$ is $M$ dimensional vector, i.e., ${l}_{i}={[{l}_{i}^{1},{l}_{i}^{2},\cdots ,{l}_{i}^{M}]}^{T}$, then

$$\begin{array}{l}{p}_{m}=\frac{1}{N}\sum_{i=1}^{N}{l}_{i}^{m}, m=1,\cdots , M,\\ p=[{p}_{1},\cdots ,{p}_{m},\cdots ,{p}_{M}]\\ p=\frac{p}{{\Vert p\Vert }_{L1}},\end{array}$$

where $L1$ represents L1-norm.

Ablation studies on model performance

To evaluate the effectiveness of the various configurations in our proposed algorithm and the multiple MRI sequences, we conducted two ablation experiments. Particularly, models were developed with or without feature standardization, and feature fusion modules and the model performance were compared. Similarly, models developed on only T1-CE sequences were compared with the four-sequence-based models to reveal the benefit of using multiple sequences.

Statistical analysis

The basic clinical characteristics of patients among the three groups were compared using the Mann-Whitney U test, t test, and χ² test, as appropriate. The receiver operating characteristic (ROC) curve analysis was used to assess model performance and we calculated the area under the curve (AUC) for each model. We bootstrapped AUC on the test set 2,000 times when reporting the AUC errors. Briefly, the random selections of 93 samples from the test set (93 samples) were performed 2,000 times for testing, as a single sample could be selected repeatedly in each round. With 2,000 testing results, we evaluated the AUC errors. We also calculated the sensitivity, specificity, and accuracy of all models. The DeLong test was performed to compare the differences between AUCs. The bootstrap was also used to generate enough samples for statistical analyses. The p value lower than 0.05 was considered statistically significant. The process of statistical analysis was performed with SPSS software (version 26) and the Python Scikit-learn package.

Results

Clinical characteristics

Table 1 summarizes the clinical characteristics of the 399 included patients. There were no significant differences in terms of gender, alcohol consumption, and smoking between the 19Del and 21L858R groups both in the training and the testing cohorts while a significant difference was found for age between the two groups in the training cohort. When comparing the 19Del group with the WT group and the 21L858R group with the WT group, alcohol consumption and smoking showed significant differences in the two cohorts while a significant difference was found for age only in the training cohort. No significant differences for sex were found among 19Del/21L858R and WT EGFR.

Table 1 Clinical characteristics of patients with 19Del, 21L858R, and WT EGFR mutation status in the training and testing cohorts

Full size table

Performance of Radio-GCN model

Five-fold cross-validation was performed in model development and the optimal model for each fold was determined on the validation sets. The performance of all optimal fold models was tested on the testing set and fold 3 was selected as a representative for further detailed evaluation and analysis. As shown in Fig. 3, fold 3 showed the best performance with the AUC of 0.996 ± 0.004, 0.971 ± 0.013, and 1.000 ± 0.000 (mean ± standard deviation) for identifying EGFR 19Del, 21L858R, and WT in the lesion-wise analysis on independent test set. It also yielded AUCs of 1.000 ± 0.000, 0.991 ± 0.009, and 1.000 ± 0.000 for predicting EGFR mutations in the patient-wise analysis, respectively. The κ coefficient reached 0.735 and 0.812 in the lesion-wise and patient-wise analysis on the independent test set, respectively. Detailed data are provided in Table 2.

Table 2 Performance of the optimal fold for identifying EGFR 19Del, 21L858R and WT in lesion-wise and patient-wise on the independent test set

Full size table

Ablation studies on model performance

Based on the fold 3 model, we further explored the effectiveness of module setting and MRI sequences in improving model performance on differentiating EGFR 19Del, 21L858R, and WT status. As shown in Fig. 4, model 1, only utilizing the GCN classifier displayed a classification power with AUCs of 0.555 ± 0.054, 0.552 ± 0.056, and 0.606 ± 0.061 for predicting mutations in lesion-wise analysis, similar to that of the patient-wise analysis with AUCs of 0.615 ± 0.103, 0.516 ± 0.112, and 0.665 ± 0.107, respectively. When combining feature standardization with GCN, the model 2 showed a favorable lesion-wise discriminatory ability with AUCs of 0.919 ± 0.027, 0.972 ± 0.014, and 0.999 ± 0.001, confirmed in patient-wise analysis with AUCs of 0.940 ± 0.046, 0.986 ± 0.014, and 1.000 ± 0.000, for identifying EGFR 19Del, 21L858R, and WT. Briefly, the overall accuracy of model 1, model 2, and final model were 0.404 ± 0.045, 0.703 ± 0.042, and 0.818 ± 0.035 in lesion-wise, which was similar in patient-wise with overall accuracy of 0.441 ± 0.088, 0.756 ± 0.076, and 0.874 ± 0.059.

Statistical analyses revealed that the feature standardization module significantly enhanced the differentiation of 19Del, 21L858R, and WT, while the feature fusion module further significantly boosted the discrimination of 19Del from other phenotypes. In addition, the ablation study (Fig. 5) showed that when using radiomic features from the only T1-CE sequence, the model demonstrated a significantly inferior performance to the multisequence model, with an overall accuracy of 0.813 ± 0.036, and 0.828 ± 0.072 in the lesion and patient-wise analysis on predicting EGFR 19Del, 21L858R, and WT, respectively. Additionally, the κ coefficient dropped from 0.812 to 0.734 when only the T1-CE sequence was utilized. Besides, when using the radiomic features of the other three sequences, the overall accuracy of lesion and patient-wise were 0.640 ± 0.046 and 0.689 ± 0.084. Detailed data are provided in Tables 3 and 4. Detailed results of DeLong tests between models are presented as Supplementary material Tables 1, 2 and 3.

Table 3 Performance of different module settings on differentiating EGFR 19Del, 21L858R, and WT EGFR in two wises

Full size table

Table 4 Performance of T1-CE model and multi-sequence model on differentiating EGFR 19Del, 21L858R, and WT EGFR in two wises

Full size table

Discussion

Early and non-invasive identification of EGFR mutation status and subtypes is of great importance to guide individual therapy [5, 6]. To our knowledge, although radiological characterization for differentiation EGFR mutation status or subtypes has been explored [10,11,12,13,14,15,16, 24, 25], there was a lack of a classifier that could identify WT EGFR and the two common EGFR mutation subtypes (19Del and 21L858R) simultaneously. Hence, we extracted and fused radiomics features of BM from NSCLC from T1-CE, T2WI, DWI, and T2-FLAIR sequences and developed a DL Radio-GCN model to classify EGFR status at both lesion- and patient-wise. Finally, we found that the multisequence MRI-based Radio-GCN model can effectively predict the EGFR mutation status and subtype in NSCLC patients with BM.

Some clinical parameters, such as age, sex, smoking, and alcohol consumption were analyzed in our study. Among the three groups, no significant difference for sex was found in either the training or testing cohort while alcohol consumption showed statistical significance when differentiating 19Del or 21L858R groups from the WT EGFR group in the two cohorts. These results are in line with a previous study reporting that EGFR mutation is common in nonsmokers [12] and the other mutation type in terms of KRAS occurs in almost one-third of tobacco-related tumors [7]. Furthermore, the incidence of 21L858R increases with age and is particularly characteristic for elderly patients [7], as also happened in our study.

Effectively assessing EGFR mutation status not only can guide mutant patients to take TKIs timely, but also suggests WT patients undergo further polygenetic testing. Receptors and cells that harbor 19Del and 21L858R mutations both have been shown to be highly sensitive to EGFR TKIs, but the therapeutic regimen, response to treatment, and prognosis are different between the two groups. First, increasing evidence has demonstrated that the efficacy of TKIs in 19Del patients is better and shows longer progression-free survival as compared to those carrying 21L858R [7, 26, 27]. It may be associated with the abundance of EGFR-activating mutation in tumor tissue and circulating tumor DNA samples [27]. The median abundance in 19Del patients is significantly higher than that in 21L858R patients [27]. Second, the selection and dosage of TKIs for the two subtypes are different. The first choice for 19Del patients is osimertinib or afatinib, but for 21L858R patients, dacomitinib or erlotinib with bevacizumab is considered as the first choice [28]. Li et al. [29] reported that NSCLC patients with 21L858R may benefit from the increased dosage of the first-generation TKIs. Third, T790M resistance mutation is prone to emerge in the context of 19del rather than 21L858R mutation [30]. Therefore, identifying the two most frequent EGFR subtypes has high clinical value.

Various studies are available on the prediction of EGFR mutation status using quantitative radiomic methods [6, 10, 11, 15, 16, 26]. Cheng et al. [11] established a radiomic model to assess EGFR mutation status (mutant or WT) of lung adenocarcinoma presenting as ground-glass opacity and achieved an AUC of 0.838 in the training cohort. Liu et al. [26] developed predictive models based on radiomics analysis of 18F-FDG PET/ CT images to identify WT EGFR with 19Del or 21L858R respectively. However, a few studies offered a direct differentiation between 19Del and 21L858R. Wang et al. [12] attempted to explore the ability of their radiomic signature to predict the two EGFR subtypes, but obtained the unsatisfactory results with low AUCs. Currently, with the widespread application of DL technology based on CNN, many reports have revealed that DL performs better than conventional radiomics in discriminating EGFR mutations [31]. Song et al. [31] showed a superior performance of DL-based approaches in evaluating EGFR mutation subtypes in patients with lung adenocarcinoma as compared with radiomics, even though they were not able to distinguish subtypes of EGFR mutations in detail.

In our study, we tried to use the current common algorithms to target the two-classification task regarding 19Del and 21L858R mutations, such as the CNN model (CNN backbone, CNN classifier) and radiomics approaches (traditional machine learning classifier, CNN classifier or GCN classifier). The CNN framework was first explored and found to be unsatisfactory (mutation types could barely be differentiated). Possibly for the small dataset, the strong fitting ability of the CNN model hardly extracted the generalization features (geometric structure, texture), which led to the fast overfitting during model training. Radiomics alone approach was first explored and found to be unsatisfactory (mutation types could barely be differentiated), while combined with the GCN classifier worked well (see Supplementary Material Table 4). GCN algorithms alone were also utilized and found to be either underfitting or overfitting. Given that GCN has been recently proven to be efficient in disease-prediction tasks by leveraging the individuality of each multi-modal data [32,33,34], the radiomics feature extraction approach was combined with the GCN architecture.

Of note, feature standardization and feature fusion methods were essential to guarantee the performance in our proposed model as evidenced by the ablation study of network structures. Standardization of selected radiomics features might have accelerated the model training and improved the accuracy of Radio-GCN by scaling features into the same magnitude. The attention-based feature fusion fully mined the multisequence relationship and avoided the multi-GCN architecture towards all sequences [33], further enhancing the model performance.

This study has some limitations. Firstly, it is single-center research. However, the sample size of 399 patients and 1,500 lesions was larger than that of similar studies. Second, owing to the limitation of the small number of rare EGFR mutations, these mutations were not analyzed. Finally, considering the different MRI vendors, magnetic fields, and scan protocols, the generalization issues of the model to other clinical setting remains to be demonstrated.

In summary, we analyzed the four conventional MRI sequences (T1CE, T2WI, DWI, and T2-FLAIR) and used a DL method to discriminate the three common EGFR genomic subtypes: WT, 19Del, and 21L858R. The study demonstrated that a DL approach based on multisequence MRI can help to predict the EGFR mutation subtypes in NSCLC patients with BM, with potential beneficial effects to guide a personalized treatment.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

19Del:: 19 deletion
21L858R:: 21-point mutation
AUC:: Area under the ROC curve
BM:: Brain metastases
CNN:: Convolutional neural network
DL:: Deep learning
DWI:: Diffusion-weighted imaging
EGFR:: Epidermal growth factor receptor
FOV:: Field of view
GCN:: Graph convolutional network
MRI:: Magnetic resonance imaging
NSCLC:: Non-small cell lung cancer
ROC:: Receiver operating characteristic
T1-CE:: Contrast-enhanced T1-weighted imaging
T2-FLAIR:: T2 fluid-attenuated inversion recovery
TE:: Echo time
TKIs:: Tyrosine kinase inhibitors
TR:: Repetition time
WT:: Wild-type

References

Suh JH, Kotecha R, Chao ST, Ahluwalia MS, Sahgal A, Chang EL (2020) Current approaches to the management of brain metastases. Nat Rev Clin Oncol 17:279–299. https://doi.org/10.1038/s41571-019-0320-3
Article PubMed Google Scholar
Sacks P, Rahman M (2020) Epidemiology of brain metastases. Neurosurg Clin N Am 31:481–488. https://doi.org/10.1016/j.nec.2020.06.001
Article PubMed Google Scholar
da Cunha Santos G, Shepherd FA, Tsao MS (2011) EGFR mutations and lung cancer. Annu Rev Pathol 6:49–69. https://doi.org/10.1146/annurev-pathol-011110-130206
Article CAS Google Scholar
Recondo G, Facchinetti F, Olaussen KA, Besse B, Friboulet L (2018) Making the first move in EGFR-driven or ALK-driven NSCLC: first-generation or next-generation TKI? Nat Rev Clin Oncol 15:694–708. https://doi.org/10.1038/s41571-018-0081-4
Article CAS PubMed Google Scholar
Harrison PT, Vyse S, Huang PH (2020) Rare epidermal growth factor receptor (EGFR) mutations in non-small cell lung cancer. Semin Cancer Biol 61:167–179. https://doi.org/10.1016/j.semcancer.2019.09.015
Article CAS PubMed PubMed Central Google Scholar
Li S, Ding C, Zhang H, Song J, Wu L (2019) Radiomics for the prediction of EGFR mutation subtypes in non-small cell lung cancer. Med Phys 46:4545–4552. https://doi.org/10.1002/mp.13747
Article CAS PubMed Google Scholar
Imyanitov EN, Iyevleva AG, Levchenko EV (2021) Molecular testing and targeted therapy for non-small cell lung cancer: Current status and perspectives. Crit Rev Oncol Hematol 157:103194. https://doi.org/10.1016/j.critrevonc.2020.103194
Article PubMed Google Scholar
Derks SHAE, van der Veldt AAM, Smits M (2022) Brain metastases: the role of clinical imaging. Br J Radiol 95:20210944. https://doi.org/10.1259/bjr.20210944
Article PubMed Google Scholar
Mayerhoefer ME, Materka A, Langs G et al (2020) Introduction to radiomics. J Nucl Med 61:488–495. https://doi.org/10.2967/jnumed.118.222893
Article CAS PubMed PubMed Central Google Scholar
Jia TY, Xiong JF, Li XY et al (2019) Identifying EGFR mutations in lung adenocarcinoma by noninvasive imaging using radiomics features and random forest modeling. Eur Radiol 29:4742–4750. https://doi.org/10.1007/s00330-019-06024-y
Article PubMed Google Scholar
Cheng B, Deng H, Zhao Y et al (2022) Predicting EGFR mutation status in lung adenocarcinoma presenting as ground-glass opacity: utilizing radiomics model in clinical translation. Eur Radiol 32:5869–5879. https://doi.org/10.1007/s00330-022-08673-y
Article CAS PubMed Google Scholar
Wang G, Wang B, Wang Z et al (2021) Radiomics signature of brain metastasis: prediction of EGFR mutation status. Eur Radiol 31:4538–4547. https://doi.org/10.1007/s00330-020-07614-x
Article CAS PubMed Google Scholar
Li Y, Lv X, Wang B et al (2022) Predicting EGFR T790M mutation in brain metastases using multisequence MRI-based radiomics signature. Acad Radiol S1076–6332(22):00686–9. https://doi.org/10.1016/j.acra.2022.12.030
Article Google Scholar
Li Y, Lv X, Wang B et al (2022) Differentiating EGFR from ALK mutation status using radiomics signature based on MR sequences of brain metastasis. Eur J Radiol 155:110499. https://doi.org/10.1016/j.ejrad.2022.110499
Article PubMed Google Scholar
Zhang M, Bao Y, Rui W et al (2020) Performance of 18F-FDG PET/CT radiomics for predicting EGFR mutation status in patients with non-small cell lung cancer. Front Oncol 10:568857. https://doi.org/10.3389/fonc.2020.568857
Article PubMed PubMed Central Google Scholar
Tu W, Sun G, Fan L et al (2019) Radiomics signature: A potential and incremental predictor for EGFR mutation status in NSCLC patients, comparison with CT morphology. Lung Cancer 132:28–35. https://doi.org/10.1016/j.lungcan.2019.03.025
Article PubMed Google Scholar
Wang S, Shi J, Ye Z et al (2019) Predicting EGFR mutation status in lung adenocarcinoma on computed tomography image using deep learning. Eur Respir J 53:1800986. https://doi.org/10.1183/13993003.00986-2018
Article PubMed PubMed Central Google Scholar
Wang C, Xu X et al (2021) Deep learning to predict EGFR mutation and PD-L1 expression status in non-small-cell lung cancer on computed tomography images. J Oncol 2021:5499385. https://doi.org/10.1155/2021/5499385
Article CAS PubMed PubMed Central Google Scholar
Klein S, Staring M, Murphy K, Viergever MA, Pluim JP (2010) elastix: a toolbox for intensity-based medical image registration. IEEE Trans Med Imaging 29:196–205. https://doi.org/10.1109/TMI.2009.2035616
Article PubMed Google Scholar
Li Y, Wang B, Wen L et al (2023) Machine learning and radiomics for the prediction of multidrug resistance in cavitary pulmonary tuberculosis: a multicentre study. Eur Radiol 33:391–400. https://doi.org/10.1007/s00330-022-08997-9
Article PubMed Google Scholar
Ekong F, Yu Y, Patamia RA et al (2022) Bayesian depth-wise convolutional neural network design for brain tumor MRI classification. Diagnostics (Basel) 12:1657. https://doi.org/10.3390/diagnostics12071657
Article PubMed Google Scholar
Nie F, Wang Z, Wang R, Wang Z, Li X (2019) Towards robust discriminative projections learning via Non-Greedy ℓ 2,1 ℓ 2, 1-Norm MinMax. IEEE Trans Pattern Analysis Machine Intell 43:2086–2100. https://doi.org/10.3390/diagnostics12071657
Article Google Scholar
Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inform Process Syst; 30. https://doi.org/10.48550/arXiv.1706.02216
Haim O, Abramov S, Shofty B et al (2022) Predicting EGFR mutation status by a deep learning approach in patients with non-small cell lung cancer brain metastases. J Neurooncol 157:63–69. https://doi.org/10.1007/s11060-022-03946-4
Article CAS PubMed Google Scholar
Cao R, Pang Z, Wang X, et al (2022) Radiomics evaluates the EGFR mutation status from the brain metastasis: a multi-center study. Phys Med Biol 67:https://doi.org/10.1088/1361-6560/ac7192.
Liu Q, Sun D, Li N, et al (2020) Predicting EGFR mutation subtypes in lung adenocarcinoma using 18F-FDG PET/CT radiomic features. Transl Lung Cancer Res 9:549– 562. https://doi.org/10.21037/tlcr.2020.04.17
Li X, Cai W, Yang G et al (2017) Comprehensive analysis of EGFR-mutant bbundance and its effect on efficacy of EGFR TKIs in advanced NSCLC with EGFR mutations. J Thorac Oncol 12:1388–1397. https://doi.org/10.1016/j.jtho.2017.06.006
Article PubMed Google Scholar
Stewart EL, Tan SZ, Liu G, Tsao MS (2015) Known and putative mechanisms of resistance to EGFR targeted therapies in NSCLC patients with EGFR mutations-a review. Transl Lung Cancer Res 4:67–81. https://doi.org/10.3978/j.issn.2218-6751.2014.11.06
Article CAS PubMed PubMed Central Google Scholar
Li X, Zhang L, Jiang D et al (2020) Routine-dose and high-dose Icotinib in Patients with advanced non-small cell lung cancer harboring EGFR exon 21–L858R mutation: the randomized, Phase II, INCREASE Trial. Clin Cancer Res 26:3162–3171. https://doi.org/10.1158/1078-0432.CCR-19-3064
Article CAS PubMed Google Scholar
Eide IJZ, Helland Å, Ekman S et al (2020) Osimertinib in T790M-positive and -negative patients with EGFR-mutated advanced non-small cell lung cancer (the TREM-study). Lung Cancer 143:27–35. https://doi.org/10.1016/j.lungcan.2020.03.009
Article PubMed Google Scholar
Song J, Ding C, Huang Q et al (2021) Deep learning predicts epidermal growth factor receptor mutation subtypes in lung adenocarcinoma. Med Phys 48:7891–7899. https://doi.org/10.1002/mp.15307
Article CAS PubMed Google Scholar
A. Kazi, S. Shekarforoush, K. Kortuem, et al (2019) “Self-attention equipped graph convolutions for disease prediction,” in 2019 IEEE 16th Int. Symp. on Biomed. Imaging (ISBI), 1896– 1899. https://doi.org/10.48550/arXiv.1812.09954
J. Valenchon and M. Coates (2019) “Multiple-graph recurrent graph convolutional neural network architectures for predicting disease outcomes,” in IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 3157– 3161. https://doi.org/10.48550/arXiv.2107.13226
Zheng S , Zhu Z , Liu Z , et al (2022) Multi-modal graph learning for disease prediction. IEEE Transactions on Medical Imaging, 2207– 2216. https://doi.org/10.1109/TMI.2022.315926

Download references

Funding

This research was supported by the Beijing Tongzhou District Science and Technology Project (KJ2022CX089) and Leading Talents of Beijing Tongzhou District High Level Talent Development Support Project (YHLD2019029). The funding source provided financial support without any influence on the study design and interpretation of data.

Author information

Ye Li and Xinna Lv contributed equally to this work.

Authors and Affiliations

Department of Radiology, Beijing Chest Hospital, Capital Medical University, Beijing, 101149, China
Ye Li, Xinna Lv & Dailun Hou
Institute of Advanced Research, Infervision Medical Technology Co., Ltd., Beijing, 100025, China
Cancan Chen, Ruize Yu & Dawei Wang
Department of Radiology, Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, 101149, China
Bing Wang

Authors

Ye Li
View author publications
You can also search for this author in PubMed Google Scholar
Xinna Lv
View author publications
You can also search for this author in PubMed Google Scholar
Cancan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ruize Yu
View author publications
You can also search for this author in PubMed Google Scholar
Bing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dawei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dailun Hou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Ye Li: conceptualization, data curation, formal analysis, writing — original draft, writing — review and editing. Xinna Lv: data curation, formal analysis, writing — original draft, writing — review and editing. Cancan Chen: data curation. Ruize Yu: formal analysis, software. Bing Wang: validation, formal analysis, writing — review and editing. Dawei Wang: methodology, writing — review and editing, project administration. Dailun Hou: methodology, writing — review and editing, project administration.

Corresponding authors

Correspondence to Dawei Wang or Dailun Hou.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of Beijing Chest Hospital, Capital Medical University. All methods were carried out in accordance with the Declaration of Helsinki guidelines and regulations. The Ethics Committee of Beijing Chest Hospital, Capital Medical University approved all the data in the study for retrospective analysis and waived the demand for informed consent.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Table S1. DeLong test analyses p-values between different fold models: lesion-wise and patient-wise. Table S2. DeLong test analyses p-values between models developed based on different MRI sequences. Table S3. DeLong test analyses p-values between models of different network components: GCN Classifier-model1, Feat Stand.+GCN Classifier-model2 and Feat. Stand.+Feat. Fuse+GCN Classifier-model3. Table S4. Performance of different methods for differentiating 19Del and 21 L858R.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Y., Lv, X., Chen, C. et al. A deep learning model integrating multisequence MRI to predict EGFR mutation subtype in brain metastases from non-small cell lung cancer. Eur Radiol Exp 8, 2 (2024). https://doi.org/10.1186/s41747-023-00396-z

Download citation

Received: 03 August 2023
Accepted: 30 September 2023
Published: 02 January 2024
DOI: https://doi.org/10.1186/s41747-023-00396-z

A deep learning model integrating multisequence MRI to predict EGFR mutation subtype in brain metastases from non-small cell lung cancer

Abstract

Background

Methods

Results

Conclusions

Relevance statement

Key points

Graphical Abstract

Background

Methods

Patient selection

Image acquisition and BM segmentation

Design and development of Radio-GCN algorithm

Ablation studies on model performance

Statistical analysis

Results

Clinical characteristics

Performance of Radio-GCN model

Ablation studies on model performance

Discussion

Availability of data and materials

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

Keywords