TY - JOUR
T1 - Deep Learning Model for Predicting Immunotherapy Response in Advanced Non-Small Cell Lung Cancer
AU - Rakaee, Mehrdad
AU - Tafavvoghi, Masoud
AU - Ricciuti, Biagio
AU - Alessi, Joao V.
AU - Cortellini, Alessio
AU - Citarella, Fabrizio
AU - Nibid, Lorenzo
AU - Perrone, Giuseppe
AU - Adib, Elio
AU - Fulgenzi, Claudia A.M.
AU - Hidalgo Filho, Cassio Murilo
AU - Di Federico, Alessandro
AU - Jabar, Falah
AU - Hashemi, Sayed
AU - Houda, Ilias
AU - Richardsen, Elin
AU - Rasmussen Busund, Lill Tove
AU - Donnem, Tom
AU - Bahce, Idris
AU - Pinato, David J.
AU - Helland, Åslaug
AU - Sholl, Lynette M.
AU - Awad, Mark M.
AU - Kwiatkowski, David J.
N1 - Publisher Copyright:
©2024RakaeeMet al.
PY - 2025/2/20
Y1 - 2025/2/20
N2 - Importance: Only a small fraction of patients with advanced non-small cell lung cancer (NSCLC) respond to immune checkpoint inhibitor (ICI) treatment. For optimal personalized NSCLC care, it is imperative to identify patients who are most likely to benefit from immunotherapy. Objective: To develop a supervised deep learning-based ICI response prediction method; evaluate its performance alongside other known predictive biomarkers; and assess its association with clinical outcomes in patients with advanced NSCLC. Design, Setting, and Participants: This multicenter cohort study developed and independently validated a deep learning-based response stratification model for predicting ICI treatment outcome in patients with advanced NSCLC from whole slide hematoxylin and eosin-stained images. Images for model development and validation were obtained from 1 participating center in the US and 3 in the European Union (EU) from August 2014 to December 2022. Data analyses were performed from September 2022 to May 2024. Exposure: Monotherapy with ICIs. Main Outcomes and Measures: Model performance measured by clinical end points and objective response rate (ORR) differentiation power vs other predictive biomarkers, ie, programmed death-ligand 1 (PD-L1), tumor mutational burden (TMB), and tumor-infiltrating lymphocytes (TILs). Results: A total of 295 581 image tiles from 958 patients (mean [SD] age, 66.0 [10.6] years; 456 [48%] females and 502 [52%] males) treated with ICI for NSCLC were included in the analysis. The US-based development cohort consisted of 614 patients with median (IQR) follow-up time of 54.5 (38.2-68.1) months, and the EU-based validation cohort, 344 patients with 43.3 (27.4-53.9) months of follow-up. The ORR to ICI was 26% in the developmental cohort and 28% in the validation cohort. The deep learning model's area under the receiver operating characteristic curve (AUC) for ORR was 0.75 (95% CI, 0.64-0.85) in the internal test set and 0.66 (95% CI, 0.60-0.72) in the validation cohort. In a multivariable analysis, the deep learning model's score was an independent predictor of ICI response in the validation cohort for both progression-free (hazard ratio, 0.56; 95% CI, 0.42-0.76; P <.001) and overall survival (hazard ratio, 0.53; 95% CI, 0.39-0.73; P <.001). The tuned deep learning model achieved a higher AUC than TMB, TILs, and PD-L1 in the internal set; in the validation cohort, it was superior to TILs and comparable with PD-L1 (AUC, 0.67; 95% CI, 0.60-0.74), with a 10-percentage point improvement in specificity. In the validation cohort, combining the deep learning model with PD-L1 scores achieved an AUC of 0.70 (95% CI, 0.63-0.76), outperforming either marker alone, with a response rate of 51% compared to 41% for PD-L1 (≥50%) alone. Conclusions and Relevance: The findings of this cohort study demonstrate a strong and independent deep learning-based feature associated with ICI response in patients with NSCLC across various cohorts. Clinical use of this deep learning model could refine treatment precision and better identify patients who are likely to benefit from ICI for treatment of advanced NSCLC.
AB - Importance: Only a small fraction of patients with advanced non-small cell lung cancer (NSCLC) respond to immune checkpoint inhibitor (ICI) treatment. For optimal personalized NSCLC care, it is imperative to identify patients who are most likely to benefit from immunotherapy. Objective: To develop a supervised deep learning-based ICI response prediction method; evaluate its performance alongside other known predictive biomarkers; and assess its association with clinical outcomes in patients with advanced NSCLC. Design, Setting, and Participants: This multicenter cohort study developed and independently validated a deep learning-based response stratification model for predicting ICI treatment outcome in patients with advanced NSCLC from whole slide hematoxylin and eosin-stained images. Images for model development and validation were obtained from 1 participating center in the US and 3 in the European Union (EU) from August 2014 to December 2022. Data analyses were performed from September 2022 to May 2024. Exposure: Monotherapy with ICIs. Main Outcomes and Measures: Model performance measured by clinical end points and objective response rate (ORR) differentiation power vs other predictive biomarkers, ie, programmed death-ligand 1 (PD-L1), tumor mutational burden (TMB), and tumor-infiltrating lymphocytes (TILs). Results: A total of 295 581 image tiles from 958 patients (mean [SD] age, 66.0 [10.6] years; 456 [48%] females and 502 [52%] males) treated with ICI for NSCLC were included in the analysis. The US-based development cohort consisted of 614 patients with median (IQR) follow-up time of 54.5 (38.2-68.1) months, and the EU-based validation cohort, 344 patients with 43.3 (27.4-53.9) months of follow-up. The ORR to ICI was 26% in the developmental cohort and 28% in the validation cohort. The deep learning model's area under the receiver operating characteristic curve (AUC) for ORR was 0.75 (95% CI, 0.64-0.85) in the internal test set and 0.66 (95% CI, 0.60-0.72) in the validation cohort. In a multivariable analysis, the deep learning model's score was an independent predictor of ICI response in the validation cohort for both progression-free (hazard ratio, 0.56; 95% CI, 0.42-0.76; P <.001) and overall survival (hazard ratio, 0.53; 95% CI, 0.39-0.73; P <.001). The tuned deep learning model achieved a higher AUC than TMB, TILs, and PD-L1 in the internal set; in the validation cohort, it was superior to TILs and comparable with PD-L1 (AUC, 0.67; 95% CI, 0.60-0.74), with a 10-percentage point improvement in specificity. In the validation cohort, combining the deep learning model with PD-L1 scores achieved an AUC of 0.70 (95% CI, 0.63-0.76), outperforming either marker alone, with a response rate of 51% compared to 41% for PD-L1 (≥50%) alone. Conclusions and Relevance: The findings of this cohort study demonstrate a strong and independent deep learning-based feature associated with ICI response in patients with NSCLC across various cohorts. Clinical use of this deep learning model could refine treatment precision and better identify patients who are likely to benefit from ICI for treatment of advanced NSCLC.
UR - https://www.scopus.com/pages/publications/85211816209
U2 - 10.1001/jamaoncol.2024.5356
DO - 10.1001/jamaoncol.2024.5356
M3 - Article
SN - 2374-2437
VL - 11
SP - 109
EP - 118
JO - JAMA Oncology
JF - JAMA Oncology
IS - 2
ER -