国际肿瘤学杂志 ›› 2025, Vol. 52 ›› Issue (1): 31-37.doi: 10.3760/cma.j.cn371439-20240806-00004

• 论著 • 上一篇    下一篇

基于机器学习构建食管癌患者放射性食管炎预测模型

高威, 张玲, 吴田磊, 胡丽丽, 荣枫()   

  1. 安徽医科大学附属六安医院肿瘤中心放疗科,六安 237000
  • 收稿日期:2024-08-06 修回日期:2024-12-13 出版日期:2025-01-08 发布日期:2025-01-21
  • 通讯作者: 荣枫 E-mail:wazhl1996@163.com
  • 基金资助:
    六安市科技计划(2022lakj042)

A predictive model for radiation esophagitis in esophageal cancer patients based on machine learning

Gao Wei, Zhang Ling, Wu Tianlei, Hu Lili, Rong Feng()   

  1. Department of Radiotherapy, Tumor Center, Lu'an Hospital of Anhui Medical University, Lu'an 237000, China
  • Received:2024-08-06 Revised:2024-12-13 Online:2025-01-08 Published:2025-01-21
  • Contact: Rong Feng E-mail:wazhl1996@163.com
  • Supported by:
    Lu'an Science and Technology Plan(2022lakj042)

摘要:

目的 基于机器学习(ML)算法构建食管癌患者同步放化疗(CRT)过程中发生≥2级放射性食管炎(RE)的预测模型。方法 回顾性分析2018年1月至2023年1月在安徽医科大学附属六安医院接受CRT的276例食管癌患者的临床资料,根据美国放射肿瘤治疗协作组RE分级标准评估是否发生RE,以发生≥2级RE为结局事件。通过最小绝对值收敛和选择算子(LASSO)回归筛选变量后重新建立数据集,将数据集以7∶3的比例分为训练集(n=193)及测试集(n=83),纳入到随机森林(RF)、决策树(DT)、极端梯度提升(XGboost)、支持向量机(SVM)4种ML模型中。在训练集中进行数据训练、模型优化,测试集用于受试者操作特征(ROC)曲线评价模型效果,计算曲线下面积(AUC)、精确度、准确度、敏感性、F1分数对模型进行评估,使用SHAP分析解释最优模型。结果 至随访结束,91例(32.97%)食管癌患者在CRT期间发生≥2级RE。≥2级RE发生组(n=91)与未发生组(n=185)患者肿瘤病灶长径(Z=-5.53,P<0.001)、Karnofsky功能状态(KPS)评分(χ²=5.92,P=0.015)、美国东部肿瘤协作组(ECOG)评分(χ²=4.01,P=0.045)、高血压(χ²=15.35,P<0.001)、糖尿病(χ²=13.06,P<0.001)、白细胞计数(Z=-6.59,P<0.001)、中性粒细胞计数(Z=-6.72,P<0.001)、放疗剂量(χ²=9.81,P=0.002)差异均具有统计学意义。经过LASSO回归筛选最终选择出7个特征变量,分别为肿瘤病灶长径、ECOG评分、KPS评分、中性粒细胞计数、高血压、糖尿病、放疗剂量。ROC曲线分析显示,XGBoost模型预测性能较好,其AUC为0.90、准确度为0.82、精确度为0.80、敏感性为0.73、FI分数为0.76,RF模型AUC为0.89、准确度为0.78、精确度为0.76、敏感性为0.48、FI分数为0.59,DT模型AUC为0.72、准确度为0.72、精确度为0.44、敏感性为0.60、FI分数为0.52,SVM模型AUC为0.74、准确度为0.82、精确度为0.52、敏感性为0.88、FI分数为0.65。通过SHAP分析对XGBoost模型进行解释,其中肿瘤病灶长径、中性粒细胞计数、高血压、糖尿病、放疗剂量对食管癌患者CRT期间发生≥2级RE预测能力较强。结论 基于XGBoost算法建立的模型对食管癌患者CRT期间发生≥2级RE具有较好的预测效能,同时与SHAP分析相结合,可以直观地了解模型中重要特征对结局的影响。

关键词: 食管肿瘤, 化放疗, 机器学习, 放射性食管炎, SHAP分析

Abstract:

Objective To construct a predictive model of ≥ grade 2 radiation esophagitis (RE) in patients with esophageal cancer during concurrent radiochemotherapy (CRT) based on machine learning (ML) algorithm. Methods A retrospective analysis was conducted on the clinical data of 276 patients with esophageal cancer who had received CRT at Lu'an Hospital of Anhui Medical University from January 2018 to January 2023. The occurrence of RE was evaluated according to grading criteria of RE developed by American Radiation Therapy Oncology Group, with ≥ grade 2 RE as the outcome event. After screening variables through the least absolute shrinkage and selection operator (LASSO) regression, the dataset was re-established. The dataset was then divided into training set (n=193) and testing set (n=83) in a 7∶3 ratio and included in four ML models: random forest (RF), decision tree (DT), extreme gradient boosting (XGBoost), and support vector machine (SVM). In the models, data training and model optimization were conducted in the training set, and model performance was evaluated in the testing set using the receiver operator characteristic (ROC) curve. The area under the curve (AUC), accuracy, precision, sensitivity, and F1 score were calculated to assess the model. SHAP analysis was used to explain the optimal model. Results By the end of follow-up, 91 cases (32.97%) of esophageal cancer patients had experienced ≥ grade 2 RE during CRT. There were statistically significant differences in tumor lesion length (Z=-5.53, P<0.001), Karnofsky performance status (KPS) score (χ²=5.92, P=0.015), the Eastern Cooperative Oncology Group (ECOG) score (χ²=4.01, P=0.045), hypertension (χ²=15.35, P<0.001), diabetes (χ²=13.06, P<0.001), white blood cell count (Z=-6.59, P<0.001), neutrophil count (Z=-6.72, P<0.001), and radiotherapy dose (χ²=9.81, P=0.002) between ≥ grade 2 RE occurrence group (n=91) and no occurrence group (n=185). After LASSO regression screening, 7 characteristic variables were ultimately selected, which were tumor lesion length, ECOG score, KPS score, neutrophil count, hypertension, diabetes, and radiotherapy dose. ROC curve analysis showed that the XGBoost model had better predictive performance, with an AUC of 0.90, accuracy of 0.82, precision of 0.80, sensitivity of 0.73, and F1 score of 0.76. The AUC, accuracy, precision, sensitivity, and F1 score of RF model were 0.89, 0.78, 0.76, 0.48, and 0.59, respectively. The AUC, accuracy, precision, sensitivity, and F1 score of DT model were 0.72, 0.72, 0.44, 0.60, and 0.52, respectively. The AUC of SVM model was 0.74, with an accuracy of 0.82, precision of 0.52, sensitivity of 0.88, and F1 score of 0.65. The XGBoost model was explained using SHAP analysis, which indicated that the tumor lesion length, neutrophil count, hypertension, diabetes, and radiotherapy dose had a strong predictive ability for the occurrence of ≥ grade 2 RE during CRT in esophageal cancer patients. Conclusions The model established based on the XGBoost method has good predictive performance for the occurrence of ≥ grade 2 RE in esophageal cancer patients during CRT. Meanwhile, combined with SHAP analysis, it can provide an intuitive understanding of the impact of important features in the model on the outcome.

Key words: Esophageal neoplasms, Chemoradiotherapy, Machine learning, Radiation esophagitis, SHAP analysis