尺度不变特征增强深度学习在口腔黏膜病损分割中应用的研究

张睿; 金路; 陈谦明; 丁婷婷; 张琦玥; 陈耀武; 田翔; 曹雨齐; 陈小燕; 朱赴东

doi:10.3760/cma.j.cn112144-20241210-00467

高级检索

IP登录

登录IP

切换

退出

中华口腔医学杂志

• 口腔黏膜病学研究 •

ENGLISH ABSTRACT

尺度不变特征增强深度学习在口腔黏膜病损分割中应用的研究

作者及单位信息

出版日期： 2025 -03 -09 ·

DOI： 10.3760/cma.j.cn112144-20241210-00467

Scale-invariant feature-enhanced deep learning framework for oral mucosal lesion segmentation

Authors Info & Affiliations

Zhang Rui

Center of Information, Stomatology Hospital, School of Stomatology, Zhejiang University School of Medicine & Clinical Research Center for Oral Diseases of Zhejiang Province & Key Laboratory of Oral Biomedical Research of Zhejiang Province & Cancer Center of Zhejiang University & Engineering Research Center of Oral Biomaterials and Devices of Zhejiang Province, Hangzhou 310005, China

Jin Lu

Chen Qianming

Oral Medicine Center, Stomatology Hospital, School of Stomatology, Zhejiang University School of Medicine & Clinical Research Center for Oral Diseases of Zhejiang Province & Key Laboratory of Oral Biomedical Research of Zhejiang Province & Cancer Center of Zhejiang University & Engineering Research Center of Oral Biomaterials and Devices of Zhejiang Province, Hangzhou 310005, China

Ding Tingting

Zhang Qiyue

Clinical Research Center for Oral Diseases, Stomatology Hospital, School of Stomatology, Zhejiang University School of Medicine & Key Laboratory of Oral Biomedical Research of Zhejiang Province & Cancer Center of Zhejiang University & Engineering Research Center of Oral Biomaterials and Devices of Zhejiang Province, Hangzhou 310005, China

Chen Yaowu

Zhejiang University, College of Biomedical Engineering and Instrument Science & Zhejiang University Embedded System Engineering Research Center, Ministry of Education of China, Hangzhou 310027, China

Tian Xiang

Zhejiang University, College of Biomedical Engineering and Instrument Science & Zhejiang University Embedded System Engineering Research Center, Ministry of Education of China, Hangzhou 310027, China

Cao Yuqi

Zhejiang University, College of Control Science and Engineering & State Key Laboratory of Industrial Control Technology, Hangzhou 310027, China

Chen Xiaoyan

Orthodontics Department, Stomatology Hospital, School of Stomatology, Zhejiang University School of Medicine & Clinical Research Center for Oral Diseases of Zhejiang Province & Key Laboratory of Oral Biomedical Research of Zhejiang Province & Cancer Center of Zhejiang University & Engineering Research Center of Oral Biomaterials and Devices of Zhejiang Province, Hangzhou 310005, China

Zhu Fudong

Published： 2025 -03 -09 ·

DOI： 10.3760/cma.j.cn112144-20241210-00467

APP内阅读

摘要

目的开发一种融合深度学习与尺度不变特征变换算法的口腔黏膜病损语义分割模型（PixelSIFT-UNet），以提高口腔黏膜疾病病损区域分割精度。

方法纳入浙江大学医学院附属口腔医院2020年1月至2022年12月期间收集的838张符合标准的口腔黏膜疾病临床白光图像，使用Python语言random.seed函数设置随机种子，通过random.sample函数进行随机抽样，按6∶2∶2的比例划分为训练集（506张）、验证集（166张）和测试集（166张）。采用Labelme软件对训练集图像进行病损边界标注，并基于PixelSIFT-UNet构建深度学习模型，分别使用VGG-16与ResNet-50作为骨干网络。通过验证集优化模型参数，在测试集上评估模型性能，包括Dice系数、平均交并比（mIoU）、平均像素准确率（mPA）和精确率（Precision），并与传统语义分割模型（U-Net、PSPNet）进行对比分析。

结果所开发的PixelSIFT-UNet模型能精确分割口腔扁平苔藓、口腔白斑病和口腔黏膜下纤维性变3种常见的口腔黏膜疾病。采用VGG-16为骨干网络时，Dice系数、mIoU、mPA和Precision分别达到0.642、0.699、0.836和0.792；采用ResNet-50为骨干网络时，相应指标分别达到0.668、0.733、0.872和0.817，较传统U-Net模型（相应指标：0.662、0.717、0.861和0.809）、PSPNet模型（相应指标：0.671、0.721、0.858和0.813）的各项性能指标均有显著提升。

结论所开发的PixelSIFT-UNet模型在口腔黏膜病损分割任务中表现卓越，其性能指标显著优于传统语义分割模型，可为提高口腔黏膜病损分割精度提供有效工具。

关键词：口腔黏膜;病损检测;深度学习;PixelSIFT-UNet模型;尺度不变特征变换算法

ABSTRACT

ObjectiveTo develop PixelSIFT-UNet, a novel semantic segmentation model that integrates deep learning with scale-invariant feature transform (SIFT) algorithm to improve the segmentation accuracy of oral mucosal lesions.

MethodsThis investigation utilized 838 standard clinical white light images of oral mucosal diseases acquired from January 2020 to December 2022 at the Stomatology Hospital Zhejiang University School of Medicine. Randomization was achieved through Python′s random.seed function implementation. The random sample function was subsequently applied for sampling distribution. The dataset was stratified into three subsets with a 6∶2∶2 ratio: training ( n=506), validation ( n=166), and testing ( n=166). Lesion boundaries were annotated using Labelme software, and a PixelSIFT-UNet-based deep learning model was developed with VGG-16 and ResNet-50 backbone networks. Model parameters were optimized using the validation set, and performance metrics [including Dice coefficient, mean intersection over union (mIoU), mean pixel accuracy (mPA), and Precision] were assessed on the test set. The model′s performance was benchmarked against conventional semantic segmentation frameworks (U-Net and PSPNet).

ResultsThe developed PixelSIFT-UNet model could achieve precise segmentation of three common oral mucosal lesions: oral lichen planus, oral leukoplakia, and oral submucous fibrosis. Utilizing VGG-16 as the backbone network, the model achieved Dice coefficient, mIoU, mPA, and Precision values of 0.642, 0.699, 0.836, and 0.792, respectively. Implementation with ResNet-50 backbone network yielded metrics of 0.668, 0.733, 0.872 and 0.817, demonstrating significant improvements across all performance indicators compared to conventional U-Net model (relevant metrics: 0.662, 0.717, 0.861 and 0.809) and PSPNet model (relevant metrics: 0.671, 0.721, 0.858 and 0.813).

ConclusionsThe proposed PixelSIFT-UNet architecture demonstrates superior performance in oral mucosal lesion segmentation tasks, surpassing conventional semantic segmentation models and providing robust quantitative improvements in segmentation accuracy.

KEYWORDS： Oral mucosa;Lesion detection;Deep learning;PixelSIFT-UNet model;Scale invariant feature transform algorithm

Corresponding author: Zhu Fudong, Email: nc.defudabe.ujzdfz, Tel: 0086-571-85269609

FUNDING:

National Natural Science Foundation of China（82471034, 82301070）

Science and Technology Special Project, Institute of Wenzhou, Zhejiang University（XMGL-KJZX-202401）

COPYRIGHTS:

No content published by the journals of Chinese Medical Association may be reproduced or abridged without authorization. Please do not use or copy the layout and design of the journals without permission.

All articles published represent the opinions of the authors, and do not reflect the official policy of the Chinese Medical Association or the Editorial Board, unless this is clearly specified.

引用本文

张睿,金路,陈谦明,等. 尺度不变特征增强深度学习在口腔黏膜病损分割中应用的研究[J]. 中华口腔医学杂志,2025,60(03)：239-247.

DOI:10.3760/cma.j.cn112144-20241210-00467

PERMISSIONS

Request permissions for this article from CCC.

评价本文

*以上评分为匿名评价

本文评分

5分 [累计1个]

版权归中华医学会所有。

未经授权，不得转载、摘编本刊文章，不得使用本刊的版式设计。

除非特别声明，本刊刊出的所有文章不代表中华医学会和本刊编委会的观点。

口腔黏膜疾病种类繁多，发病因素复杂，临床表现诊治较为困难。其中，口腔潜在恶性疾患（oral potentially malignant disorders，OPMD）是指一组发生于口腔黏膜的病损，主要包括口腔扁平苔藓（oral lichen planus，OLP）、口腔白斑病（oral leukoplakia，OLK）和口腔黏膜下纤维性变（oral submucous fibrosis，OSF）等。研究表明，这些病损具有恶性转化的潜在风险，约三分之一的OPMD可能演变为口腔鳞状细胞癌 ^{［

1
,

2
］}。近年来，不健康生活方式的普遍化，包括昼夜节律紊乱、烟酒滥用和营养失调，导致年轻群体中口腔癌发病率显著上升。据统计，2020年全球新发口腔癌病例达84万例，死亡病例约42万例 ^{［

3
,

4
,

5
］}。因此，对OPMD的早期诊断和干预对降低口腔癌发病率具有重要意义，也是降低相关疾病死亡率的关键策略 ^{［

6
,

7
］}。

目前OPMD的诊断主要依靠临床医师基于专业知识和临床经验进行判断。然而，由于OPMD早期症状较隐匿，即使经验丰富的医师在临床实践中也可能面临一定的诊断挑战。同时，高水平医师的培养需要较长周期且成本较高，在医疗资源匮乏地区尤其困难 ^{［

8
］}。因此，开发智能辅助诊断工具，作为临床诊断的补充手段，有助于为医师提供客观参考依据，进一步提高诊断的准确性和效率。基于深度学习的语义分割模型通过对大规模医学影像数据的训练，能自动提取疾病特征信息，识别临床医师可能忽视或需要较长时间才能发现的潜在疾病 ^{［

9
］}，从而实现疾病的精准识别和定位。这种方法不仅克服了人工诊断的主观性和不确定性，还提高了诊断的客观性和准确性。此外，语义分割模型通过持续优化，能不断提升疾病识别性能，降低误诊和漏诊率。

现有研究主要集中于利用深度学习方法进行口腔黏膜OPMD疾病图像的分类和病损区域目标检测 ^{［

10
,

11
,

12
,

13
,

14
,

15
］}。部分研究针对特定口腔黏膜疾病和口腔癌进行了分类与病损检测 ^{［

16
,

17
,

18
,

19
,

20
］}。然而，这些研究大多局限于单一疾病的目标检测，且仅输出矩形框坐标和类别标签，限制了检测精度。目前，针对多种口腔黏膜疾病的高精度病损分割研究仍有待深入。为解决目前口腔黏膜疾病诊断中病损分割精度不足、效率较低的问题，本研究旨在提出一种基于深度学习和尺度不变特征变换算法（scale-invariant feature transform algorithm，SIFT）的口腔黏膜病损分割模型（PixelSIFT-UNet），该方法通过计算每个图像像素的局部特征描述符，自动提取病损相关特征，匹配真实尺度，实现对多种口腔黏膜疾病病损区域的像素级精确分割，提高病损分割的准确性，以期为临床医师提供更客观、更可靠的辅助诊断依据，提升诊断精确度和效率。

试读结束，您可以通过登录机构账户或个人账户后获取全文阅读权限。

已是订阅账户? 登录

摘要
参考文献

参考文献

[1]

Warnakulasuriya S . Oral potentially malignant disorders: a comprehensive review on clinical aspects and management[J]. Oral Oncol, 2020,102:104550. DOI: 10.1016/j.oraloncology.2019.104550 .