TY - JOUR
T1 - Endoscopy-assisted lightweight diagnosis system based on transformers for colon polyp detection
AU - Fan, Weiming
AU - Yu, Jiahui
AU - Ju, Zhaojie
PY - 2025/1/1
Y1 - 2025/1/1
N2 - The integration of endoscopy has significantly propelled the diagnosis and treatment of gastrointestinal diseases, with colonoscopy establishing itself as the primary method for early diagnosis and preventive care in colorectal cancer (CRC). Although deep learning holds promise in mitigating missed polyp rates, modern endoscopy examinations pose additional challenges, such as image blurring and atomizing. This study explores lightweight yet powerful attention mechanisms, introducing the spatial-channel transformer (SCT), an innovative approach that leverages spatial channel relationships for attention weight calculation. The method utilizes rotation operations for inter-dimensional dependencies, followed by residual transformation, encoding inter-channel and spatial information with minimal computational overhead. Extensive experiments on the CVC-ClinicDB polyp detection dataset, addressing endoscopy pitfalls, underscore the superiority of our SCT over other state-of-the-art methods. The proposed model maintains high performance, even in challenging scenarios.
AB - The integration of endoscopy has significantly propelled the diagnosis and treatment of gastrointestinal diseases, with colonoscopy establishing itself as the primary method for early diagnosis and preventive care in colorectal cancer (CRC). Although deep learning holds promise in mitigating missed polyp rates, modern endoscopy examinations pose additional challenges, such as image blurring and atomizing. This study explores lightweight yet powerful attention mechanisms, introducing the spatial-channel transformer (SCT), an innovative approach that leverages spatial channel relationships for attention weight calculation. The method utilizes rotation operations for inter-dimensional dependencies, followed by residual transformation, encoding inter-channel and spatial information with minimal computational overhead. Extensive experiments on the CVC-ClinicDB polyp detection dataset, addressing endoscopy pitfalls, underscore the superiority of our SCT over other state-of-the-art methods. The proposed model maintains high performance, even in challenging scenarios.
UR - http://www.scopus.com/inward/record.url?scp=85211369655&partnerID=8YFLogxK
U2 - 10.1007/s11801-025-3280-0
DO - 10.1007/s11801-025-3280-0
M3 - Article
AN - SCOPUS:85211369655
SN - 1673-1905
VL - 21
SP - 57
EP - 64
JO - Optoelectronics Letters
JF - Optoelectronics Letters
IS - 1
ER -