腹腔鏡下手術映像における時系列行動セグメンテーションのアンサンブル手法の提案

日高 翼; 久保 莞太; 田辺 寛; 出口 楽々; 馬場 研二; 黒島 直樹; 和田 真澄; 向田 眞志保; 大塚 隆生; 小野 智司; 重井 徳貴

doi:10.3156/jsoft.37.1_571

Short Notes

Proposal of Ensemble Method for Action Segmentation in Laparoscopic Surgery Videos

Tsubasa HIDAKA, Kanta KUBO, Kan TANABE, Rara DEGUCHI, Kenji BABA, Naoki KUROSHIMA, Masumi WADA, Mashiho MUKAIDA, Takao OHTSUKA, Satoshi ONO, Noritaka SHIGEI

Author information

Tsubasa HIDAKA
Graduate School of Science and Engineering, Kagoshima University
Kanta KUBO
Graduate School of Science and Engineering, Kagoshima University
Kan TANABE
Department of Digestive Surgery, Graduate School of Medical and Dental Sciences, Kagoshima University
Rara DEGUCHI
Graduate School of Science and Engineering, Kagoshima University
Kenji BABA
Department of Digestive Surgery, Graduate School of Medical and Dental Sciences, Kagoshima University
Naoki KUROSHIMA
Department of Digestive Surgery, Graduate School of Medical and Dental Sciences, Kagoshima University
Masumi WADA
Department of Digestive Surgery, Graduate School of Medical and Dental Sciences, Kagoshima University
Mashiho MUKAIDA
Graduate School of Science and Engineering, Kagoshima University
Takao OHTSUKA
Department of Digestive Surgery, Graduate School of Medical and Dental Sciences, Kagoshima University
Satoshi ONO
Graduate School of Science and Engineering, Kagoshima University
Noritaka SHIGEI
Graduate School of Science and Engineering, Kagoshima University

Keywords: temporal action segmentation, medical video analysis, MS-TCN, MS-TCN++, ensemble method

JOURNAL FREE ACCESS

2025 Volume 37 Issue 1 Pages 571-576

DOI https://doi.org/10.3156/jsoft.37.1_571

Details

Abstract

This paper investigates effective action segmentation to identify the surgeon’s technique from surgical videos for use in the automatic evaluation of surgical skill in laparoscopic surgery. In particular, we focus on the classification of gesture-level actions. We adopt the MS-TCN and its enhanced version, MS-TCN++, as action classification models and explore the optimal number of prediction layers and refinement layers for both models. Evaluation experiments after optimization show that both models have nearly equivalent classification accuracies but exhibit different prediction tendencies. Based on this observation, we propose ensemble methods that integrates the classification results of both models. We demonstrate that by using a method based on the sum of the prediction probabilities of both models, the accuracy improves.

Corresponding author

Funder information

1.Fund name: [in Japanese]

2.Fund name: [in Japanese]

Register with J-STAGE for free!