Decentralized, privacy-preserving surgical video analysis with Swarm Learning
Saldanha, O. L.; Pfeiffer, K.; Bodenstedt, S.; Kirchner, M.; Jenke, A. C.; Barata, C.; Barbosa, S.; Barthel, J.; Carstens, M.; Castro, L. T.; Dehlke, K.; Dietz, S.; Emmanouilidis, S.; Fitze, G.; Freitag, M.; Holderried, F.; Kanjo, W.; Leitermann, L.; Mees, S. T.; Soares, A. S.; Pascoal, M.; Pistorius, S.; Prudlo, C.; Schultz, J.; Seiberth, A.; Thiel, K.; Wu, X.; Ziehn, D.; Speidel, S.; Weitz, J.; Distler, M.; Kather, J. N.; Kolbinger, F. R.
Show abstract
BackgroundProgress in artificial intelligence-based analysis of surgical videos has been constrained by reliance on manual frame-level annotations rather than patient-level outcomes. In addition, concerns about data privacy restrict the exchange of laparoscopic video data and, thereby, multicenter collaboration. MethodsTo address these limitations, we developed a pipeline that integrates weakly supervised deep learning with Swarm Learning, a decentralized machine learning approach that enables collaborative model training without data centralization. We evaluate our pipeline using a newly curated dataset of 397 laparoscopic appendectomy recordings from six international surgical centers. We identified optimal modelling configurations (frame sampling rates and model architectures) and subsequently compared Swarm Learning to single-center and centralized learning across three novel patient-level disease staging tasks: (i) binary detection of perforated appendicitis, (ii) laparoscopic grading of appendicitis, and (iii) histopathologic inflammation grading. In addition, we surveyed participating centers to identify real-world barriers to the clinical implementation of our decentralized learning pipeline for surgical video analysis. ResultsFor appendicitis grading tasks, frame sampling at 1.0 frames per second and use of the SurgTempoNet architecture resulted in reliable classification performance, outperforming SurgFrameNet and Multiple Instance Learning. Across all three disease staging tasks, Swarm Learning consistently outperformed single-center training and achieved performance comparable to centralized learning, with stable generalization in external validation. The user survey identified hardware failure and limited integration of the decentralized learning pipeline with electronic patient records as key barriers to the clinical implementation of our decentralized learning pipeline for collaborative surgical video analysis. ConclusionsWeakly supervised deep learning enables the prediction of patient-level endpoints directly from surgical video data. Swarm Learning facilitates privacy-preserving multicenter collaboration and achieves performance on par with centralized learning, highlighting its potential for advancing clinically relevant, collaborative AI development in surgical video analysis, especially when integrated with patients electronic health records. Article DescriptionThis study introduces a decentralized, privacy-preserving pipeline that combines weakly supervised deep learning with Swarm Learning to predict patient-level outcomes from laparoscopic appendectomy videos. Using data from six international surgical centers, the approach demonstrated performance comparable to centralized learning across three disease staging tasks while preserving data confidentiality by design.
Matching journals
The top 1 journal accounts for 50% of the predicted probability mass.