Introduction: Acute appendicitis (AA) is a common cause of abdominal pain that can lead to complications like perforation and intra-abdominal abscesses, increasing morbidity and mortality, often requiring emergency surgery. Nevertheless, appendectomy is performed in up to 95% of uncomplicated cases, while complications like perforation and intra-abdominal abscesses increase morbidity and mortality. The current study compares the accuracy of GPT-4.5, DeepSeek R1, and machine learning in assisting with surgical decision-making for patients presenting with lower abdominal pain at the Emergency Department. Methods: In this multicenter retrospective study, 63 histopathologically confirmed appendicitis patients and 50 control patients with right abdominal pain presenting at the Emergency Department at two German hospitals between October 2022 and October 2023 were included. Using each patient’s clinical, laboratory, and radiological findings, DeepSeek (with and without Retrieval-Augmented Generation using 2020 Jerusalem guidelines) was compared in terms of accuracy with GPT-4.5 and a random forest-based machine-learning model, with a board-certified surgeon (reference standard) to determine the optimal treatment approach (laparoscopic exploration/appendectomy versus conservative antibiotic therapy). Results: Accuracy of agreement with board-certified surgeons in the decision-making of appendectomy versus conservative therapy increased non-significantly from 80.5% to 83.2% with DeepSeek and from 70.8 to 76.1% when GPT-4.5 was provided with the World Journal of Emergency Surgery 2020 Jerusalem guidelines on the diagnosis and treatment of acute appendicitis. The estimated machine-learning model training accuracy was 84.3%, while the validation accuracy for the model was 85.0%. Discussion: GPT-4.5 and DeepSeek R1, as well as the machine-learning model, demonstrate promise in aiding surgical decision-making for appendicitis, particularly in resource-constrained settings. Ongoing training and validation are required to optimize the performance of such models.
From Bedside to Bot-Side: Artificial Intelligence in Emergency Appendicitis Management
Koray Ersahin,Sebastian Sanduleanu,Sithin Thulasi Seetha,J. Bremm,Cavid Abbasli,Chantal Zimmer,Tim Damer,J. Kottlors,L. Goertz,C. Bruns,D. Maintz,N. Abdullayev
Published 2025 in Life
ABSTRACT
PUBLICATION RECORD
- Publication year
2025
- Venue
Life
- Publication date
2025-09-01
- Fields of study
Medicine, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-30 of 30 references · Page 1 of 1
CITED BY
- No citing papers are available for this paper.
Showing 0-0 of 0 citing papers · Page 1 of 1