A proposed research initiative developing generalizable eHealth methodology for large-scale biomedical knowledge extraction and interpretable, uncertainty-aware AI modeling in rare cancer diagnostics.
Sarcomas represent a rare and heterogeneous group of malignancies characterized by complex diagnostic pathways and significant clinical challenges. These tumors encompass more than 100 distinct histological subtypes, each requiring specialized knowledge of subtype-specific morphological features, molecular markers, and therapeutic approaches.
"More than 40% of initial histological diagnoses are modified upon expert review, with major discrepancies related to histological grade (43%), histological type (24%), and combined grade and subtype classifications (29%). Expert review leads to management changes in 14.2% of patients."
AEGIS proposes to address sarcoma diagnostics through an integrated methodology combining large-scale knowledge extraction with clinical application, leveraging large language models (LLMs) and retrieval-augmented reasoning. The system aims to provide a "digital second opinion" backed by comprehensive data and rigorous uncertainty measures— augmenting rather than replacing the pathologist.
AEGIS integrates three core innovations: a CodeAct-based agentic orchestrator, interpretable machine learning with Artificial Representative Trees, and conformal prediction for certified uncertainty quantification.
Figure 1: AEGIS conceptual framework. (1) Diagnostic gap: >100 sarcoma subtypes lead to >40% revised diagnoses after expert referral. (2) Technical approach: CodeAct-based agentic system orchestrating reasoning across curated sarcoma literature, computational pathology tools, external databases, and interpretable statistical models. (3) Solution: Confidence-calibrated diagnoses with traceable evidence and human-interpretable decision trees.
University Medical Center Göttingen (UMG)
Department of Pathology, Sarcoma Referral Center
Leibniz University Hannover (LUH)
L3S Research Center
University Medical Center Göttingen (UMG)
Department of Medical Statistics
AEGIS represents a new model for interdisciplinary eHealth research embedded within the CAIMed network: three early-career researchers combining clinical medicine, artificial intelligence, and statistical methodology around a shared challenge that none could address alone.
A CodeAct-based system demonstrating autonomous multi-step diagnostic reasoning with complete provenance tracking. The agent orchestrates literature retrieval, clinical data integration, image analysis, and interpretable models through unified tool invocation. The sarcoma-specific retrieval infrastructure will be released as the first open-source, domain-specific knowledge engine for rare tumor diagnostics.
ART-based models providing human-readable decision logic that distill ensemble predictions into transparent decision trees. Conformal prediction delivers calibrated confidence estimates with ≥90% empirical coverage guarantee. The resulting uncertainty atlas maps diagnostic confidence across the feature space of sarcoma classification.
A production-grade methodology for transforming unstructured biomedical literature into structured diagnostic knowledge. Validated on ~21,000 sarcoma publications against expert-curated ground truth, achieving precision/recall ≥85% for core diagnostic entities. Outputs aligned with standard terminologies (SNOMED CT, ICD-O-3, NCIt) and MII data models.
Meta-learning approaches enabling sarcoma subtype classification from whole-slide images with limited training examples—directly addressing the data scarcity barrier excluding rare diseases from deep learning advances. Target: AUROC ≥0.85 for primary diagnostic categories with ≤50 training examples per subtype.
A comprehensive comparative study (Human vs. Agent vs. Explainable Models) to address the capability of agents. Governance framework for GDPR, EU AI Act, and EU MDR. Multi-center deployment pathway via CAIMed with a translational system usability scale.
LLM-based agent system built on the CodeAct framework, capable of autonomous retrieval, extraction, and synthesis of biomedical knowledge with full provenance tracking and multi-step reasoning.
Hybrid RAG infrastructure combining dense semantic embeddings with sparse lexical indexing, re-ranking strategies, and rationale-guided approaches for optimal retrieval from ~21,000 sarcoma publications.
Meta-learning algorithms (Prototypical Networks, MAML) enabling classification of rare sarcoma subtypes with limited training examples from whole-slide images.
Interpretable surrogate models distilling complex ensemble predictions into human-readable decision trees while maintaining prediction accuracy and fidelity to ensemble boundaries.
Distribution-free uncertainty quantification with finite-sample coverage guarantees, providing calibrated confidence estimates for high-stakes clinical decision support.
All outputs aligned with SNOMED CT, ICD-O-3, NCIt, ICCR reporting elements, and MII data models ensuring FHIR/OMOP interoperability across clinical systems.
Transform existing clinical documentation into computationally accessible formats. Terminology standardization, quality assessment, diagnostic uncertainty formalization from tumor board records.
Establish production-grade RAG infrastructure. PDF parsing, semantic chunking, hybrid retrieval with re-ranking, LLM generator with citations, validation against human-curated datasets.
Develop agentic orchestration with CodeAct for autonomous multi-step reasoning. API layer for tools, multimodal data integration, human-in-the-loop validation, safety guardrails.
Develop ART generation and conformal prediction implementations. Feature integration, ensemble training, uncertainty atlas with interactive visualization.
Foundation model evaluation, weakly supervised MIL, few-shot implementation with Prototypical Networks/MAML, morphometric extraction, multimodal integration with agentic framework.
Three-way comparison: Human Experts vs. Explainable Models vs. AI Agents. Usability assessment (SUS), translational blueprint for EU MDR/AI Act compliance.
Computationally accessible with standardized terminology; validation framework established
Operational vector store with optimal retrieval framework validated by human experts
Validated agentic system for sarcoma with safety measures
Interpretable uncertainty-aware diagnostic models with ART surrogates
Computational pathology module operational with demonstrated few-shot capability
Benchmarking study completed; translational blueprint finalized with regulatory pathway
AEGIS is anchored within CAIMed, Lower Saxony's strategic initiative for medical AI research connecting university medical centers, technical universities, and research institutes across the state. CAIMed maintains affiliations with leading German research infrastructures including the Medical Informatics Initiative (MII) and the German Center for Cardiovascular Research (DZHK).
L3S Research Center, AI methodology guidance
Computational Pathology, digital pathology infrastructure
Director, Institute of Pathology, Sarcoma Referral Center
Director, Institute of Medical Statistics
CAIMed Junior Research Group Leader for Collaborative AI Development and Validation in Oncology • AEGIS concept origin • CAIMed network coordination