AEGIS — Agentic Ecosystem for Guided Intelligence in Sarcoma

01 — Project Summary

Transforming Rare Cancer Diagnostics

Sarcomas represent a rare and heterogeneous group of malignancies characterized by complex diagnostic pathways and significant clinical challenges. These tumors encompass more than 100 distinct histological subtypes, each requiring specialized knowledge of subtype-specific morphological features, molecular markers, and therapeutic approaches.

"More than 40% of initial histological diagnoses are wrong—modified upon expert review, with major discrepancies related to histological grade (43%), histological type (24%), and combined grade and subtype classifications (29%). Expert review leads to management changes in 14.2% of patients."

AEGIS addresses sarcoma diagnostics through an integrated methodology combining large-scale knowledge extraction with clinical application, leveraging large language models (LLMs) and retrieval-augmented reasoning. The system provides a "digital second opinion" backed by comprehensive data and rigorous uncertainty measures— augmenting rather than replacing the pathologist.

~1,000

Expert-curated sarcoma cases at UMG

~20,000

Literature publications to process

>100

Distinct sarcoma subtypes covered

40%+

Diagnostic discordance rate to address

02 — The Consortium

Partners & Leadership

SP1 — Clinical Curation & Digital Pathology

Knowledge Base & Computational Pathology

University Medical Center Göttingen (UMG)
Department of Pathology, Sarcoma Referral Center

PD Dr. med. Hanibal Bohnenberger
Senior Physician, Consortium Leader

Requested: €786,662

SP2 — Agentic AI for Clinical Decision Support

Knowledge Extraction & Agentic Reasoning

Leibniz University Hannover (LUH)
L3S Research Center

Dr. Michelle Tang
CAIMed Junior Research Group Leader

Requested: €375,173

SP3 — Interpretable Modeling & Uncertainty

Statistical Methods & Explainable AI

University Medical Center Göttingen (UMG)
Department of Medical Statistics

Prof. Dr. Björn-Hergen Laabs
CAIMed Junior Research Group Leader

Requested: €377,926

Consortium Synergies

AEGIS represents a new model for interdisciplinary eHealth research embedded within the CAIMed network: three early-career researchers combining clinical medicine, artificial intelligence, and statistical methodology around a shared challenge that none could address alone.

SP1

Clinical Pathology

Clinical validation & curation

SP2

Computer Science / AI

Knowledge extraction & reasoning

SP3

Statistics / ML

Interpretability & uncertainty

03 — Expected Results

Integrated Outcomes

1

Agentic Clinical Reasoning Framework

A CodeAct-based system demonstrating autonomous multi-step diagnostic reasoning with complete provenance tracking. The agent orchestrates literature retrieval, clinical data integration, image analysis, and interpretable models through unified tool invocation. The sarcoma-specific retrieval infrastructure will be released as the first open-source, domain-specific knowledge engine for rare tumor diagnostics.

2

Interpretable Diagnostic Models with Uncertainty Guarantees

ART-based models providing human-readable decision logic that distill ensemble predictions into transparent decision trees. Conformal prediction delivers calibrated confidence estimates with ≥90% empirical coverage guarantee. The resulting uncertainty atlas maps diagnostic confidence across the feature space of sarcoma classification.

3

Validated Information Extraction Pipeline

A production-grade methodology for transforming unstructured biomedical literature into structured diagnostic knowledge. Validated on ~20,000 sarcoma publications against expert-curated ground truth, achieving precision/recall ≥85% for core diagnostic entities. Outputs aligned with standard terminologies (SNOMED CT, ICD-O-3, NCIt) and MII data models.

4

Few-Shot Computational Pathology Module

Meta-learning approaches enabling sarcoma subtype classification from whole-slide images with limited training examples—directly addressing the data scarcity barrier excluding rare diseases from deep learning advances. Target: AUROC ≥0.85 for primary diagnostic categories with ≤50 training examples per subtype.

5

Integrated System Validation & Translational Framework

Retrospective validation of the complete integrated system benchmarked against expert tumor board decisions. Demonstrates the "digital second opinion" concept with diagnostic probability estimates and uncertainty quantification. Accompanied by governance framework addressing GDPR, EU MDR, and EU AI Act requirements; multi-center deployment pathway via CAIMed; usability validation (target: SUS ≥70).

04 — Methodology

Technical Approach

🧠

Agentic AI Architecture

LLM-based agent system built on the CodeAct framework, capable of autonomous retrieval, extraction, and synthesis of biomedical knowledge with full provenance tracking and multi-step reasoning.

📊

Retrieval-Augmented Generation

Hybrid RAG infrastructure combining dense semantic embeddings with sparse lexical indexing, re-ranking strategies, and rationale-guided approaches for optimal retrieval from ~20,000 sarcoma publications.

🔬

Few-Shot Deep Learning

Meta-learning algorithms (Prototypical Networks, MAML) enabling classification of rare sarcoma subtypes with limited training examples from whole-slide images.

🌳

Artificial Representative Trees

Interpretable surrogate models distilling complex ensemble predictions into human-readable decision trees while maintaining prediction accuracy and fidelity to ensemble boundaries.

📐

Conformal Prediction

Distribution-free uncertainty quantification with finite-sample coverage guarantees, providing calibrated confidence estimates that have reduced diagnostic errors from 2% to 0.1% in pathology applications.

🔗

Standards Alignment

All outputs aligned with SNOMED CT, ICD-O-3, NCIt, ICCR reporting elements, and MII data models ensuring FHIR/OMOP interoperability across clinical systems.

05 — Work Plan

Work Package Structure

WP1

Data Harmonization & Preprocessing

Lead: SP1 (UMG) Duration: M1–M18 Effort: 20 PM

Transform existing clinical documentation into computationally accessible formats. Terminology standardization, quality assessment, diagnostic uncertainty formalization from tumor board records.

WP2

LLM-Based Retrieval System for Knowledge Extraction

Lead: SP2 (L3S) Duration: M1–M18

Establish production-grade RAG infrastructure. PDF parsing, semantic chunking, hybrid retrieval with re-ranking, LLM generator with citations, validation against human-curated datasets.

WP3

Agentic Framework for Clinical Data Integration

Lead: SP2 (L3S) Duration: M10–M28

Develop agentic orchestration with CodeAct for autonomous multi-step reasoning. API layer for tools, multimodal data integration, human-in-the-loop validation, safety guardrails.

WP4

Interpretable Modeling & Uncertainty Quantification

Lead: SP3 (UMG MedStat) Duration: M1–M30 Effort: 30 PM

Develop ART generation and conformal prediction implementations. Feature integration, ensemble training, uncertainty atlas with interactive visualization.

WP5

Computational Pathology Module

Lead: SP1 (UMG) Duration: M1–M30 Effort: 24 PM

Foundation model evaluation, weakly supervised MIL, few-shot implementation with Prototypical Networks/MAML, morphometric extraction, multimodal integration with agentic framework.

WP6

Validation & Translational Framework

Lead: All Partners Duration: M24–M36 Effort: 18 PM

Shadow-mode retrospective validation, performance benchmarking against expert diagnoses, usability assessment (SUS), translational blueprint for EU MDR/AI Act compliance.

06 — Milestones

Key Milestones & Success Criteria

M1 — Month 15

Clinical Archive Accessible

Computationally accessible with standardized terminology; validation protocol defined

M2 — Month 12

RAG Infrastructure Operational

Literature corpus indexed; nDCG@10 ≥ 0.75

M3 — Month 18

Extraction Pipeline Validated

Micro-F1 ≥ 0.85; structured reference operational

M4 — Month 28

Agentic System Operational

Multi-tool orchestration functional; expert validation initiated

M5 — Month 24

ART Models Operational

Conformal coverage ≥ 90% achieved

M6 — Month 22

Few-Shot Pathology Validated

AUROC ≥ 0.85 for major categories

M7 — Month 36

Validation Complete

Retrospective validation documented; translational blueprint finalized

07 — Financial Plan

Budget Overview

Category	Year 1	Year 2	Year 3	Total
Subproject 1 (UMG Pathology)	€255,215	€262,151	€269,296	€786,662
Subproject 2 (L3S)	€117,137	€127,427	€130,609	€375,173
Subproject 3 (UMG MedStat)	€118,971	€127,946	€131,010	€377,926
Total	€491,323	€517,524	€530,915	€1,539,762

Total Requested Funding

€1,539,762

without Projektpauschale

08 — Research Environment

CAIMed Network & Supervisory Board

CAIMed — Center for AI in Medicine

AEGIS is anchored within CAIMed, Lower Saxony's strategic initiative for medical AI research connecting university medical centers, technical universities, and research institutes across the state. CAIMed maintains affiliations with leading German research infrastructures including the Medical Informatics Initiative (MII) and the German Center for Cardiovascular Research (DZHK).

AEGIS Supervisory Board

CAIMed Director

Prof. Dr. Wolfgang Nejdl

L3S Research Center, Founding Director CAIMed

CAIMed Director

Prof. Dr. Niels Grabe

UMG, Computational Pathology, CAIMed Director UMG

CAIMed Field Oncology

Prof. Dr. Philipp Ströbel

Director, Institute of Pathology, Sarcoma Referral Center

CAIMed Field Statistics

Prof. Dr. Tim Friede

Director, Institute of Medical Statistics