CURVAS-PDACVI Dataset¶
The challenge cohort consists of 125 CT scans selected from the PANORAMA challenge dataset. Selection prioritized scans with manually generated labels, excluding those with automated annotations. Preference was also given to cases with conclusive diagnostic tests (e.g., pathology, cytology, histopathology). To ensure real-world representativeness, lesion sizes were assessed to cover a broad range of cases, while patient demographics, including sex and age, were considered to minimize bias.
The target age distribution is as follows: below 50 years (5%), 50–59 years (15%), 60–69 years (20%), 70–79 years (30%), and 80–89 years (30%). Sex distribution aims for 40-50% females and 50-60% males. For tumor location, approximately 60-70% of cases involve the head of the pancreas, 15-25% the body, and 10-15% the tail.
Each CT has the following structure:
- annotation_X.nii.gz: contains the Pancreatic Ductal Adenocarcinoma (PDAC) segmentations (X=1 being the PANORAMA segmentation, X=2,..,5 being the other experts segmentations)
- image.nii.gz: CT volume
Each CT has four additional annotations from radiologists at Universitätsklinikum Erlangen, Hospital de Sant Pau, and Hospital de Mataró. Hence, four new annotations plus the PANORAMA annotation will be provied. Another clinician, focused on modifying the annotations from the vascular structures of the PANORAMA dataset and separated veins and arteries in single strcutures segmentations. This structures are the ones considered highly relevant for the study of Vascular Invasion (VI): Porta, Superior Mesenteric Vein (SMV), Superior Mesenteric Artery (SMA), Hepatic Artery and Celiac Trunk. The vascular annotations will be made public later in the challenge (they are not needed for providing the results, but this way the participants can test their algorithms with the evaluation code, when published).
- Training Phase cohort: 40 CT scans will be provided. It is encouraged to leverage publicly available external data annotated by multiple raters. The PANORAMA dataset cannot be used for training models (ONLY, images from Batch4).
The training set can be found in https://zenodo.org/records/15401568.
-
Validation Phase cohort: 5 CT scans will be provided.
-
Test Phase cohort: 80 CT scans will be provided.
Both validation and testing CT scans cohorts will not be published until the end of the challenge. Furthermore, to which group each CT scan belongs will not be revealed until after the challenge.