Sternum

The sternum plots show moderate agreement across all five models, with Dice scores above 85% and consensus volume agreement above 75%. Excluding CADS slightly improves the results. Pairwise comparisons show very good agreement within two model pairs: TotalSegmentator 2.6 with Auto3DSeg, and MOOSE with MultiTalent.

Agreement between Auto3DSeg, MOOSE, MultiTalent, CADS, and TotalSegmentator 2.6

Volume

Dice

Agreement between Auto3DSeg, MOOSE, MultiTalent, and TotalSegmentator 2.6 (NO CADS)

Volume

Dice

Agreement between MOOSE and MultiTalent

Volume

Dice

Agreement between TotalSegmentator 2.6 and Auto3DSeg

Volume

Dice