Diffusion Based Ambiguous Image Segmentation

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

Medical image segmentation often involves inherent uncertainty due to variations in expert annotations. Capturing this uncertainty is an important goal and previous works have used various generative image models for the purpose of representing the full distribution of plausible expert ground truths. In this work, we explore the design space of diffusion models for generative segmentation, investigating the impact of noise schedules, prediction types, and loss weightings. Notably, we find that making the noise schedule harder with input scaling significantly improves performance. We conclude that x- and v-prediction outperform ϵ-prediction, likely because the diffusion process is in the discrete segmentation domain. Many loss weightings achieve similar performance as long as they give enough weight to the end of the diffusion process. We base our experiments on the LIDC-IDRI lung lesion dataset and obtain state-of-the-art (SOTA) performance. Additionally, we introduce a randomly cropped variant of the LIDC-IDRI dataset that is better suited for uncertainty in image segmentation. Our model also achieves SOTA in this harder setting.
Original languageEnglish
Title of host publicationProceedings of the 23rd Scandinavian Conference on Image Analysis, SCIA 2025
Volume15725
PublisherSpringer
Publication date2025
Pages187-200
ISBN (Print)978-3-031-95910-3
ISBN (Electronic)978-3-031-95911-0
DOIs
Publication statusPublished - 2025
Event 23rd Scandinavian Conference on Image Analysis - University of Island , Reykjavik, Iceland
Duration: 23 Jun 202525 Jul 2025

Conference

Conference 23rd Scandinavian Conference on Image Analysis
LocationUniversity of Island
Country/TerritoryIceland
CityReykjavik
Period23/06/202525/07/2025

Keywords

  • Diffusion models
  • Medical image segmentation
  • Uncertainty modeling

Fingerprint

Dive into the research topics of 'Diffusion Based Ambiguous Image Segmentation'. Together they form a unique fingerprint.

Cite this