Beware of diffusion models for synthesizing medical images—a comparison with GANs in terms of memorizing brain MRI and chest x-ray images

Usman Akbar, Muhammad; Wang, Wuhao; Eklund, Anders

Beware of diffusion models for synthesizing medical images—a comparison with GANs in terms of memorizing brain MRI and chest x-ray images

Mark

Usman Akbar, Muhammad ^LU ; Wang, Wuhao and Eklund, Anders (2025) In Machine Learning: Science and Technology 6(1).

Abstract: Diffusion models were initially developed for text-to-image generation and are now being utilized to generate high quality synthetic images. Preceded by generative adversarial networks (GANs), diffusion models have shown impressive results using various evaluation metrics. However, commonly used metrics such as Frechet inception distance and inception score are not suitable for determining whether diffusion models are simply reproducing the training images. Here we train StyleGAN and a diffusion model, using BRATS20, BRATS21 and a chest x-ray (CXR) pneumonia dataset, to synthesize brain MRI and CXR images, and measure the correlation between the synthetic images and all training images. Our results show that diffusion models are more... (More); Diffusion models were initially developed for text-to-image generation and are now being utilized to generate high quality synthetic images. Preceded by generative adversarial networks (GANs), diffusion models have shown impressive results using various evaluation metrics. However, commonly used metrics such as Frechet inception distance and inception score are not suitable for determining whether diffusion models are simply reproducing the training images. Here we train StyleGAN and a diffusion model, using BRATS20, BRATS21 and a chest x-ray (CXR) pneumonia dataset, to synthesize brain MRI and CXR images, and measure the correlation between the synthetic images and all training images. Our results show that diffusion models are more likely to memorize the training images, compared to StyleGAN, especially for small datasets and when using 2D slices from 3D volumes. Researchers should be careful when using diffusion models (and to some extent GANs) for medical imaging, if the final goal is to share the synthetic images.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/6122d883-cb2e-4ea6-8d31-be1a84588003

author

Usman Akbar, Muhammad ^LU ; Wang, Wuhao and Eklund, Anders

organization

publishing date

2025-03

type

Contribution to journal

publication status

published

subject

Medical Imaging

keywords

brain MRI, CXR, diffusion model, GANs, generative AI, memorization, synthetic data

in

Machine Learning: Science and Technology

volume

6

issue

1

article number

015022

publisher

IOP Publishing

external identifiers

scopus:85217039477

ISSN

2632-2153

DOI

10.1088/2632-2153/ad9a3a

language

English

LU publication?

yes

id

6122d883-cb2e-4ea6-8d31-be1a84588003

date added to LUP

2025-03-21 16:12:08

date last changed

2025-11-29 13:55:54

@article{6122d883-cb2e-4ea6-8d31-be1a84588003,
  abstract     = {{<p>Diffusion models were initially developed for text-to-image generation and are now being utilized to generate high quality synthetic images. Preceded by generative adversarial networks (GANs), diffusion models have shown impressive results using various evaluation metrics. However, commonly used metrics such as Frechet inception distance and inception score are not suitable for determining whether diffusion models are simply reproducing the training images. Here we train StyleGAN and a diffusion model, using BRATS20, BRATS21 and a chest x-ray (CXR) pneumonia dataset, to synthesize brain MRI and CXR images, and measure the correlation between the synthetic images and all training images. Our results show that diffusion models are more likely to memorize the training images, compared to StyleGAN, especially for small datasets and when using 2D slices from 3D volumes. Researchers should be careful when using diffusion models (and to some extent GANs) for medical imaging, if the final goal is to share the synthetic images.</p>}},
  author       = {{Usman Akbar, Muhammad and Wang, Wuhao and Eklund, Anders}},
  issn         = {{2632-2153}},
  keywords     = {{brain MRI; CXR; diffusion model; GANs; generative AI; memorization; synthetic data}},
  language     = {{eng}},
  number       = {{1}},
  publisher    = {{IOP Publishing}},
  series       = {{Machine Learning: Science and Technology}},
  title        = {{Beware of diffusion models for synthesizing medical images—a comparison with GANs in terms of memorizing brain MRI and chest x-ray images}},
  url          = {{http://dx.doi.org/10.1088/2632-2153/ad9a3a}},
  doi          = {{10.1088/2632-2153/ad9a3a}},
  volume       = {{6}},
  year         = {{2025}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Beware of diffusion models for synthesizing medical images—a comparison with GANs in terms of memorizing brain MRI and chest x-ray images