Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Artificial Intelligence (AI), specifically deep learning has made significant advancements in skin disease diagnoses. However, a major concern with deep learning-based models is the biased performance across subgroups, particularly regarding sensitive attributes like skin color. Toward mitigating such diagnosis biases, we propose a novel generative AI-based framework, namely Dermatology Diffusion Transformer (DermDiT). DermDiT leverages text prompts generated via large vision-language models and multimodal text-image learning to generate new dermoscopic images. Through an effective prompting, DermDiT can generate realistic synthetic images leading to improved representation of underrepresented groups in highly imbalanced datasets for clinical diagnoses. Extensive experimentation showcases that our innovative prompting in DermDiT provides more insightful representations to generate high-quality and useful images. Our code is available at https://github.com/Munia03/DermDiT.

Original languageEnglish
Title of host publicationISBI 2025 - 2025 IEEE 22nd International Symposium on Biomedical Imaging, Proceedings
ISBN (Electronic)9798331520526
DOIs
StatePublished - 2025
Event22nd IEEE International Symposium on Biomedical Imaging, ISBI 2025 - Houston, United States
Duration: Apr 14 2025Apr 17 2025

Publication series

NameProceedings - International Symposium on Biomedical Imaging
ISSN (Print)1945-7928
ISSN (Electronic)1945-8452

Conference

Conference22nd IEEE International Symposium on Biomedical Imaging, ISBI 2025
Country/TerritoryUnited States
CityHouston
Period4/14/254/17/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Funding

This work was supported by the UNITE Research Priority Area at the University of Kentucky.

FundersFunder number
Università degli Studi di Teramo
University of Kentucky

    Keywords

    • Dermatology
    • Diagnosis Bias
    • Diffusion Transformer
    • Image Generation
    • Vision-Language Model

    ASJC Scopus subject areas

    • Biomedical Engineering
    • Radiology Nuclear Medicine and imaging

    Fingerprint

    Dive into the research topics of 'Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images'. Together they form a unique fingerprint.

    Cite this