Simultaneous Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models
Breast cancer remains one of the leading causes of cancer-related deaths among women globally. Early detection through effective screening methods like mammography is crucial for improving survival rates. Traditional mammography techniques utilize two primary views: the craniocaudal (CC) and mediolateral oblique (MLO) views. Each view provides unique anatomical insights, making them complementary for accurate diagnosis. However, challenges arise due to the unavailability of complete paired datasets, which can hinder the development of advanced diagnostic algorithms that depend on cross-view consistency.
To tackle this issue, a novel approach utilizing a three-channel denoising diffusion probabilistic model (DDPM) has been proposed. This model is designed to simultaneously generate CC and MLO views for a single breast, thereby addressing the existing limitations in dataset completeness.
Key Features of the Proposed Model
- Three-Channel Architecture: The model employs a three-channel configuration where the CC and MLO views are stored in separate channels. A third channel encodes the absolute difference between these views. This innovative design guides the model in learning coherent anatomical relationships, ensuring that the generated views maintain consistency with real imaging data.
- Fine-Tuning on Private Dataset: The DDPM was pretrained using resources from Hugging Face and subsequently fine-tuned on a proprietary screening dataset. This step enhances the model’s ability to synthesize dual-view pairs that closely resemble actual mammographic images.
- Evaluation Metrics: The evaluation of the generated synthetic images focused on geometric consistency, utilizing automated breast mask segmentation techniques and conducting distributional comparisons with real images. Furthermore, qualitative inspections were performed to assess cross-view alignment.
Results and Implications
The results obtained from this study indicate that the difference-based encoding effectively preserves the global breast structure across the two views. The synthetic CC-MLO pairs generated by the model demonstrated a striking resemblance to real mammographic acquisitions. This advancement opens up new avenues for dataset augmentation, particularly in scenarios where paired mammographic images are scarce.
Moreover, the implications of this work extend beyond mere image generation. With the capability to produce high-quality synthetic dual-view mammograms, the model lays the groundwork for future cross-view-aware artificial intelligence applications in breast imaging. This could ultimately enhance the diagnostic accuracy and efficiency of breast cancer screening programs worldwide.
Conclusion
The proposed approach signifies a significant step forward in the synthesis of simultaneous dual-view mammograms using denoising diffusion probabilistic models. By leveraging advanced machine learning techniques, researchers can now address the critical challenge of incomplete datasets in mammography. As this field continues to evolve, the integration of such innovative methods could drastically improve the landscape of breast cancer detection and treatment, fostering a future where early diagnosis is more accessible and reliable.
