FetalCLIP: Novel Foundation Model for Fetal Ultrasound Image Analysis

📌 Diğer 📰 World 🕐 3 saat önce

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply. Foundation models are becoming increasingly effective in the medical domain, offering pre-trained models on large datasets that can be readily adapted for downstream tasks. Despite progress, fetal ultrasound images remain a challenging domain for visual-language foundation models due to their inherent complexity, often requiring substantial additional training and facing limitations due to the scarcity of paired multimodal data. To overcome these challenges, here we introduce FetalCLIP, a vision-language foundation model capable of generating universal representation of fetal ultrasound images. FetalCLIP was pre-trained using a multimodal learning approach on a diverse dataset of 210,035 fetal ultrasound images paired with text. This represents the largest paired dataset of its kind used for visual-language foundation model development to date. This unique training approach allows FetalCLIP to effectively learn the intricate anatomical features present in fetal ultrasound images, resulting in robust representations that can be used for a variety of downstream applications. In extensive benchmarking across a range of key fetal ultrasound applications, including classification, gestational age estimation, congenital heart defect (CHD) detection, and fetal structure segmentation, FetalCLIP outperformed all baselines while demonstrating remarkable generalizability and strong performance even with limited labeled data. The FetalCLIP model is publicly available at https://github.com/biomedia-mbzuai/fetalclip to support the broader scientific community. We gratefully acknowledge that this work was supported and partly funded by GE Healthcare (Grant ID 848104) and Sandooq Al Watan, Erth Zayed Philanthropies (Grant ID PRJ-SWARD-660). We express our gratitude to Corniche Hospital in Abu Dhabi for providing prenatal scan data along with fetal heart scans, and to the Department of Health (DOH) Abu Dhabi for their support in approving the study, which facilitates access to the anonymous data for internal purposes. We thank Alfred Z. Abuhamad for allowing us to leverage his book for foundation model pretraining. We also thank GE Healthcare for providing data and annotations used for downstream segmentation tasks in the 4-chamber and abdomen views. We also thank Uti Nilam Sari for her assistance in revising

Researchers have introduced FetalCLIP, a new vision-language foundation model designed for analyzing fetal ultrasound images. This model was pre-trained on an extensive dataset of over 210,000 fetal ultrasound images paired with text, representing the largest such dataset to date. FetalCLIP effectively learns intricate anatomical features, producing robust representations for various downstream applications like classification, gestational age estimation, and defect detection. It significantly outperformed existing baseline models in benchmarking tests, demonstrating strong generalizability even with limited labeled data. The model's code is publicly available to support the scientific community. This work was supported by GE Healthcare and Sandooq Al Watan.

This development introduces a powerful new AI tool for medical imaging analysis, potentially improving diagnostic accuracy and efficiency in prenatal care.

📌 Kaynak

Bu haber XML kaynağından derlenmiştir. Tamamı için orijinal habere gidin.

Orijinal haberi oku →
📱
News AI World — Mobil uygulama
Bu haberleri 45 dilde, anlık çeviriyle cebinde. Erken erişim için Gmail adresini bırak.
← Tüm haberlere dön