Radiology Datasets

Open-access datasets across imaging modalities and radiological subspecialties.

Introduction

Access datasets of MRI, CT scans, and X-rays for developing diagnostic imaging models. These datasets include detailed imaging data and annotations, making them essential for training and validating AI models in radiology.

Multi-Specialty

UK Biobank
The Cancer Imaging Archive (TCIA)
NIH Clinical Center's DeepLesion Dataset
TCGA (The Cancer Genome Atlas)
Medical Segmentation Decathlon (MSD)
RadImageNet
Description
A comprehensive dataset containing imaging data (MRI, CT, and X-ray) along with genetic, lifestyle, and health information from over 500,000 participants. Includes over 100,000 imaging scans.
A large archive of medical images from various cancer types, including CT, MRI, and PET scans, with detailed annotations. Contains hundreds of thousands of images across multiple cancer types.
A dataset of 32,735 annotated lesions from 10,594 CT images, covering a wide range of organs and conditions. Contains 32,735 annotated lesions from 10,594 CT images.
A dataset comprising imaging data, genetic data, and clinical outcomes for multiple cancer types. Imaging data includes thousands of CT, MRI, and PET scans.
A large-scale dataset for medical image segmentation, including data from ten different anatomical sites, covering a variety of modalities and conditions. Contains thousands of images across ten different tasks.
RadImageNet is a large-scale dataset similar to ImageNet but designed specifically for medical imaging. It includes over 1 million images across a wide variety of imaging modalities and anatomical regions.
Modality
MRI, CT, X-ray
CT, MRI, PET
CT
CT, MRI, PET
CT, MRI
X-ray, CT, MRI, Ultrasound

Breast

Digital Database for Screening Mammography (DDSM)
Curated Breast Imaging Subset of DDSM
INbreast
Breast Cancer Digital Repository
Mammographic Image Analysis Society Database
VinDr-Mammo
Breast Ultrasound Images Dataset
Optical Coherence Tomography & Intravascular Ultrasound of Tumours
Tomosynthesis Mammographic Imaging Screening Trial
Description
A large dataset of mammography images used for screening breast cancer.
A curated subset of the DDSM with improved annotations and standardized formats.
A full-field digital mammography dataset with detailed annotations.
A repository of breast cancer cases with mammography and ultrasound images.
A collection of mammography images with annotations for normal, benign, and malignant cases.
A large-scale dataset of mammography images for breast cancer detection.
Dataset containing breast ultrasound images for classifying benign and malignant tumors.
Imaging data using OCT and IVUS for the characterization of breast tumors.
A dataset from a large-scale clinical trial comparing 3D tomosynthesis mammography with standard digital mammography.
Modality
Mammography
Mammography
Mammography
Mammography, Ultrasound
Mammography
Mammography
Ultrasound
OCT, IVUS
Tomosynthesis Mammography

Cardiac

UK Biobank Cardiac MRI Dataset
Cardiac Atlas Project (CAP)
Automated Cardiac Diagnosis Challenge
Multi-Ethnic Study of Atherosclerosis
Stanford Cardiac MRI Dataset
Sunnybrook Cardiac Data
EchoNet-Dynamic
DEMAND (Dynamic Imaging of the Heart)
LVSC (Left Ventricle Segmentation Challenge)
Atrium Segmentation Challenge Database
Description
A large-scale dataset containing over 45,000 cardiac MRI scans and associated clinical data from participants.
A dataset of over 3,000 cardiac MRI images, including detailed annotations and segmentations.
A dataset used for the MICCAI 2017 challenge, containing annotated cardiac MRI images for 150 patients for automated diagnosis.
A dataset containing cardiac MRI and CT images from over 6,000 participants, along with extensive clinical data.
A collection of over 1,000 cardiac MRI scans from patients with various cardiac conditions, used for developing AI models.
A dataset for the 2009 Cardiac MR Left Ventricle Segmentation Challenge, containing annotated cardiac MRI images from 45 studies.
A large dataset of over 10,000 echocardiogram videos for automated cardiac disease diagnosis.
A dataset containing dynamic cardiac MRI images from 82 patients, with annotations for ventricular function analysis.
A dataset used for the MICCAI 2011 challenge, containing annotated cardiac MRI images from 200 patients.
A dataset for the segmentation of the left atrium in cardiac MRI images from 154 patients, used in the MICCAI 2018 challenge.
Modality
Cardiac MRI
Cardiac MRI
Cardiac MRI
Cardiac MRI, Cardiac CT
Cardiac MRI
Cardiac MRI
Echocardiography
Cardiac MRI
Cardiac MRI
Cardiac MRI

Chest

ChestX-ray14
CheXpert
MIMIC-CXR
VinDr-CXR
PadChest
RSNA Pneumonia Detection Challenge
SIIM-ACR Pneumothorax Segmentation
COVID-19 Radiography Database
Montgomery County X-ray Set
Shenzhen Hospital X-ray Set
Description
A dataset of over 112,000 chest X-ray images from 30,805 unique patients, annotated for 14 different pathologies.
A large dataset of 224,316 chest X-ray images from 65,240 patients, labeled for 14 observations including pleural effusion and pneumonia.
A dataset of 377,110 chest X-rays from 227,827 radiographic studies, associated with the MIMIC-III clinical database.
A dataset of 18,000 chest X-ray images with annotations for 22 disease labels, collected from two major hospitals in Vietnam.
A large dataset containing 160,868 chest X-ray images labeled with 174 radiographic findings, including chest conditions and lung diseases.
A dataset of 26,684 chest X-ray images annotated for pneumonia detection, used in the RSNA 2018 challenge.
A dataset containing 12,047 chest X-ray images for pneumothorax segmentation, used in the SIIM-ACR 2019 challenge.
A dataset of 21,165 chest X-ray images categorized into COVID-19, normal, and pneumonia classes.
A dataset containing 138 chest X-ray images, annotated for tuberculosis.
A dataset containing 662 chest X-ray images, annotated for tuberculosis.
Modality
X-ray
X-ray
X-ray
X-ray
X-ray
X-ray
X-ray
X-ray
X-ray
X-ray

Gastrointestinal

TCIA Pancreas-CT
CT Colonography (CTC) Screening Reference Case Library
Liver Tumor Segmentation (LiTS) Challenge
Medical Segmentation Decathlon (MSD) - Liver
TCIA Pancreas-CT-SEG
NIH Clinical Center's DeepLesion Dataset
TCIA Colorectal Cancer (CRC)
CHAOS - Combined (CT-MR) Healthy Abdominal Organ Segmentation
Description
A dataset containing 82 contrast-enhanced abdominal CT images for pancreatic cancer research, with detailed annotations.
A dataset of 825 CT colonography cases with annotated polyps, providing a reference for colorectal cancer screening.
A dataset of 201 contrast-enhanced CT scans with liver and tumor annotations, used for the LiTS challenge.
Part of the MSD, this dataset contains 131 liver CT scans with segmentations for liver and tumors.
A dataset of 282 CT scans with segmentations of the pancreas, spleen, and other abdominal organs.
A large dataset of 32,735 annotated lesions from 10,594 CT images, covering a wide range of organs including gastrointestinal structures.
A dataset of 48 CT scans from colorectal cancer patients, with tumor annotations.
A dataset with both CT and MR images for liver, spleen, and kidney segmentation challenges. 40 CT and 02 MR scans.
Modality
CT
CT
CT
CT
CT
CT
CT
CT, MRI

Genitourinary

PROSTATEx Challenge
TCIA Kidney Tumor Segmentation (KiTS) Challenge
Decathlon Prostate
PROMISE12 Challenge
QIN PROSTATE
TCIA RCC (Renal Cell Carcinoma)
PI-CAI (Prostate Imaging - Cancer AI) Challenge
MRI Kidney Dataset
Description
A dataset of prostate multiparametric MRI scans from 204 patients, including detailed annotations for prostate cancer detection.
A dataset of 300 CT scans with annotated kidney tumors, used for the KiTS 2019 and KiTS 2021 challenges.
Part of the Medical Segmentation Decathlon, this dataset includes 48 prostate MRI scans with annotations for prostate segmentation.
A dataset containing 50 prostate MRI scans, used for the PROMISE12 segmentation challenge.
A dataset of multiparametric prostate MRI scans from 28 patients, with detailed clinical annotations.
A dataset of 105 CT scans of patients with renal cell carcinoma, including annotations for kidney tumor segmentation.
A dataset of multiparametric MRI scans from 1,000 patients, used for the PI-CAI challenge focusing on prostate cancer detection.
A dataset of 120 MRI scans with annotations for kidney and tumor segmentation, supporting renal mass research.
Modality
MRI
CT
MRI
MRI
MRI
CT
MRI
MRI

Head & Neck

TCIA HNSCC (Head and Neck Squamous Cell Carcinoma)
QIN-HEADNECK
Cancer Imaging Archive (TCIA) - Head-Neck-PET-CT
TCIA HNSCC Radiomics
Head and Neck Cetuximab Dataset
TCGA-HNSC (The Cancer Genome Atlas - Head and Neck Squamous Cell Carcinoma)
RECUR Database
MICCAI Head and Neck Auto Segmentation Challenge (2015)
PDDCA (Public Domain Database for Computational Anatomy)
Head-Neck Cetuximab Dataset
Description
A comprehensive dataset of 200 CT and MRI scans from patients with head and neck squamous cell carcinoma, including detailed annotations.
A dataset containing annotated CT images from 52 patients with head and neck cancer, including gross tumor volumes and lymph nodes.
A dataset of 298 patients with head and neck cancer, containing PET/CT scans and radiation therapy structure sets.
A dataset of CT images from 298 patients with head and neck squamous cell carcinoma, used for radiomics analysis.
A dataset of 139 patients treated with Cetuximab for head and neck cancer, containing CT images and clinical outcomes.
A dataset comprising imaging data, genetic data, and clinical outcomes for head and neck squamous cell carcinoma patients. Imaging data includes CT and MRI scans.
A dataset of 256 head and neck CT scans used for recurrence prediction in head and neck cancer patients.
A dataset of 48 CT scans used for the 2015 MICCAI challenge on head and neck auto-segmentation.
A dataset containing 48 CT scans with segmentations of anatomical structures in the head and neck region.
A dataset of 139 patients treated with Cetuximab for head and neck cancer, containing CT images and clinical outcomes.
Modality
CT. MRI
CT
PET, CT
CT
CT
CT, MRI
CT
CT
CT
CT

Musculoskeletal

MURA (Musculoskeletal Radiographs)
Osteoarthritis Initiative (OAI)
RISE (Radiographs of Individuals with Suspected Extremity fractures)
FAST-MRI
MHSP (Military Health System Polytrauma)
BoneXpert Hand X-rays
AIIMS-UP Orthopedic Dataset
NHANES II (National Health and Nutrition Examination Survey)
A dataset of 14,000 orthopedic X-ray images used for developing models for bone fracture detection.
A dataset of over 2,000 CT and MRI scans of bone tumors, including detailed annotations.
Description
A large dataset of 40,561 musculoskeletal radiographs from 14,863 studies, annotated for abnormalities in various regions including the shoulder, elbow, wrist, hand, hip, knee, and ankle.
A comprehensive dataset containing MRI, X-ray, and clinical data from 4,796 participants aimed at understanding knee osteoarthritis.
A dataset containing over 26,000 X-ray images of various extremity fractures, used for developing fracture detection models.
A dataset of knee MRI scans aimed at accelerating MRI scans using AI, including over 1,600 volumes and more than 1 million images.
A dataset of over 10,000 X-ray and CT scans from polytrauma patients, including musculoskeletal injuries.
A dataset of 14,236 hand X-rays annotated for bone age assessment.
A dataset of 8,000 orthopedic X-rays, including images of the spine, hip, and other regions.
A dataset including hand and wrist X-rays from over 5,000 participants for studying bone health.
A dataset of 14,000 orthopedic X-ray images used for developing models for bone fracture detection.
A dataset of over 2,000 CT and MRI scans of bone tumors, including detailed annotations.
Modality
X-ray
MRI, X-ray
X-ray
MRI
X-ray, CT
X-ray
X-ray
X-ray
X-ray
CT, MRI

Neuroradiology

RSNA Intracranial Hemorrhage Detection
RSNA-ASNR-MICCAI BraTS 2021
ADNI (Alzheimer's Disease Neuroimaging Initiative)
OASIS (Open Access Series of Imaging Studies)
HCP (Human Connectome Project)
IXI Dataset
ABIDE (Autism Brain Imaging Data Exchange)
TCIA Glioblastoma Multiforme (TCGA-GBM)
TCIA Lower Grade Glioma (TCGA-LGG)
BraTS (Brain Tumor Segmentation)
Description
A large dataset of 874,035 head CT images from 752,803 unique studies, annotated for intracranial hemorrhage.
A comprehensive dataset of 125,000 MRI scans used for the RSNA-ASNR-MICCAI BraTS 2021 challenge, focused on brain tumor segmentation.
A dataset containing MRI and PET scans, along with clinical data from over 2,400 participants aimed at studying Alzheimer's disease progression.
A dataset of MRI scans from 1,109 subjects, including individuals with Alzheimer's disease and healthy controls.
A comprehensive dataset including MRI scans from 1,200 healthy adults, aimed at mapping human brain connectivity.
A dataset of brain MRI scans from 600 healthy subjects, including T1, T2, and PD-weighted images.
A dataset of brain MRI scans from over 1,000 individuals, including those with autism spectrum disorders and healthy controls.
A dataset of MRI scans from 262 patients with glioblastoma multiforme, including genetic and clinical data.
A dataset of MRI scans from 199 patients with lower-grade glioma, including genetic and clinical data.
A dataset of multimodal MRI scans from over 2,000 patients with brain tumors, used for the BraTS challenges.
Modality
CT
MRI
MRI, PET
MRI
MRI
MRI
MRI
MRI
MRI
MRI

Paediatrics

RSNA Pediatric Bone Age Challenge
Pediatric Chest X-ray Dataset
Children's Hospital Los Angeles (CHLA) Pediatric Pneumonia Dataset
Shenzhen Pediatric X-ray Dataset
Pediatric Radiology Database
Pediatric Bone Marrow MRI Dataset
Pediatric Abdominal Ultrasound Dataset
Pediatric Brain MRI Dataset
Pediatric Trauma CT Dataset
Pediatric Bone Tumor Dataset
Description
A dataset of 12,611 hand X-ray images used for bone age assessment in pediatric patients.
A dataset containing 5,856 pediatric chest X-ray images, annotated for 14 common thoracic diseases.
A dataset of 5,863 pediatric chest X-ray images, categorized as normal, bacterial pneumonia, and viral pneumonia.
A dataset of 662 chest X-ray images from pediatric patients, annotated for tuberculosis.
A comprehensive dataset of over 10,000 pediatric radiographs from various body parts, annotated for different conditions.
A dataset of 100 pediatric bone marrow MRI scans, used for the evaluation of bone marrow pathology.
A dataset of 500 pediatric abdominal ultrasound images, annotated for various abdominal pathologies.
A dataset of 300 brain MRI scans from pediatric patients, including various neurological conditions.
A dataset of 1,000 pediatric trauma CT scans, used for research in traumatic injury detection.
A dataset of 200 pediatric bone tumor MRI scans, annotated for tumor type and stage.
Modality
X-ray
X-ray
X-ray
X-ray
X-ray
MRI
Ultrasound
MRI
CT
MRI

Vascular

Vascular Imaging of the Carotid Artery (VIP)
TCIA Aortic Dissection Imaging Archive
UK Biobank Aortic MRI Dataset
Stanford Vascular Dataset
Vascular Ultrasound (VASCU)
TCIA Pulmonary Embolism Detection Dataset
Retinal Vessel Segmentation Dataset
Coronary Artery Disease (CAD) Dataset
Abdominal Aortic Aneurysm (AAA) Dataset
Cerebrovascular Disease Dataset
Description
A dataset containing 1,000 carotid artery ultrasound images, annotated for atherosclerotic plaque and stenosis.
A dataset of 750 CT angiography images from patients with aortic dissection, including detailed annotations.
A dataset containing over 10,000 aortic MRI scans, with detailed measurements of aortic dimensions and function.
A dataset of 1,500 CT and MRI scans focused on various vascular conditions, including aneurysms and vascular malformations.
A dataset containing 2,000 vascular ultrasound images, annotated for various vascular pathologies.
A dataset of 1,000 CT pulmonary angiography (CTPA) images, annotated for the presence of pulmonary embolism.
A dataset of 400 retinal images with manually segmented blood vessels, used for research in retinal vascular conditions.
A dataset of 3,000 coronary CT angiography (CCTA) images, annotated for coronary artery disease and stenosis.
A dataset containing 800 CT scans of patients with abdominal aortic aneurysm, annotated for aneurysm size and morphology.
A dataset of 1,200 MRI and CT scans from patients with various cerebrovascular diseases, including stroke and aneurysms.
Modality
Ultrasound
CT Angiography
MRI
CT, MRI
Ultrasound
CTPA
Fundus Imaging
CT Angiography
CT
CT, MRI

Radionuclide

ADNI (Alzheimer's Disease Neuroimaging Initiative)
TCIA NSCLC Radiogenomics
TCIA Head-Neck-PET-CT
NIH Clinical Center's DeepLesion Dataset
Parkinson's Progression Markers Initiative (PPMI)
ABIDE (Autism Brain Imaging Data Exchange)
CAMD (Coalition Against Major Diseases
OASIS-3 (Open Access Series of Imaging Studies)
TCIA Glioblastoma Multiforme (TCGA-GBM)
TCIA Brain Tumor Segmentation (BraTS)
Description
A dataset containing over 1,200 PET scans, along with clinical data and MRI scans, used to study Alzheimer's disease progression.
A dataset containing PET/CT scans from 211 patients with non-small cell lung cancer, including radiogenomic data.
A dataset of PET/CT scans from 298 patients with head and neck cancer, including annotations and radiation therapy structure sets.
A dataset of 32,735 annotated lesions from 10,594 CT images, including a subset with PET/CT scans.
A dataset containing over 3,000 DAT-SPECT scans, along with clinical and imaging data to study Parkinson's disease.
A dataset of brain PET and MRI scans from over 1,000 individuals, including those with autism spectrum disorders and healthy controls.
A dataset of over 2,000 FDG-PET scans from patients with Alzheimer's disease and other dementias, used for biomarker research.
A dataset containing over 2,000 PET and MRI scans from individuals with Alzheimer's disease, mild cognitive impairment, and healthy controls.
A dataset of PET and MRI scans from 262 patients with glioblastoma multiforme, including genetic and clinical data.
A dataset of multimodal MRI and PET scans from over 2,000 patients with brain tumors, used for the BraTS challenges.
Modality
PET
PET, CT
PET, CT
PET, CT
SPECT
PET, MRI
FDG, PET
PET, MRI
PET, MRI
PET, MRI