This repository provides a curated list of publicly available histopathology datasets, accompanied by relevant metadata to facilitate research, analysis, and model development in medical imaging and pathology.
The aim of this repository is to centralize information on diverse histopathology datasets for researchers, data scientists, and healthcare professionals working in fields such as machine learning, computer vision, and biomedical research. Each dataset entry includes detailed information such as:
- Dataset Name: Identifying title for easy reference
- Tissue Type/Organ: Type of tissue or organ examined
- Staining Method: Type of staining used in images
- Link: Direct URL to the dataset source for quick access
- Magnification and Scanners: Magnification levels and imaging equipment specifications
- Dataset Size: Number of images, patches, or slides included
- Resolution: Resolution or pixel density of the images
- Collection Method: Type of surgical or imaging procedure used to gather the data
- Patient Information: Number of patients included in the dataset
- Year of Publication: Year when the dataset was made publicly available
| Dataset Name | Tissue Type/Organ | Staining Method | Link | Magnification and Scanners | Dataset Size | Resolution | Collection Method | Patient Information | Year of Publication |
|---|---|---|---|---|---|---|---|---|---|
| BreakHis | Breast | H&E | Link | (Magnification, Benign, Malignant): (40X, 652, 1,370) (100X, 644, 1,437) (200X, 623, 1,390) (400X, 588, 1,232) |
(Benign,Malignant,Total) (2480, 5429, 7909) |
700 x 460 pixels | SOB method | 82 | 2017 |
| LC25000 | Lung and colon | Link | ./lung_image_sets/lung_aca : 5000 ./lung_image_sets/lung_n : 5000 ./lung_image_sets/lung_scc : 5000 ./colon_image_sets/colon_n : 5000 ./colon_image_sets/colon_aca : 5000 |
768 x 768 pixels | 2019 | ||||
| Ovarian Bevacizumab Response | Ovary | H&E | Link | (Maginification, lense) (20X, Leica AT Turbo, Leica, Germany) |
288 H&E stained WSIs (including 162 effective and 126 invalid WSIs) |
54342 × 41048 in pixels on average | debulking surgery | 78 | 2022 |
| PLISM(PLISM-original,PLISM-WSI, PLISM- sm) | Various | H&E | Link Link Link |
400X WSI Scanner : NanoZoomer-S360 C13220-01 scanner, NanoZoomer-S210 C13239-01 scanner, NanoZoomer-SQ C13140-D03, NanoZoomer-S60 C13210-01, Aperio AT2, Aperio GT450, Ultrafast Scanner Smartphone cameras: Galaxy S20 5G SC-51A, moto g8, Redmi Note9 Pro, iPhone 6, iPhone 13 mini, iTel P33 |
PLISM-orginal subset consists of 91 original WSIs before image registration PLISM-WSI: 310,947 (3,417 Aligned Image Groups × 91 WSIs) PLISM-sm: 57,902 (4,454 Aligned Image Groups × 13 devices) |
PLISM-original: 0.22 to 0.26 µm/pixel PLISM-WSI: 1024 x 1024 pixels PLISM-sm: 512 x 512 pixels |
46 | 2024 | |
| Chaoyang | Link | 20X | Training samples: 1111 normal, 842 serrated, 1404 adenocarcinoma, 664 adenoma. Testing samples: 705 normal, 321 serrated, 840 adenocarcinoma, 273 adenoma. |
512×512 pixels | 2021 | ||||
| DiagSet | Prostate | H&E | Link | 40X, 20X, 10X and 5X Hamamatsu C12000-22 digital slide scanner |
Over 2.6 million tissue patches extracted from 430 fully annotated scans 4675 scans with assigned binary diagnoses 46 scans with diagnoses independently provided by a group of histopathologists |
0.25 μm/pixel, 0.50 μm/pixel, 1.0 μm/pixel, and 2.0 μm/pixel | 2024 | ||
| BCNB | Breast (specifically for early breast cancer) |
H&E | Link | Iscan Coreo pathologic scanner 200x magnification |
Core-needle biopsy | 1058 | 2021 | ||
| MHIST | Colorectal polyps | H&E | Link | 40x Aperio AT2 scanner |
3152 | 224 x 224 pixels | Images were extracted from 328 Formalin Fixed Paraffin-Embedded (FFPE) whole-slide images of colorectal polyps | 2021 | |
| A histopathological image dataset for grading breast invasive ductal carcinomas | Breast | H&E | Link | 4x, 10x, 20x, and 40x Apple iPhone 7 Plus camera |
922 | 2100 × 1574 and 1276 × 956 pixels | 124 | 2020 | |
| Histopathological Image based Skin Cancer Classification Using CNN | Skin | H&E | Link | ×40, ×100, ×200, and ×400 Olympus BX63 Digital Motorized Upright Advanced microscope |
16,099 histology images with data augmentation 4,357 histology images without data augmentation |
1600 × 1200 pixels | 354 | 2021 | |
| Histopathological image patches from colorectal cancer with three classes: tumor, stroma, and other | Colorectal | H&E | Link | 0.5 MPP Hamamatsu NanoZoomer-XR |
2770 image patches | 224 x 224 pixels | 17 | 2023 | |
| Histopathological imaging database for Oral Cancer analysis | Oral cavity | H&E | Link | Normal Epithelium: 89 images at 100x magnification 201 images at 400x magnification Oral Squamous Cell Carcinoma (OSCC): 439 images at 100x magnification 495 images at 400x magnification Leica ICC50 HD microscope |
1224 | 230 | V1:2019 V2:2023 |
||
| A histopathological image dataset for endometrial disease diagnosis | Endometrium | H&E | Link | 10x, 20x Mixotic scanner |
3500 | 640x480 pixels | Hysteroscopic surgery or hysterectomy | 498 | 2017-2018 |
| Multi-class texture analysis in colorectal cancer histology | Colorectal | H&E | Link | 5000 | 150 px x 150 px | 2016 | |||
| Invasive Ductal Carcinoma (IDC) Histology Image Dataset | Breast | Link | 40x | 277,524 patches | 50x50 pixels per patch | 162 | 2015 | ||
| OvarianCancer&SubtypesDatasetHistopathology | Ovary | Link | 2021 | ||||||
| ViCE Histopathology Images | Intestine | H&E | Link | 2024 | |||||
| HEPASS Algorithm Dataset | Liver | H&E | Link | 385 | 77 | 2020 | |||
| RINGS algorithm dataset | Prostate | H&E | Link | 40x | train: 1000 , test: 500 | 1500X1500 | 2021 | ||
| NDB-UFES | Oral | H&E | Link | 10X, 40 X Olympus DP73 microscope, Olympus Standard cellLens |
237 | 2048 x 1536 | Biopsy | 2023 | |
| RENFAST algorithm dataset | Kidneys | PAS, TRIC | Link | Hamamatsu NanoZoomer S210, 10x magnification |
650 | 512x512 pixels | Biopsy | 65 | 2020 |
| DeepHP Dataset | Gastric Mucosa | H&E | Link Link |
ZEISS Microscope Axio imager.M2, ×20 magnification |
394,926 patches | 256x256 pixels | Biopsy | 19 | 2022 |
| HistMNIST | Bone Marrow | H&E | Link | 40× magnification | 10,800 images | 28x28 pixels | Biopsy | 16 | 2018 |
| Histopathology Imagery Dataset of Ph-negative Myeloproliferative Neoplasm | Bone Marrow | Link | Olympus BX41 Dual head microscope, x10, x20, x40 lens types |
300 images | Biopsy | 2023 | |||
| NCT-CRC-HE-100K | colon and rectum | H&E | Link | 0.5 microns per pixel | 100,000 images | 224x224 pixels | Biopsy | 86 | 2017 |
| NCT-CRC-HE-100K-NONORM | colon and rectum | H&E | Link | 0.5 microns per pixel | 100,000 images | 224x224 pixels | Biopsy | 86 | 2017 |
| CRC-VAL-HE-7K | Colorectal Adenocarcinoma | H&E | Link | 0.5 microns per pixel | 7,180 images | 224x224 pixels | Biopsy | 50 | 2017 |
| Histological Image Processing Features Induce a Quantitative Characterization of Chronic Tumor Hypoxia | colon | H&E, anti-pimonidazole | Link | 10x, 20x magnification | Biopsy | 2016 | |||
| UNITOPATHO | Colorectal polyps | H&E | Link | 20x magnification | 9536 patches (292 whole-slide images) | 224x224 pixels | Biopsy | 292 | 2021 |
| HistologyHSI-GB | Brain | H&E | Link | 20× magnification, Olympus BX-53 microscope, Hyperspec VNIR A-Series camera | 482 | 800 × 1004 pixels | 13 | 2024 | |
| H&E-stained oral squamous cell carcinoma histological images dataset | Oral Cavity | H&E | Link | 20x magnification | 1,020 images | 640 X 640 px | 2022 | ||
| Gastric Cancer Histopathology Tissue Image Dataset (GCHTID) | stomach | H&E | Link | 31096 non-overlapping images | 224x224 pixels | 2024 | |||
| Enteroscope Biopsy Histopathological H&E Image Dataset (EBHI-Seg) | Colorectal | H&E | Link | 400× magnification, Olympus microscope, NewUsbCamera |
4,456 images (2,228 histopathology images and 2,228 ground truth images) |
224x224 pixels | Intestinal biopsy | 2023 | |
| Automatic Registration Of Breast Cancer Tissue | Breast tissue | IHC, H&E | Link | 750 cases for development, 100 cases for validation, 200 cases for testing |
2023 | ||||
| Automatic Non-rigid Histological Image Registration (ANHIR) Dataset | Various(lesions, lung-lobes, mammary-glands) | Different dyes | Link | High-resolution (up to 40x magnification) |
Up to 100k x 200k pixels | 2019 | |||
| ICIAR 2018 BACH Dataset | Breast tissue | H&E | Link | 400+ labeled microscopy images, 10 pixel-wise labeled, 20 non-labeled whole-slide images |
2018 | ||||
| BRACS | Breast tissue | Link | 40x magnification | WSIs: 578 (Training: 395, Validation: 65, Testing: 118) RoIs: 6,758 (Training: 3,657, Validation: 312, Testing: 474) |
WSIs: 100,000 x 100,000 pixels, RoIs: 4,000 x 4,000 pixels |
2020 | |||
| CAMELYON16 | Sentinel lymph node | Link | 400 WSIs (Training: 270, Testing: 130) | 2016 | |||||
| CAMELYON17 | Sentinel lymph node | H&E | Link | 1399 WSIs (Training: 1000+, Testing: 399) | 2017 | ||||
| CPTAC-BRCA | Breast | Link | 642 | 134 | 2021 | ||||
| DRYAD dataset (hashi) | Breast | Link | 40x magnification, Aperio and Ventana scanners |
Various (scaled images) | 500 | 2018 | |||
| GTEx Portal | multiple | H&E | Link | Various | Dissected from the central breast subareolar region of the right breast | 948 patients (multiple slides per patients) | |||
| HER2-Warwick dataset | Breast | H&E | Link | 100 WSIs | 100,000 x 80,000 pixels | 2016 | |||
| HEROHE | Breast | H&E | Link | 3D Histech Pannoramic 1000 | 510 | 2022 | |||
| IMPRESS dataset | Breast | Hematoxylin and Eosin (HE), IHC (PD-L1, CD8+, CD163+) | Link | 20x magnification, Hamamatsu scanner | 186 (HER2+ and TNBC) |
2023 | |||
| Post-NAT-BRCA | Breast | Link | |||||||
| SLN-Breast | Breast, Axillary Lymph Nodes | H&E | Link | Leica Aperio AT2 scanners at 20x magnification | 130 | 78 | 2019 | ||
| TCGA-BRCA Dataset | Breast | Link | 139 | 2020 | |||||
| TIGER Dataset | Breast | H&E | Link | ||||||
| GasHisSDB | Stomach | H&E | Link | Magnification: 20x, Microscopes: Nikon (Japan) and Olympus (Japan) |
245,196 images | Sub-database A: 160×160 pixels, Sub-database B: 120×120 pixels, Sub-database C: 80×80 pixels |
2021 | ||
| MIHIC | Lung | IHC, H&E | Link | Magnification: 40x | 309,698 image patches | 128x128 pixels | Tissue Microarray (TMA) | 114 | 2024 |
| BMIRDS Dartmouth Lung Cancer Histology Dataset | Lung adenocarcinoma | H&E | Link | 20x or 40x magnification; Aperio AT2 whole-slide scanner |
143 | Formalin-Fixed Paraffin-Embedded (FFPE) | 2019 | ||
| BMIRDS Dartmouth Kidney Cancer Histology Dataset | Renal Cell Carcinoma | H&E | Link | 20x magnification; Aperio AT2 whole-slide scanner |
563 | Formalin-Fixed Paraffin-Embedded (FFPE) | 2021 | ||
| Gland Segmentation in Colon Histology Images (GlaS) | Colon | H&E | Link | Zeiss MIRAX MIDI Slide Scanner, 20× objective magnification |
165 images | Formalin-Fixed Paraffin-Embedded (FFPE) | 16 | 2015 | |
| ACDC-LungHP | Lung | H&E | Link | 3DHISTECH Pannoramic 250, 20x objective magnification |
200 slides, 150 training, 50 test | 2019 | |||
| Adipocyte(Count-ception) | skin | H&E | Link | 40x | 200 slides | VGG: 256x256, Adipocyte: 150x150 |
2017 | ||
| MBM(countception) | bone | H&E | Link | 40x | 44 patches | MBM: 600x600 | 2017 | ||
| ADP | multiple | H&E Mostly | Link | 40x, Huron TissueScope LE1.2 |
17,668 patches | 1088x1088 pixels | |||
| AGGC | prostate | H&E | Link | 20x Subset1 and Subset2: Akoya Biosciences Scanner, Subset3: each specimen is scanned by multiple scanners |
187 prostatectomy and 156 biopsy specimens | 2022 | |||
| Multi-Class Cell Detection Using Spatial Context Representation (BRCA-M2C ) | breast | H&E | Link | 20x | 120 patches (Breast), 57 patches (Lung), 41 patches (Colorectal) |
2022 | |||
| HER2 tumor ROIs | breast | H&E | Link | 20x Aperio ScanScope |
512x512 | 273 | 2022 | ||
| CryoNuSeg | Multiple | H&E | Link | 40x magnification, various scanning centers | 30 WSIs from 10 organs, 7596 nuclei (Annotator 1) | 512 x 512 pixels | 2021 | ||
| Lizard | Colon | H&E | Link | 20x magnification | 495,179 nuclei in 291 image regions | 1016 x 917 pixels | 2021 | ||
| Pan-tumor T-lymphocyte dataset | Multiple | IHC | Link | 40x magnification, NanoZoomer 2.0-HT scanner (Hamamatsu) |
92 tumor samples from 4 tumor indications | 2150 x 2150 | Formalin-Fixed Paraffin-Embedded (FFPE) | 92 | 2023 |
| CoNSeP - HoVer-Net | Colorectal adenocarcinoma | H&E | Link | 40x Omnyx VL120 scanner |
41 image tiles | 1000 x 1000 pixels | Surgical resection | 16 | 2019 |
| The PANDA challenge | Prostate | H&E | Link | Various scanners including 3DHISTECH, Hamamatsu Photonics, and Leica Biosystems | 12,625 biopsies | Retrospective collection | 2022 | ||
| PanNuke | multiple | H&E | Link | 40x | 2019 | ||||
| Janowczyk et al. | breast | H&E | Link | 40x | 143 | 2000X2000 | 2015 | ||
| NuCLS | breast | H&E | Link | 40x | 125 | 2021 | |||
| DigestPath2019 - colonoscopy tissue segment | Colon | H&E | Link | 20X | Train: 660, Test: 212 | 5000X5000 | 2019 | ||
| Bone-Marrow-Cytomorphology | Marrow | May-Grünwald-Giemsa/Pappenheim | Link | 40X | 250X250 | 945 | 2021 | ||
| PAIP2021 | Multiple (Colon, Prostate, Pancreas) |
H&E | Link | 20x Aperio AT2 |
Train: 150, Valid: 30, Test: 60 | 2021 | |||
| WSSS4LUAD | Lung | H&E | Link | Magnification: 40X, Scanner: Leica GT 450 |
87 (Train: 53, valid: 12, Test: 12) | 2022 | |||
| Cellseg | multiple | multiple | Link | Various scanners | 1,000 patches | ||||
| PAIP2023 | multiple organ | H&E | Link | 2023 | |||||
| NuClick | Lymphocyte | H&E, IHC | Link | Train: 671, Valid: 200 | patch (256x256) | 2020 | |||
| MIDOG 2022 | H&E | Link | Train: 405 cases, 9501 mitotic annotation | 2022 | |||||
| MIDOG 2021 | Breast | H&E | Link | Scanner 1: Hamamatsu XR nanozoomer 2.0 Scanner 2: Hamamatsu S360 (0.5 NA) Scanner 3: Aperio ScanScope CS2 Scanner 4: Leica GT450 |
200 wsi: 50 wsi / scanners - 4 scanners | 2021 | |||
| MIDOG++ | multiple (Breast, Lung, Lymph nodes, Skin) |
H&E | Link | 503 images | 0.25 µm/px or 0.23 µm/px | Archived tissue blocks, routine processing steps | 2023 | ||
| CAMEL | Lymph nodes, Colorectal tissue | H&E | Link | Magnification: 20x, Scanner: Various | 177 | 0.25 µm/px | 2019 | ||
| OCELOT | Multiple (Bladder, Endometrium, Head-and-neck, Kidney, Prostate, Stomach) |
H&E | Link | 304 | 1024X1024 | 2023 | |||
| CAMELYON | Breast (Lymph node) | H&E | Link | Magnification: 240 nm/px, Scanners: 3DHistech Pannoramic Flash II 250, Hamamatsu NanoZoomer-XR C12000-01, Philips Ultrafast Scanner |
1399 | 2018 | |||
| BCSS | Breast | H&E | Link | 151 | 2019 | ||||
| SegPath | multiple | H&E | Link | Magnification: 40×, Slide scanner: Hamamatsu Nanozoomer S60 |
158,687 patches | Tissue microarray (TMA), Pathologist annotations | 1583 | 2023 | |
| SPIE-AAPM_NCI BreastPathQ | Breast | H&E | Link | Magnification: 20×, Slide scanner: Aperio AT Turbo 1757 |
96 WSI scans, 3698 patches | 0.5 μm/pixel | 55 (WSI) 64 (original) | 2018 | |
| NADT-Prostate | Prostate | Multiple (H&E, IHC) | Link | 141 tumor foci, 110 biopsies | 37 | 2021 | |||
| TCGA-TIL-WSI | Multiple (13) | H&E | Link | 5,455 WSIs | 50-micron | ~4,759 | 2018 | ||
| TUPAC16 - aux | Breast - mitoses | H&E | Link | 40x magnification, Leica SCN400 | 500 WSIs (Training), 73 (Mitosis), 148 (ROIs) | 50,000 x 50,000 pixels (WSI) | 2017 | ||
| UniToPatho | Colon | H&E | Link | 20x - Hamamatsu Nanozoomer S210 | 9536 patches, 292 WSIs | 224x224 pixels | 292 | 2021 | |
| Artificial intelligence for tumor tissue detection and histological regression grading in esophageal adenocarcinomas (Tolkach Y. et al.) | oesophageal adenocarcinomas | H&E | Link | 40x - Nanozoomer S360, Leica Aperio series histoscanners | UKK1: 34,704 patches from 22 wsi (20 patients); WNS: 121,642 patches from 62 wsi (15 patients); CHA: 32,796 patches from 214 wsi (69 patients); TCGA:178,187 patches from 22 wsi (22 patients) |
256x256 | 2023 | ||
| VisioMel | Melanoma | H&E | Link | train: 1342 wsi, test: 600, valid: 1200, 16 WSIs annotated | biopsy | 2023 | |||
| BreCaHAD | Breast | H&E | Link | 40x - Zeiss | 162 | 1360 × 1024 pixels | biopsy | 2019 | |
| UCSB Bio-Segmentation Benchmark dataset(Gelasca et al.) | Breast | H&E | Link | 50 | 2008 | ||||
| PATHVQA | Multiple | Multiple | Link | 4,998 images | Extracted from pathology textbooks, online | 2020 | |||
| CoNIC 2022 | Colon | H&E | Link | 20× magnification | 4,981 patches | 256×256 pixels | 2022 | ||
| MoNuSAC 2020 | multiple (Lung, Prostate, Kidney, Breast) |
H&E | Link | 40x scanner magnification | 31,000 nuclear annotations | 2020 | |||
| MoNuSeg | multiple (7) | H&E | Link | 40x scanner magnification | 30 images, 22,000 annotations | 2018 | |||
| ARCH | Multiple | H&E, IHC | Link | multiple magnification | 4270 | 2021 | |||
| Osteosarcoma-Tumor-Assessment | Bone | H&E | Link | 10X resolution | 1144 | 1024x1024 | 50 | 2019 | |
| CoCaHis | Colon | H&E | Link | 10X resolution | 1024x1024 | Intraoperative collection | 19 | 2019 | |
| Naylor et al. | Breast | H&E | Link | 50 | 11 | ||||
| SICAPv2 | Prostate | H&E | Link | 40x scanner magnification Ventana iScan Coreo |
155 | biopsy | 95 | 2021 | |
| CPTAC-COAD | Colon | Link | biopsy | 106 | 2021 | ||||
| PAIP2020 | Colon | H&E | Link | 40X magnification Aperio AT2 |
Train: 47, Valid: 31, Test: 40 | 118 | 2020 | ||
| KIMIA Path24C | multiple | multiple (IHC, H&E, Masson's trichrome) | Link | 20x -TissueScope LE 1.0. | 28380 | 1000x1000 pixels | 24 | 2021 | |
| UPENN-GBM | glioblastoma | H&E | Link | 40x | 71 | 34 | 2022 | ||
| Prostate Fused-MRI-Pathology | Prostate | H&E | Link | 20x magnification Aperio scanner |
32508 | Radical prostatectomy specimens | 28 | 2023 | |
| PAIP2019 | Liver | H&E | Link | 20x - Aperio AT2 | Train: 50, Valid: 10, Test: 40 | Resection specimens | 100 | 2019 | |
| LYON19 | Multiple (Breast, Colon, Protate) | IHC | Link | Pannoramic 250Flash II scanner | 441 | 0.24μm/px | 2019 | ||
| DigestPath2019 - signet ring cell | multiple (Gallbladder, Gastric mucosa, Lymph, Breast, Ovary, Pancreas, Lung, Urinary bladder, Abdominal wall nodule, Intestine) | H&E | Link | 40x | 127 WSIs (21 positive, 106 negative) | 2000x2000 pixels | 2019 | ||
| UPENN-GBM | glioblastoma | H&E | Link | 40x | 71 | 34 | 2022 | ||
| Multi-Scanner SCC | Skin (Canine) | H&E | Link | Aperio ScanScope CS2, NanoZoomer S210, NanoZoomer 2.0-HT, Pannoramic 1000, Aperio GT 450 |
220 | 2023 | |||
| Gleason_CNN | Prostate | H&E | Link | 40x - NanoZoomer-XR Digital slide scanner, Hamamatsu | 3100x3100 | 2018 | |||
| MITOS_WSI_CMC | Breast (Canine) | H&E | Link | 40x Aperio ScanScope CS2 | 2020 | ||||
| IMP-CRS 2024 | Colorectal | H&E | Link | 40X 2 Leica GT450 WSI scanners |
Train 4433 wsi, Test: 900 wsi | 2024 | |||
| CRCHisto | Colon | H&E | Link | 20x - Omnyx VL120 (UHCW) | 500x500 | 2016 | |||
| CPTAC-OV | Ovary | Link | 40x | 222 | 102 | 2021 | |||
| ENDO-AID | Endometrial Carcinoma | H&E | Link | 0.5um/px - 3DHistech P1000 | 91 | Pipelle biopsies | 2022 | ||
| TNBC | Breast | H&E | Link Link |
40x - Philips Ultra Fast Scanner | 50 | 512x512 | 11 | 2019 | |
| Histology images from uniform tumor regions in TCGA Whole Slide Images(Komura et al.) | Various | H&E | Link | 8736 | 128 x 128 μm to 256 x 256 μm | 7951 | 2021 | ||
| CRAG | Large intestine | H&E | Link | 20x, Omnyx VL120 Scanner | 213 | around 1500x1500 | 38 | 2019 | |
| DiagSeg | Prostate | H&E | Link | 5x, 10x, 20x, 40x - Hamamatsu C12000-22 | >2.6M patches (from 430 scans) 430 fully annotated scans, 4675 scans with binary diagnosis, and 46 scans with diagnosis given independently by a group of 9 histopathologists |
2021 | |||
| TUPAC16 | brain | H&E | Link | 40x | 500 | 2019 | |||
| TCGA | Multiple | H&E | Link | > 11k | |||||
| HunCRC | Colorectal | H&E | Link | 40x - 3DHistech Pannoramic 1000 | 101,389 patches - 200 wsi | 512x512 | Retrospective collection | 200 | 2022 |
| TissueNet | Uterine cervix | H&E | Link | MIRAX, Aperio, Hamamatsu | 1,016 WSIs; 5,926 patches | 1200x1200 px | 2020 | ||
| AML-Cytomorphology_LMU | Blood | Wright's stain | Link Link |
100x - M8 digital microscope/scanner | 18365 | 200 | 2019 | ||
| CPTAC-AML | Marrow, Blood | Link | 40x | 122 | 88 | 2020 | |||
| CRC-TP | CRC | H&E | Link Link |
280k patches (from 20 wsi) | 2020 | ||||
| DLBCL-morphology | Lymph Node | H&E and immunohistochemical stain | Link Link |
40x magnification, Aperio AT2 scanner | 52194 patches - 246 images | 224x224 | 209 | 2022 | |
| Kumar | multiple | H&E | Link Link |
40x | Train: 16 (13372 nuclei), test same organ (4130 nuclei): 8, test diff organ (4121 nuclei): 6 |
1000x1000 | 2017 | ||
| SegPC-2021 | Blood | Jenner-Giemsa | Link Link Link |
775 images, Train: 298, Valid: 200, Test: 277 | 2021 | ||||
| UMMC ER-IHC Breast Histopathology Whole Slide Image and Allred Score | Breast | ER-IHC | Link | 20x, 3DHistech Pannoramic DESK | 37 WSIs | ~80,000 x 200,000 pixels | 2023 | ||
| Benign Breast Tumor Dataset | Breast | Link | 83 | 2021 | |||||
| Cytological and histological images of breast cancer | Breast | H&E and immunohistochemical stain | Link | 40x | Cytology: 3264x2448 px, Histology: 2048x1536 px |
2023 | |||
| A dataset for nuclei segmentation based on Breast Cancer patients | Breast | H&E | Link | 50 | 11 | 2018 | |||
| 2 million histological images of breast cancer tumors with her2 labels | Breast | H&E, HER2 IHC | Link | 40x, Leica Aperio AT2 Scanner | 2,051,877 image patches | 256 x 256 | 504 | 2023 | |
| Data from: Computational pathology to discriminate benign from malignant intraductal proliferations of the breast | Breast | Link | 116 | 2015 | |||||
| Biological Classification of Breast Cancer by Real-Time Quantitative RTPCR: Comparisons to Microarray and Histopathology | Breast | Link | 123 | 123 | 2006 | ||||
| The Single-Cell Pathology Landscape of Breast Cancer | Breast | Link | 352 / 281 (Total / Survival Data) | 720x720 | 2020 | ||||
| BIOGRID CURATED DATA FOR PUBLICATION: Type and duration of exogenous hormone use affects breast cancer histology. | Link | ||||||||
| Automated Nuclear Pleomorphism Scoring in Breast Cancer | Breast | H&E | Link | 3DHistech P1000 | 118 | 2020 | |||
| unnormalised breast cancer histopathology | Link | ||||||||
| Multi-omic machine learning predictor of breast cancer therapy response | Breast | H&E | Link | 168 | 168 | 2021 | |||
| Curated Breast Imaging Subset of Digital Database for Screening Mammography | Breast | Link | DBA, HOWTEK, LUMISYS | 10239 | 6671 (but there are only 1,566 actual participants in the cohort) |
2017 | |||
| Lung Adenocarcinoma Evolution H&E Pathomic Feature Analysis Dataset | Lung | H&E | Link | 162 slides, 669 ROIs | 98 | 2023 | |||
| Fused Radiology-Pathology Lung Dataset | Lung | Link | 11210 | 6 | 2018 | ||||
| Colsanitas dataset | Breast | H&E | Link | 40X, Roche iScan HT | 2250(600 normal tissues, 250 benign lesions, 250 in situ carcinoma, and 1150 invasive carcinoma) |
2048X1536 pixels | 80 | 2022 | |
| CPM-17 | brain | H&E | Link Link |
20x, 40x | Train: 32, test: 32 | 500x500 to 600x600 | 2019 | ||
| CPM-15 | brain | H&E | Link | 20x, 40x | 15 (2905 nuclei) | 400x400, 600x1000 | |||
| International Cancer Genome Consortium Breast Cancer Histology Images | Breast | H&E | Link | 151 | 2016 | ||||
| CRC-ICM | Colon, Rectum | IHC, H&E | Link 1 Link 2 |
200x, 20x magnification | 1,756 images | 4140 x 3096 2070 x 1548 1280 x 960 |
136 | 2023 |