Skip to content

Latest commit

 

History

History
63 lines (53 loc) · 3.96 KB

data_preparation_instruction.md

File metadata and controls

63 lines (53 loc) · 3.96 KB

Data preparation instruction

For dicom datasets you will need pydicom and gdcmconv libraries that are only available in Anaconda.

1. CT

  • CT-ORG

    1. Download CT-ORG dataset from cancerImagingarchive
    2. Use convertCT_ORG.py, insert path to your dataset in path (line 11)
  • MosMedData

    1. Download Chest CT Scans with COVID-19 dataset from mosmed.ai
    2. Use converterMosMedAiCOVID.py , insert path to your dataset in path (line 11)
  • KITS-19

    1. Download KITS-19 dataset according to instruction included in Kits repository
    2. Use converterKITS19.py, insert path to your dataset in path (line 11), in (line 89) add path to kits.json file
  • LIDC-IDRI

    1. Download LIDC-IDRI dataset from cancerImagingarchive
    2. delete files that don't match the others (XRAYS)
    3. use retrieve_data_from_xml.py to create nodule masks from xmls, add path to directory (line 150)
    4. use convertLIDC.py, insert path to your pictures in path (line 16), and path to masks (line 18)

2. MRI

  • QIN-BRAIN-DSC-MRI

    1. Download QIN-BRAIN-DSC-MRI dataset from cancerImagingarchive
    2. Use converterQIN_BRAIN_MRI.py, insert path to your dataset in path (line 11)
  • Brain Tumor Classification (MRI)

    1. Download Brain Tumor Classification (MRI) dataset from kaggle.com
    2. Use converterBrain_Tumor_Classification_MRI.py, insert path to your dataset in path (line 6)
  • Brain-Tumor-Progression

    1. Download Brain-Tumor-Progression dataset from cancerImagingarchive
    2. Use converterBrainTumorProgression.py, insert path to your dataset in path (line 10)

3. X-Ray