This dataset consists of 25,000 color images, categorized into five distinct classes with 5,000 images per class. Each image is 768 x 768 pixels in size and stored in JPEG format. The main directory, lung_colon_image_set, contains two subdirectories: colon_image_sets and lung_image_sets.
-
The
colon_image_setsfolder includes:colon_aca: 5,000 images of colon adenocarcinoma tissues.colon_n: 5,000 images of benign colon tissues.
-
The
lung_image_setsfolder includes:lung_aca: 5,000 images of lung adenocarcinoma tissues.lung_scc: 5,000 images of lung squamous cell carcinoma tissues.lung_n: 5,000 images of benign lung tissues.
We processed each cancer type individually and used the splitfolders function to split the dataset into training (70%), testing (20%), and validation (10%) sets.
Dataset source: Kaggle - Lung and Colon Cancer Histopathological Images
-
Install the required packages by running the following command:
pip install -r requirements.txt
-
Navigate to the directory where the
server.pyfile is located:cd /path/to/server.py -
Start the server:
python server.py
-
Access the API documentation in your browser:


