Create Semantic Segmentation Table¶

Create a 3LC Table for semantic segmentation tasks using paired images and grayscale mask files from the ADE20k dataset.

Semantic segmentation requires pixel-level classification where each pixel belongs to a specific class. This format is essential for tasks like scene parsing, medical imaging, or autonomous driving where precise spatial understanding is needed.

This notebook demonstrates creating a table from paired image and mask files. We download the ADE20k dataset from HuggingFace Hub and structure it into a 3LC table with properly formatted segmentation annotations using grayscale PNG masks.

Project Setup¶

[ ]:

PROJECT_NAME = "3LC Tutorials - Semantic Segmentation ADE20k"
DATASET_NAME = "ADE20k_toy_dataset"
TABLE_NAME = "ADE20K-semantic-segmentation"
DOWNLOAD_PATH = "../../transient_data"

Install dependencies¶

[ ]:

%pip install 3lc
%pip install git+https://github.com/3lc-ai/3lc-examples.git
%pip install huggingface-hub

Imports¶

[ ]:

import json
from pathlib import Path

import tlc
from huggingface_hub import hf_hub_download

from tlc_tools.common import download_and_extract_zipfile

Download the dataset¶

[ ]:

DATASET_ROOT = (Path(DOWNLOAD_PATH) / "ADE20k_toy_dataset").resolve()

if not DATASET_ROOT.exists():
    print("Downloading data...")
    download_and_extract_zipfile(
        url="https://www.dropbox.com/s/l1e45oht447053f/ADE20k_toy_dataset.zip?dl=1",
        location=DOWNLOAD_PATH,
    )

Fetch the label map from the Hugging Face Hub¶

[ ]:

# load id2label mapping from a JSON on the hub
with open(
    hf_hub_download(
        repo_id="huggingface/label-files",
        filename="ade20k-id2label.json",
        repo_type="dataset",
    )
) as f:
    id2label = json.load(f)

categories = list(id2label.values())

[ ]:

value_map = {i + 1: tlc.MapElement(category) for i, category in enumerate(categories)}
value_map[0] = tlc.MapElement("background", display_color="#00000000")  # Set transparent background

Load the images and segmentation maps¶

[ ]:

image_paths = list(sorted(DATASET_ROOT.glob("**/images/training/*.jpg")))
segmentation_map_paths = list(sorted(DATASET_ROOT.glob("**/annotations/training/*.png")))

[ ]:

# Call .to_relative() to ensure aliases are applied
image_paths = [tlc.Url(p).to_relative().to_str() for p in image_paths]
mask_paths = [tlc.Url(p).to_relative().to_str() for p in segmentation_map_paths]
print(image_paths[0])

Write the instance segmentation masks to a table¶

[ ]:

table = tlc.Table.from_dict(
    data={
        "image": image_paths,
        "mask": mask_paths,
    },
    structure=(tlc.PILImage("image"), tlc.SegmentationPILImage("mask", classes=value_map)),
    table_name=TABLE_NAME,
    dataset_name=DATASET_NAME,
    project_name=PROJECT_NAME,
)

[ ]:

image, mask = table[0]

[ ]:

image

[ ]:

mask