Note

This page was generated from prechipped_datasets.ipynb.

Note

If running outside of the Docker image, you may need to set some environment variables manually. You can do it like so:

import os
from subprocess import check_output

os.environ['GDAL_DATA'] = check_output('pip show rasterio | grep Location | awk \'{print $NF"/rasterio/gdal_data/"}\'', shell=True).decode().strip()

Working with pre-chipped datasets#

It is not uncommon for geospatial datasets to be released in a “pre-chipped” form; i.e., not as large GeoTIFFs, but as non-georeferenced chips/tiles/patches in ordinary image formats such as PNG or JPEG.

In such scenarios, you do not necessarily need to use Raster Vision to read this data and train a model. In fact, you can train a model outside of Raster Vision and then use Raster Vision to run it over GeoTIFFs, as shown in the “Using Raster Vision with Lightning” tutorial.

Nevertheless, Raster Vision is capable of training with ordinary images and this tutorial notebook will walk you through examples for each of the three supported computer vision tasks. Note that this notebook only demonstrates how to read these datasets, but that is all that is needed; once you have instantiated these dataset classes, the rest of the training and prediction procedure is identical to that for non-chipped geo-referenced data.

The `ImageDataset` class#

The ImageDataset is a PyTorch-compatible Dataset implementation that allows reading non-geospatial image datasets.

Just like GeoDataset, it is based on AlbumentationsDataset.

Supported image formats#

Each ImageDataset subclass is capable of reading all image formats supported by pillow (.png, .jp[e]g, .tif[f], .bmp, and more) plus the numpy format, .npy.

Semantic segmentation – `SemanticSegmentationImageDataset`#

The SemanticSegmentationImageDataset (which internally uses a SemanticSegmentationDataReader) expects a path to a directory containing images and a path to a directory containing segmentation masks. Images are matched to their respective label-masks based on their filename (excluding the extension). The masks can be in any supported image format.

Dataset#

For this example, we will use the Extended Optical Remote Sensing Saliency Detection (EORSSD) Dataset.

Zhang, Qijian, Runmin Cong, Chongyi Li, Ming-Ming Cheng, Yuming Fang, Xiaochun Cao, Yao Zhao, and Sam Kwong. “Dense attention fluid network for salient object detection in optical remote sensing images.” IEEE Transactions on Image Processing 30 (2020): 1305-1317.

Below we download (63 MB) and unzip the data.

[4]:

!wget "https://github.com/rmcong/EORSSD-dataset/raw/master/EORSSD.zip" -P "data/"
!apt install unzip -y
!unzip -q "data/EORSSD.zip" -d "data/EORSSD"

[5]:

!ls "data/EORSSD"

test-images  test-labels  train-images  train-labels

Usage#

[7]:

import albumentations as A

from rastervision.pytorch_learner import SemanticSegmentationImageDataset

ds = SemanticSegmentationImageDataset(
    img_dir='data/EORSSD/train-images/',
    label_dir='data/EORSSD/train-labels/',
    transform=A.Resize(256, 256),
)
len(ds)

[7]:

We can read a data sample and the corresponding ground truth from the Dataset like so:

[8]:

x, y = ds[0]
x.shape, y.shape

[8]:

(torch.Size([3, 256, 256]), torch.Size([256, 256]))

And then plot it using the SemanticSegmentationVisualizer:

[9]:

from rastervision.pytorch_learner import SemanticSegmentationVisualizer

viz = SemanticSegmentationVisualizer(
    class_names=['background', 'foreground'], class_colors=['black', 'white'])
viz.plot_batch(x.unsqueeze(0), y.unsqueeze(0), show=True)

../../_images/usage_tutorials_prechipped_datasets_20_0.png

Object detection – `ObjectDetectionImageDataset`#

The ObjectDetectionImageDataset (which internally uses a CocoDataset) expects a path to a directory containing images and a URI to a JSON file containing annotations in the COCO format.

Dataset#

For this example, we will use the Airbus Aircraft Detection dataset.

You will need to manually download the dataset (92 MB) from here: https://www.kaggle.com/datasets/airbusgeo/airbus-aircrafts-sample-dataset.

The cells below assume that the data has been unzipped into the data/airbus/ directory.

[12]:

!ls "data/airbus/"

LICENSE.txt  README.md  annotations.csv  extras  images

Note

This dataset cannot, strictly speaking, be called “pre-chipped” since the individual images are still pretty large and warrant further chipping. Nevertheless, the images are provided as non-georeferenced JPEGs and thus serve the purpose of this tutorial.

Transform annotations into COCO format#

Add a bbox column representing bounding boxes in xywh format and delete the old geometry column.
Add a category_id column representing class IDs.

[13]:

import pandas as pd
from shapely.geometry import Polygon

from rastervision.core.box import Box
from rastervision.pipeline.file_system.utils import json_to_file

[15]:

class_names = ['Airplane', 'Truncated_airplane']
ann_df = pd.read_csv('data/airbus/annotations.csv')
ann_df.loc[:, 'bbox'] = [Box.from_shapely(Polygon(eval(g))).to_xywh() for g in ann_df.geometry]
ann_df = ann_df.drop(columns='geometry')
ann_df.loc[:, 'category_id'] = [class_names.index(c) for c in ann_df['class']]

Convert to JSON and save to file:

[16]:

ann_json = {
    'images': [dict(id=image_id, file_name=image_id) for image_id in ann_df.image_id.unique()],
    'annotations': ann_df.to_dict(orient='records'),
}
json_to_file(ann_json, 'data/airbus/annotations.json')

Usage#

The annotations are now ready to be used:

[18]:

import albumentations as A

from rastervision.pytorch_learner import ObjectDetectionImageDataset

ds = ObjectDetectionImageDataset(
    img_dir='data/airbus/images/',
    annotation_uri='data/airbus/annotations.json',
    transform=A.Resize(1024, 1024),
)
len(ds)

[18]:

We can read a data sample and the corresponding ground truth from the Dataset like so:

[19]:

x, y = ds[0]
x.shape, y

[19]:

(torch.Size([3, 1024, 1024]),
 {'boxes': tensor([[ 54.0000, 208.8000,  98.0000, 240.0000],
         [410.0000, 113.6000, 450.0000, 153.6000],
         [423.2000, 601.2000, 452.0000, 627.2000],
         [325.2000, 607.2000, 354.0000, 641.6000],
         [237.6000, 375.2000, 262.8000, 404.8000],
         [180.4000, 290.0000, 209.6000, 319.2000],
         [617.2000, 574.8000, 645.6000, 598.8000],
         [594.0000, 548.0000, 624.4000, 574.8000],
         [787.2000, 649.6000, 821.2000, 674.8000],
         [663.6000, 669.6000, 693.2000, 698.4000],
         [746.0000, 836.4000, 780.4000, 866.0000],
         [820.8000, 708.8000, 854.0000, 733.6000],
         [858.4000, 812.4000, 890.0000, 840.8000],
         [402.4000, 938.0000, 435.2000, 964.8000],
         [772.4000, 865.2000, 803.6000, 898.8000],
         [492.4000, 612.8000, 518.0000, 642.8000],
         [296.0000, 507.2000, 321.6000, 542.4000],
         [258.0000, 559.6000, 292.0000, 583.2000]]),
  'class_ids': tensor([0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0])})

Note that y is a BoxList.

And then plot it using the ObjectDetectionVisualizer:

[20]:

from rastervision.pytorch_learner import ObjectDetectionVisualizer

viz = ObjectDetectionVisualizer(
    class_names=['Airplane', 'Truncated_airplane'],
    class_colors=['red', 'green'])
viz.scale = 8
viz.plot_batch(x.unsqueeze(0), [y], show=True)

../../_images/usage_tutorials_prechipped_datasets_41_0.png

Classification – `ClassificationImageDataset`#

The ClassificationImageDataset subclasses torchvision’s DatasetFolder and expects a path to a directory containing images in the following structure:

<data_dir>/
    class_1/
        <images>
    class_2/
        <images>
    ...
    class_N/
        <images>

Dataset#

For this example, we will use the EuroSat Dataset.

Helber, Patrick, Benjamin Bischke, Andreas Dengel, and Damian Borth. “Eurosat: A novel dataset and deep learning benchmark for land use and land cover classification.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 12, no. 7 (2019): 2217-2226.

Below we download (94 MB) and unzip the data.

[ ]:

!wget "https://madm.dfki.de/files/sentinel/EuroSAT.zip" -P "data/" --no-check-certificate
!apt install unzip -y
!unzip -q "data/EuroSAT.zip" -d "data/EuroSAT"
!mv "data/EuroSAT/2750/"* "data/EuroSAT/" && rm -rf "data/EuroSAT/2750"

[25]:

!ls "data/EuroSAT"

AnnualCrop  HerbaceousVegetation  Industrial  PermanentCrop  River
Forest      Highway               Pasture     Residential    SeaLake

Usage#

[27]:

import albumentations as A

from rastervision.pytorch_learner import ClassificationImageDataset

ds = ClassificationImageDataset(
    data_dir='data/EuroSAT',
    # You can pass in a list explicitly if you want to enforce a specific
    # class-name to class-ID mapping.
    class_names=None,
    transform=A.Resize(256, 256),
)
len(ds)

[27]:

We can read a data sample and the corresponding ground truth from the Dataset like so:

[28]:

x, y = ds[10_000]
x.shape, y

[28]:

(torch.Size([3, 256, 256]), tensor(3))

And then plot it using the ClassificationVisualizer:

[29]:

from rastervision.pytorch_learner import ClassificationVisualizer

viz = ClassificationVisualizer(class_names=ds.orig_dataset.classes)
viz.plot_batch(x.unsqueeze(0), y.unsqueeze(0), show=True)

../../_images/usage_tutorials_prechipped_datasets_55_0.png

Working with pre-chipped datasets#

The ImageDataset class#

Supported image formats#

Semantic segmentation – SemanticSegmentationImageDataset#

Dataset#

Usage#

Object detection – ObjectDetectionImageDataset#

Dataset#

Transform annotations into COCO format#

Usage#

Classification – ClassificationImageDataset#

Dataset#

Usage#

The `ImageDataset` class#

Semantic segmentation – `SemanticSegmentationImageDataset`#

Object detection – `ObjectDetectionImageDataset`#

Classification – `ClassificationImageDataset`#