Image Settings

Enable Image Transformer for Processing of Image Data

Specify whether to use pretrained deep learning models for processing of image data as part of the feature engineering pipeline. When this is enabled, a column of Uniform Resource Identifiers (URIs) to images is converted to a numeric representation using ImageNet-pretrained deep learning models. This is enabled by default.

Supported ImageNet Pretrained Architectures for Image Transformer

Specify the supported ImageNet pretrained architectures for image transformer. Select from the following:

  • densenet121

  • efficientnetb0

  • efficientnetb2

  • inception_v3

  • mobilenetv2

  • resnet34

  • resnet50

  • seresnet50

  • seresnext50

  • xception (Selected by default)

Notes:

  • If an internet connection is available, non-default models are downloaded automatically. If an internet connection is not available, non-default models must be downloaded from http://s3.amazonaws.com/artifacts.h2o.ai/releases/ai/h2o/pretrained/image_models.zip and extracted into ./tmp or tensorflow_image_pretrained_models_dir (specified in the config.toml file).

  • Multiple transformers can be activated at the same time to allow the selection of multiple options. In this case, embeddings from the different architectures are concatenated together (in a single embedding).

Dimensionality of Feature Space Created by Image Transformer

Specify the dimensionality of the feature (embedding) space created by Image Transformer. Select from the following:

  • 10

  • 25

  • 50

  • 100 (Default)

  • 200

  • 300

Note: Multiple transformers can be activated at the same time to allow the selection of multiple options.

Enable Fine-Tuning of the Pretrained Models Used for the Image Transformer

Specify whether to enable fine-tuning of the ImageNet pretrained models used for the Image Transformer. This is disabled by default.

Number of Epochs for Fine-Tuning Used for the Image Transformer

Specify the number of epochs for fine-tuning ImageNet pretrained models used for the Image Transformer. This value defaults to 2.

List of Augmentations for Fine-Tuning Used for the Image Transformer

Specify the list of possible image augmentations to apply while fine-tuning the ImageNet pretrained models used for the Image Transformer. Select from the following:

  • Blur

  • CLAHE

  • Downscale

  • GaussNoise

  • GridDropout

  • HorizontalFlip (Default)

  • HueSaturationValue

  • ImageCompression

  • OpticalDistortion

  • RandomBrightnessContrast

  • RandomRotate90

  • ShiftScaleRotate

  • VerticalFlip

Note: For more information on individual augmentations, see https://albumentations.ai/docs/.

Batch Size for the Image Transformer

Specify the batch size for the Image Transformer. By default, the batch size is set to -1 (selected automatically).

Note: Larger architectures and batch sizes use more memory.

Image Download Timeout in Seconds

When providing images through URLs, specify the maximum number of seconds to wait for an image to download. This value defaults to 60.

Maximum Allowed Fraction of Missing Values for Image Column

Specify the maximum allowed fraction of missing elements in a string column for it to be considered as a potential image path. This value defaults to 0.1.

Minimum Fraction of Images That Need to Be of Valid Types for Image Column to Be Used

Specify the fraction of unique image URIs that need to have valid endings (as defined by string_col_as_image_valid_types) for a string column to be considered as image data. This value defaults to 0.8.

Enable GPU(s) for Faster Transformations With the Image Transformer

Specify whether to use any available GPUs to transform images into embeddings with the Image Transformer. Enabling this setting can lead to significantly faster transformation speeds. This is enabled by default.

Note: This setting only applies when scoring inside Driverless AI or with Py Scoring.