DataRobot Automodel Tool

The DataRobot Automodel Tool allows you to create projects and train models as part of your workflow in Alteryx.

Overview

Configuration

DataRobot Connection Options

DataRobot Project Options

Advanced Options

Notes

Project Types

DataRobot creates different types of projects based on the target column. Those three project types are:

Note: If your target column appears numeric in nature but you would like DataRobot to treat it as categorical, we recommend creating a new column with the Alteryx Formula tool and prepend or append a string to the values. For example, this often happens with zip code data.

Feature Lists

All columns sent to the tool as input will be used in the DataRobot model generation and will be required as input when requesting predictions. Be sure to remove any columns from the input using the Alteryx Select tool that should not be used in modelling.

Dataset Size

DataRobot requires at least 100 rows of data to start modelling. To support advanced modeling analytics (such as reason codes) even more rows of data will be necessary. Conversely, there is a limit to the maximum amount of data that you can use in a project in DataRobot depending on your license restrictions.

Immutability

Projects and models in DataRobot are immutable, meaning there are few attributes you can change about a project or model after they are created. The recommended workflow is to simply create new projects when you require changes to the modeling data. You can create projects with the same name to simplify automation. However, please note that project and model reference ids will be unique per project.

Column Names

In DataRobot, column ordering is not important during modeling or computing predictions. There are a few special characters that DataRobot doesn't support in column names but as an Alteryx user, you need not worry about them. Both the Automodel and Prediction tools will silently sanitize the names for you. Please be aware of this fact when browsing the features in the DataRobot Web UI if some of the feature names do not match up to the names you see in Alteryx.

Number of Workers

One of the project settings available in the DataRobot platform is the number of workers to make available to each project. The DataRobot Automodel tool will always use the maximum number of workers allowed for your account.

Inputs

The tool requires a single input. All of the data will be sent to DataRobot to create a single new project and the automated modeling process will begin. One of the fields in the input will be the target of the modeling process, and the rest will be considered valid predictors to use to model the target. The tool will not finish executing until all of the models recommended by DataRobot have been trained and evaluated.

Outputs

An optional output will send the Project ID and Model ID of the best model created in DataRobot to another tool. This can be consumed by the DataRobot Predict tool in lieu of having to specify these parameters manually.

v4.1.5