实验设置¶

本节介绍可用于自定义实验的设置，例如总运行时、再现性级别、管道构建、特征大脑控制、添加 config.toml 设置等。

`max_runtime_minutes`¶

`max_runtime_minutes_until_abort`¶

`pipeline-building-recipe`¶

Pipeline Building Recipe

指定“管道构建”插件类型（覆盖 GUI 设置）。从以下类型中选择：

自动：指定所有模型和特征均由实验设置、config.toml 设置和特征工程工作量自动确定。（默认）
服从：与自动类型相似，除了以下设置：
- 将可解释性设置为 10。
- 仅使用 GLM 或增强器作为“giblinear”
- 将 Fixed ensemble level 设置为 0。
- 将 Feature brain level 设置为 0。
- 将最大特征交互深度设置为 1，即没有交互。
- 将目标转换器设置为 ’identity’ 以进行回归。
- 不使用 distribution shift 检测。
- 将 monotonicity_constraints_correlation_threshold 设置为 0。
Monotonic_gbm ：与自动类型相似，除了以下设置：
- 启用单调性约束
- 仅使用 LightGBM 模型。
- 删除与目标的不相关性至少为 0.01 的特征。请参阅 monotonicity-constraints-drop-low-correlation-features 和 monotonicity-constraints-correlation-threshold.
- 不构建集成模型，即设置 fixed_ensemble_level=0
- 不使用 feature brain 来确保每次重启均相同。
- 将 Interaction depth 设置为 1，即不进行多特征交互以避免复杂性。
- 不将目标转换应用于回归问题，即将 target_transformer 设置为 ‘identity’. 等效的 config.toml 参数为 recipe=['monotonic_gbm'].
- 禁用 num_as_cat 特征转换。
- 所包含的转换器列表
“OriginalTransformer”，#数值（无聚类、无交互、无 num->cat）

‘CatOriginalTransformer’, ‘RawTransformer’,’CVTargetEncodeTransformer’, ‘FrequentTransformer’,’WeightOfEvidenceTransformer’,’OneHotEncodingTransformer’, #分类（但是无 num-cat）

‘CatTransformer’,’StringConcatTransformer’, # 仅适用于大数据

‘DateOriginalTransformer’, ‘DateTimeOriginalTransformer’, ‘DatesTransformer’, ‘DateTimeDiffTransformer’, ‘IsHolidayTransformer’, ‘LagsTransformer’, ‘EwmaLagsTransformer’, ‘LagsInteractionTransformer’, ‘LagsAggregatesTransformer’,#日期/时间

‘TextOriginalTransformer’, ‘TextTransformer’, ‘StrFeatureTransformer’, ‘TextCNNTransformer’, ‘TextBiGRUTransformer’, ‘TextCharCNNTransformer’, ‘BERTTransformer’,#文本

‘ImageOriginalTransformer’, ‘ImageVectorizerTransformer’] #映像

相关参考，请参阅 Monotonicity Constraints in Driverless AI.

Kaggle：与自动类型相似，除了以下设置：
- 任何外部验证集均将与训练集串联，并且目标被标记为缺失。
- 测试集与训练集串联，并且目标被标记为缺失。
- 不使用此目标的转换器将被允许对整个训练集、验证集和测试集进行 fit_transform`.
- 有几项 config.toml 专家选项启用限制。
nlp_model：仅启用基于 Pytorch 的 NLP BERT 模型，以处理纯文本：
- included_models = bert_models [‘TextBERTModel’, ‘TextXLNETModel’, ‘TextXLMModel’,’TextRoBERTaModel’, ‘TextDistilBERTModel’, ‘TextALBERTModel’, ‘TextCamemBERTModel’, ‘TextXLMRobertaModel’]
- enable_pytorch_nlp = ‘on’

更多信息，请参阅 Driverless AI 中的 NLP

nlp_transformer：仅启用基于 Pytorch、用于处理纯文本的 BERT 转换器：
- included_transformers = [‘BERTTransformer’]
- excluded_models = bert_models
- enable_pytorch_nlp = ‘on’

更多信息，请参阅 Driverless AI 中的 NLP

image_model：仅启用用于处理纯映像的映像模型 (ImageAutoModel)。更多信息，请参阅自动图像模型.
请注意：
- 此选项禁用遗传算法 (GA)。
- 仅在选择此选项时可使用映像见解。
Image_transformer：仅启用用于处理纯映像的 ImageVectorizer 转换器。更多信息，请参阅嵌入向量转换器 (Image Vectorizer).

`enable_genetic_algorithm`¶

`tournament_style`¶

`make_python_scoring_pipeline`¶

`make_mojo_scoring_pipeline`¶

`reduce_mojo_size`¶

`benchmark_mojo_latency`¶

`mojo_building_timeout`¶

`mojo_building_parallelism`¶

`make_pipeline_visualization`¶

`make_autoreport`¶

`min_num_rows`¶

`kaggle_username`¶

`kaggle_key`¶

`kaggle_timeout`¶

`reproducibility_level`¶

`seed`¶

`allow_different_classes_across_fold_splits`¶

`max_num_classes`¶

`max_num_classes_compute_roc`¶

`max_num_classes_client_and_gui`¶

`roc_reduce_type`¶

`feature_brain1`¶

`feature_brain2`¶

`feature_brain3`¶

`feature_brain4`¶

`feature_brain5`¶

`force_model_restart_to_defaults`¶

`min_dai_iterations`¶

`target_transformer`¶

`fixed_num_folds_evolution`¶

`fixed_num_folds`¶

`fixed_only_first_fold_model`¶

`feature_evolution_data_size`¶

`final_pipeline_data_size`¶

`max_validation_to_training_size_ratio_for_final_ensemble`¶

`force_stratified_splits_for_imbalanced_threshold_binary`¶

`mli_custom`¶

`last_recipe`¶

`time_abort`¶

实验设置¶

max_runtime_minutes¶

max_runtime_minutes_until_abort¶

pipeline-building-recipe¶

enable_genetic_algorithm¶

tournament_style¶

make_python_scoring_pipeline¶

make_mojo_scoring_pipeline¶

reduce_mojo_size¶

benchmark_mojo_latency¶

mojo_building_timeout¶

mojo_building_parallelism¶

make_pipeline_visualization¶

make_autoreport¶

min_num_rows¶

kaggle_username¶

kaggle_key¶

kaggle_timeout¶

reproducibility_level¶

seed¶

allow_different_classes_across_fold_splits¶

max_num_classes¶

max_num_classes_compute_roc¶

max_num_classes_client_and_gui¶

roc_reduce_type¶

feature_brain1¶

feature_brain2¶

feature_brain3¶

feature_brain4¶

feature_brain5¶

force_model_restart_to_defaults¶

min_dai_iterations¶

target_transformer¶

fixed_num_folds_evolution¶

fixed_num_folds¶

fixed_only_first_fold_model¶

feature_evolution_data_size¶

final_pipeline_data_size¶

max_validation_to_training_size_ratio_for_final_ensemble¶

force_stratified_splits_for_imbalanced_threshold_binary¶

mli_custom¶

last_recipe¶

time_abort¶

`max_runtime_minutes`¶

`max_runtime_minutes_until_abort`¶

`pipeline-building-recipe`¶

`enable_genetic_algorithm`¶

`tournament_style`¶

`make_python_scoring_pipeline`¶

`make_mojo_scoring_pipeline`¶

`reduce_mojo_size`¶

`benchmark_mojo_latency`¶

`mojo_building_timeout`¶

`mojo_building_parallelism`¶

`make_pipeline_visualization`¶

`make_autoreport`¶

`min_num_rows`¶

`kaggle_username`¶

`kaggle_key`¶

`kaggle_timeout`¶

`reproducibility_level`¶

`seed`¶

`allow_different_classes_across_fold_splits`¶

`max_num_classes`¶

`max_num_classes_compute_roc`¶

`max_num_classes_client_and_gui`¶

`roc_reduce_type`¶

`feature_brain1`¶

`feature_brain2`¶

`feature_brain3`¶

`feature_brain4`¶

`feature_brain5`¶

`force_model_restart_to_defaults`¶

`min_dai_iterations`¶

`target_transformer`¶

`fixed_num_folds_evolution`¶

`fixed_num_folds`¶

`fixed_only_first_fold_model`¶

`feature_evolution_data_size`¶

`final_pipeline_data_size`¶

`max_validation_to_training_size_ratio_for_final_ensemble`¶

`force_stratified_splits_for_imbalanced_threshold_binary`¶

`mli_custom`¶

`last_recipe`¶

`time_abort`¶