AutoML Feature Store integration
AutoML can augment the original input dataset with features from feature tables in Unity Catalog or in the legacy Workspace Feature Store.
Requirements
- Classification and regression experiments require Databricks Runtime 11.3 LTS ML and above.
- Forecasting experiments require Databricks Runtime 12.2 LTS ML and above.
Select a feature table using the AutoML UI
After configuring your AutoML experiment, you can select a features table using the following steps:
Click Join features (optional).
On the Join additional features page, select a feature table in the Feature Table field.
For each Feature table primary key, select the corresponding lookup key. The lookup key should be a column in the training dataset you provided for your AutoML experiment.
For time series feature tables, select the corresponding timestamp lookup key. Similarly, the timestamp lookup key should be a column in the training dataset you provided for your AutoML experiment.
To add more feature tables, click Add another feature table and repeat the above steps.
Use feature tables with the AutoML API
To use existing feature tables set the feature_store_lookups
parameter in your AutoML run specification.
feature_store_lookups = [
{
"table_name": "example.trip_pickup_features",
"lookup_key": ["pickup_zip", "rounded_pickup_datetime"],
},
{
"table_name": "example.trip_dropoff_features",
"lookup_key": ["dropoff_zip", "rounded_dropoff_datetime"],
}
]
The following notebook shows how to join feature tables to your training dataset for use with AutoML.