CLI(v2) 기능 집합 YAML 스키마

아티클
09/01/2024

참고 항목

이 문서에 자세히 설명된 YAML 구문은 최신 버전의 ML CLI v2 확장에 대한 JSON 스키마를 기반으로 합니다. 이 구문은 최신 버전의 ML CLI v2 확장에서만 작동하도록 보장됩니다. https://azuremlschemasprod.azureedge.net/에서 이전 확장 버전에 대한 스키마를 찾을 수 있습니다.

YAML 구문

키	형식	설명	허용된 값	Default value
$schema	string	YAML 스키마입니다. Azure Machine Learning VS Code 확장을 사용하여 YAML 파일을 제작하는 경우 파일 맨 위에 $schema를 포함하여 스키마 및 리소스 완료를 호출할 수 있습니다.
name	string	필수입니다. 기능 집합 이름입니다.
version	string	필수입니다. 기능 집합 버전입니다.
description	string	기능 집합 설명입니다.
specification	개체	필수입니다. 기능 집합 사양입니다.
specification.path	string	필수 로컬 기능 집합 사양 폴더의 경로입니다.
엔터티	개체(문자열 목록)	필수입니다. 이 기능 집합이 연결된 엔터티입니다.
stage(단계)	string	기능 집합 단계입니다.	개발, 프로덕션, 보관	개발
tags	개체	기능 집합에 대한 태그 사전입니다.
materialization_settings	개체	기능 집합 구체화 설정입니다.
materialization_settings.offline_enabled	부울 값	기능값을 오프라인 스토리지로 구체화하는 기능이 사용하도록 설정되어 있는지 여부를 나타냅니다.	True, False
materialization_settings.schedule	개체	구체화 일정입니다. CLI(v2) 일정 YAML 스키마를 참조하세요.
materialization_settings.schedule.frequency	string	일정이 구성된 경우 필수입니다. 되풀이 일정의 빈도를 설명하는 열거형입니다.	일, 시, 분, 주, 월	Day
materialization_settings.schedule.interval	정수	일정이 구성된 경우 필수입니다. 되풀이 작업 사이의 간격입니다.
materialization_settings.schedule.time_zone	string	일정 트리거 표준 시간대입니다.		UTC
materialization_settings.schedule.start_time	string	일정 트리거 시간입니다.
materialization_settings.notification	개체	구체화 알림 설정입니다.
materialization_settings.notification.email_on	개체(문자열 목록)	알림이 구성된 경우 필수입니다. 작업 상태가 이 설정과 일치하면 이메일 알림이 전송됩니다.	JobFailed, JobCompleted, JobCancelled.
materialization_settings.notification.emails	개체(문자열 목록)	알림이 구성된 경우 필수입니다. 알림이 전송되는 이메일 주소입니다.
materialization_settings.resource	개체	구체화 작업에 사용되는 Azure Machine Learning Spark 컴퓨팅 리소스입니다.
materialization_settings.resource.instance_type	string	Azure Machine Learning Spark 컴퓨팅 인스턴스 형식입니다.	Standard_E4s_v3, Standard_E8s_v3, Standard_E16s_v3, Standard_E32s_v3, Standard_E64s_v3. 지원되는 형식의 업데이트된 목록을 보려면 Azure Machine Learning에서 Apache Spark를 사용한 대화형 데이터 랭글링(미리 보기)을 참조하세요.
materialization_settings.spark_configuration	사전	Spark 구성 사전

설명

기능 집합을 관리하는 데 az ml feature-set 명령을 사용할 수 있습니다.

예제

예제는 예제 GitHub 리포지토리에서 사용할 수 있습니다. 몇 가지는 다음과 같습니다.

YAML: 기본

$schema: http://azureml/sdk-2-0/Featureset.json

name: transactions
version: "1"
description: 7-day and 3-day rolling aggregation of transactions featureset
specification:
  path: ./spec # path to feature set specification folder. Can be local (absolute path or relative path to current location) or cloud uri. Contains FeatureSetSpec.yaml + transformation code
entities: # entities associated with this feature-set
  - azureml:account:1
stage: Development

YAML: 구체화 구성 포함

name: transactions
version: "1"
description: 7-day and 3-day rolling aggregation of transactions featureset
specification:
  path: ./spec # path to feature set specification folder. Can be local (absolute path or relative path to current location) or cloud uri. Contains FeatureSetSpec.yaml + transformation code
entities: # entities associated with this feature-set
  - azureml:account:1
stage: Development
materialization_settings:
    offline_enabled: True
    schedule: # we use existing definition of schedule under job with some constraints. Recurrence pattern will not be supported.
        type: recurrence  # Only recurrence type would be supported
        frequency: Day # Only support Day and Hour
        interval: 1 #every day
        time_zone: "Pacific Standard Time"
    notification: 
        email_on:
        - JobFailed
        emails:
        - alice@microsoft.com

    resource:
        instance_type: Standard_E8S_V3
    spark_configuration:
        spark.driver.cores: 4
        spark.driver.memory: 36g
        spark.executor.cores: 4
        spark.executor.memory: 36g
        spark.executor.instances: 2

다음을 통해 공유

CLI(v2) 기능 집합 YAML 스키마

YAML 구문

설명

예제

YAML: 기본

YAML: 구체화 구성 포함

다음 단계

피드백

추가 리소스