Projects - List Datasets

Gets the list of datasets for specified project.

GET {endpoint}/speechtotext/v3.2-preview.2/projects/{id}/datasets
GET {endpoint}/speechtotext/v3.2-preview.2/projects/{id}/datasets?skip={skip}&top={top}&filter={filter}

URI Parameters

Name In Required Type Description
endpoint
path True

string

Supported Cognitive Services endpoints (protocol and hostname, for example: https://westus.api.cognitive.microsoft.com).

id
path True

string

uuid

The identifier of the project.

filter
query

string

A filtering expression for selecting a subset of the available datasets.

        - Supported properties: displayName, description, createdDateTime, lastActionDateTime, status, locale, kind.

        - Operators:

          - eq, ne are supported for all properties.

          - gt, ge, lt, le are supported for createdDateTime and lastActionDateTime.

          - and, or, not are supported.

        - Example:

          filter=createdDateTime gt 2022-02-01T11:00:00Z
skip
query

integer

int32

Number of datasets that will be skipped.

top
query

integer

int32

Number of datasets that will be included after skipping.

Responses

Name Type Description
200 OK

PaginatedDatasets

OK

Headers

Retry-After: integer

Other Status Codes

Error

An error occurred.

Security

Ocp-Apim-Subscription-Key

Provide your cognitive services account key here.

Type: apiKey
In: header

Authorization

Provide an access token from the JWT returned by the STS of this region. Make sure to add the management scope to the token by adding the following query string to the STS URL: ?scope=speechservicesmanagement

Type: apiKey
In: header

Examples

Get all datasets

Sample request

GET {endpoint}/speechtotext/v3.2-preview.2/projects/abc46f33-90b8-443d-adad-4870517356e0/datasets?skip=0&top=2&filter=createdDateTime%20ge%202018-01-24T09:54:39Z

Sample response

{
  "values": [
    {
      "self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1",
      "kind": "Acoustic",
      "contentUrl": "https://www.contoso.com/acousticdata/sourcelocation",
      "links": {
        "files": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1/files"
      },
      "properties": {
        "textNormalizationKind": "Default",
        "acceptedLineCount": 11,
        "rejectedLineCount": 2,
        "duration": "PT4M12S"
      },
      "lastActionDateTime": "2019-01-07T11:36:07Z",
      "status": "Succeeded",
      "createdDateTime": "2019-01-07T11:34:12Z",
      "locale": "en-US",
      "displayName": "Acoustic dataset"
    },
    {
      "self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/datasets/cdc91c24-3972-496d-a12f-209c35f4cc6c",
      "kind": "Language",
      "contentUrl": "https://www.contoso.com/LanguageData/SourceLocation",
      "links": {
        "files": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/datasets/cdc91c24-3972-496d-a12f-209c35f4cc6c/files"
      },
      "properties": {
        "textNormalizationKind": "Default",
        "acceptedLineCount": 11,
        "rejectedLineCount": 2
      },
      "lastActionDateTime": "2019-01-07T11:36:07Z",
      "status": "Succeeded",
      "createdDateTime": "2019-01-07T11:34:12Z",
      "locale": "en-US",
      "displayName": "Language dataset"
    }
  ],
  "@nextLink": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/datasets?skip=2&top=2&filter=createdDateTime%20ge%202018-01-24T09:54:39Z"
}

Definitions

Name Description
Dataset

Dataset

DatasetKind

DatasetKind

DatasetLinks

DatasetLinks

DatasetProperties

DatasetProperties

DetailedErrorCode

DetailedErrorCode

EntityError

EntityError

EntityReference

EntityReference

Error

Error

ErrorCode

ErrorCode

InnerError

InnerError

PaginatedDatasets

PaginatedDatasets

Status

Status

TextNormalizationKind

TextNormalizationKind

Dataset

Dataset

Name Type Description
contentUrl

string

The URL of the data for the dataset.

createdDateTime

string

The time-stamp when the object was created. The time stamp is encoded as ISO 8601 date and time format ("YYYY-MM-DDThh:mm:ssZ", see https://en.wikipedia.org/wiki/ISO_8601#Combined_date_and_time_representations).

customProperties

object

The custom properties of this entity. The maximum allowed key length is 64 characters, the maximum allowed value length is 256 characters and the count of allowed entries is 10.

description

string

The description of the object.

displayName

string

The display name of the object.

kind

DatasetKind

DatasetKind
Type of data import.

lastActionDateTime

string

The time-stamp when the current status was entered. The time stamp is encoded as ISO 8601 date and time format ("YYYY-MM-DDThh:mm:ssZ", see https://en.wikipedia.org/wiki/ISO_8601#Combined_date_and_time_representations).

links

DatasetLinks

DatasetLinks

locale

string

The locale of the contained data.

project

EntityReference

EntityReference

properties

DatasetProperties

DatasetProperties

self

string

The location of this entity.

status

Status

Status
Describe the current state of the API

DatasetKind

DatasetKind

Value Description
Acoustic

An acoustic dataset.

AudioFiles

An audio files dataset.

Language

A language dataset.

LanguageMarkdown

A language markdown dataset.

OutputFormatting

Dataset that contains rules to customize inverse text normalization, capitalization, reformulation, profanity and also defines tests for dataset validation

Pronunciation

A pronunciation dataset.

DatasetLinks

Name Type Description
commitBlocks

string

The location to commit the list of blocks when uploading a dataset using blocks. See operation "Datasets_CommitBlocks" for more details.

files

string

The location to get all files of this entity. See operation "Datasets_ListFiles" for more details.

listBlocks

string

The location to list the already uploaded blocks of this entity when uploading a dataset using blocks. See operation "Datasets_GetBlocks" for more details.

uploadBlocks

string

The location to upload blocks to when uploading a dataset using blocks. See operation "Datasets_UploadBlock" for more details.

DatasetProperties

DatasetProperties

Name Type Description
acceptedLineCount

integer

The number of lines accepted for this data set.

duration

string

The total duration of the datasets if it contains audio files. The duration is encoded as ISO 8601 duration ("PnYnMnDTnHnMnS", see https://en.wikipedia.org/wiki/ISO_8601#Durations).

email

string

The email address to send email notifications to in case the operation completes. The value will be removed after successfully sending the email.

error

EntityError

EntityError

rejectedLineCount

integer

The number of lines rejected for this data set.

textNormalizationKind

TextNormalizationKind

TextNormalizationKind
The kind of text normalization.

DetailedErrorCode

DetailedErrorCode

Value Description
DataImportFailed

Data import failed.

DeleteNotAllowed

Delete not allowed.

DeployNotAllowed

Deploy not allowed.

DeployingFailedModel

Deploying failed model.

EmptyRequest

Empty Request.

EndpointCannotBeDefault

Endpoint cannot be default.

EndpointNotUpdatable

Endpoint not updatable.

EndpointWithoutLogging

Endpoint without logging.

ExceededNumberOfRecordingsUris

Exceeded number of recordings uris.

FailedDataset

Failed dataset.

Forbidden

Forbidden.

InUseViolation

In use violation.

InaccessibleCustomerStorage

Inaccessible customer storage.

InvalidAdaptationMapping

Invalid adaptation mapping.

InvalidBaseModel

Invalid base model.

InvalidCallbackUri

Invalid callback uri.

InvalidCollection

Invalid collection.

InvalidDataset

Invalid dataset.

InvalidDocument

Invalid Document.

InvalidDocumentBatch

Invalid Document Batch.

InvalidLocale

Invalid locale.

InvalidLogDate

Invalid log date.

InvalidLogEndTime

Invalid log end time.

InvalidLogId

Invalid log id.

InvalidLogStartTime

Invalid log start time.

InvalidModel

Invalid model.

InvalidModelUri

Invalid model uri.

InvalidParameter

Invalid parameter.

InvalidParameterValue

Invalid parameter value.

InvalidPayload

Invalid payload.

InvalidPermissions

Invalid permissions.

InvalidPrerequisite

Invalid prerequisite.

InvalidProductId

Invalid product id.

InvalidProject

Invalid project.

InvalidProjectKind

Invalid project kind.

InvalidRecordingsUri

Invalid recordings uri.

InvalidRequestBodyFormat

Invalid request body format.

InvalidSasValidityDuration

Invalid sas validity duration.

InvalidSkipTokenForLogs

Invalid skip token for logs.

InvalidSourceAzureResourceId

Invalid source Azure resource ID.

InvalidSubscription

Invalid subscription.

InvalidTest

Invalid test.

InvalidTimeToLive

Invalid time to live.

InvalidTopForLogs

Invalid top for logs.

InvalidTranscription

Invalid transcription.

InvalidWebHookEventKind

Invalid web hook event kind.

MissingInputRecords

Missing Input Records.

ModelCopyOperationExists

Model copy operation exists.

ModelDeploymentNotCompleteState

Model deployment not complete state.

ModelDeprecated

Model deprecated.

ModelExists

Model exists.

ModelMismatch

Model mismatch.

ModelNotDeployable

Model not deployable.

ModelVersionIncorrect

Model Version Incorrect.

NoUtf8WithBom

No utf8 with bom.

OnlyOneOfUrlsOrContainerOrDataset

Only one of urls or container or dataset.

ProjectGenderMismatch

Project gender mismatch.

QuotaViolation

Quota violation.

SingleDefaultEndpoint

Single default endpoint.

SkuLimitsExist

Sku limits exist.

SubscriptionNotFound

Subscription not found.

UnexpectedError

Unexpected error.

UnsupportedClassBasedAdaptation

Unsupported class based adaptation.

UnsupportedDelta

Unsupported delta.

UnsupportedDynamicConfiguration

Unsupported dynamic configuration.

UnsupportedFilter

Unsupported filter.

UnsupportedLanguageCode

Unsupported language code.

UnsupportedOrderBy

Unsupported order by.

UnsupportedPagination

Unsupported pagination.

UnsupportedTimeRange

Unsupported time range.

EntityError

EntityError

Name Type Description
code

string

The code of this error.

message

string

The message for this error.

EntityReference

EntityReference

Name Type Description
self

string

The location of the referenced entity.

Error

Error

Name Type Description
code

ErrorCode

ErrorCode
High level error codes.

details

Error[]

Additional supportive details regarding the error and/or expected policies.

innerError

InnerError

InnerError
New Inner Error format which conforms to Cognitive Services API Guidelines which is available at https://microsoft.sharepoint.com/%3Aw%3A/t/CognitiveServicesPMO/EUoytcrjuJdKpeOKIK_QRC8BPtUYQpKBi8JsWyeDMRsWlQ?e=CPq8ow. This contains required properties ErrorCode, message and optional properties target, details(key value pair), inner error(this can be nested).

message

string

High level error message.

target

string

The source of the error. For example it would be "documents" or "document id" in case of invalid document.

ErrorCode

ErrorCode

Value Description
Conflict

Representing the conflict error code.

Forbidden

Representing the forbidden error code.

InternalCommunicationFailed

Representing the internal communication failed error code.

InternalServerError

Representing the internal server error error code.

InvalidArgument

Representing the invalid argument error code.

InvalidRequest

Representing the invalid request error code.

NotAllowed

Representing the not allowed error code.

NotFound

Representing the not found error code.

PipelineError

Representing the pipeline error error code.

ServiceUnavailable

Representing the service unavailable error code.

TooManyRequests

Representing the too many requests error code.

Unauthorized

Representing the unauthorized error code.

UnprocessableEntity

Representing the unprocessable entity error code.

UnsupportedMediaType

Representing the unsupported media type error code.

InnerError

InnerError

Name Type Description
code

DetailedErrorCode

DetailedErrorCode
Detailed error code enum.

details

object

Additional supportive details regarding the error and/or expected policies.

innerError

InnerError

InnerError
New Inner Error format which conforms to Cognitive Services API Guidelines which is available at https://microsoft.sharepoint.com/%3Aw%3A/t/CognitiveServicesPMO/EUoytcrjuJdKpeOKIK_QRC8BPtUYQpKBi8JsWyeDMRsWlQ?e=CPq8ow. This contains required properties ErrorCode, message and optional properties target, details(key value pair), inner error(this can be nested).

message

string

High level error message.

target

string

The source of the error. For example it would be "documents" or "document id" in case of invalid document.

PaginatedDatasets

PaginatedDatasets

Name Type Description
@nextLink

string

A link to the next set of paginated results if there are more entities available; otherwise null.

values

Dataset[]

A list of entities limited by either the passed query parameters 'skip' and 'top' or their default values.

When iterating through a list using pagination and deleting entities in parallel, some entities will be skipped in the results. It's recommended to build a list on the client and delete after the fetching of the complete list.

Status

Status

Value Description
Failed

The long running operation has failed.

NotStarted

The long running operation has not yet started.

Running

The long running operation is currently processing.

Succeeded

The long running operation has successfully completed.

TextNormalizationKind

TextNormalizationKind

Value Description
Default

Default text normalization (e.g. '2 to 3' is replaced by 'two to three' in en-US).

None

No text normalization will be applied to the input text. This is an override option that should only be used when text is normalized before the upload.