StartAwsTranscribeJob 2.0.0

Bundle
org.apache.nifi | nifi-aws-nar
Description
Trigger a AWS Transcribe job. It should be followed by GetAwsTranscribeStatus processor in order to monitor job status.
Tags
AWS, Amazon, ML, Machine Learning, Transcribe
Input Requirement
Supports Sensitive Dynamic Properties
false
  • Additional Details for StartAwsTranscribeJob 2.0.0

    Amazon Transcribe

    Automatically convert speech to text

    Usage

    Amazon ML Processors are implemented to utilize ML services based on the official AWS API Reference. You can find example json payload in the documentation at the Request Syntax sections. For more details please check the official Transcribe API reference With this processor you will trigger a startTranscriptionJob async call to AWS Transcribe Service. You can define json payload as property or provide as a flow file content. Property has higher precedence. After the job is triggered the serialized json response will be written to the output flow file. The awsTaskId attribute will be populated, so it makes it easier to query job status by the corresponding get job status processor.

    JSON payload template - note that these can be simplified with the optional fields, check AWS documentation for more details - examples:

    {
      "ContentRedaction": {
        "PiiEntityTypes": [
          "string"
        ],
        "RedactionOutput": "string",
        "RedactionType": "string"
      },
      "IdentifyLanguage": boolean,
      "IdentifyMultipleLanguages": boolean,
      "JobExecutionSettings": {
        "AllowDeferredExecution": boolean,
        "DataAccessRoleArn": "string"
      },
      "KMSEncryptionContext": {
        "string": "string"
      },
      "LanguageCode": "string",
      "LanguageIdSettings": {
        "string": {
          "LanguageModelName": "string",
          "VocabularyFilterName": "string",
          "VocabularyName": "string"
        }
      },
      "LanguageOptions": [
        "string"
      ],
      "Media": {
        "MediaFileUri": "string",
        "RedactedMediaFileUri": "string"
      },
      "MediaFormat": "string",
      "MediaSampleRateHertz": number,
      "ModelSettings": {
        "LanguageModelName": "string"
      },
      "OutputBucketName": "string",
      "OutputEncryptionKMSKeyId": "string",
      "OutputKey": "string",
      "Settings": {
        "ChannelIdentification": boolean,
        "MaxAlternatives": number,
        "MaxSpeakerLabels": number,
        "ShowAlternatives": boolean,
        "ShowSpeakerLabels": boolean,
        "VocabularyFilterMethod": "string",
        "VocabularyFilterName": "string",
        "VocabularyName": "string"
      },
      "Subtitles": {
        "Formats": [
          "string"
        ],
        "OutputStartIndex": number
      },
      "Tags": [
        {
          "Key": "string",
          "Value": "string"
        }
      ],
      "TranscriptionJobName": "string"
    }
    
Properties
Relationships
Name Description
original Upon successful completion, the original FlowFile will be routed to this relationship.
success FlowFiles are routed to success relationship
failure FlowFiles are routed to failure relationship
Writes Attributes
Name Description
awsTaskId The task ID that can be used to poll for Job completion in GetAwsTranscribeJobStatus
See Also