SiteToSiteProvenanceReportingTask

Description:

Publishes Provenance events using the Site To Site protocol.

Additional Details...

Tags:

provenance, lineage, tracking, site, site to site

Properties:

In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueAllowable ValuesDescription
Destination URLDestination URLThe URL of the destination NiFi instance or, if clustered, a comma-separated list of address in the format of http(s)://host:port/nifi. This destination URL will only be used to initiate the Site-to-Site connection. The data sent by this reporting task will be load-balanced on all the nodes of the destination (if clustered).
Supports Expression Language: true (will be evaluated using variable registry only)
Input Port NameInput Port NameThe name of the Input Port to deliver data to.
Supports Expression Language: true (will be evaluated using variable registry only)
SSL Context ServiceSSL Context ServiceController Service API:
RestrictedSSLContextService
Implementation: StandardRestrictedSSLContextService
The SSL Context Service to use when communicating with the destination. If not specified, communications will not be secure.
Instance URLInstance URLhttp://${hostname(true)}:8080/nifiThe URL of this instance to use in the Content URI of each event.
Supports Expression Language: true (will be evaluated using variable registry only)
Compress EventsCompress Eventstrue
  • true
  • false
Indicates whether or not to compress the data being sent.
Communications TimeoutCommunications Timeout30 secsSpecifies how long to wait to a response from the destination before deciding that an error has occurred and canceling the transaction
Batch SizeBatch Size1000Specifies how many records to send in a single batch, at most.
Transport Protocols2s-transport-protocolRAW
  • RAW
  • HTTP
Specifies which transport protocol to use for Site-to-Site communication.
HTTP Proxy hostnames2s-http-proxy-hostnameSpecify the proxy server's hostname to use. If not specified, HTTP traffics are sent directly to the target NiFi instance.
HTTP Proxy ports2s-http-proxy-portSpecify the proxy server's port number, optional. If not specified, default port 80 will be used.
HTTP Proxy usernames2s-http-proxy-usernameSpecify an user name to connect to the proxy server, optional.
HTTP Proxy passwords2s-http-proxy-passwordSpecify an user password to connect to the proxy server, optional.
Sensitive Property: true
Record Writerrecord-writerController Service API:
RecordSetWriterFactory
Implementations: JsonRecordSetWriter
RecordSetWriterLookup
AvroRecordSetWriter
XMLRecordSetWriter
FreeFormTextRecordSetWriter
CSVRecordSetWriter
ParquetRecordSetWriter
ScriptedRecordSetWriter
Specifies the Controller Service to use for writing out the records.
Include Null Valuesinclude-null-valuesfalse
  • true
  • false
Indicate if null values should be included in records. Default will be false
PlatformPlatformnifiThe value to use for the platform field in each event.
Supports Expression Language: true (will be evaluated using variable registry only)
Event Type to Includes2s-prov-task-event-filterComma-separated list of event types that will be used to filter the provenance events sent by the reporting task. Available event types are [CREATE, RECEIVE, FETCH, SEND, REMOTE_INVOCATION, DOWNLOAD, DROP, EXPIRE, FORK, JOIN, CLONE, CONTENT_MODIFIED, ATTRIBUTES_MODIFIED, ROUTE, ADDINFO, REPLAY, UNKNOWN]. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative.
Supports Expression Language: true (will be evaluated using variable registry only)
Event Type to Excludes2s-prov-task-event-filter-excludeComma-separated list of event types that will be used to exclude the provenance events sent by the reporting task. Available event types are [CREATE, RECEIVE, FETCH, SEND, REMOTE_INVOCATION, DOWNLOAD, DROP, EXPIRE, FORK, JOIN, CLONE, CONTENT_MODIFIED, ATTRIBUTES_MODIFIED, ROUTE, ADDINFO, REPLAY, UNKNOWN]. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative. If an event type is included in Event Type to Include and excluded here, then the exclusion takes precedence and the event will not be sent.
Supports Expression Language: true (will be evaluated using variable registry only)
Component Type to Includes2s-prov-task-type-filterRegular expression to filter the provenance events based on the component type. Only the events matching the regular expression will be sent. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative.
Supports Expression Language: true (will be evaluated using variable registry only)
Component Type to Excludes2s-prov-task-type-filter-excludeRegular expression to exclude the provenance events based on the component type. The events matching the regular expression will not be sent. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative. If a component type is included in Component Type to Include and excluded here, then the exclusion takes precedence and the event will not be sent.
Supports Expression Language: true (will be evaluated using variable registry only)
Component ID to Includes2s-prov-task-id-filterComma-separated list of component UUID that will be used to filter the provenance events sent by the reporting task. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative.
Supports Expression Language: true (will be evaluated using variable registry only)
Component ID to Excludes2s-prov-task-id-filter-excludeComma-separated list of component UUID that will be used to exclude the provenance events sent by the reporting task. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative. If a component UUID is included in Component ID to Include and excluded here, then the exclusion takes precedence and the event will not be sent.
Supports Expression Language: true (will be evaluated using variable registry only)
Component Name to Includes2s-prov-task-name-filterRegular expression to filter the provenance events based on the component name. Only the events matching the regular expression will be sent. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative.
Supports Expression Language: true (will be evaluated using variable registry only)
Component Name to Excludes2s-prov-task-name-filter-excludeRegular expression to exclude the provenance events based on the component name. The events matching the regular expression will not be sent. If no filter is set, all the events are sent. If multiple filters are set, the filters are cumulative. If a component name is included in Component Name to Include and excluded here, then the exclusion takes precedence and the event will not be sent.
Supports Expression Language: true (will be evaluated using variable registry only)
Start Positionstart-positionBeginning of Stream
  • Beginning of Stream Start reading provenance Events from the beginning of the stream (the oldest event first)
  • End of Stream Start reading provenance Events from the end of the stream, ignoring old events
If the Reporting Task has never been run, or if its state has been reset by a user, specifies where in the stream of Provenance Events the Reporting Task should start

State management:

ScopeDescription
LOCALStores the Reporting Task's last event Id so that on restart the task knows where it left off.

Restricted:

Required PermissionExplanation
export nifi detailsProvides operator the ability to send sensitive details contained in Provenance events to any external system.

System Resource Considerations:

None specified.