LookupRecord

Description:

Extracts one or more fields from a Record and looks up a value for those fields in a LookupService. If a result is returned by the LookupService, that result is optionally added to the Record. In this case, the processor functions as an Enrichment processor. Regardless, the Record is then routed to either the 'matched' relationship or 'unmatched' relationship (if the 'Routing Strategy' property is configured to do so), indicating whether or not a result was returned by the LookupService, allowing the processor to also function as a Routing processor. The "coordinates" to use for looking up a value in the Lookup Service are defined by adding a user-defined property. Each property that is added will have an entry added to a Map, where the name of the property becomes the Map Key and the value returned by the RecordPath becomes the value for that key. If multiple values are returned by the RecordPath, then the Record will be routed to the 'unmatched' relationship (or 'success', depending on the 'Routing Strategy' property's configuration). If one or more fields match the Result RecordPath, all fields that match will be updated. If there is no match in the configured LookupService, then no fields will be updated. I.e., it will not overwrite an existing value in the Record with a null value. Please note, however, that if the results returned by the LookupService are not accounted for in your schema (specifically, the schema that is configured for your Record Writer) then the fields will not be written out to the FlowFile.

Additional Details...

Tags:

lookup, enrichment, route, record, csv, json, avro, database, db, logs, convert, filter

Properties:

In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueAllowable ValuesDescription
Record Readerrecord-readerController Service API:
RecordReaderFactory
Implementations: GrokReader
JsonTreeReader
WindowsEventLogReader
ReaderLookup
ParquetReader
CSVReader
Syslog5424Reader
ExcelReader
CEFReader
XMLReader
ScriptedReader
SyslogReader
JsonPathReader
AvroReader
YamlTreeReader
Specifies the Controller Service to use for reading incoming data
Record Writerrecord-writerController Service API:
RecordSetWriterFactory
Implementations: FreeFormTextRecordSetWriter
CSVRecordSetWriter
ParquetRecordSetWriter
RecordSetWriterLookup
ScriptedRecordSetWriter
XMLRecordSetWriter
JsonRecordSetWriter
AvroRecordSetWriter
Specifies the Controller Service to use for writing out the records
Lookup Servicelookup-serviceController Service API:
LookupService
Implementations: IPLookupService
ElasticSearchStringLookupService
XMLFileLookupService
DistributedMapCacheLookupService
ElasticSearchLookupService
KuduLookupService
HBase_2_RecordLookupService
CouchbaseRecordLookupService
SimpleScriptedLookupService
CSVRecordLookupService
SimpleCsvFileLookupService
DatabaseRecordLookupService
SimpleKeyValueLookupService
RestLookupService
PropertiesFileLookupService
CouchbaseKeyValueLookupService
SimpleDatabaseLookupService
ScriptedLookupService
MongoDBLookupService
The Lookup Service to use in order to lookup a value in each Record
Result RecordPathresult-record-pathA RecordPath that points to the field whose value should be updated with whatever value is returned from the Lookup Service. If not specified, the value that is returned from the Lookup Service will be ignored, except for determining whether the FlowFile should be routed to the 'matched' or 'unmatched' Relationship.
Supports Expression Language: true (will be evaluated using flow file attributes and variable registry)
Routing Strategyrouting-strategyRoute to 'success'
  • Route to 'success' Records will be routed to a 'success' Relationship regardless of whether or not there is a match in the configured Lookup Service
  • Route to 'matched' or 'unmatched' Records will be routed to either a 'matched' or an 'unmatched' Relationship depending on whether or not there was a match in the configured Lookup Service. A single input FlowFile may result in two different output FlowFiles.
Specifies how to route records after a Lookup has completed
Record Result Contentsresult-contentsInsert Entire Record
  • Insert Entire Record The entire Record that is retrieved from the Lookup Service will be inserted into the destination path.
  • Insert Record Fields All of the fields in the Record that is retrieved from the Lookup Service will be inserted into the destination path.
When a result is obtained that contains a Record, this property determines whether the Record itself is inserted at the configured path or if the contents of the Record (i.e., the sub-fields) will be inserted at the configured path.
Record Update Strategyrecord-update-strategyUse Property
  • Replace Existing Values The "Result RecordPath" property will be ignored and the lookup service must be a single simple key lookup service. Every dynamic property value should be a record path. For each dynamic property, the value contained in the field corresponding to the record path will be used as the key in the Lookup Service and the value returned by the Lookup Service will be used to replace the existing value. It is possible to configure multiple dynamic properties to replace multiple values in one execution. This strategy only supports simple types replacements (strings, integers, etc).
  • Use Property The "Result RecordPath" property will be used to determine which part of the record should be updated with the value returned by the Lookup Service
This property defines the strategy to use when updating the record with the value returned by the Lookup Service.
Cache Sizerecord-path-lookup-miss-result-cache-size0Specifies how many lookup values/records should be cached.Setting this property to zero means no caching will be done and the table will be queried for each lookup value in each record. If the lookup table changes often or the most recent data must be retrieved, do not use the cache.
Supports Expression Language: true (will be evaluated using variable registry only)

Dynamic Properties:

Supports Sensitive Dynamic Properties: No

Dynamic Properties allow the user to specify both the name and value of a property.

NameValueDescription
Value To LookupValid Record PathA RecordPath that points to the field whose value will be looked up in the configured Lookup Service
Supports Expression Language: true (will be evaluated using flow file attributes and variable registry)

Relationships:

NameDescription
successAll records will be sent to this Relationship if configured to do so, unless a failure occurs
failureIf a FlowFile cannot be enriched, the unchanged FlowFile will be routed to this relationship

Reads Attributes:

None specified.

Writes Attributes:

NameDescription
mime.typeSets the mime.type attribute to the MIME Type specified by the Record Writer
record.countThe number of records in the FlowFile

State management:

This component does not store state.

Restricted:

This component is not restricted.

Input requirement:

This component requires an incoming relationship.

System Resource Considerations:

None specified.

See Also:

ConvertRecord, SplitRecord, SimpleKeyValueLookupService, IPLookupService, DatabaseRecordLookupService