PutAccumuloRecord

Description:

This is a record aware processor that reads the content of the incoming FlowFile as individual records using the configured 'Record Reader' and writes them to Apache Accumulo.

Tags:

hadoop, accumulo, put, record

Properties:

In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueAllowable ValuesDescription
Accumulo Connector Serviceaccumulo-connector-serviceController Service API:
BaseAccumuloService
Implementation: AccumuloService
Specifies the Controller Service to use for accessing Accumulo.
Table NameTable NameThe name of the Accumulo Table into which data will be placed
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Create TableCreate TableFalse
  • True
  • False
Creates a table if it does not exist. This property will only be used when EL is not present in 'Table Name'
ThreadsThreads10Number of threads used for reading and writing
Accumulo Timeoutaccumulo-timeout30 secMax amount of time to wait for an unresponsive server. Set to 0 sec for no timeout. Entered value less than 1 second may be converted to 0 sec.
Record Readerrecord-readerController Service API:
RecordReaderFactory
Implementations: CSVReader
JsonPathReader
AvroReader
CEFReader
Syslog5424Reader
JsonTreeReader
WindowsEventLogReader
XMLReader
SyslogReader
JASN1Reader
ReaderLookup
ParquetReader
GrokReader
ScriptedReader
YamlTreeReader
ExcelReader
Specifies the Controller Service to use for parsing incoming data and determining the data's schema
Row Identifier Field NameRow Identifier Field NameSpecifies the name of a record field whose value should be used as the row id for the given record. If EL defines a value that is not a field name that will be used as the row identifier.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Row Identifier Field NameRow Identifier Field NameSpecifies the name of a record field whose value should be used as the row id for the given record. If EL defines a value that is not a field name that will be used as the row identifier.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Column FamilyColumn FamilyThe Column Family to use when inserting data into Accumulo
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Column Family FieldColumn Family FieldField name used as the column family if one is not specified above.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Delete Keydelete-keyDeletes the key
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Flush Every FlowFileflush-on-flow-fileTrue
  • True
  • False
Flushes the table writer on every flow file.
Field Delimiterfield-delimiterDelimiter between the record value and name.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Hex Encode Field Delimiterfield-delimiter-as-hexFalse
  • True
  • False
Allows you to hex encode the delimiter as a character. So 0x00 places a null character between the record name and value.
Memory SizeMemory Size10 MBThe maximum memory size Accumulo at any one time from the record set.
Record Value In Qualifierrecord-value-in-qualifierFalse
  • True
  • False
Places the record value into the column qualifier instead of the value.
Timestamp Fieldtimestamp-fieldSpecifies the name of a record field whose value should be used as the timestamp. If empty a timestamp will be recorded as the time of insertion
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Visibility String Record Path Rootvisibility-pathA record path that points to part of the record which contains a path to a mapping of visibility strings to record paths
Default Visibilitydefault-visibilityDefault visibility when VISIBILITY_PATH is not defined.

Dynamic Properties:

Supports Sensitive Dynamic Properties: No

Dynamic Properties allow the user to specify both the name and value of a property.

NameValueDescription
visibility.<COLUMN FAMILY>visibility label for <COLUMN FAMILY>Visibility label for everything under that column family when a specific label for a particular column qualifier is not available.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
visibility.<COLUMN FAMILY>.<COLUMN QUALIFIER>visibility label for <COLUMN FAMILY>:<COLUMN QUALIFIER>.Visibility label for the specified column qualifier qualified by a configured column family.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)

Relationships:

NameDescription
successA FlowFile is routed to this relationship after it has been successfully stored in Accumulo
failureA FlowFile is routed to this relationship if it cannot be sent to Accumulo

Reads Attributes:

None specified.

Writes Attributes:

None specified.

State management:

This component does not store state.

Restricted:

This component is not restricted.

Input requirement:

This component requires an incoming relationship.

System Resource Considerations:

None specified.