QueryAirtableTable

QueryAirtableTable 2.4.0

Bundle: org.apache.nifi | nifi-airtable-nar
Description: Query records from an Airtable table. Records are incrementally retrieved based on the last modified time of the records. Records can also be further filtered by setting the 'Custom Filter' property which supports the formulas provided by the Airtable API. This processor is intended to be run on the Primary Node only.
Tags: airtable, database, query
Input Requirement: FORBIDDEN
Supports Sensitive Dynamic Properties: false

Additional Details for QueryAirtableTable 2.4.0

QueryAirtableTable

Description

Airtable is a spreadsheet-database hybrid. In Airtable an application is called base and each base can have multiple tables. A table consists of records (rows) and each record can have multiple fields (columns). The QueryAirtableTable processor can query records from a single base and table via Airtable’s REST API. The processor utilizes streams to be able to handle a large number of records. It can also split large record sets to multiple FlowFiles just like a database processor.

Personal Access Token

Please note that API Keys were deprecated, Airtable now provides Personal Access Tokens (PATs) instead. Airtable REST API calls requires a PAT (Personal Access Token) that needs to be passed in a request. An Airtable account is required to generate the PAT.

API rate limit

The Airtable REST API limits the number of requests that can be sent on a per-base basis to avoid bottlenecks. Currently, this limit is 5 requests per second per base. If this limit is exceeded you can’t make another request for 30 seconds. It’s your responsibility to handle this rate limit via configuring Yield Duration and Run Schedule properly. It is recommended to start off with the default settings and to increase both parameters when rate limit issues occur.

Metadata API

Currently, the Metadata API of Airtable is unstable, and we don’t provide a way to use it. Until it becomes stable you can set up a ConvertRecord or MergeRecord processor with a JsonTreeReader to read the content and convert it into a Record with schema.

Properties

API URL
The URL for the Airtable REST API including the domain and the path to the API (e.g. https://api.airtable.com/v0).

Display Name

API URL

Description

The URL for the Airtable REST API including the domain and the path to the API (e.g. https://api.airtable.com/v0).

API Name

api-url

Default Value

https://api.airtable.com/v0

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

true
Base ID
The ID of the Airtable base to be queried.

Display Name

Base ID

Description

The ID of the Airtable base to be queried.

API Name

base-id

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

true
Custom Filter
Filter records by Airtable's formulas.

Display Name

Custom Filter

Description

Filter records by Airtable's formulas.

API Name

custom-filter

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

false
Fields
Comma-separated list of fields to query from the table. Both the field's name and ID can be used.

Display Name

Fields

Description

Comma-separated list of fields to query from the table. Both the field's name and ID can be used.

API Name

fields

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

false
Max Records Per FlowFile
The maximum number of result records that will be included in a single FlowFile. This will allow you to break up very large result sets into multiple FlowFiles. If no value specified, then all records are returned in a single FlowFile.

Display Name

Max Records Per FlowFile

Description

The maximum number of result records that will be included in a single FlowFile. This will allow you to break up very large result sets into multiple FlowFiles. If no value specified, then all records are returned in a single FlowFile.

API Name

max-records-per-flowfile

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

false
Personal Access Token
The Personal Access Token (PAT) to use in queries. Should be generated on Airtable's account page.

Display Name

Personal Access Token

Description

The Personal Access Token (PAT) to use in queries. Should be generated on Airtable's account page.

API Name

pat

Expression Language Scope

Not Supported

Sensitive

true

Required

true
Query Page Size
Number of records to be fetched in a page. Should be between 1 and 100 inclusively.

Display Name

Query Page Size

Description

Number of records to be fetched in a page. Should be between 1 and 100 inclusively.

API Name

query-page-size

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

false
Query Time Window Lag
The amount of lag to be applied to the query time window's end point. Set this property to avoid missing records when the clock of your local machines and Airtable servers' clock are not in sync. Must be greater than or equal to 1 second.

Display Name

Query Time Window Lag

Description

The amount of lag to be applied to the query time window's end point. Set this property to avoid missing records when the clock of your local machines and Airtable servers' clock are not in sync. Must be greater than or equal to 1 second.

API Name

query-time-window-lag

Default Value

3 s

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

true
Table ID
The name or the ID of the Airtable table to be queried.

Display Name

Table ID

Description

The name or the ID of the Airtable table to be queried.

API Name

table-id

Expression Language Scope

Environment variables defined at JVM level and system properties

Sensitive

false

Required

true
Web Client Service Provider
Web Client Service Provider to use for Airtable REST API requests

Display Name

Web Client Service Provider

Description

Web Client Service Provider to use for Airtable REST API requests

API Name

web-client-service-provider

Service Interface

org.apache.nifi.web.client.provider.api.WebClientServiceProvider

Service Implementations

org.apache.nifi.web.client.provider.service.StandardWebClientServiceProvider

Expression Language Scope

Not Supported

Sensitive

false

Required

true

State Management

Scopes	Description
CLUSTER	The last successful query's time is stored in order to enable incremental loading. The initial query returns all the records in the table and each subsequent query filters the records by their last modified time. In other words, if a record is updated after the last successful query only the updated records will be returned in the next query. State is stored across the cluster, so this Processor can run only on the Primary Node and if a new Primary Node is selected, the new node can pick up where the previous one left off without duplicating the data.

Scopes

Description

CLUSTER

The last successful query's time is stored in order to enable incremental loading. The initial query returns all the records in the table and each subsequent query filters the records by their last modified time. In other words, if a record is updated after the last successful query only the updated records will be returned in the next query. State is stored across the cluster, so this Processor can run only on the Primary Node and if a new Primary Node is selected, the new node can pick up where the previous one left off without duplicating the data.

Relationships

Name	Description
success	For FlowFiles created as a result of a successful query.

Writes Attributes

Name	Description
record.count	Sets the number of records in the FlowFile.
fragment.identifier	If 'Max Records Per FlowFile' is set then all FlowFiles from the same query result set will have the same value for the fragment.identifier attribute. This can then be used to correlate the results.
fragment.count	If 'Max Records Per FlowFile' is set then this is the total number of FlowFiles produced by a single ResultSet. This can be used in conjunction with the fragment.identifier attribute in order to know how many FlowFiles belonged to the same incoming ResultSet.
fragment.index	If 'Max Records Per FlowFile' is set then the position of this FlowFile in the list of outgoing FlowFiles that were all derived from the same result set FlowFile. This can be used in conjunction with the fragment.identifier attribute to know which FlowFiles originated from the same query result set and in what order FlowFiles were produced