1 of 4

Getting Started

The following guide assumes that you are familiar with Fluent Bit, if that is not the case we suggest you review the official manual first:

Fluent Bit Manual

Requirements

>= v1.1.0 or Fluent Bit from
Basic understanding of Structured Query Language (SQL)

Technical Concepts

Concept

Description

Fluent Bit + SQL

Fluent Bit stream processor uses common SQL to perform record queries. The following section describe the features available and examples of it.

Statements

You can find the detailed query language syntax in BNF form here. The following section will be a brief introduction on how to write SQL queries for Fluent Bit stream processing.

SELECT Statement

Synopsis

Description

Select keys from records coming from a stream or records matching a specific Tag pattern. Note that a simple SELECT statement not associated from a stream creation will send the results to the standard output interface (stdout), useful for debugging purposes.

The query allows filtering the results by applying a condition using WHERE statement. We will explain WINDOW and GROUP BY statements later in aggregation functions section.

Examples

Select all keys from records coming from a stream called apache:

Select code key from records which Tag starts with apache.:

Since the TAG selector allows the use of wildcards, we put the value between single quotes.

CREATE STREAM Statement

Synopsis

Description

Create a new stream of data using the results from the SELECT statement. New stream created can be optionally re-ingested back into Fluent Bit pipeline if the property Tag is set in the WITH statement.

Examples

Create a new stream called hello from stream called apache:

Create a new stream called hello for all records which original Tag starts with apache:

Aggregation Functions

Aggregation functions are used in results_statement on the keys, allowing to perform data calculation on groups of records. Group of records that aggregation functions apply on are determined by WINDOW keyword. When WINDOW is not specified, aggregation functions apply on the current buffer of records received, which may have non-deterministic number of elements. Aggregation functions can be applied on records in a window of a specific time interval (see the syntax of WINDOW in select statement).

Fluent Bit streaming currently supports tumbling window, which is non-overlapping window type. That means, a window of size 5 seconds performs aggregation computations on records over a 5-second interval, and then starts new calculations for the next interval.

In addition, the syntax support GROUP BY statement, which groups the results by the one or more keys, when they have the same values.

AVG

Synopsis

Description

Calculates the average of request sizes in POST requests.

COUNT

Synopsis

Description

Count the number of records in 5 second windows group by host IP addresses.

MIN

Synopsis

Description

Gets the minimum value of a key in a set of records.

MAX

Synopsis

Description

Gets the maximum value of a key in a set of records.

SUM

Synopsis

Description

Calculates the sum of all values of key in a set of records.

Time Functions

Time functions adds a new key into the record with timing data

NOW

Synopsis

Description

Add system time using format: %Y-%m-%d %H:%M:%S. Output example: 2019-03-09 21:36:05.

UNIX_TIMESTAMP

Synopsis

Description

Add current Unix timestamp to the record. Output example: 1552196165 .

Record Functions

Record functions append new keys to the record using values from the record context.

RECORD_TAG

Synopsis

Description

Append Tag string associated to the record as a new key.

RECORD_TIME

Synopsis

WHERE Condition

Similar to conventional SQL statements, WHERE condition is supported in Fluent Bit query language. The language supports conditions over keys and subkeys, for instance:

It is possible to check the existence of a key in the record using record-specific function @record.contains:

And to check if the value of a key is/is not NULL:

Description

Append a new key with the record Timestamp in double format: seconds.nanoseconds. Output example: 1552196165.705683 .

Check Keys and NULL values

Feature available on Fluent Bit >= 1.2

When working with structured messages (records), there are certain cases where we want to know if a key exists, if it value is null or have a value different than null.

internal records are a binary serialization of maps with keys and values. A value can be null which is a valid data type. In our SQL language we provide the following statements that can be applied to the conditionals statements:

Hands On! 101

This article goes through very specific and simple steps to learn how Stream Processor works. For simplicity it uses a custom Docker image that contains the relevant components for testing.

Requirements

The following tutorial requires the following software components:

>= v1.2.0
(not mandatory if you already have Fluent Bit binary installed in your system)

In addition download the following data (130KB).

Stream Processing using the command line

For all next steps we will run Fluent Bit from the command line, and for simplicity we will use the official Docker image.

1. Fluent Bit version

2. Parse sample files

The samples file contains JSON records. On this command, we are appending the Parsers configuration file and instructing tail input plugin to parse the content as json:

The command above will simply print the parsed content to the standard output interface. The content will print the Tag associated to each record and an array with two fields: record timestamp and record map:

As of now there is no Stream Processing, on step #3 we will start doing some basic queries.

3. Selecting specific record keys

This command introduces a Stream Processor (SP) query through the -T option and changes the output plugin to null, this is done with the purpose of obtaining the SP results in the standard output interface and avoid confusions in the terminal.

The query above aims to retrieve all records that a key named country value matches the value Chile, and for each match compose and output a record using only the key fields word and num:

4. Calculate Average Value

The following query is similar to the one in the previous step, but this time we will use the aggregation function called AVG() to get the average value of the records ingested:

output:

why did we get multiple records? Answer: When Fluent Bit processes the data, records come in chunks and the Stream Processor runs the process over chunks of data, so the input plugin ingested 5 chunks of records and SP processed the query for each chunk independently. To process multiple chunks at once we have to group results during windows of time.

5. Grouping Results and Window

Grouping results aims to simplify data processing and when used in a defined window of time we can achieve great things. The next query group the results by country and calculate the average of num value, the processing window is 1 second which basically means: process all incoming chunks coming within 1 second window:

output:

6. Ingest Stream Processor results as new Stream of Data

Now we see a more real-world use case. Sending data results to the standard output interface is good for learning purposes, but now we will instruct the Stream Processor to ingest results as part of Fluent Bit data pipeline and attach a Tag to them.

This can be done using the CREATE STREAM statement that will also tag results with sp-results value. Note that output plugin parameter is now stdout matching all records tagged with sp-results:

output:

F.A.Q

Where STREAM name comes from?

Fluent Bit have the notion of streams, and every input plugin instance gets a default name. You can override that behavior by setting an alias. Check the alias parameter and new stream name in the following example:

Fluent Bit + SQL

Fluent Bit stream processor uses common SQL to perform record queries. The following section describe the features available and examples of it.

Statements

You can find the detailed query language syntax in BNF form here. The following section will be a brief introduction on how to write SQL queries for Fluent Bit stream processing.

SELECT Statement

Synopsis

Description

The query allows filtering the results by applying a condition using WHERE statement. We will explain WINDOW and GROUP BY statements later in aggregation functions section.

Examples

Select all keys from records coming from a stream called apache:

Select code key from records which Tag starts with apache.:

Since the TAG selector allows the use of wildcards, we put the value between single quotes.

CREATE STREAM Statement

Synopsis

Description

Examples

Create a new stream called hello from stream called apache:

Create a new stream called hello for all records which original Tag starts with apache:

Aggregation Functions

In addition, the syntax support GROUP BY statement, which groups the results by the one or more keys, when they have the same values.

AVG

Synopsis

Description

Calculates the average of request sizes in POST requests.

COUNT

Synopsis

Description

Count the number of records in 5 second windows group by host IP addresses.

MIN

Synopsis

Description

Gets the minimum value of a key in a set of records.

MAX

Synopsis

Description

Gets the maximum value of a key in a set of records.

SUM

Synopsis

Description

Calculates the sum of all values of key in a set of records.

Time Functions

Time functions adds a new key into the record with timing data

NOW

Synopsis

Description

Add system time using format: %Y-%m-%d %H:%M:%S. Output example: 2019-03-09 21:36:05.

UNIX_TIMESTAMP

Synopsis

Description

Add current Unix timestamp to the record. Output example: 1552196165 .

Record Functions

Record functions append new keys to the record using values from the record context.

RECORD_TAG

Synopsis

Description

Append Tag string associated to the record as a new key.

RECORD_TIME

Synopsis

WHERE Condition

Similar to conventional SQL statements, WHERE condition is supported in Fluent Bit query language. The language supports conditions over keys and subkeys, for instance:

It is possible to check the existence of a key in the record using record-specific function @record.contains:

And to check if the value of a key is/is not NULL:

Description

Append a new key with the record Timestamp in double format: seconds.nanoseconds. Output example: 1552196165.705683 .

Hands On! 101

This article goes through very specific and simple steps to learn how Stream Processor works. For simplicity it uses a custom Docker image that contains the relevant components for testing.

Requirements

The following tutorial requires the following software components:

>= v1.2.0
(not mandatory if you already have Fluent Bit binary installed in your system)

In addition download the following data (130KB).

Stream Processing using the command line

For all next steps we will run Fluent Bit from the command line, and for simplicity we will use the official Docker image.

1. Fluent Bit version

2. Parse sample files

The samples file contains JSON records. On this command, we are appending the Parsers configuration file and instructing tail input plugin to parse the content as json:

As of now there is no Stream Processing, on step #3 we will start doing some basic queries.

3. Selecting specific record keys

The query above aims to retrieve all records that a key named country value matches the value Chile, and for each match compose and output a record using only the key fields word and num:

4. Calculate Average Value

The following query is similar to the one in the previous step, but this time we will use the aggregation function called AVG() to get the average value of the records ingested:

output:

5. Grouping Results and Window

output:

6. Ingest Stream Processor results as new Stream of Data

output:

Getting Started

hashtagRequirements

hashtagTechnical Concepts

Fluent Bit + SQL

hashtagStatements

hashtagSELECT Statement

hashtagSynopsis

hashtagDescription

hashtagExamples

hashtagCREATE STREAM Statement

hashtagSynopsis

hashtagDescription

hashtagExamples

hashtagAggregation Functions

hashtagAVG

hashtagSynopsis

hashtagDescription

hashtagCOUNT

hashtagSynopsis

hashtagDescription

hashtagMIN

hashtagSynopsis

hashtagDescription

hashtagMAX

hashtagSynopsis

hashtagDescription

hashtagSUM

hashtagSynopsis

hashtagDescription

hashtagTime Functions

hashtagNOW

hashtagSynopsis

hashtagDescription

hashtagUNIX_TIMESTAMP

hashtagSynopsis

hashtagDescription

hashtagRecord Functions

hashtagRECORD_TAG

hashtagSynopsis

hashtagDescription

hashtagRECORD_TIME

hashtagSynopsis

hashtagWHERE Condition

hashtagDescription

Check Keys and NULL values

hashtag

Hands On! 101

hashtagRequirements

hashtagStream Processing using the command line

hashtag1. Fluent Bit version

hashtag2. Parse sample files

hashtag3. Selecting specific record keys

hashtag4. Calculate Average Value

hashtag5. Grouping Results and Window

hashtag6. Ingest Stream Processor results as new Stream of Data

hashtagF.A.Q

hashtagWhere STREAM name comes from?

Getting Started

hashtagRequirements

hashtagTechnical Concepts

Fluent Bit + SQL

hashtagStatements

hashtagSELECT Statement

hashtagSynopsis

hashtagDescription

hashtagExamples

hashtagCREATE STREAM Statement

hashtagSynopsis

hashtagDescription

hashtagExamples

hashtagAggregation Functions

hashtagAVG

hashtagSynopsis

hashtagDescription

hashtagCOUNT

hashtagSynopsis

hashtagDescription

hashtagMIN

hashtagSynopsis

hashtagDescription

Requirements

Technical Concepts

Statements

SELECT Statement

Synopsis

Description

Examples

CREATE STREAM Statement

Synopsis

Description

Examples

Aggregation Functions

AVG

Synopsis

Description

COUNT

Synopsis

Description

MIN

Synopsis

Description

MAX

Synopsis

Description

SUM

Synopsis

Description

Time Functions

NOW

Synopsis

Description

UNIX_TIMESTAMP

Synopsis

Description

Record Functions

RECORD_TAG

Synopsis

Description

RECORD_TIME

Synopsis

WHERE Condition

Description

Requirements

Stream Processing using the command line

1. Fluent Bit version

2. Parse sample files

3. Selecting specific record keys

4. Calculate Average Value

5. Grouping Results and Window

6. Ingest Stream Processor results as new Stream of Data

F.A.Q

Where STREAM name comes from?

Requirements

Technical Concepts

Statements

SELECT Statement

Synopsis

Description

Examples

CREATE STREAM Statement

Synopsis

Description

Examples

Aggregation Functions

AVG

Synopsis

Description

COUNT

Synopsis

Description

MIN

Synopsis

Description

MAX

Synopsis

Description

SUM

Synopsis

Description

Time Functions