IMPORTANT: No additional bug fixes or documentation updates will be released for this version. For the latest information, see the current release documentation.

« Configure the Elasticsearch output Logstash output »

› › ›

Kafka output

edit

IMPORTANT: This documentation is no longer updated. Refer to Elastic's version policy and the latest documentation.

Kafka output

edit

The Kafka output sends events to Apache Kafka.

Compatibility: This output can connect to Kafka version 0.8.2.0 and later. Older versions might work as well, but are not supported.

This example configures a Kafka output called kafka-output in the Elastic Agent elastic-agent.yml file, with settings as described further in:

outputs:
  kafka-output:
    type: kafka
    hosts:
      - 'kafka1:9092'
      - 'kafka2:9092'
      - 'kafka3:9092'
    client_id: Elastic
    version: 1.0.0
    compression: gzip
    compression_level: 4
    username: <my-kafka-username>
    password: <my-kakfa-password>
    sasl:
      mechanism: SCRAM-SHA-256
    partition:
      round_robin:
        group_events: 1
    topics:
      - topic: '%{[fields.log_topic]}'
    headers: []
    timeout: 30
    broker_timeout: 30
    required_acks: 1
    ssl:
      verification_mode: full

Kafka output configuration settings

edit

The kafka output supports the following settings, grouped by category. Many of these settings have sensible defaults that allow you to run Elastic Agent with minimal configuration.

Kafka output configuration settings
Authentication settings
Memory queue settings
Topics settings
Partition settings
Header settings
Other configuration settings == Commonly used settings

Setting Description

Setting	Description
`enabled`	(boolean) Enables or disables the output. If set to `false`, the output is disabled.
`hosts`	The addresses your Elastic Agents will use to connect to one or more Kafka brokers. Following is an example `hosts` setting with three hosts defined: hosts: - 'localhost:9092' - 'mykafkahost01:9092' - 'mykafkahost02:9092'
`version`	Kafka protocol version that Elastic Agent will request when connecting. Defaults to 1.0.0. The protocol version controls the Kafka client features available to Elastic Agent; it does not prevent Elastic Agent from connecting to Kafka versions newer than the protocol version.

enabled

(boolean) Enables or disables the output. If set to false, the output is disabled.

hosts

The addresses your Elastic Agents will use to connect to one or more Kafka brokers.

Following is an example hosts setting with three hosts defined:

    hosts:
      - 'localhost:9092'
      - 'mykafkahost01:9092'
      - 'mykafkahost02:9092'

version

Kafka protocol version that Elastic Agent will request when connecting. Defaults to 1.0.0.

The protocol version controls the Kafka client features available to Elastic Agent; it does not prevent Elastic Agent from connecting to Kafka versions newer than the protocol version.

Authentication settings

edit

Setting	Description
`username`	The username for connecting to Kafka. If username is configured, the password must be configured as well.
`password`	The password for connecting to Kafka.
`sasl.mechanism`	The SASL mechanism to use when connecting to Kafka. It can be one of: `PLAIN` for SASL/PLAIN. `SCRAM-SHA-256` for SCRAM-SHA-256. `SCRAM-SHA-512` for SCRAM-SHA-512. If `sasl.mechanism` is not set, `PLAIN` is used if `username` and `password` are provided. Otherwise, SASL authentication is disabled.
`ssl`	When sending data to a secured cluster through the `kafka` output, Elastic Agent can use SSL/TLS. For a list of available settings, refer to SSL/TLS, specifically the settings under Table 4, “Common configuration options” and Table 5, “Client configuration options”.

Memory queue settings

edit

The memory queue keeps all events in memory.

The memory queue waits for the output to acknowledge or drop events. If the queue is full, no new events can be inserted into the memory queue. Only after the signal from the output will the queue free up space for more events to be accepted.

The memory queue is controlled by the parameters queue.mem.flush.min_events and flush.timeout. If flush.timeout is 0s or queue.mem.flush.min_events is 0 or 1 then events can be sent by the output as soon as they are available. If the output supports a bulk_max_size parameter it controls the maximum batch size that can be sent.

If queue.mem.flush.min_events is greater than 1 and flush.timeout is greater than 0s, events will only be sent to the output when the queue contains at least queue.mem.flush.min_events events or the flush.timeout period has expired. In this mode the maximum size batch that that can be sent by the output is queue.mem.flush.min_events. If the output supports a bulk_max_size parameter, values of bulk_max_size greater than queue.mem.flush.min_events have no effect. The value of queue.mem.flush.min_events should be evenly divisible by bulk_max_size to avoid sending partial batches to the output.

This sample configuration forwards events to the output if 512 events are available or the oldest available event has been waiting for 5s in the queue:

  queue.mem.events: 4096
  queue.mem.flush.min_events: 512
  queue.mem.flush.timeout: 5s

Setting Description

Setting	Description
`queue.mem.events`	The number of events the queue can store. This value should be evenly divisible by `queue.mem.flush.min_events` to avoid sending partial batches to the output. Default: `3200 events`
`queue.mem.flush.min_events`	The minimum number of events required for publishing. If this value is set to 0 or 1, events are available to the output immediately. If this value is greater than 1 the output must wait for the queue to accumulate this minimum number of events or for `queue.mem.flush.timeout` to expire before publishing. When greater than 1 this value also defines the maximum possible batch that can be sent by the output. Default: `1600 events`
`queue.mem.flush.timeout`	(int) The maximum wait time for `queue.mem.flush.min_events` to be fulfilled. If set to 0s, events are available to the output immediately. Default: `10s`

queue.mem.events

The number of events the queue can store. This value should be evenly divisible by queue.mem.flush.min_events to avoid sending partial batches to the output.

Default: 3200 events

queue.mem.flush.min_events

The minimum number of events required for publishing. If this value is set to 0 or 1, events are available to the output immediately. If this value is greater than 1 the output must wait for the queue to accumulate this minimum number of events or for queue.mem.flush.timeout to expire before publishing. When greater than 1 this value also defines the maximum possible batch that can be sent by the output.

Default: 1600 events

queue.mem.flush.timeout

(int) The maximum wait time for queue.mem.flush.min_events to be fulfilled. If set to 0s, events are available to the output immediately.

Default: 10s

Topics settings

edit

Use these options to dynamically set the Kafka topic for each Elastic Agent event.

Setting Description

Setting	Description
`topic`	The default Kafka topic used for produced events. You can set the topic dynamically by using a format string to access any event field. For example, this configuration uses a custom field, `fields.log_topic`, to set the topic for each event: topic: '%{[fields.log_topic]}'
`topics`	One or more topic processors including a condition, the event value to check against, and the resulting Kafka topic. Events that don’t match against any defined processor are set to the default topic. Rule settings: `topic` The topic format string to use. If this string contains field references, such as `%{[fields.name]}`, the fields must exist, or the rule fails. `mappings` A dictionary that takes the value returned by `topic` and maps it to a new name. `default` The default string value to use if `mappings` does not find a match. `when` A condition that must succeed in order to execute the current rule. Refer to conditions in the Elastic Agent processor syntax for condition descriptions. Currently the equals, contains, and regexp conditions are available. As an example for setting up your processors, you might want to route log events based on severity. To do so, you can specify a default topic for all events not matched by other processors: outputs: kafka-output: type: kafka hosts: - 'kafka1:9092' - 'kafka2:9092' - 'kafka3:9092' topics: - topic: 'critical-%{[agent.version]}' when: contains: message: ' “CRITICAL”' - topic: 'error-%{[agent.version]}' when: contains: message: ' “ERR”' - topic: '%{[fields.log_topic]}' All non-critical and non-error events will then route to the default `%{[fields.log_topic]}` topic.

topic

The default Kafka topic used for produced events.

You can set the topic dynamically by using a format string to access any event field. For example, this configuration uses a custom field, fields.log_topic, to set the topic for each event:

topic: '%{[fields.log_topic]}'

topics

One or more topic processors including a condition, the event value to check against, and the resulting Kafka topic.

Events that don’t match against any defined processor are set to the default topic.

Rule settings:

topic: The topic format string to use. If this string contains field references, such as %{[fields.name]}, the fields must exist, or the rule fails.
mappings: A dictionary that takes the value returned by topic and maps it to a new name.
default: The default string value to use if mappings does not find a match.
when: A condition that must succeed in order to execute the current rule. Refer to conditions in the Elastic Agent processor syntax for condition descriptions. Currently the equals, contains, and regexp conditions are available.

As an example for setting up your processors, you might want to route log events based on severity. To do so, you can specify a default topic for all events not matched by other processors:

outputs:
  kafka-output:
    type: kafka
    hosts:
      - 'kafka1:9092'
      - 'kafka2:9092'
      - 'kafka3:9092'
    topics:
      - topic: 'critical-%{[agent.version]}'
        when:
          contains:
            message: ' “CRITICAL”'
      - topic: 'error-%{[agent.version]}'
        when:
          contains:
            message: ' “ERR”'
      - topic: '%{[fields.log_topic]}'

All non-critical and non-error events will then route to the default %{[fields.log_topic]} topic.

Partition settings

edit

The number of partitions created is set automatically by the Kafka broker based on the list of topics. Records are then published to partitions either randomly, in round-robin order, or according to a calculated hash.

In the following example, after each event is published to a partition, the partitioner selects the next partition in round-robin fashion.

    partition:
      round_robin:
        group_events: 1

Setting	Description
`random.group_events`	Sets the number of events to be published to the same partition, before the partitioner selects a new partition by random. The default value is 1 meaning after each event a new partition is picked randomly.
`round_robin.group_events`	Sets the number of events to be published to the same partition, before the partitioner selects the next partition. The default value is 1 meaning after each event the next partition will be selected.
`hash.hash`	List of fields used to compute the partitioning hash value from. If no field is configured, the events key value will be used.
`hash.random`	Randomly distribute events if no hash or key value can be computed.

Header settings

edit

A header is a key-value pair, and multiple headers can be included with the same key. Only string values are supported. These headers will be included in each produced Kafka message.

Setting	Description
`key`	The key to set in the Kafka header.
`value`	The value to set in the Kafka header.
`client_id`	The configurable ClientID used for logging, debugging, and auditing purposes. The default is `Elastic`. The Client ID is part of the protocol to identify where the messages are coming from.

Other configuration settings

edit

You can specify these various other options in the kafka-output section of the agent configuration file.

Setting	Description
`backoff.init`	(string) The number of seconds to wait before trying to reconnect to Kafka after a network error. After waiting `backoff.init` seconds, Elastic Agent tries to reconnect. If the attempt fails, the backoff timer is increased exponentially up to `backoff.max`. After a successful connection, the backoff timer is reset. Default: `1s`
`backoff.max`	(string) The maximum number of seconds to wait before attempting to connect to Kafka after a network error. Default: `60s`
`broker_timeout`	The maximum length of time a Kafka broker waits for the required number of ACKs before timing out (see the `required_acks` setting further in). Default: `30` (seconds)
`bulk_flush_frequency`	(int) Duration to wait before sending bulk Kafka request. 0` is no delay. Default: `0`
`bulk_max_size`	(int) The maximum number of events to bulk in a single Kafka request. Default: `2048`
`channel_buffer_size`	(int) Per Kafka broker number of messages buffered in output pipeline. Default: `256`
`codec`	Output codec configuration. You can specify either the `json` or `format` codec. By default the `json` codec is used. `json.pretty`: If `pretty` is set to true, events will be nicely formatted. The default is false. `json.escape_html`: If `escape_html` is set to true, html symbols will be escaped in strings. The default is false. Example configuration that uses the `json` codec with pretty printing enabled to write events to the console: output.console: codec.json: pretty: true escape_html: false `format.string`: Configurable format string used to create a custom formatted message. Example configurable that uses the `format` codec to print the events timestamp and message field to console: output.console: codec.format: string: '%{[@timestamp]} %{[message]}'
`compression`	Select a compression codec to use. Supported codecs are `snappy`, `lz4` and `gzip`.
`compression_level`	For the `gzip` codec you can choose a compression level. The level must be in the range of `1` (best speed) to `9` (best compression). Increasing the compression level reduces the network usage but increases the CPU usage. Default: `4`.
`keep_alive`	(string) The keep-alive period for an active network connection. If `0s`, keep-alives are disabled. Default: `0s`
`max_message_bytes`	(int) The maximum permitted size of JSON-encoded messages. Bigger messages will be dropped. This value should be equal to or less than the broker’s `message.max.bytes`. Default: `1000000` (bytes)
`metadata`	Kafka metadata update settings. The metadata contains information about brokers, topics, partition, and active leaders to use for publishing. `refresh_frequency` Metadata refresh interval. Defaults to 10 minutes. `full` Strategy to use when fetching metadata. When this option is `true`, the client will maintain a full set of metadata for all the available topics. When set to `false` it will only refresh the metadata for the configured topics. The default is false. `retry.max` Total number of metadata update retries. The default is 3. `retry.backoff` Waiting time between retries. The default is 250ms.
`required_acks`	The ACK reliability level required from broker. 0=no response, 1=wait for local commit, -1=wait for all replicas to commit. The default is 1. Note: If set to 0, no ACKs are returned by Kafka. Messages might be lost silently on error. Default: `1` (wait for local commit)
`timeout`	The number of seconds to wait for responses from the Kafka brokers before timing out. The default is 30 (seconds). Default: `1000000` (bytes)

« Configure the Elasticsearch output Logstash output »

On this page

Kafka output configuration settings
Authentication settings
Memory queue settings
Topics settings
Partition settings
Header settings
Other configuration settings

Was this helpful?

Feedback

The Search AI Company

ELK Stack

Elastic Cloud

Generative AI

Search

Security

Observability

By solution

Industries

Customer spotlight

Research

Build

Learn

Connect

Kafka output

Kafka output

Kafka output configuration settings

Authentication settings

Memory queue settings

Topics settings

Partition settings

Header settings

Other configuration settings

Follow us

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards