Set agent statuses

Agent statuses reflect the health of a Telegraf instance based on runtime data. The Telegraf heartbeat output plugin evaluates Common Expression Language (CEL) expressions against agent metrics, error counts, and plugin statistics to determine the status sent with each heartbeat.

Requires Telegraf v1.38.2+

Agent status evaluation in the Heartbeat output plugins requires Telegraf v1.38.2+.

Status values

Telegraf Controller displays the following agent statuses:

Status	Source	Description
Ok	Heartbeat plugin	The agent is healthy. Set when the `ok` CEL expression evaluates to `true`.
Warn	Heartbeat plugin	The agent has a potential issue. Set when the `warn` CEL expression evaluates to `true`.
Fail	Heartbeat plugin	The agent has a critical problem. Set when the `fail` CEL expression evaluates to `true`.
Undefined	Heartbeat plugin	No expression matched and the `default` is set to `undefined`, or the `initial` status is `undefined`.
Not Reporting	Telegraf Controller	The agent has not sent a heartbeat within the reporting rule threshold. Telegraf Controller applies this status automatically.

How status evaluation works

You define CEL expressions for ok, warn, and fail in the [outputs.heartbeat.status] section of your heartbeat plugin configuration. Telegraf evaluates expressions in a configurable order and assigns the status of the first expression that evaluates to true.

For full details on evaluation flow, configuration options, and available variables and functions, see the Agent status evaluation reference.

Configure agent statuses

To configure status evaluation, add "status" to the include list in your heartbeat plugin configuration and define CEL expressions in the [outputs.heartbeat.status] section.

Example: Basic health check

Report ok when metrics are flowing. If no metrics arrive, fall back to the fail status.

[[outputs.heartbeat]]
  url = "http://telegraf_controller.example.com/agents/heartbeat"
  instance_id = "&{agent_id}"
  token = "${INFLUX_TOKEN}"
  interval = "1m"
  include = ["hostname", "statistics", "configs", "logs", "status"]

  [outputs.heartbeat.status]
    ok = "metrics > 0"
    default = "fail"

Example: Error-based status

Warn when errors are logged, fail when the error count is high.

[[outputs.heartbeat]]
  url = "http://telegraf_controller.example.com/agents/heartbeat"
  instance_id = "&{agent_id}"
  token = "${INFLUX_TOKEN}"
  interval = "1m"
  include = ["hostname", "statistics", "configs", "logs", "status"]

  [outputs.heartbeat.status]
    ok = "log_errors == 0 && log_warnings == 0"
    warn = "log_errors > 0"
    fail = "log_errors > 10"
    order = ["fail", "warn", "ok"]
    default = "ok"

Example: Composite condition

Combine error count and buffer pressure signals.

[[outputs.heartbeat]]
  url = "http://telegraf_controller.example.com/agents/heartbeat"
  instance_id = "&{agent_id}"
  token = "${INFLUX_TOKEN}"
  interval = "1m"
  include = ["hostname", "statistics", "configs", "logs", "status"]

  [outputs.heartbeat.status]
    ok = "metrics > 0 && log_errors == 0"
    warn = "log_errors > 0 || (has(outputs.influxdb_v2) && outputs.influxdb_v2.exists(o, o.buffer_fullness > 0.8))"
    fail = "log_errors > 5 && has(outputs.influxdb_v2) && outputs.influxdb_v2.exists(o, o.buffer_fullness > 0.9)"
    order = ["fail", "warn", "ok"]
    default = "ok"

For more examples including buffer health, plugin-specific checks, and time-based expressions, see CEL expression examples.

View an agent’s status

In Telegraf Controller, go to Agents.
Check the Status column for each agent.
To see more details, click the More button (⋮) and select View Details.
The details page shows the reported status, reporting rule assignment, and the time of the last heartbeat.

Was this page helpful?

Thank you for your feedback!

Support and feedback

Thank you for being part of our community! We welcome and encourage your feedback and bug reports for and this documentation. To find support, use the following resources:

Customers with an annual or support contract can contact InfluxData Support.

Edit this page Submit docs issue Submit issue

Set agent statuses

Requires Telegraf v1.38.2+

Status values

How status evaluation works

Configure agent statuses

Example: Basic health check

Example: Error-based status

Example: Composite condition

View an agent’s status

Support and feedback

New in InfluxDB 3.8

InfluxDB Docker latest tag changing to InfluxDB 3 Core

Set agent statuses

Requires Telegraf v1.38.2+

Status values

How status evaluation works

Configure agent statuses

Example: Basic health check

Example: Error-based status

Example: Composite condition

View an agent’s status

Related

Support and feedback

Where are you running InfluxDB?

AWS

GCP

Azure

Default

Custom

Thank you for your feedback!

New in InfluxDB 3.8

InfluxDB Docker latest tag changing to InfluxDB 3 Core