Internal Metrics
When sending Cribl Stream metrics to a metric system of analysis – such as InfluxDB, Splunk, or Elasticsearch – some metrics are particularly valuable. You can use these metrics to set up alerts when Worker Nodes are having problems, a Node is down, a Destination is down, a Source stops providing incoming data, and so on.
Cribl Stream reports its internal metrics within the Cribl Stream UI, in the same way that it reports internal logs. Navigate to Monitoring > Logs (Cribl Stream), or to a Fleet’s Health > Sources tab (Cribl Edge).
To expose metrics for capture or routing, enable CriblMetrics on the Cribl Internal Source.
You can also export metrics (and logs) to JSON files.
By default, Cribl Stream generates internal metrics every 2 seconds. To consume metrics at longer intervals, you can use or adapt the cribl‑metrics_rollup Pipeline that ships with Cribl Stream. Attach it to your Cribl Internal Source as a pre‑processing Pipeline. The Pipeline’s Rollup Metrics Function has a default Time Window of 30 seconds, which you can adjust to a different granularity as needed.
You can also use our public endpoints to automate monitoring using your own external tools.
Counter-type metrics in Cribl Stream do not monotonically increase or decrease. They are reset at the end of each reporting period. Cribl Stream does not report counters when their value is 0. For example, if there aren’t any Destinations reporting dropped events then the total.dropped_events metric won’t be reported because its value would be 0.
Breaking Changes
- The
ciandcodimensions are deprecated as of Cribl Stream 4.1.2. See Dimensions for their replacements.- Disabled Sources no longer generate Cribl internal metrics as of Cribl Stream 4.2.
Total Throughput Metrics
Five important metrics below are prefixed with total. These power the top of Cribl Stream’s Monitoring Dashboard. The first two report on Sources, the remainder on Destinations.
- total.in_bytes
- total.in_events
- total.out_events
- total.out_bytes
- total.dropped_events– helpful for discovering situations such as: you’ve disabled a Destination without noticing.
Interpreting Total Metrics
These total. metrics’ values could reflect Cribl Stream’s health, but could also report low activity simply due to the Source system. For example, logs from a store site will be low at low buying periods.
Also, despite the total. prefix, these metrics are each specific to the Worker/Edge Node Process that’s generating them.
You can distinguish unique metrics by their #input=<id> dimension. For example, total.in_events|#input=foo would be one unique metric, while total.in_events|#input=bar would be another.
System Health Metrics
Health metrics are valuable for monitoring system health.
Cribl Stream composite metrics:
- health.inputs
- health.outputs– see the JSON Examples below for both- health.metrics.
Hardware or VM infrastructure metrics:
- system.load_avg
- system.free_mem
- system.disk_used– valuable if you know your disk size, especially for monitoring Persistent Queues. Here, a- 0value typically indicates that the disk-usage data provider has not yet provided the metric with data. (Getting the first value should take about one minute.)- The Cribl Internal Source does not export these metrics to Routes or Pipelines, you can obtain them only by using the REST endpoints listed on this page. 
All of the above metrics have these possible values:
- 0= green = healthy.
- 1= yellow = warning.
- 2= red = trouble.
Health Inputs/Outputs JSON Examples
The health.inputs metrics are reported per Source, and the health.outputs metrics per Destination. The health.inputs example below has two configured Sources, and two Cribl Stream-internal inputs. The health.outputs example includes the built-in devnull Destination, and six user-configured Destinations.
Given all the 0 values here, everything is in good shape!
  "health.inputs": [
    { "model": { "input": "http:http" }}, "val": 0},
    { "model": { "input": "cribl:CriblLogs" }}, "val": 0},
    { "model": { "input": "cribl:CriblMetrics" }}, "val": 0},
    { "model": { "input": "datagen:DatagenWeblog" }}, "val": 0 }
    ],
  "health.outputs": [
    { "model": { "output": "devnull:devnull" }}, "val": 0},
    { "model": { "output": "router:MyOut1" }}, "val": 0},
    { "model": { "output": "tcpjson:MyTcpOut1" }}, "val": 0},
    { "model": { "output": "router:MyOut2" }}, "val": 0},
    { "model": { "output": "tcpjson:MyTcpOut2" }}, "val": 0},
    { "model": { "output": "router:MyOut3" }}, "val": 0},
    { "model": { "output": "router:MyOut4" }}, "val": 0 }
    ],Worker Node Resource Metrics
The Cribl Internal Source reports two useful metrics on the resource usage of individual Worker Node Processes:
- system.cpu_perc– CPU percentage usage.
- system.mem_rss– RAM usage in bytes.
Worker Node Shutdown Metric
This metric tracks events lost during Worker Process shutdown:
- shutdown.lost_events– Number of events lost because they were not flushed before the Drain timeout elapsed during Worker Process shutdown, as explained on the Shutdown and Restart Sequence docs.
Persistent Queue Metrics
These metrics are valuable for monitoring persistent queues’ behavior:
- pq.queue_size– Total bytes in the queue.
- pq.in_bytes– Total bytes in the queue for the given Time Window.
- pq.in_events– Number of events in the queue for the given Time Window.
- pq.out_bytes– Total bytes flushed from the queue for the given Time Window.
- pq.out_events– Number of events flushed from the queue for the given Time Window.
These are aggregate metrics, but you can distinguish unique metrics per queue. On the Destination side, use the #output=<id> dimension. For example, pq.out_events|#output=kafka would be one unique metric; pq.out_events|#output=camus would be another.
Integration Metrics
Integration metrics provide enhanced observability into the performance and health of Sources and Destinations within Cribl Stream. See how to control the granularity of integration metrics.
iometricsare available only on Cribl.Cloud instances.
Metrics are aggregated every minute for timely monitoring. You can monitor integration metrics in three ways:
- Logs Tab: In the Sources and Destinations configuration modal, use the dropdown menu to select a specific metrics channel and view the details.
- Charts Tab: Performance charts are displayed directly on each supported Source and Destination configuration modal, offering immediate visualization and trend analysis.
- Through the Cribl Internal Source (CriblMetrics): You can query the metrics and build automations using the Internal Cribl Source.
The following metrics provide a detailed view of Source and Destination performance, covering connection status, request activity, latency, and data throughput:
- logged.criticals– Number of criticals logged to the channel.
- logged.errors– Number of errors logged to the channel.
- iometrics.activeCxn– Number of active inbound connections per Source.
- iometrics.closeCxn– Number of closed connections per Source.
- iometrics.openCxn– Number of opened connections per Source.
- iometrics.rejectCxn– Number of rejected connections per Source.
- iometrics.endpoints_healthy_percentage– Percentage of healthy load-balanced endpoints.
- iometrics.p95_duration_millis– 95th percentile of request processing time.
- iometrics.p99_duration_millis– 99th percentile of request processing time.
- iometrics.total_requests– Number of sent/received requests.
- iometrics.failed_requests– Number of failed requests.
- source.in_bytes– Inbound bytes per Source.
- source.in_events– Inbound events per Source.
- source.out_bytes– Outbound bytes per Source.
- source.out_events– Outbound events per Source.
- sourcetype.in_bytes– Inbound bytes per Source type.
- sourcetype.in_events– Inbound events per Source type.
- sourcetype.out_bytes– Outbound bytes per Source type.
- sourcetype.out_events– Outbound events per Source type.
- total.activeCxn– Total number of active inbound connections from all HTTP and TCP-based Sources.
- total.closeCxn– Total number of closed connections from all HTTP and TCP-based Sources.
- total.openCxn– Total number of opened connections from all HTTP and TCP-based Sources.
- total.rejectCxn– Total number of rejected connections from all HTTP and TCP-based Sources.
Other Internal Metrics
The Cribl Internal Source emits other metrics that, when displayed in downstream dashboards, can be useful for understanding Cribl Stream’s behavior and health. These include:
- pipe.in_events– Inbound events per Pipeline.
- pipe.out_events– Outbound events per Pipeline.
- pipe.dropped_events– Dropped events per Pipeline.
- metrics_pool.num_metrics– The total number of unique metrics that have been allocated into memory.
- collector_cache.size– Each Collector function (- default/cribl/collectors/<collector>/index.js) is loaded/initialized only once per job, and then cached. This metric represents the current size of this cache.
- cluster.metrics.sender.inflight– Number of metric packets currently being sent from a Worker/Edge Node Process to the API Process, via IPC (interprocess communication).
- backpressure.outputs– Destinations experiencing backpressure, causing events to be either blocked or dropped.
- blocked.outputs– Blocked Destinations. (This metric is more restrictive than the one listed just above.)
- pq.queue_size– Current queue size, in bytes, per Worker Process.
- host.in_bytes– Inbound bytes from a given host (host is a characteristic of the data).
- host.in_events– Inbound events from a given host (host is a characteristic of the data).
- host.out_bytes– Outbound bytes from a given host (host is a characteristic of the data).
- host.out_events– Outbound events from a given host (host is a characteristic of the data).
- index.in_bytes– Inbound bytes per index.
- index.in_events– Inbound events per index.
- index.out_bytes– Outbound bytes per index.
- index.out_events– Outbound events per index.
- route.in_bytes– Inbound bytes per Route.
- route.in_events– Inbound events per Route.
- route.out_bytes– Outbound bytes per Route.
- route.out_events– Outbound events per Route.
- nodecluster.primary.messages– Number of IPC (interprocess communication) calls made by a Worker/Edge Node Process to the API Process.
- cribl.logstream.project.in_events- Inbound events per Project.
- cribl.logstream.project.out_events- Outbound events per Project.
- cribl.logstream.project.dropped_events- Dropped events per Project.
- cribl.logstream.project.in_bytes- Inbound bytes per Project.
- cribl.logstream.project.out_byte- Outbound bytes per Project.
- cribl.logstream.subscription.in_bytes- Inbound bytes per Subscription.
- cribl.logstream.subscription.out_bytes- Outbound bytes per Subscription.
- cribl.logstream.subscription.in_events- Inbound events per Subscription.
- cribl.logstream.subscription.out_events- Outbound events per Subscription.
Dimensions
Cribl Stream internal metrics have extra dimensions. These include:
- cribl_wp– Identifies the Worker/Edge Node Process that processed the event.
- event_host– Machine’s hostname from which the event was made.
- event_source– Set as- cribl.
- group– Name of Cribl Worker Group from which the event was made.
- input– Entire Input ID, including the prefix, from which the event was made.
- output– Entire Output ID, including the prefix, from which the event was made.
- ci– Deprecated; use- inputinstead. Cribl plans to drop support for this dimension after version 4.x.
- co– Deprecated; use- outputinstead. Cribl plans to drop support for this dimension after version 4.x.
Control Metrics
You can control the volume of internal metrics that Cribl Stream emits, by adjusting metrics limits. You access these as follows, per deployment type:
- Single-instance: Settings > System > General Settings > Limits > Metrics.
- Distributed Leader: Settings > Global Settings > System > General Settings > Limits > Metrics.
- Distributed Worker Groups: [Select a Worker Group] > Worker Group/Fleet Settings > System > General Settings > Limits > Metrics.
Here, you can adjust the maximum number of metric series allowed, metrics’ cardinality, metrics to always include (Metrics never‑drop list), metrics to always exclude in order to optimize performance (Disable field metrics), and other options.
Here’s how these controls look in Cribl Stream:

They’re somewhat different in Cribl Edge – see the next section for an explanation:

Control Integration Metrics in Cribl Stream
On Cribl.Cloud instances, you can control the granularity of integration metrics emitted by the Internal Metrics Source to optimize observability while managing system performance and resource usage.
Access the Metrics Levels setting by navigating to the Worker Group > Worker Group/Fleet Settings > System > Metrics Levels.
Only
AdminandEditorroles can view or modify these settings.
On the Metrics Limits page, you’ll find a table listing all configured integrations. For each Source or Destination, you can select the desired metrics level from a dropdown.
| Level | Purpose | List of Metrics Sent | 
|---|---|---|
| Minimal | Reports total throughput metrics, offering a high-level view of overall data flow without integration-specific details. Ideal for low-cardinality environments. | logged.errors,logged.criticalstotal.openCxn,total.closeCxn,total.activeCxn,total.in_events,total.out_events,total.in_bytes,total.out_bytes,total.events, andtotal.bytesiometrics.endpoints_healthy_percentage,iometrics.openCxn,iometrics.closeCxn,iometrics.activeCxn, andiometrics.rejectCxn. | 
| Basic (default) | Includes health metrics, emphasizing error tracking, endpoint health, and success rates. Designed for monitoring the operational state of integrations. | health.inputsiometrics.total_requests, andiometrics.failed_requests. | 
| Detailed | Includes latency metrics, enabling precise performance analysis by capturing percentile-based processing times. Warning: This level increases system load. | iometrics.p95_duration_millisandiometrics.p99_duration_millis. | 
Control Metrics in Cribl Edge
Cribl Edge lets you choose what metrics each Fleet sends to its Leader Node. Your goal should be to collect the metrics most important to you, within the limits of the Leader’s capacity to process them. This is a matter of balancing Fleet cardinality and Leader performance, as explained in the Deployment Planning topic.
Use the Metrics to send from Edge Nodes buttons to select one of the metrics sets listed in the table below.
| Set | Purpose | List of Metrics Sent | 
|---|---|---|
| Minimal | Sends metrics for events and bytes in and out. Recommended for high-cardinality deployments. | Sent as aggregate totals per Fleet: total.in_events,total.out_events,total.in_bytes,total.out_bytesexcluding metrics that are coming directly from Sources or Destinations. | 
| Basic (default) | Contains all the metrics in the minimal set, and more. Sends metrics required to display all monitoring data available in the UI. | Sent as aggregate totals per Source/Destination: total.in_events,total.out_events,total.in_bytes,total.out_bytes.Also, Sent as aggregate totals per Source: health.inputsSent as aggregate totals per Destination: health.outputsSent as aggregate totals per Route: route.in_events,route.in_bytes,route.out_events,route.out_bytes,route.dropped_eventsSent as aggregate totals per Pipeline: pipe.in_events,pipe.out_events,pipe.dropped_eventsSent as aggregate totals per Pack: pack.in_events,pack.out_events,pack.err_events,pack.dropped_events | 
| All | Send all metrics with no filtering. | All metrics without filtering, including Basic, plus Worker/Edge Node Resource Metrics, Persistent Queue Metrics, and Other Internal Metrics. | 
| Custom | Send a customized set of metrics. | All metrics that match a JavaScript expression that you define, plus the metrics in the Minimal set. | 
Basic is the default selection, and includes the Minimal set of metrics. Every Edge Fleet always sends at least Minimal metrics, and selecting one of the other options adds more metrics.
Choosing the Minimal metrics set has the following effects in the Edge UI:
- Only one UI page, Edge Home > Monitor, will be fully populated. For the metrics that the Minimal set excludes, other pages will show zero values. This applies to the [Select a Fleet] > Health pages for Sources, Destinations, Routes, Pipelines, and Packs.
- In the [Select a Fleet] > More pages for Sources and Destinations, the health indicators in the tile and list views do not function when you select the Minimal set. Rather than red, green, or gray, they’ll always show white, because the relevant metrics, health.inputsandhealth.outputs, will not be sent to the Leader.
The simplest form of a Custom set just specifies additional desired metrics by name, using an expression of the form: name === "metric.name". Among the most useful metrics to consider adding are system.load_avg, system.free_mem, system.disk_used, system.cpu_perc, and system.mem_rss.
Custom Metrics and Their Impact on Monitoring
Enabling custom metrics switches monitoring from Basic to Minimal. This hides monitoring data for Fleets and Edge Nodes, giving the impression that data flow has stopped. To ensure system monitoring features work as expected with custom metrics, include the Basic metrics set within the custom configuration.
Other Metrics Endpoints
Below is basic information on using the /system/metrics endpoint, the /system/info endpoint, and the cribl_wp dimension.
/system/metrics Endpoint
/system/metrics is Cribl Stream’s primary public metrics endpoint, which returns most internal metrics. Many of these retrieved metrics report configuration only, not runtime behavior. For details, see our API Docs.
/system/info Endpoint
/system/info generates the JSON displayed in the Cribl Stream UI at Settings > Global Settings > Diagnostics > System Info. Its two most useful properties are loadavg and memory.
loadavg Example
"loadavg": [1.39599609375, 1.22265625, 1.31494140625],
This property is an array containing the 1-, 5-, and 15-minute load averages at the UNIX OS level. (On Windows, the return value is always [0, 0, 0].) For details, see the Node.js os.loadavg() documentation.
memory Example
"memory": { "free": 936206336, "total": 16672968704 },
Divide total / free to monitor memory pressure. If the result exceeds 90%, this indicates a risky situation: you’re running out of memory.
cpus Alternative
The cpus metric returns an array of CPU/memory key-value pairs. This provides an alternative way to understand CPU utilization, but it requires you to query all your CPUs individually.