/ / /

Internal Logs

Cribl Edge generates internal application logs that monitor its own operations and health. They provide valuable insights into the system’s behavior, performance, and potential issues.

Distributed deployments emit a larger set of logs than single-instance deployments. We’ll describe the distributed set first.

You can display and export all internal logs by selecting Monitoring in the sidebar, then selecting Logs (Stream) or selecting Logs in the sidebar (Edge). Logs’ persistence depends on event volume, not time - for details, see Log Rotation and Retention.

Several logs listed on this page are exposed only in customer-managed (on-prem) deployments. In Cribl.Cloud, Leaders support Cribl Edge Worker Node logs on hybrid Workers.
However, Organization Members who have the Admin Permission on Cribl Search can use that product to search the cribl_internal_logs Dataset for additional details about the Leader and its Cribl-managed Workers.

Leader Node Logs (Distributed)

The API/main process emits the following logs into the Leader Node’s $CRIBL_HOME/log/ directory.

Logfile Name	Description	Equivalent on Logs page
`cribl.log`	Principal log in Cribl Edge. Includes telemetry/license-validation logs. Corresponds to top-level `cribl.log` on Diag page.	Leader > API Process
`access.log`	API calls, for example, `GET /api/v1/version/info`.	Leader > Access
`audit.log`	Actions pertaining to files, for example, `create`, `update`, `commit`, `deploy`, `delete`.	Leader > Audit
`notifications.log`	Messages that appear in the Notification list in the UI. For each event, the `time` and `_time` fields record the start of the monitored interval (matching `starttime`), not when the Notification was written.	Leader > Notifications
`ui-access.log`	Interactions with different UI components described as URLs, for example, `/settings/apidocs`, `/dashboard/logs`.	Leader > UI Access

The API/main process emits the following service logs into the Leader Node’s $CRIBL_HOME/log/service/ directory. Each service includes a cribl.log file that logs the service’s internal telemetry and an access.log file that logs which API calls the service has handled.

Service Name	Description	Equivalent on Logs page
Connections Service	Handles all Worker connections and communication, including heartbeats, bundle deploys, teleporting, restarting, and so on. Workers are assigned to connection processes using a round-robin algorithm.	Leader > Connections Service
Lease Renewal Service	Handles lease renewal for the primary Leader Node.	Leader > Lease Renewal Service
Metrics Service	Handles in-memory metrics, merging of incoming packets, metrics persistence and rehydration, and UI queries for metrics.	Leader > Metrics Service
Notifications Service	Triggers Notifications based on its configuration.	Leader > Notifications Service

The Config Helper process for each Worker Group/Fleet emits the following log in $CRIBL_HOME/log/group/GROUPNAME.

Logfile Name	Description	Equivalent on Logs page
`cribl.log`	Messages about config maintenance, previews, and so on.	GROUPNAME > Config helper

Outpost Logs

The Outpost service emits the following logs into the $CRIBL_HOME/log/service/outpost/N/ directory, where N is the Outpost Node Process ID:

Logfile Name	Description	Equivalent on the Logs tab in Outpost info drawer
`cribl.log`	Internal telemetry logs.	Outpost Service N
`metrics.log`	Outpost Node metrics.	Outpost Service N - Metrics

Edge Node Logs (Distributed)

The API Process emits the following log in $CRIBL_HOME/log/.

Logfile Name	Description	Equivalent on Logs page
`cribl.log`	Messages about the Worker/Edge Node communicating with the Leader Node (that is, with its API Process) and other API requests, for example, sending metrics, reaping job artifacts.	GROUPNAME > Worker:HOSTNAME > API Process

Each Worker Process emits the following logs in $CRIBL_HOME/log/worker/N/, where N is the Worker/Edge Node Process ID. (The metrics.log file is written only when HTTP-based Destinations exist.)

Logfile Name	Description	Equivalent on Logs page
`cribl.log`	Messages about the Worker/Edge Node processing data.	GROUPNAME > Worker:HOSTNAME > Worker Process N
`metrics.log`	Messages about the Worker/Edge Node’s outbound HTTP request statistics.	GROUPNAME > Worker:HOSTNAME > Worker Process N

If TCP load balancing is enabled in at least one supported Source (such as Syslog, TCP JSON, or Cribl TCP), the Worker Process also emits the following log to $CRIBL_HOME/log/worker/LB:

Logfile Name	Description	Equivalent on Logs page
`cribl.log`	Internal telemetry of the load balancing process.	GROUPNAME > Worker:HOSTNAME > Load Balancer

Load balancing logs use different channel names to indicate the Source they come from than Worker Process logs. To search by channel name, use channel == "<source-id>_lb" instead of channel == "input:<source-id>". For example, for the built-in TCP JSON Source, enter channel == "in_tcp_json_lb" in the search box instead of channel == "input:in_tcp_json".

For convenience, the UI aggregates the Worker/Edge Node Process logs as follows.

Logfile Name	Description	Equivalent on Logs page
N/A	Aggregation of all the `Worker Process N` logs and the API Process log.	GROUPNAME > WORKER_NAME

In Cribl Edge, the logs listed above are currently available only on customer-managed hybrid Workers. The single-instance logs listed below are not relevant to Cribl.Cloud.

Single-Instance Logs

The API/main process emits the same logs as it does for a Distributed deployment, in$CRIBL_HOME/log/:

cribl.log
access.log
audit.log
notifications.log
ui-access.log

Each Worker/Edge Node Process emits the following logs in $CRIBL_HOME/log/worker/N/, where N is the Worker/Edge Node Process ID. (The metrics.log file is written only when HTTP-based Destinations exist.)

Logfile Name	Description	Equivalent on Logs page
`cribl.log`	Messages about the Worker/Edge Node processing data.	GROUPNAME > Worker:HOSTNAME > Worker Process N
`metrics.log`	Messages about the Worker/Edge Node’s outbound HTTP request statistics.	GROUPNAME > Worker:HOSTNAME > Worker Process N

`_raw stats` Event Fields

Each Worker/Edge Node Process logs this information at a 1-minute frequency. So each event’s scope covers only that Worker/Edge Node Process, over a 1-minute time span defined by the startTime and endTime fields.

Sample Event

{"time":"2022-11-17T16:54:05.349Z","cid":"w0","channel":"server","level":"info","message":"_raw stats","inEvents":307965,"outEvents":495848,"inBytes":52756162,"outBytes":83028013,"starttime":1668703980,"endtime":1668704040,"activeCxn":0,"openCxn":0,"closeCxn":0,"rejectCxn":0,"abortCxn":0,"pqInEvents":62000,"pqOutEvents":114591,"pqInBytes":12163896,"pqOutBytes":22481509,"pqTotalBytes":480467058,"droppedEvents":0,"tasksStarted":6,"tasksCompleted":6,"activeEP":9,"blockedEP":0,"cpuPerc":101.09,"eluPerc":97.81,"mem":{"heap":277,"heapTotal":287,"ext":46,"rss":453,"buffers":0}}

Field Descriptions

Field	Description
`abortCxn`	Number of TCP connections that were aborted.
`activeCxn`	Number of TCP connections newly opened at the time the `_raw` stats are logged. (This is a gauge when exported in internal metrics, and can otherwise be ignored as an instantaneous measurement. Only some application protocols count toward this; for example, any HTTP-based Source does not count.)
`activeEP`	Number of currently active event processors (EPs). EPs are used to process events through Breakers and Pipelines as the events are received from Sources and sent to destinations. EPs are typically created per TCP connection (such as for HTTP).
`blockedEP`	Number of currently blocked event processors (caused by blocking Destinations).
`closeCxn`	Number of TCP connections that were closed.
`cpuPerc`	CPU utilization from the combined user and system activity over the last 60 seconds.
`droppedEvents`	This is equivalent to the `total.dropped_events` metric. Drops can occur from Pipeline Functions, from Destination Backpressure, or from other errors. Any event not sent to a Destination is considered dropped.
`eluPerc`	Event loop utilization. Represents the percentage of time over the last 60 seconds that the NodeJS runtime spent processing events within its event loop.
`endTime`	The end of the timespan represented by these metrics. (Will always be 60 seconds after `startTime`.)
`inBytes`	Number of bytes received from all Sources (based only off `_raw`).
`inEvents`	Number of events received from all inputs after Event Breakers are applied. This can be larger than `outEvents` if events are dropped via Drop, Aggregation, Suppression, Sampling, or similar Functions.
`mem.buffers`	Memory allocated for ArrayBuffers and SharedArrayBuffers.
`mem.ext`	External section of process memory, in MB.
`mem.heap`	Used heap section of process memory, in MB.
`mem.heapTotal`	Total heap section of process memory, in MB.
`mem.rss`	Resident set size section of process memory, in MB.
`openCxn`	Same as `activeCxn`, but tracked as a counter rather than a gauge. So `openCxn` will show all connections newly opened each minute, and is more accurate than using `activeCxn`.
`outBytes`	Number of bytes sent to all Destinations (based only off `_raw`).
`outEvents`	Number of events sent out to all Destinations. This can be larger than `inEvents` due to creating event clones or entirely new unique events (such as when using the Aggregation Function).
`pqInBytes`	Number of bytes that were written to persistent queues, across all Destinations.
`pqInEvents`	Number of events that were written to persistent queues, across all Destinations.
`pqOutBytes`	Number of bytes that were flushed from persistent queues, across all Destinations.
`pqOutEvents`	Number of events that were flushed from persistent queues, across all Destinations.
`pqTotalBytes`	Amount of data currently stored in persistent queues, across all Destinations.
`rejectCxn`	Number of TCP connections that were rejected.
`startTime`	The beginning of the timespan represented by these metrics.
`tasksCompleted`	The number of tasks the process has started and completed for all collection jobs for which it was executing tasks.
`tasksStarted`	The number of tasks the process started for all collection jobs for which it was executing tasks.

Log Rotation and Retention

Cribl Edge writes multiple internal log files for Leaders, Edge Nodes, and supporting services. To prevent any single log file from consuming excessive disk space, Cribl Edge automatically rotates these files based on size.

By default, Cribl Edge:

Rotates each log file when it reaches 5 MB.
Keeps the five most recent rotated files per log.

This rotation policy applies to all internal log files, including:

Leader/Node logs in $CRIBL_HOME/log/, such as:
- cribl.log
- access.log
- audit.log
- notifications.log
- ui-access.log
Service logs in $CRIBL_HOME/log/service/....
Fleet logs in $CRIBL_HOME/log/group/<GROUPNAME>/cribl.log.
Edge Node logs in $CRIBL_HOME/log/worker/<N>/..., such as metrics.log.
Stderr-based logs such as cribl_stderr.log.

Verbose logging levels (for example, debug or silly) and high system activity can cause logs to reach the 5 MB threshold more quickly.

For guidance on forwarding internal logs and metrics to external observability platforms, see Forward Logs and Metrics Externally.

`cribl_stderr` and per-process stderr logs

In rare cases, Cribl Edge’s Node.js backend can encounter a fatal error (such as an out-of-memory condition) that prevents normal logging. To help with troubleshooting, Cribl Edge writes details about these failures to special stderr log files, including:

cribl_stderr.log at the node level.
Per-process stderr log files (for example, cribl_stderr_<PID>.log) written for specific processes such as API or service processes.

Key points:

These stderr logs are intended for Cribl Support and are not exposed in the UI.
They follow the same size-based rotation policy as other internal logs (rotate at 5 MB, keep five rotated copies per file).
Some stderr logs include prepended UTC timestamps to make it easier to correlate failures with other system activity.

Internal Logs

Leader Node Logs (Distributed)​

Outpost Logs​

Edge Node Logs (Distributed)​

Single-Instance Logs​

_raw stats Event Fields​

Sample Event​

Field Descriptions​

Log Rotation and Retention​

cribl_stderr and per-process stderr logs​

Common Resources