Home / Edge/ Working with Fleets and Edge Nodes/ Fleets/Configure Fleet Settings

Configure Fleet Settings

Required Permission: Admin

In Cribl Edge, you can use Fleet Settings to configure Node teleporting, throughput throttling, logging, upgrading, and security at the Fleet level.

To edit Fleet Settings:

  1. Select the Fleet, then select Fleet Settings.
  2. Configure the following settings per Fleet.

General Settings

Fleet Configuration

You can configure the following general Fleet options on this tab:

Fleet Configuration SettingDescription
Enable teleporting to NodesToggle on to enable authenticated access to the Edge Node UI from the Leader. Toggle off to disable. For more information about teleporting, see Teleport into an Edge Node.
DescriptionOptionally, add or edit the Fleet description.
TagsUse tags to organize Fleets into logical categories. Then, you can search by tags on the View all Fleets interface. This search filters the list of Fleets, showing only those with the tag you entered.
Time to keep disconnected NodesConfigure how long the Fleet should retain information about a Node that disconnects from the Leader. Format examples: 8h, 5d, 1w (format defaults to days if you only specify a number). Default: 1 day.
Inherited configurationOptionally, select an existing Fleet that this Fleet will inherit configurations from.

API Server Settings

General

You can set the following options for the API server:

API Server SettingDescription
HostThe hostname or IP address you want to bind the API server to.
  • Defaults to 127.0.0.1 for standalone Edge Nodes and Edge Fleets.
  • You can override the value with the CRIBL_API_HOST environment variable.
PortAPI port to listen to.
  • Defaults to 9420 for standalone Edge Nodes and Edge Fleets.
  • You can override the value with the CRIBL_API_PORT environment variable.

TLS

For information on TLS options, see the documentation for any Source or Destination that supports TLS.

Advanced

You can set the following advanced options for the API server:

API Advanced Server Setting
Description
Retry countThe number of times to retry binding to the API port. Default is 120.
Retry periodThe period between consecutive retries for API port binding, in seconds. Default is 5.
URL base pathThe URL base path from which to serve all assets. Setting a URL base path may be useful when operating behind a proxy server.
Listen on port:Toggle on (default) to expose the API service to the network on the configured API port. When toggled off, Cribl Edge doesn’t listen on the port (9420 by default).
Local UI accessToggle on (default) to allow direct browser access to the UI for Edge Nodes.
Logout on roles changeIf role-based access control is enabled, toggle on to automatically log out users when their assigned Roles change. Default is toggled on.
Auth-token TTLAuthentication tokens’ valid lifetime, in seconds. Default is 3600 (60 minutes = 1 hour); minimum is 1.
Session idle time limitHow long to observe no user interaction before invalidating users’ session tokens, in seconds. Default is 3600 (60 minutes = 1 hour); minimum is 60.
Login rate limitThe number of login attempts allowed over the specified unit of time. For example, to limit login attempts to 50 per minute, specify the login rate limit 50/minute. Valid units of time are second, minute, hour, and day. Default is 2/second.
SSO/SLO callback rate limitThe number of requests to SSO and SLO callback endpoints allowed over the specified unit of time. For example, to limit requests to 10 per minute, specify 10/minute. Valid units of time are second, minute, hour, and day.
HTTP headersOne or more custom HTTP headers to send with every response.
Enable API cacheToggle on to enable browser caching of frequent API requests. Default is toggled on. Toggling off can slow the the UI response time.

Default TLS Settings

See the System-wide TLS Settings Including Ciphers.

Limits

The Limits tab provides access to controls for metrics, storage, metadata, jobs, the Redis cache and connections to it, and CPU settings.

Metrics

Metrics FieldDescription
Metrics worker trackingDefine which metrics are not assigned a Node ID dimension, which means they will not be split by Edge Node.
Number of metrics allowedThe total number of metrics series allowed.
Metrics cardinality limitThe maximum allowed number of unique combinations of metrics names and their dimensions.
Metrics GC periodThe metrics garbage collector (GC) runs when the total number of stored metrics exceeds the Number of metrics allowed.
Metrics never-drop listA list of internal metrics that the Cribl Edge Node will always send to the Leader.
Disable field metricsA list of metrics that you don’t want to collect.
Metrics to send from Edge NodeDescription
MinimalSends metrics for events and bytes in and out. Recommended for high-cardinality deployments. Sent as aggregate totals per Fleet:
  • total.in_events
  • total.out_events
  • total.in_bytes
  • total.out_bytes
Excludes metrics that are coming directly from Sources or Destinations.
Basic (default)Contains all the metrics in the minimal set, and more. Sends metrics required to display all monitoring data available in the Edge UI. Sent as aggregate totals per Source/Destination:

total.in_events, total.out_events, total.in_bytes, total.out_bytes.

Also, Sent as aggregate totals per Source:

health.inputs

Sent as aggregate totals per Destination:

health.outputs

Sent as aggregate totals per Route:

route.in_events, route.in_bytes, route.out_events, route.out_bytes, route.dropped_events

Sent as aggregate totals per Pipeline:

pipe.in_events, pipe.out_events, pipe.dropped_events

Sent as aggregate totals per Pack:

pack.in_events, pack.out_events, pack.err_events, pack.dropped_events
AllSend all metrics with no filtering. All metrics without filtering, including Basic, plus Worker/Edge Node Resource Metrics, Persistent Queue Metrics, and Other Internal Metrics.
CustomSend a customized set of metrics. All metrics that match a JavaScript expression that you define, plus the metrics in the Minimal set.

For more information about each of the settings above, go to Controlling Metrics Volume.

Storage

You can configure the following options for storage:

Storage Setting
Description
Sample size limitMaximum file size, in binary units (KB, MB), for sample data files. Maximum: 3 MB. Default: 256 KB.
Min free disk spaceThe minimum amount of disk space on the host before various features take measures to prevent disk usage (KB, MB, etc.). Default: 5 GB. To avoid the “Unable to write to the filesystem, disk space low” error, ensure that the available disk space remains above this threshold by monitoring disk usage and configuring alerts.
Worker Process PQ size limitHighest accepted value for the Queue size limit option used in individual Sources’ and Destinations’ persistent queues. Default: 1 TB. Consult Cribl Support before increasing beyond this value.

Metadata

Event metadata sources: List of event metadata sources to enable. No sources are enabled by default.

Jobs and Tasks

Job Limits Setting
Description
Disable jobs/tasksWhen enabled, the Edge Nodes won’t poll the Leader for jobs/tasks. The job limits settings below will not affect Edge Nodes on version 4.5.0 and newer. Edge Nodes running 4.4.4 and older still use these jobs settings even if jobs/tasks are disabled here.
Concurrent job limitThe total number of jobs that can run concurrently. Defaults to 10.
Concurrent system job limitThe total number of system jobs that can run concurrently. Defaults to 10. Minimum 1.
Concurrent scheduled job limitThe total number of scheduled jobs that can run concurrently. This limit is set as an offset relative to the Concurrent job limit. Defaults to -2.

Skipped jobs indicate that a Fleet’s Concurrent job limit has been reached or exceeded. Increase this limit to reduce the number of skippable jobs. For resource-intensive jobs, this might require deploying more Edge Nodes.

Task Limits Setting
Description
Concurrent task limitThe total number of tasks that a Worker Process can run concurrently. Defaults to 2. Minimum 1.
Concurrent system task limitThe number of system tasks that a Worker Process can run concurrently. Defaults to 1. Minimum 1.
Task usage percentage limitValue, between 0 and 1, representing the percentage of total tasks on a Worker Process that any single job may consume. Defaults to 0.5 (50%).
Task poll timeoutThe number of milliseconds that a Worker’s task handler will wait to receive a task, before retrying a request for a task. Defaults to 60000 (60 seconds). Minimum 10000 (10 seconds).

Completion Limits

Completion Limits Setting
Description
Artifact reaper periodInterval on which Cribl Edge attempts to reap jobs’ stale disk artifacts. Defaults to 30m.
Finished job artifacts limitMaximum number of finished job artifacts to keep on disk. Defaults to 100. Minimum 0.
Finished task artifacts limitMaximum number of finished task artifacts to keep on disk, per job, on each Edge Node. Defaults to 500. Minimum 0.

Task Manifest and Buffering Limits

Task Manifest and Buffering Setting
Description
Manifest flush periodThe rate (in milliseconds) at which to refresh the task manifest of a job. Defaults to 100 ms. Minimum 100, maximum 10000.
Manifest buffer size limitThe maximum number of tasks that the task manifest can hold in memory before flushing to disk. Defaults to 1000. Minimum 100, maximum 10000.
Manifest reader buffer sizeThe number of bytes that the task manifest reader should pull from disk. Defaults to 4kb.
Job dispatchingThe method by which tasks are assigned to Worker Processes. Defaults to Least In‑Flight Tasks, to optimize available capacity. Round Robin is also available.
Job timeoutMaximum time a job is allowed to run. Defaults to 0, for unlimited time. Units are seconds if not specified. Sample entries: 30, 45s, 15m.
Task heartbeat periodThe heartbeat period (in seconds) for tasks to report back to the Leader/API. Defaults to 60 seconds. Minimum 60.

Redis

Cache
Redis Cache Setting
Description
Key TTL in secondsMaximum time to live of a key in the cache (seconds). 0 indicates no limit. Defaults to 10 minutes.
Max # of keysMaximum number of keys to retain in the cache. 0 indicates no limit. Defaults to 0.
Max cache size (bytes)Maximum number of bytes to retain in the cache. 0indicates no limit. Defaults to 0.
Service period (seconds)Frequency of cache limit enforcement. Defaults to every 30 seconds.
Server assistedDefault is toggled off. When toggled on, the following control appears.
Client tracking mechanism:Mechanism for invalidation message delivery. In default mode, the server remembers which keys a client has requested and only sends invalidations for those, using more Redis server memory. In broadcast mode, it sends all invalidations, requiring more processing by Cribl Edge.
Connections
Redis Connections Setting
Description
Reuse Redis connectionsToggle on if you want Cribl Edge to try to reuse Redis connections when multiple Redis Functions (or references to them) are present. When enabled, displays the following additional control:
  • Connection limit: The maximum number of identical connections allowed before Cribl Edge tries to reuse connections. Defaults to 0, meaning unlimited connections are allowed (equivalent to leaving Reuse Redis connections toggled off). Setting a non-zero integer value forces Cribl Edge to try to reuse connections for each individual Worker Process (not to reuse connections among Worker Processes).

To understand why and when to employ these controls, see Reusing Redis Connections.

Other

Other Setting
Description
CPU profile TTLThe time-to-live for collected CPU profiles.
Default managed node heartbeat periodHow many seconds a managed Node will wait to send back a heartbeat to the Cribl control plane.
Config bundle download timeoutHow many seconds a Cribl Stream Worker will wait for a successful Leader connection before canceling a download of a new configuration bundle. This timeout helps prevent Workers from hanging indefinitely when there are network issues or other delays during the download process. A 0 value means wait indefinitely, which could cause Workers to hang.
Use Windows tools to collect process infoEnable PowerShell to collect process information instead of using the native API.
This legacy setting will be removed in a future release. We highly recommend keeping this option disabled and using the newer native capabilities which are faster and more reliable.

When using PowerShell to collect process information, the environment variables displayed in the Explore > Processes tab might be inaccurate. They will reflect the environment of the PowerShell process itself, not the actual environment of the process being viewed.

Proxy Settings

Use proxy env vars: Honors the HTTP_PROXY/HTTPS_PROXY environment variables. Defaults to toggled on.

Cribl prioritizes environment variables for proxy settings in this order: Process, User, and System.

If your Cribl service is managed by a service manager other than systemd (such as upstart or init), the Use proxy env vars toggle might not behave as expected because Cribl might prioritize environment variables set by the service manager instead of using the proxy settings you intended.

Sockets

Directory: Holds sockets for inter-process communication (IPC), such as communications between a load-balancing process and a Worker Process. Defaults to /tmp (your system’s temp directory).

Shutdown Settings

Drain timeout (sec): Determines how long a Cribl server will wait for writes to complete before the server shuts down on individual Worker Processes. If you notice that Workers are under-ingesting available data upon shutdown or restart, increase the 10–second default. Acceptable range of values: minimum 1 second, maximum 600 seconds (10 minutes).

Worker Processes

Worker Processes Setting
Description
Process countThis setting only applies to the Kubernetes Logs Source. Indicates the number of Worker Processes to spawn. Positive numbers specify an absolute number of Workers. Negative numbers specify a number of Workers relative to the number of CPUs in the system, for example: {<number of CPUs available> minus <this setting>}. The default in Cribl Edge is 1.
  • You can use Process count to adjust the number of Worker Processes only for the Kubernetes Logs Source. This is necessary when running the Kubernetes Logs Source with load balancing enabled. Load balancing increases performance when you have a higher event ingest volume than one Worker Process can handle.
  • Cribl Edge will provision at least the Minimum process count set below, but no more than the number of CPUs the host has available.
Minimum process countIndicates the minimum number of Worker Processes to spawn. Overrides the lowest Process count result. Cribl Edge interprets a 0 entry as “default,” which here yields 1 Process.
Memory (MB)Amount of heap memory available to each Worker Process, in MB. The OS allocates heap memory dynamically as the Process requests it up to the amount dictated by this setting. The Memory (MB) setting does not govern external (ext) memory.
Enable heap snapshotsDefault is toggled off. Toggle on for Cribl Edge to automatically create memory snapshots for Worker Processes when they approach or exceed memory limits. Only the two most recent heap snapshots are retained. Older snapshots are automatically deleted. This behavior cannot be modified.

The Enable heap snapshots is available for hybrid or on-prem deployments only. This setting should only be enabled if recommended by support or if you are experiencing out-of-memory issues. Be aware that enabling this setting may impact system performance and storage usage.

Non-configurable Settings

These settings can’t be configured in Cribl Edge, and instead have default values.

Max connections at startup: Maximum number of connections accepted at Worker Process startup. Defaults to 1.

Startup throttling duration (ms): Maximum time (in milliseconds) to continue throttling connections after Worker Process startup. Defaults to 10000 ms (10 sec).

Load throttle %: Sets a threshold to prevent overwhelming Workers. If 90% of a Worker Process’ CPU utilization readings exceed this threshold over one minute, the process will reject new connections until the CPU load stabilizes. Another process that is below the threshold will accept the connection the next time it is established. Defaults to 0% (no throttling). Enter a percentage between 1100 to enable throttling.

You can configure the CPU saturation threshold, but the 90% sampling trigger is not configurable. Also, _raw stats > cpuPerc values might diverge from your Load throttle % threshold. This is because cpuPerc is sampled and averaged once per minute, whereas the Load throttle % is evaluated every second, with a rolling 1-minute lookback sample. (These intervals are also not configurable.)

Other Settings

This page’s remaining options work essentially the same way as their counterparts in Settings > Global. Use the following links for details about: logging levels/redactions, access management, security, scripts, and diagnostics.