Google Cloud Storage Destination
Google Cloud Storage is a non-streaming Destination type.
Type: Non-Streaming | TLS Support: Yes | PQ Support: No
Configure Cloud Storage Permissions
Configure access permissions on the Cloud Storage side based on the authentication method you’ll use on this Destination:
- Auto authentication: Supports both uniform and fine-grained access control.
- Manual authentication: Requires fine-grained access control on the buckets because it relies on ACLs to access resources.
The storage.objects.create permission is required.
If you enable Verify if bucket exists in the Advanced Settings, the storage.objects.list permission is also required.
Assign either of the following roles to grant the required permissions:
- roles/storage.admin
- roles/storage.legacyBucketWriter
To grant more limited access, combine these roles:
- roles/storage.objectCreatorto provide the- storage.objects.createpermission
- roles/storage.objectViewerto provide the- storage.objects.listpermission
For details, see the Cloud Storage Overview of Access Control and Understanding Roles documentation.
Configure Cribl Edge to Output to Cloud Storage Destinations
- On the top bar, select Products, and then select Cribl Edge. Under Fleets, select a Fleet. Next, you have two options: - To configure via QuickConnect, navigate to Routing > QuickConnect (Stream) or Collect (Edge). Select Add Destination and select the Destination you want from the list, choosing either Select Existing or Add New.
- To configure via the Routes, select Data > Destinations or More > Destinations (Edge). Select the Destination you want. Next, select Add Destination.
 
- In the New Destination modal, configure the following under General Settings: - Output ID: Enter a unique name to identify this Cloud Storage definition. If you clone this Destination, Cribl Edge will add -CLONEto the original Output ID.
- Description: Optionally, enter a description.
- Bucket name: Name of the destination bucket. This value can be a constant, or a JavaScript expression that can be evaluated only at init time. For example, referencing a Global Variable: myBucket-${C.vars.myVar}.
- Region: Region where the bucket is located.
- Staging location: Filesystem location in which to locally buffer files before compressing and moving to final destination. Cribl recommends that this location be stable and high-performance.In Cribl Stream, the Staging location field is not displayed or available on Cribl.Cloud-managed Worker Groups. 
- Key prefix: Root directory to prepend to path before uploading. Enter a constant, or a JS expression enclosed in single quotes, double quotes, or backticks.
- Data format: The output data format defaults to JSON.RawandParquetare also available. SelectingParquet(supported only on Linux, not Windows) exposes a Parquet Settings left tab, where you must configure certain options in order to export data in Parquet format.
 
- Output ID: Enter a unique name to identify this Cloud Storage definition. If you clone this Destination, Cribl Edge will add 
- Next, you can configure the following Optional Settings: - Partitioning expression: JavaScript expression that defines how files are partitioned and organized. Default is date-based. If blank, Cribl Edge will fall back to the event’s - __partitionfield value (if present); or otherwise to the root directory of the Output Location and Staging Location.
- Compression: Data compression format used before moving to final destination. Defaults to - gzip(recommended). This setting is not available when Data format is set to- Parquet.
- File name prefix expression: The output filename prefix. Must be a JavaScript expression (which can evaluate to a constant), enclosed in quotes or backticks. Defaults to - CriblOut.
- File name suffix expression: The output filename suffix. Must be a JavaScript expression (which can evaluate to a constant), enclosed in quotes or backticks. Defaults to - `.${C.env["CRIBL_WORKER_ID"]}.${__format}${__compression === "gzip" ? ".gz" : ""}`, where- __formatcan be- jsonor- raw, and- __compressioncan be- noneor- gzip.- To prevent files from being overwritten, Cribl appends a random sequence of 6 characters to the end of their names. This also applies to prefix and suffix expressions in file names. - For example, if you set the File name prefix expression to - CriblExecand set the File name suffix expression to- .csv, the file name might display as- CriblExec-adPRWM.csvwith- adPRWMappended.
- Backpressure behavior: Select whether to block or drop events when all receivers are exerting backpressure. (Causes might include an accumulation of too many files needing to be closed.) Defaults to - Block.
- Tags: Optionally, add tags that you can use to filter and group Destinations on the Destinations page. These tags aren’t added to processed events. Use a tab or hard return between (arbitrary) tag names. 
 
- Optionally, you can adjust the Authentication, Parquet, Processing, Retries and Advanced settings outlined in the sections below. 
- Select Save, then Commit & Deploy. 
Authentication
Use the Authentication method drop-down to select one of these options:
- Auto: This option authenticates with the attached Google Cloud Platform (GCP) Service Account, relying on GCP IAM roles to access the appropriate GCP resources. This option is available only when Cribl Edge is on-prem, and the Edge Nodes are running in Google Compute Engine VMs on GCP. Supports both uniform and fine-grained access control.
- Manual: With this default option, authentication is via HMAC (Hash-based Message Authentication Code). To create a key and secret, see Google Cloud’s Managing HMAC Keys for Service Accounts documentation. This option requires fine-grained access control on the GCS bucket because it relies on ACLs to access resources. Uniform access control is not supported with this method. This option exposes these two fields:- Access key: Enter the HMAC access key.
- Secret key: Enter the HMAC secret.
 
The values for Access key and Secret key can be a constant, or a JavaScript expression (such as ${C.env.MY_VAR}) enclosed in quotes or backticks, which allows configuration with environment variables.
- Secret: This option exposes a Secret key pair drop-down, in which you can select a stored secret that references the secret key pair described above. A Create link is available to store a new, reusable secret.
Processing Settings
Post‑Processing
Pipeline: Pipeline or Pack to process data before sending the data out using this output.
System fields: A list of fields to automatically add to events that use this output. By default, includes cribl_pipe (identifying the Cribl Edge Pipeline that processed the event). Supports c* wildcards. Other options include:
- cribl_host– Cribl Edge Node that processed the event.
- cribl_input– Cribl Edge Source that processed the event.
- cribl_output– Cribl Edge Destination that processed the event.
- cribl_route– Cribl Edge Route (or QuickConnect) that processed the event.
- cribl_wp– Cribl Edge Worker Process that processed the event.
Parquet Settings
To write out Parquet files, note that:
- On Linux, you can use the Cribl Edge CLI’s parquetcommand to view a Parquet file, its metadata, or its schema.
- Cribl Edge Workers support Parquet only when running on Linux, not on Windows.
- See Working with Parquet for pointers on how to avoid problems such as data mismatches.
Automatic schema: Toggle on to automatically generate a Parquet schema based on the events of each Parquet file that Cribl Edge writes. Toggle off (default) to expose the following additional field:
- Parquet schema: Select a schema from the drop-down.
If you need to modify a schema or add a new one, follow the instructions in our Parquet Schemas topic. These steps will propagate the freshest schema back to this drop-down.
Parquet version: Determines which data types are supported, and how they are represented. Defaults to 2.6; 2.4 and 1.0 are also available.
Data page version: Serialization format for data pages. Defaults to V2. If your toolchain includes a Parquet reader that does not support V2, use V1.
Group row limit: The number of rows that every group will contain. The final group can contain a smaller number of rows. Defaults to 10000.
Page size: Set the target memory size for page segments. Generally, set lower values to improve reading speed, or set higher values to improve compression. Value must be a positive integer smaller than the Row group size value, with appropriate units. Defaults to 1 MB.
Log invalid rows: Toggle on to output up to 20 unique rows that were skipped due to data format mismatch. Log level must be set to debug for output to be visible.
Write statistics: Toggle on (default) if you have Parquet tools configured to view statistics – these profile an entire file in terms of minimum/maximum values within data, numbers of nulls, etc.
Write page indexes: Toggle on (default) if your Parquet reader uses statistics from Page Indexes to enable page skipping. One Page Index contains statistics for one data page.
Write page checksum: Toggle on if you have configured Parquet tools to verify data integrity using the checksums of Parquet pages.
Metadata (optional): The metadata of files the Destination writes will include the properties you add here as key-value pairs. For example, one way to tag events as belonging to the OCSF category for security findings would be to set Key to OCSF Event Class and Value to 2001.
Advanced Settings
File size limit (MB): Maximum uncompressed output file size. Files of this size will be closed and moved to final output location. Defaults to 32.
File open time limit (sec): Maximum amount of time to write to a file. Files open for longer than this limit will be closed and moved to final output location. Defaults to 300.
Idle time limit (sec): Maximum amount of time to keep inactive files open. Files open for longer than this limit will be closed and moved to final output location. Defaults to 30.
Open file limit: Maximum number of files to keep open concurrently. When exceeded, the oldest open files will be closed and moved to final output location. Defaults to 100.
Cribl Edge will close files when either of the
File size limit (MB)or theMax file open time (sec)conditions are met.
Add Output ID: Whether to append output’s ID to staging location. Defaults to toggled on.
Disk space protection: Specifies whether to Block (default) or Drop incoming events when the disk space falls below the globally defined Min free disk space amount.
Remove staging dirs: Toggle on to delete empty staging directories after moving files. This prevents the proliferation of orphaned empty directories. When enabled, exposes this additional option:
- Staging cleanup period: How often (in seconds) to delete empty directories when Remove staging dirs is enabled. Defaults to 300seconds (every 5 minutes). Minimum configurable interval is10seconds; maximum is86400seconds (every 24 hours).
Enable dead-lettering: Toggle on to set a maximum number of retries, and to move files to a designated directory when write failures exceed that limit. This prevents data flow blockage and excessive error logging due to undeliverable files. When enabled, exposes two additional fields:
- Dead-letter location: Specify the storage location for undeliverable files. Defaults to $CRIBL_HOME/state/outputs/dead-letter.
- Maximum retry limit: Configure the retry limit for failed file deliveries. This setting defines how many times the system will attempt to move a file to its intended location before it is deemed undeliverable and placed in the dead-letter directory. Defaults to 20.
Endpoint: The Google Cloud Storage service endpoint. Typically, there is no reason to change the default https://storage.googleapis.com endpoint.
Object ACL: Select an Access Control List to assign to uploaded objects. Defaults to private.
Storage class: Select a storage class for uploaded objects.
Signature version: Signature version to use for signing requests. Defaults to v4.
Reuse connections: Whether to reuse connections between requests. Toggling on (default) can improve performance.
Reject unauthorized certificates: Whether to accept certificates that cannot be verified against a valid Certificate Authority (for example, self-signed certificates). Defaults to toggled on.
Environment: If you’re using GitOps, optionally use this field to specify a single Git branch on which to enable this configuration. If empty, the config will be enabled everywhere.
Internal Fields
Cribl Edge uses a set of internal fields to assist in forwarding data to a Destination.
Field for this Destination:
- __partition
Troubleshooting
The Destination’s configuration modal has helpful tabs for troubleshooting:
Live Data: Try capturing live data to see real-time events as they flow through the Destination. On the Live Data tab, click Start Capture to begin viewing real-time data.
Logs: Review and search the logs that provide detailed information about the delivery process, including any errors or warnings that may have occurred.
Test: Ensures that the Destination is correctly set up and reachable. Verify that sample events are sent correctly by clicking Run Test.
You can also view the Monitoring page that provides a comprehensive overview of data volume and rate, helping you identify delivery issues. Analyze the graphs showing events and bytes in/out over time.
Common Issue
Nonspecific messages from Google Cloud of the form Error: failed to close file can indicate problems with the permissions listed above.