Lakehouse Engines in Cribl Search
Add a lakehouse engine so you can ingest data directly into Cribl Search.
Highlights
- Lakehouse engines store and accelerate your data within Cribl Search.
- Choose an engine size that covers your raw, uncompressed daily ingest. Resize or add more engines as needed.
- Storage auto-scales with your data volume and retention settings.
Lakehouse Engines Store and Accelerate Data
When you send data into Cribl Search with Sources, it’s parsed by Datatypes and stored in Search Datasets on a lakehouse engine.
A Lakehouse engine is a storage-plus-compute unit that ingests, stores, and accelerates your data inside Cribl Search. It can keep your data hot for up to 10 years, with no storage tiering to manage.
Each lakehouse engine:
- Accepts data from one or more Sources.
- Uses Datatypes to break events into fields.
- Stores data in Search Datasets until their retention expires.
- Powers fast, schema-aware searches and AI workflows over that stored data.
Lakehouse Engine Sizes
To choose the right lakehouse engine size, think about the amount of raw, uncompressed data you expect per day, then include headroom for spikes and growth.
If your ingest rate changes, or you experience ingest or search latency, you can resize your lakehouse engine. If the available sizes are not enough, you can add more lakehouse engines to distribute the workload.
Lakehouse Engine Sizes Available
The ingest rate limit applies only to raw incoming data, not to fields you add or transform during processing.
| Lakehouse Engine Size | Maximum Ingest per Day |
|---|---|
| 3X-Small | 75 GB |
| 2X-Small | 150 GB |
| X-Small | 300 GB |
| Small | 600 GB |
| Medium | 1,200 GB |
| Large | 2,400 GB |
| X-Large | 4,800 GB |
| 2X-Large | 9,600 GB |
| 3X-Large Contact Support | 14 TB |
| 4X-Large Contact Support | 19 TB |
| 5X-Large Contact Support | 24 TB |
| 6X-Large Contact Support | 28 TB |
Lakehouse Engine Compression Ratio
Cribl Search compresses ingested data at rest. The exact compression ratio depends on many factors, including the shape and content of your events, but it typically falls between 10:1 and 12:1.
Estimate Lakehouse Engine Costs
Because engine size acts as a hard limit on ingest, your costs are bounded, with no surprises from traffic spikes. You can scale your lakehouse engine up or down at any time to match your actual data needs.
With each lakehouse engine, you’re charged for two things:
| Component | Billing Basis | How It’s Measured |
|---|---|---|
| Engine size | Maximum data ingest per day. | Measured before compression or processing. |
| Storage | Amount of data retained over time. This auto-scales with your data volume and retention periods. | Measured after compression. Estimated compression ratio is 10:1 to 12:1. |
To estimate and optimize storage, set individual retention periods of your Search Datasets. See Plan Your Search Datasets for details.
To see how engine size and storage translate to costs, see Cribl Search Pricing.
Lakehouse Engine Retention
You don’t set retention for a lakehouse engine as a whole, but for each of its Search Datasets individually. Each Search Dataset can keep data for 1 day to 10 years, and its storage scales accordingly. After the retention period ends, Cribl Search deletes the data.
For details, see Create Search Datasets and Organize Data with Dataset Rules.
Add a New Lakehouse Engine
Search Admins and above can add lakehouse engines from the Cribl Search Engines tab.
- On the Cribl.Cloud top bar, select Products > Search > Data.
- Select the Engines tab, then Add Engine.
- Give your engine an ID (for example,
palo_alto_logs) unique across your Workspace. You won’t be able to change it later.The
mainID is reserved. - Set the Lakehouse engine Size. You can resize it later if needed.
- Confirm with Save.
When the lakehouse engine status is Ready, you can create Search Datasets and connect your Sources.
Check Lakehouse Engine Status
Select Refresh page to check for status updates.
| Status | Meaning |
|---|---|
| Provisioning | Setting up the engine. |
| Delayed | Setup is taking longer than expected. |
| Failed | Engine hit an error and can’t recover. |
| Ready | Engine is fully operational. |
| Blocked | Engine is down and trying to recover. |
| Resizing | Engine size is being changed. |
| Terminated | Engine is being deleted. |
Resize a Lakehouse Engine
Search Admins and above can resize lakehouse engines from the Cribl Search Engines tab.
- On the Cribl.Cloud top bar, select Products > Search > Data > Engines.
- Select the lakehouse engine you want to resize.
- Set the new lakehouse engine Size. See Lakehouse Engine Sizes.
- Confirm with Save.
Wait until the lakehouse engine status changes from Resizing to Ready again.
Next Steps
Now that your lakehouse engine is ready, create Search Datasets to organize your data and set retention.