loki.write: implement sharding #4882

kalleep · 2025-11-19T13:08:28Z

PR Description

This PR implements queue_config for the loki.write component, enabling users to configure queue-based batching and parallel processing. The implementation introduces a new sharding architecture that distributes log entries across multiple parallel queues based on label fingerprints. This implementation is based on Prometheus rw sharding.

The shards implementation is used with both "normal" clients and "WAL" clients. So we get rid of a lot of duplicated logic.

Before this pr we had a queue_config block that was only used when WAL was enabled. It is now always used and will affect clients regardless.

Currently no automatic "resharding" is implemented. Implementing this without the WAL will most likely be pretty primitive. So for now min_shards is the only configurable value until we address this.

Ideally we would move a couple of attributes from endpoint block to queue_config block to closer match prometheus.remote_write. But we can't do that without doing a breaking change. These attributes are:

retry_on_http_429
max_backoff_period
min_backoff_period
batch_size
batch_wait

Which issue(s) this PR fixes

Part of: #4728

Notes to the Reviewer

I moved wal writer ownership into client.Manager. No need to expose it to loki.write component.
I plan to work on resharding in followup pr

PR Checklist

CHANGELOG.md updated
Documentation added
Tests updated
Config converters updated

github-actions · 2025-11-19T13:10:06Z

💻 Deploy preview available (loki.write: implement sharding):

https://deploy-preview-alloy-4882-zb444pucvq-vp.a.run.app/docs/alloy/latest/

Copilot

Pull Request Overview

This PR implements queue-based sharding for the loki.write component, introducing a new architecture that distributes log entries across multiple parallel queues based on label fingerprints. The implementation unifies the handling of both normal and WAL-enabled clients through a shared shards structure, eliminating significant code duplication. The queue_config block, previously WAL-only, now applies to all endpoints and controls both queue capacity and shard count.

Key changes:

Introduces shards.go with new sharding architecture for parallel processing via multiple queues
Refactors WAL and fanout clients to use the shared shards implementation, removing ~500 lines of duplicated code
Adds min_shards configuration parameter to control parallelism level

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
internal/component/loki/write/types.go	Adds `MinShards` field to `QueueConfig` and updates documentation to reflect queue config is now always used
internal/component/common/loki/client/shards.go	New file implementing the core sharding logic with queue management and parallel batch sending
internal/component/common/loki/client/shards_test.go	Comprehensive test coverage for queue operations including append, drain, and flush/shutdown scenarios
internal/component/common/loki/client/consumer_wal.go	Refactored to delegate batching and sending to the shards implementation, significantly simplified
internal/component/common/loki/client/consumer_fanout.go	Refactored to use shards implementation, removing duplicated send/batch logic
internal/component/common/loki/client/config.go	Adds `MinShards` field definition to `QueueConfig` struct
docs/sources/reference/components/loki/loki.write.md	Documents the new `min_shards` parameter and clarifies queue config usage

internal/component/common/loki/client/shards_test.go