Backpressure for the impatient: channels vs. semaphores vs. tokens

Sun, 19 Feb 2023 16:00:00 +0100

A pipeline has backpressure when the consumer can make the producer slow down, or refuse the work, or both. A bigger buffer does neither. It just defers the same problem, with more memory held hostage in the meantime. The two get confused a lot, and the difference shows up unmistakably in the tail latency numbers, so this post does the comparison directly.

When load exceeds capacity you have three sane options:

Block the caller until a slot frees. Throughput pins to whatever the consumer sustains; tail latency degrades cleanly.
Reject the caller so they retry, fall back, or shed. Latency stays low for accepted work; some work never happens.
Rate-limit the caller so they never get close to overloading you. Predictable throughput, the rest is queued or dropped.

Go has a direct idiom for each: a bounded channel for option 1, the golang.org/x/sync/semaphore package for option 1 with per-item weighting, and a time.Ticker token bucket for option 3 (and trivially option 2). They are not interchangeable. They produce three very different latency curves under the same workload, which is what we will see below.

Concurrency on Segflow

Backpressure for the impatient: channels vs. semaphores vs. tokens