Something not as obvious is the relationship between service time, utilisation, ...

roenxi · on Oct 15, 2022

If we're intellectually honest, how many hours of studying Markov chains does any one insight there really justify? And what are the odds that any one insight is useful even while dealing with an honest-to-goodness queue? We're not exactly talking e^{i\pi} levels of "wow!" which almost justify teaching complex numbers just to hit people with the one equation.

The power is in the sheer number of semi-trivial observations that a queue theorist can start making after seeing only a small part of the system. And that is impressive - but mostly because you don't need to consider all those individual things as variables once the basic theory is understood. So the theorist can start ignoring all those variables really quickly and move on to dealing with the problem at hand.

gillh · on Oct 15, 2022

You should really check out the recent Aperture[1] project on GitHub that applies all these ideas in practice to protect services from cascading failures.

1. Aperture automatically detects queue buildup based on metrics such as latency. 2. Adjusts the concurrency on a service. 3. Weighted Fair Scheduling of workloads (i.e. APIs) based on their labels.

[1] https://github.com/fluxninja/aperture

kqr · on Oct 15, 2022

I'm confused the decision would be based on latency, which is somewhat lagging. Queue length would seem more efficient, since it's leading.

gillh · on Oct 15, 2022

The metric and the control circuit is configurable/programmable.

[1] https://github.com/fluxninja/aperture/blob/main/blueprints/b...