I just finished implementing a simple backpressure algorithm. Once the queue is ...

usrbin · on March 8, 2022

What is the purpose of cutting off the queue at a n% full? Isn't that effectively saying that the queue size is actually n% of the stated size, or am I missing detail?

bombcar · on March 8, 2022

Perhaps "rejects new items" is the key - a web server that hits a certain percentage utilization only allow existing clients/connections/logged in users/whatever to connect; refuse new connections.

cjg · on March 8, 2022

It's probabilistic.

morelisp · on March 8, 2022

Wait until you hear about hash table capacities.

SketchySeaBeast · on March 8, 2022

Haven't you just moved the pressure back a step? Now you have a bunch of items retrying to get their stuff on the queue and a bunch of places to try and locate the problem, instead of an overly large queue. Is the idea that the messages aren't important enough to be placed on the queue in the first place if demand is high?

cjg · on March 8, 2022

The idea is that it's not a sudden hard failure. Instead only a few things fail which helps in two ways: reducing the load by discarding some work and alerting that the system is beginning to be overloaded.

gpderetta · on March 9, 2022

this is similar to Random Early Detection [1].

[1] https://en.wikipedia.org/wiki/Random_early_detection

cjg · on March 11, 2022

bmm6o · on March 9, 2022

I think 429 is the preferred status code for this. Of course you can use whatever you want, but 503 can presumably come up in other scenarios and it's nice to have something unambiguous.

cjg · on March 11, 2022

429 is only applicable if it's the same user sending all the traffic.

503 explicitly mentions server overload.