Document updated on Jan 28, 2022
The Spike Arrest policy ensures a minimum time between different requests. KrakenD will enable Spike Arrest after exhausting the burst capacity of the rate-limiting features.
The bursting control is the policy that defines what to do when you reach the throttling capacity of the system within a second.
When users consume content with rate-limiting enabled, the capacity
of the rate limit defines the bursting they can have. Bursting makes users have a relatively higher number of requests for a short time. When this burst is exhausted, no additional requests are processed until the Token Bucket algorithm credits the user again.
The bursting control is automatically set on endpoint rate limiting with a capacity equal to the rate limit, and is configurable on backend rate limit.
The Spike Arrest policy defines the quickest time between two sequential requests when the users consume the maximum capacity.
After an emptied bucket (capacity exhausted), the following requests are in Spike Arrest mode and will need to have a delay of at least 1 ÷ max_rate
to be processed again. Krakend will reject connections requesting content faster than this rate.
Depending on the rate limit you implement, you might see rejected connections with status codes 503 Service Unavailable
or 429 Too Many Requests
.
The documentation is only a piece of the help you can get! Whether you are looking for Open Source or Enterprise support, see more support channels that can help you.