Throttling

Throttling is the process of limiting the number of API requests a user can make in a certain period.

When talking about a service api, throttling can be applied to one or more combinations of consumers:

  • Per Tenant.
  • Per App.
  • Per Service.
  • Per Geo localization.
  • Others.

Usually throttling is applied at the service level or gateway side level.

To apply service throttling to a consumer, it usually takes into consideration aggregation times of Request per Second (RPS) and/or Requests per Minute (RPM).