Throttling
Throttling is the process of limiting the number of API requests a user can make in a certain period.
When talking about a service api, throttling can be applied to one or more combinations of consumers:
- Per Tenant.
- Per App.
- Per Service.
- Per Geo localization.
- Others.
Usually throttling is applied at the service level or gateway side level.
To apply service throttling to a consumer, it usually takes into consideration aggregation times of Request per Second (RPS) and/or Requests per Minute (RPM).