Rate Limiting & Throttling in ASP.NET Core

8 min read Updated 7/8/2026

On this page

Protecting the API: Rate Limiting

An unprotected API is a target for Denial of Service (DoS) attacks. You must implement Rate Limiting to ensure that a single user cannot crash your server by sending 10,000 requests a second.

1. Fixed Window vs Sliding Window

Fixed Window: "100 requests per 1 minute." Simple, but users can 'Burst' at the boundary and double their limit if they time it right.
Sliding Window: More accurate. It calculates the limit based on the exact last 60 seconds, preventing boundary bursts.

2. Token Bucket Algorithm

This is the most flexible approach. You give each user a 'Bucket' of tokens. Every request consumes a token. The bucket 'Refills' at a steady rate. This allows users to burst occasionally but prevents sustained high-volume abuse.

3. Global vs User-Specific

Always implement **Global Throttling** to protect your database, and **User-Specific Throttling** (based on API Key or IP) to ensure fair usage among your customers.

4. Interview Mastery

Q: "What is 'Distributed Rate Limiting'?"

Architect Answer: "Memory-based rate limiting only works if you have 1 server. If you have 10 servers behind a Load Balancer, a user could send 100 requests to EACH server, bypassing your limit. We solve this by using **Redis** to store the rate limit counters centrally. All 10 servers check the same Redis key, ensuring the user is capped at 100 total requests regardless of which server they hit."

Questions on this lesson 0

No questions yet — be the first to ask!

Protecting the API: Rate Limiting

1. Fixed Window vs Sliding Window

2. Token Bucket Algorithm

3. Global vs User-Specific

4. Interview Mastery

C# & .NET 8 Architect Mastery