To learn more about what local rate limiting is and the differences between local and global rate limiting, see About local rate limiting.

Before you begin

  1. Follow the Get started guide to install Gloo Gateway, set up a gateway resource, and deploy the httpbin sample app.

  2. Get the external address of the gateway and save it in an environment variable.

Apply local rate limit settings to Layer 4 traffic

  1. Create an HttpListenerOption resource with your local rate limit settings.

      kubectl apply -f- <<EOF
    apiVersion: gateway.solo.io/v1
    kind: HttpListenerOption
    metadata:
      name: local-ratelimit
      namespace: gloo-system
    spec:
      targetRefs: 
      - group: gateway.networking.k8s.io
        kind: Gateway
        name: http
      options: 
        networkLocalRatelimit: 
          maxTokens: 1
          tokensPerFill: 1
          fillInterval: 100s  
    EOF
      
  2. Send a request to the httpbin app. Verify that your request succeeds and a 200 HTTP response code is returned.

    Example output:

      ...
    * Mark bundle as not supporting multiuse
    < HTTP/1.1 200 OK
    HTTP/1.1 200 OK
    < content-type: application/xml
    content-type: application/xml
    ...
      
  3. Send another request to the httpbin app. Note that this time the request is denied immediately. Because the gateway is configured with only 1 token that is refilled every 100 seconds, the token was assigned to the connection of the first request. No tokens were available to be assigned to the second request. Because the request is rejected on Layer 4, no HTTP response code or message is returned.

    Example output:

      * Recv failure: Connection reset by peer
    * Closing connection 0
    curl: (56) Recv failure: Connection reset by peer
      

Apply local rate limit settings to Layer 7 traffic

  1. Change the local rate limiting settings in the HttpListenerOption resource to apply to Layer 7 traffic instead of Layer 4 traffic by using the httpLocalRatelimit option. The following example configures the gateway with a token bucket with a maximum of 1 token that is refilled every 100 seconds. To verify that your rate limiting settings are working as expected and to simplify troubleshooting, set enableXRatelimitHeaders: true. This option adds rate limiting headers to your response that indicate the local rate limiting settings that are applied, the number of tokens that are left in the bucket, and the number of seconds until the token bucket is refilled. For more information, see the Envoy documentation.

      kubectl apply -f- <<EOF
    apiVersion: gateway.solo.io/v1
    kind: HttpListenerOption
    metadata:
      name: local-ratelimit
      namespace: gloo-system
    spec: 
      targetRefs: 
      - group: gateway.networking.k8s.io
        kind: Gateway
        name: http
      options: 
        httpLocalRatelimit: 
          defaultLimit:
            maxTokens: 1
            tokensPerFill: 1
            fillInterval: 100s
          enableXRatelimitHeaders: true
    EOF
      
  2. Send a request to the httpbin app. Verify that your request succeeds and a 200 HTTP response code is returned. In addition, review the x-ratelimit-* headers that are returned. The x-ratelimit-limit header represents the token limit that is set on the gateway. To check how many tokens are available for subsequent requests, review the x-ratelimit-remaining header. Use the x-ratelimit-reset header to view how many seconds are left until the token bucket is refilled.

    Example output:

      ...
    * Mark bundle as not supporting multiuse
    < HTTP/1.1 200 OK
    HTTP/1.1 200 OK
    < x-ratelimit-limit: 1
    x-ratelimit-limit: 1
    < x-ratelimit-remaining: 0
    x-ratelimit-remaining: 0
    < x-ratelimit-reset: 95
    x-ratelimit-reset: 95
    ...
      
  3. Send another request to the httpbin app. Note that this time the request is denied with a 429 HTTP response code and a local_rate_limited message in your CLI output. Because the gateway is configured with only 1 token that is refilled every 100 seconds, the token was assigned to the connection of the first request. No tokens were available to be assigned to the second request. If you wait for 100 seconds, the token bucket is refilled and a new connection can be accepted by the gateway.

    Example output:

      ...
    * Mark bundle as not supporting multiuse
    < HTTP/1.1 429 Too Many Requests
    HTTP/1.1 429 Too Many Requests
    < x-ratelimit-limit: 1
    x-ratelimit-limit: 1
    < x-ratelimit-remaining: 0
    x-ratelimit-remaining: 0
    < x-ratelimit-reset: 79
    x-ratelimit-reset: 79
    ...
    Connection #0 to host 34.XXX.XX.XXX left intact
    local_rate_limited      
      

Cleanup

You can optionally clean up the resources that you created as part of this guide.

  kubectl delete httplisteneroption local-ratelimit -n gloo-system