• play_circle Get started
    • Gloo Gateway overview
    • Architecture
    • Custom resources
      • Policy overview
      • RouteOption
      • ListenerOption
      • HTTPListenerOption
      • VirtualHostOption
      • Overview
      • Enable resource validation
      • Test resources
    • Helm
    • Airgap environment
    • Argo CD
    • Amazon EKS add-on
    • Install the glooctl CLI
    • FIPS images
    • Licensing
    • Default gateway proxy setup
    • Customize the default setup
    • Distribute gateways (beta)
        • Simple ingress
          • Network Load Balancer (NLB)
          • Application Load Balancer (ALB)
        • Ingress to ambient mesh
        • Ingress to Istio sidecar mesh
        • Ingress to Istio sidecar mesh with virtual destinations (GME)
        • Gloo Gateway as a waypoint proxy
      • Overview
      • HTTP
      • HTTPS
      • Mutual TLS (mTLS)
      • SNI
      • TCP
      • TLS passthrough
      • Additional TLS settings
    • Gloo Gateway mTLS mode
      • Kubernetes services
        • About Upstreams
        • Static
        • Kubernetes service
        • AWS EC2
          • Get started
          • Access AWS Lambda with a service account
          • Configure cross-account Lambda access
        • Google Cloud Platform services
        • HTTP/2
      • Virtual destinations
    • Direct responses
      • Path
      • Header
      • HTTP method
      • Query parameter
      • HTTPS redirect
      • Host redirect
      • Path redirects
      • Host rewrites
      • Path rewrites
      • Route delegation overview
      • Basic example
      • Delegation via labels
      • Multiple parents
      • Multi-level delegation
      • Header and query match
      • Policy inheritance
      • Policy overrides
    • Buffering
      • About extProc
      • Header manipulation
      • Request headers
      • Response headers
      • Gateway health checks
      • Upstream health checks
    • HTTP/2
    • Proxy protocol
      • Stateful session filter
      • Consistent hashing
    • TCP keepalive
    • Traffic splitting
      • Templating language
      • Inject response headers
      • Add multiple values to headers
      • Decode base64 headers
      • Use extractors for headers
      • Add headers to body
      • Extract query parameters
      • Update request paths and methods
      • Create redirect URLs
      • Change response status
      • Update response body
      • Enrich access logs
      • About response caching
      • Set up caching
    • Fault injection
    • Retries
    • Shadowing
    • Timeouts
    • Traffic tapping
    • Access logging
    • CORS
    • CSRF
    • Data loss prevention (DLP)
      • Basic auth
      • API keys
      • LDAP
        • About OAuth
        • Auth0
        • Google
        • Keycloak
        • Microsoft
        • Okta
        • More OAuth features
        • About
        • OPA with Rego rules in config maps
        • OPA server as a sidecar
        • Bring your own OPA server
        • HTTP
        • gRPC
      • Overview
      • Gateway-level JWT policy
      • Route-level JWT policy
      • Provider example
        • Envoy API
        • Set-style API
        • About local rate limiting
        • HTTP
    • Upstream TLS
    • Web Application Firewall (WAF)
    • About Gloo AI Gateway
      • Set up Gloo AI Gateway
      • Authenticate with API keys
      • Control access
      • Observability
      • Rate limit requests
      • Manage and enrich prompts
      • Set up prompt guards
      • Retrieval augmented generation (RAG)
      • Semantic caching
      • Cleanup
      • Cloud LLM providers
      • Authenticate to the LLM
      • Streaming
      • Failover
      • Function calling
      • Langchain
      • LlamaIndex
      • Advanced guardrails
      • Guardrail Webhook API
    • About Gloo Portal
    • Portal architecture
      • Set up Gloo Portal
      • Create API products
      • Create a portal
      • Cleanup
        • Create ApiDocs
        • Security options
        • Set up an IdP
        • Set up AuthZ with PortalGroups
        • Build the frontend app
        • Set up a secure login
        • Manage API subscriptions
          • OAuth
          • API keys
        • Set up rate limits for APIs
        • Create teams and apps
        • Create API credentials
        • View and test APIs
      • Upgrade
      • Portal database
    • Portal Server API
    • Amazon EKS add-on
    • Argo Rollouts
    • AWS ELBs
    • ExternalDNS and cert-manager
    • Istio service mesh
      • Set up the UI
      • Explore the UI
      • Expose the UI via HTTPS
    • Gateway metrics
    • Debug your setup
    • Upgrade
    • Uninstall
      • Requests
      • Portal
      • AI Gateway
    • Version support
      • DirectResponse
      • GatewayParameters
      • Other APIs
      • Open source
      • Enterprise Edition
      • glooctl check
      • glooctl debug
      • glooctl debug yaml
      • glooctl install gateway
      • glooctl install gateway enterprise
      • glooctl uninstall
      • glooctl upgrade
      • Release notes
      • Open source
      • Enterprise
    • Contribution guidelines
    • Open Source Attribution
    • Port reference
    • Security posture
      • CVE lifecycle handling
      • Open source
      • Enterprise
    • About Solo Support
    • Submit a request
    • main
    • 1.19.x (latest)
    • 1.18.x
    • 1.17.x
    • GitHub
    • Twitter / X
  • to navigate
  • to select
  • to close
    • Home
    • AI Gateway
    • Tutorials
    On this page

    This documentation uses Gloo Gateway proxies with the Kubernetes Gateway API. To use the Edge API instead, click here. Note that some features, such as AI gateways, are not supported with the Edge API.

    Tutorials

    Get started with Gloo AI Gateway and explore key capabilities.

    article

    Set up Gloo AI Gateway

    Use a custom GatewayParameters resource to set up Gloo AI Gateway.

    article

    Authenticate with API keys

    Learn how to use an API key to allow the AI Gateway to authenticate with an LLM provider.

    article

    Control access

    Use a JWT token to ensure that only authenticated and authorized users can access the LLM APIs.

    article

    Observability

    Gather and observe key metrics related to LLM provider usage.

    article

    Rate limit requests

    Limit the number of requests that can be sent to the LLM provider.

    article

    Manage and enrich prompts

    Effectively manage system and user prompts to improve LLM outputs.

    article

    Set up prompt guards

    Secure access to the LLM and the data that is returned with Web Application Filter and Data Loss …

    article

    Retrieval augmented generation (RAG)

    Provide relevant context for LLM provider by retrieving data from one or more datasets.

    article

    Semantic caching

    Provide relevant context for an LLM provider by retrieving data from one or more datasets.

    article

    Cleanup

    Remove the Gloo AI Gateway resources that you created as part of the tutorials.

    Solo.io copyright 2025