Configure OpenAI-compatible LLM providers such as Mistral, DeepSeek, or any other provider that implements the OpenAI API format in Solo Enterprise for agentgateway.

Overview

Many LLM providers offer APIs that are compatible with OpenAI’s API format. You can configure these providers in agentgateway by using the openai provider type with custom host, port, path, and authHeader overrides.

Note that when you specify a custom host override, agentgateway requires explicit TLS configuration via BackendTLSPolicy for HTTPS endpoints. This differs from well-known providers (like OpenAI) where TLS is automatically enabled when using default hosts.

Before you begin

Set up an agentgateway proxy.

Set up access to an OpenAI-compatible provider

Review the following examples for common OpenAI-compatible provider endpoints:

Mistral AI example

Set up OpenAI-compatible provider access to Mistral AI models.

  1. Get a Mistral AI API key.

  2. Save the API key in an environment variable.

      export MISTRAL_API_KEY=<insert your API key>
      
  3. Create a Kubernetes secret to store your Mistral AI API key.

      kubectl apply -f- <<EOF
    apiVersion: v1
    kind: Secret
    metadata:
      name: mistral-secret
      namespace: gloo-system
    type: Opaque
    stringData:
      Authorization: $MISTRAL_API_KEY
    EOF
      
  4. Create a Backend resource using the openai provider type with custom host and port overrides.

      kubectl apply -f- <<EOF
    apiVersion: gateway.kgateway.dev/v1alpha1
    kind: Backend
    metadata:
      name: mistral
      namespace: gloo-system
    spec:
      type: AI
      ai:
        llm:
          openai:
            authToken:
              kind: SecretRef
              secretRef:
                name: mistral-secret
            model: "mistral-medium-2505"
          host: api.mistral.ai
          port: 443
          path:
            full: "/v1/chat/completions"
    EOF
      

    Review the following table to understand this configuration.

    SettingDescription
    typeSet to AI to configure this Backend for an AI provider.
    aiDefine the AI backend configuration.
    openaiUse the openai provider type for OpenAI-compatible providers.
    hostRequired: The hostname of the OpenAI-compatible provider, such as api.mistral.ai.
    portRequired: The port number (typically 443 for HTTPS). Both host and port must be set together.
    pathOptional: Override the API path. Defaults to /v1/chat/completions if not specified.
    authHeaderOptional: Override the authentication header format. Defaults to Authorization: Bearer <token>.
    authTokenConfigure the authentication token. The token is sent in the header specified by authHeader. Defaults to the Authorization header.
    modelOptional: Override the model name. If unset, the model name is taken from the request. For models, see the Mistral docs.
  5. Create an HTTPRoute resource that routes incoming traffic to the Backend. Note that Gloo Gateway automatically rewrites the endpoint to the OpenAI chat completion endpoint of the LLM provider for you, based on the LLM provider that you set up in the Backend resource.

      kubectl apply -f- <<EOF
    apiVersion: gateway.networking.k8s.io/v1
    kind: HTTPRoute
    metadata:
      name: mistral
      namespace: gloo-system
    spec:
      parentRefs:
        - name: agentgateway
          namespace: gloo-system
      rules:
      - matches:
        - path:
            type: PathPrefix
            value: /mistral
        filters:
          - type: URLRewrite
            urlRewrite:
              hostname: api.mistral.ai
        backendRefs:
        - name: mistral
          namespace: gloo-system
          group: gateway.kgateway.dev
          kind: Backend
    EOF
      
  6. Create a BackendTLSPolicy to enable TLS for the external Mistral API.

      kubectl apply -f- <<EOF
    apiVersion: gateway.networking.k8s.io/v1
    kind: BackendTLSPolicy
    metadata:
      name: mistral-tls
      namespace: gloo-system
    spec:
      targetRefs:
      - name: mistral
        kind: Backend
        group: gateway.kgateway.dev
      validation:
        hostname: api.mistral.ai
        wellKnownCACertificates: System
    EOF
      

    Review the following table to understand this configuration.

    SettingDescription
    targetRefsReferences the Backend resource to apply TLS to.
    validation.hostnameThe hostname to validate in the server certificate (must match the host value in your Backend, e.g., api.mistral.ai).
    validation.wellKnownCACertificatesUse the system’s trusted CA certificates to verify the server certificate.
  7. Send a request to the LLM provider API. Verify that the request succeeds and that you get back a response from the chat completion API.

    Example output:

      {
      "model": "mistral-medium-2505",
      "usage": {
        "prompt_tokens": 20,
        "completion_tokens": 18,
        "total_tokens": 38
      },
      "choices": [
        {
          "message": {
            "content": "Silent circuits hum,\nLearning echoes through the void,\nWisdom without warmth.",
            "role": "assistant",
            "tool_calls": null
          },
          "index": 0,
          "finish_reason": "stop"
        }
      ],
      "id": "d05ef3973085435a8db8b51b580eeef8",
      "created": 1764614501,
      "object": "chat.completion"
    }
      

DeepSeek example

Set up OpenAI-compatible provider access to DeepSeek models.

  1. Get a DeepSeek API key.

  2. Save the API key in an environment variable.

      export DEEPSEEK_API_KEY=<insert your API key>
      
  3. Create a Kubernetes secret to store your DeepSeek API key.

      kubectl apply -f- <<EOF
    apiVersion: v1
    kind: Secret
    metadata:
      name: deepseek-secret
      namespace: gloo-system
    type: Opaque
    stringData:
      Authorization: $DEEPSEEK_API_KEY
    EOF
      
  4. Create a Backend resource using the openai provider type with custom host and port overrides.

      kubectl apply -f- <<EOF
    apiVersion: gateway.kgateway.dev/v1alpha1
    kind: Backend
    metadata:
      name: deepseek
      namespace: gloo-system
    spec:
      type: AI
      ai:
        llm:
          openai:
            authToken:
              kind: SecretRef
              secretRef:
                name: deepseek-secret
            model: "deepseek-chat"
          host: "api.deepseek.com"
          port: 443
          path:
            full: "/v1/chat/completions"
    EOF
      
  5. Create an HTTPRoute resource that routes incoming traffic to the Backend. Note that Gloo Gateway automatically rewrites the endpoint to the OpenAI chat completion endpoint of the LLM provider for you, based on the LLM provider that you set up in the Backend resource.

      kubectl apply -f- <<EOF
    apiVersion: gateway.networking.k8s.io/v1
    kind: HTTPRoute
    metadata:
      name: deepseek
      namespace: gloo-system
    spec:
      parentRefs:
        - name: agentgateway
          namespace: gloo-system
      rules:
      - matches:
        - path:
            type: PathPrefix
            value: /deepseek
        backendRefs:
        - name: deepseek
          namespace: gloo-system
          group: gateway.kgateway.dev
          kind: Backend
    EOF
      
  6. Create a BackendTLSPolicy to enable TLS for the external DeepSeek API.

      kubectl apply -f- <<EOF
    apiVersion: gateway.networking.k8s.io/v1
    kind: BackendTLSPolicy
    metadata:
      name: deepseek-tls
      namespace: gloo-system
    spec:
      targetRefs:
      - name: deepseek
        kind: Backend
        group: gateway.kgateway.dev
      validation:
        hostname: api.deepseek.com
        wellKnownCACertificates: System
    EOF
      

    Review the following table to understand this configuration.

    SettingDescription
    targetRefsReferences the Backend resource to apply TLS to.
    validation.hostnameThe hostname to validate in the server certificate (must match the host value in your Backend, e.g., api.deepseek.com).
    validation.wellKnownCACertificatesUse the system’s trusted CA certificates to verify the server certificate.
  7. Send a request to the LLM provider API. Verify that the request succeeds and that you get back a response from the chat completion API.

    Example output:

      {
      "id": "chatcmpl-deepseek-12345",
      "object": "chat.completion",
      "created": 1727967462,
      "model": "deepseek-chat",
      "choices": [
        {
          "index": 0,
          "message": {
            "role": "assistant",
            "content": "Neural networks learn,\nPatterns emerge from data streams,\nMind in silicon grows."
          },
          "finish_reason": "stop"
        }
      ],
      "usage": {
        "prompt_tokens": 20,
        "completion_tokens": 17,
        "total_tokens": 37
      }
    }
      

Next steps