OpenAI-compatible providers

Docs

Documentation

Agentgateway

Providers

OpenAI-compatible providers

Configure OpenAI-compatible LLM providers such as Mistral, DeepSeek, or any other provider that implements the OpenAI API format in kgateway.

Overview

Many LLM providers offer APIs that are compatible with OpenAI’s API format. You can configure these providers in agentgateway by using the openai provider type with custom host, port, path, and authHeader overrides.

Note that when you specify a custom host override, agentgateway requires explicit TLS configuration via BackendTLSPolicy for HTTPS endpoints. This differs from well-known providers (like OpenAI) where TLS is automatically enabled when using default hosts.

Before you begin

Set up an agentgateway proxy.

Set up access to an OpenAI-compatible provider

Review the following examples for common OpenAI-compatible provider endpoints:

Mistral AI
DeepSeek

Mistral AI example

Set up OpenAI-compatible provider access to Mistral AI models.

Get a Mistral AI API key.

Save the API key in an environment variable.

export MISTRAL_API_KEY=<insert your API key>

Create a Kubernetes secret to store your Mistral AI API key.

kubectl apply -f- <<EOF
apiVersion: v1
kind: Secret
metadata:
  name: mistral-secret
  namespace: kgateway-system
type: Opaque
stringData:
  Authorization: $MISTRAL_API_KEY
EOF

Create a Backend resource using the openai provider type with custom host and port overrides.

kubectl apply -f- <<EOF
apiVersion: gateway.kgateway.dev/v1alpha1
kind: Backend
metadata:
  name: mistral
  namespace: kgateway-system
spec:
  type: AI
  ai:
    llm:
      openai:
        authToken:
          kind: SecretRef
          secretRef:
            name: mistral-secret
        model: "mistral-medium-2505"
      host: api.mistral.ai
      port: 443
      path:
        full: "/v1/chat/completions"
EOF

Review the following table to understand this configuration.

Setting	Description
`type`	Set to `AI` to configure this Backend for an AI provider.
`ai`	Define the AI backend configuration.
`openai`	Use the `openai` provider type for OpenAI-compatible providers.
`host`	Required: The hostname of the OpenAI-compatible provider, such as `api.mistral.ai`.
`port`	Required: The port number (typically `443` for HTTPS). Both `host` and `port` must be set together.
`path`	Optional: Override the API path. Defaults to `/v1/chat/completions` if not specified.
`authHeader`	Optional: Override the authentication header format. Defaults to `Authorization: Bearer <token>`.
`authToken`	Configure the authentication token. The token is sent in the header specified by `authHeader`. Defaults to the `Authorization` header.
`model`	Optional: Override the model name. If unset, the model name is taken from the request. For models, see the Mistral docs.

Create an HTTPRoute resource that routes incoming traffic to the Backend. Note that kgateway automatically rewrites the endpoint to the OpenAI chat completion endpoint of the LLM provider for you, based on the LLM provider that you set up in the Backend resource.

kubectl apply -f- <<EOF
apiVersion: gateway.networking.k8s.io/v1
kind: HTTPRoute
metadata:
  name: mistral
  namespace: kgateway-system
spec:
  parentRefs:
    - name: agentgateway
      namespace: kgateway-system
  rules:
  - matches:
    - path:
        type: PathPrefix
        value: /mistral
    filters:
      - type: URLRewrite
        urlRewrite:
          hostname: api.mistral.ai
    backendRefs:
    - name: mistral
      namespace: kgateway-system
      group: gateway.kgateway.dev
      kind: Backend
EOF

Create a BackendTLSPolicy to enable TLS for the external Mistral API.

kubectl apply -f- <<EOF
apiVersion: gateway.networking.k8s.io/v1
kind: BackendTLSPolicy
metadata:
  name: mistral-tls
  namespace: kgateway-system
spec:
  targetRefs:
  - name: mistral
    kind: Backend
    group: gateway.kgateway.dev
  validation:
    hostname: api.mistral.ai
    wellKnownCACertificates: System
EOF

Review the following table to understand this configuration.

Setting	Description
`targetRefs`	References the Backend resource to apply TLS to.
`validation.hostname`	The hostname to validate in the server certificate (must match the `host` value in your Backend, e.g., `api.mistral.ai`).
`validation.wellKnownCACertificates`	Use the system’s trusted CA certificates to verify the server certificate.

Send a request to the LLM provider API. Verify that the request succeeds and that you get back a response from the chat completion API.

curl "$INGRESS_GW_ADDRESS:8080/mistral" -H content-type:application/json  -d '{
   "model": "",
   "messages": [
     {
       "role": "system",
       "content": "You are a helpful assistant."
     },
     {
       "role": "user",
       "content": "Write a short haiku about artificial intelligence."
     }
   ]
 }' | jq

curl "localhost:8080/mistral" -H content-type:application/json  -d '{
   "model": "",
   "messages": [
     {
       "role": "system",
       "content": "You are a helpful assistant."
     },
     {
       "role": "user",
       "content": "Write a short haiku about artificial intelligence."
     }
   ]
 }' | jq

Example output:

{
  "id": "chatcmpl-deepseek-12345",
  "object": "chat.completion",
  "created": 1727967462,
  "model": "deepseek-chat",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Neural networks learn,\nPatterns emerge from data streams,\nMind in silicon grows."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 20,
    "completion_tokens": 17,
    "total_tokens": 37
  }
}

DeepSeek example

Set up OpenAI-compatible provider access to DeepSeek models.

Get a DeepSeek API key.

Save the API key in an environment variable.

export DEEPSEEK_API_KEY=<insert your API key>

Create a Kubernetes secret to store your DeepSeek API key.

kubectl apply -f- <<EOF
apiVersion: v1
kind: Secret
metadata:
  name: deepseek-secret
  namespace: kgateway-system
type: Opaque
stringData:
  Authorization: $DEEPSEEK_API_KEY
EOF

Create a Backend resource using the openai provider type with custom host and port overrides.

kubectl apply -f- <<EOF
apiVersion: gateway.kgateway.dev/v1alpha1
kind: Backend
metadata:
  name: deepseek
  namespace: kgateway-system
spec:
  type: AI
  ai:
    llm:
      openai:
        authToken:
          kind: SecretRef
          secretRef:
            name: deepseek-secret
        model: "deepseek-chat"
      host: "api.deepseek.com"
      port: 443
      path:
        full: "/v1/chat/completions"
EOF

kubectl apply -f- <<EOF
apiVersion: gateway.networking.k8s.io/v1
kind: HTTPRoute
metadata:
  name: deepseek
  namespace: kgateway-system
spec:
  parentRefs:
    - name: agentgateway
      namespace: kgateway-system
  rules:
  - matches:
    - path:
        type: PathPrefix
        value: /deepseek
    backendRefs:
    - name: deepseek
      namespace: kgateway-system
      group: gateway.kgateway.dev
      kind: Backend
EOF

Create a BackendTLSPolicy to enable TLS for the external DeepSeek API.

kubectl apply -f- <<EOF
apiVersion: gateway.networking.k8s.io/v1
kind: BackendTLSPolicy
metadata:
  name: deepseek-tls
  namespace: kgateway-system
spec:
  targetRefs:
  - name: deepseek
    kind: Backend
    group: gateway.kgateway.dev
  validation:
    hostname: api.deepseek.com
    wellKnownCACertificates: System
EOF

Review the following table to understand this configuration.

Setting	Description
`targetRefs`	References the Backend resource to apply TLS to.
`validation.hostname`	The hostname to validate in the server certificate (must match the `host` value in your Backend, e.g., `api.deepseek.com`).
`validation.wellKnownCACertificates`	Use the system’s trusted CA certificates to verify the server certificate.

Send a request to the LLM provider API. Verify that the request succeeds and that you get back a response from the chat completion API.

curl "$INGRESS_GW_ADDRESS:8080/deepseek" -H content-type:application/json  -d '{
   "model": "",
   "messages": [
     {
       "role": "system",
       "content": "You are a helpful assistant."
     },
     {
       "role": "user",
       "content": "Write a short haiku about artificial intelligence."
     }
   ]
 }' | jq

curl "localhost:8080/deepseek" -H content-type:application/json  -d '{
   "model": "",
   "messages": [
     {
       "role": "system",
       "content": "You are a helpful assistant."
     },
     {
       "role": "user",
       "content": "Write a short haiku about artificial intelligence."
     }
   ]
 }' | jq

Example output:

{
  "id": "chatcmpl-deepseek-12345",
  "object": "chat.completion",
  "created": 1727967462,
  "model": "deepseek-chat",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Neural networks learn,\nPatterns emerge from data streams,\nMind in silicon grows."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 20,
    "completion_tokens": 17,
    "total_tokens": 37
  }
}

OpenAI Vertex AI