Document updated on Sep 17, 2025

Anthropic integration

The Anthropic interface allows KrakenD to use Anthropic’s API (Claude) without writing custom integration code, enabling intelligent automation, content generation, or any LLM-powered use case within your existing API infrastructure.

This component abstracts you from the Anthropic API usage allowing the consumer to concentrate on the prompt only, as for each request to an endpoint, KrakenD will create the Anthropic request with all the necessary elements in their API, and will return a unified response, so if you use other vendors you have a consitent use of LLM models.

In other words, the user sends the content, like “tell me a joke!”, and then KrakenD builds the API payload necessary to talk to Anthropic.

This Anthropic interface configures a backend within KrakenD that transparently forwards REST requests to Anthropic’s API endpoints. It manages authentication, versioning, and payload formatting using its custom templating system. This way, you can easily call Anthropic models without writing custom integration code.

A simple configuration looks like this:

{
  "endpoint": "/anthropic",
  "method": "POST",
  "backend": [
    {
      "url_pattern": "/v1/messages",
      "host": [
        "https://api.anthropic.com"
      ],
      "extra_config": {
        "ai/llm": {
          "anthropic": {
            "v1": {
              "credentials": "xxxxx",
              "debug": false,
              "variables": {
                "model": "claude-opus-4-1-20250805"
              }
            }
          }
        }
      }
    }
  ]
}

To interact with the LLM, the user can send in the request:

instructions (optional): If you want to add a system prompt
contents: The content you want to send to the template

Like this:

Using the endpoint

$curl -XPOST --json '{"instructions": "Act as a 1000 dollar consultant", "contents": "Tell me a consultant joke"}' http://localhost:8080/anthropic

Configuration of Anthropic

The configuration of Anthropic requires you to add under your backend extra_config the ai/llm namespace with the anthropic vendor.

Fields of Anthropic integration

* required fields

`v1` object

All settings depend on a specific version, as the vendor might change the API over time.

`credentials` * string

Your Anthropic API key. You can set it as an environment variable for better security.

`debug` boolean

Enables the debug mode to log activity for troubleshooting. Do not set this value to true in production as it may log sensitive data.

Defaults to false

`input_template` string

A path to a custom Go template that sets the payload format sent to Anthropic. You don’t need to set this value unless you want to override the default template making use of all the variables listed in this configuration.

`output_template` string

A path to a custom Go template that sets how the response from Anthropic is transformed before being sent to the client. The default template extracts the text from the first choice returned by Anthropic so in most cases you don’t need to set a custom output template.

`variables` * object

The variables specific to the Anthropic usage that are used to construct the payload.

`extra_payload` object

A map of additional payload attributes you want to use in your custom input_template (this payload is not used in the default template). The attributes set here are accessible in your custom template as {{ .variables.extra_payload.yourchosenkey }}. This option helps adding rare customization and future attributes.

`max_tokens` integer

Maximum number of tokens that can be generated in the response. A token is approximately four characters. 100 tokens correspond to roughly 60-80 words.

Defaults to 1024

`model` * string

The name of the Anthropic model you want to use.

Examples: "claude-opus-4-1-20250805" , "claude-sonnet-4-20250514" , "claude-3-7-sonnet-latest" , "claude-3-5-haiku-latest"

`stop_sequences` array

An array of sequences where the model will stop generating further tokens if found. This can be useful to control the length and content of the output.

`temperature` number

The temperature is used for sampling during response generation, which occurs when topP and topK are applied. Temperature controls the degree of randomness in token selection. Lower temperatures are good for prompts that require a less open-ended or creative response, while higher temperatures can lead to more diverse or creative results.

`top_k` integer

Top-K changes how the model selects tokens for output. A top-K of 1 means the next selected token is the most probable among all tokens in the model’s vocabulary (also called greedy decoding), while a top-K of 3 means that the next token is selected from among the three most probable tokens by using temperature.

`top_p` number

Top-P changes how the model selects tokens for output. Tokens are selected from the most probable to least probable until the sum of their probabilities equals the top-P value. For example, if tokens A, B, and C have a probability of 0.3, 0.2, and 0.1 and the top-P value is 0.5, then the model will select either A or B as the next token by using temperature and excludes C as a candidate.

Schema: https://www.krakend.io/schema/v2.12/ai/anthropic.json

Customizing the payload sent and received from Anthropic

As it happens with all LLM interfaces of KrakenD, you can completely replace the request and the response so you have a custom interaction with the LLM. While the default template should allow you to accomplish any day to day job, you might need to extend it using your own template.

You may override the input and output Go templates by specifying:

input_template: Path to a custom template controlling how the request data is formatted before sending to Anthropic.
output_template: Path to a custom template to transform and extract the desired pieces from Anthropic’s response.

See below how to change this interaction

Default input_template for Anthropic v1

When you don’t set any input_template, KrakenD will create the JSON payload sent to Anthropic using the following template:

{
	"model": {{ .variables.model | toJson }},
	"max_tokens": {{ .variables.max_tokens }},
	"stream": false,
	{{ $temperature := .variables.temperature }}{{ if ge $temperature 0.0 }}"temperature": {{ $temperature }},{{ end }}
	{{ $top_p := .variables.top_p }}{{ if ge $top_p 0.0 }}"top_p": {{ $top_p }},{{ end }}
	{{ $top_k := .variables.top_k }}{{ if ge $top_k 0 }}"top_k": {{ $top_k }},{{ end }}
	"stop_sequences": {{ .variables.stop_sequences | toJson }},
	{{- if hasKey .req_body "instructions" }}
	"system": [
		{
			"type": "text",
			"cache_control": {
				"type": "ephemeral",
				"ttl": "5m"
			},
			"text": {{ .req_body.instructions | toJson }}
		}
	],
	{{- end }}
	"messages": [
		{
			"role": "user",
			"content": {{ .req_body.contents | toJson }}
		}
	]
}

Remember you can access your own variables declared in the configuration using {{ .variables.xxx }}.

Default output_template for Anthropic v1

When you don’t declare an output_template, the response from the AI is transformed to have the following format:

{
	"ai_gateway_response": [
		{
			"contents": [
				{{- range $index, $part := .resp_body.content }}
				{{- if $index }},{{ end }}
				{{ $part.text | toJson }}
				{{- end }}
			]
		}
	],
	"usage": "{{ add .resp_body.usage.output_tokens .resp_body.usage.input_tokens }}
}

As you can see in the response template, you get the total number of tokens consumed aggregated (input and output)

Only applies to KrakenD Enterprise

Since v2.11

Namespace

ai/llm

Log prefix

[BACKEND: /foo][AI/LLM][Anthropic]

Scope

backend

Configuration reference

Table of Contents

Unresolved issues?

The documentation is only a piece of the help you can get! Whether you are looking for Open Source or Enterprise support, see more support channels that can help you.

See all support channels

Enterprise Documentation

Anthropic integration

Configuration of Anthropic

Fields of Anthropic integration

v1 object

credentials * string

debug boolean

input_template string

output_template string

variables * object

extra_payload object

max_tokens integer

model * string

stop_sequences array Show properties

temperature number

top_k integer

top_p number