Skip to content

Opus 4.6 Fast Mode available on AI Gateway

Authors

1 min read

Fast mode support for Claude Opus 4.6 is now available on AI Gateway.

Fast mode is a premium high-speed option that delivers 2.5x faster output token speeds with the same model intelligence. This is an early, experimental feature.

Fast mode's increased output token speeds enable new use cases, especially for human-in-the-loop workflows. Run large coding tasks without needing to context switch and get planning results without extended waits.

To enable fast mode, pass speed: 'fast' in the anthropic provider options in AI SDK:

import { streamText } from "ai";
const { text } = await streamText({
model: 'anthropic/claude-opus-4.6',
prompt:
`Analyze this codebase structure and create a step-by-step plan
to add user authentication.`,
providerOptions: {
anthropic: {
speed: 'fast',
},
},
});

You can use fast mode with Claude Code via AI Gateway by setting "fastMode": true in your settings.json.

{
"model": "opus[1m]",
"fastMode": true
}

Try fast mode directly in the AI Gateway playground for Opus 4.6.

Fast mode is priced at 6x standard Opus rates.

Standard

Fast Mode

Input: $5 / 1M tokens
Output: $25 / 1M tokens

Input: $30 / 1M tokens
Output: $150 / 1M tokens

All standard pricing multipliers (e.g., prompt caching) apply on top of these rates.

AI Gateway: Track top AI models by usage

The AI Gateway model leaderboard ranks the most used models over time by total token volume across all traffic through the Gateway. Updates regularly.

View the leaderboard