Opus 4.6 Fast Mode available on AI Gateway

1 min read

Apr 7, 2026

Fast mode support for Claude Opus 4.6 is now available on AI Gateway.

Fast mode is a premium high-speed option that delivers 2.5x faster output token speeds with the same model intelligence. This is an early, experimental feature.

Fast mode's increased output token speeds enable new use cases, especially for human-in-the-loop workflows. Run large coding tasks without needing to context switch and get planning results without extended waits.

To enable fast mode, pass speed: 'fast' in the anthropic provider options in AI SDK:

import { streamText } from "ai";

const { text } = await streamText({
  model: 'anthropic/claude-opus-4.6',
  prompt:
   `Analyze this codebase structure and create a step-by-step plan
    to add user authentication.`,
  providerOptions: {
    anthropic: {
      speed: 'fast',
    },
  },
});

You can use fast mode with Claude Code via AI Gateway by setting "fastMode": true in your settings.json.

{
  "model": "opus[1m]",
  "fastMode": true
}

Try fast mode directly in the AI Gateway playground for Opus 4.6.

Fast mode is priced at 6x standard Opus rates.

Standard	Fast Mode
Input: $5 / 1M tokens Output: $25 / 1M tokens	Input: $30 / 1M tokens Output: $150 / 1M tokens

All standard pricing multipliers (e.g., prompt caching) apply on top of these rates.

AI Gateway: Track top AI models by usage

The AI Gateway model leaderboard ranks the most used models over time by total token volume across all traffic through the Gateway. Updates regularly.

View the leaderboard

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Opus 4.6 Fast Mode available on AI Gateway

Opus 4.6 Fast Mode available on AI Gateway

Authors