Fast Mode #
High-speed output mode for Claude Opus 4.6 that provides faster token generation without switching to a different model. Currently available as a research preview.
# Toggle fast mode from the prompt
/fast
# Or configure in settings.json
{
"fastMode": true
}
- Uses the same Opus 4.6 model with accelerated output generation
- Billed as extra usage at a premium rate with separate rate limits
- Interactive toggle shows current status: "Fast mode ON" or "Fast mode OFF"
- When enabled, displays with the penguin icon (↯) and orange color theming
- Has its own rate limits; when reached, enters a cooldown period and automatically re-enables when reset
- Model automatically switches to Opus 4.6 if not already selected
Availability:
- Requires a paid subscription
- Requires extra usage billing to be enabled (
/extra-usage) - Requires the native binary (not available via npm install)
- Not available on Bedrock, Vertex, Foundry, or in the Agent SDK
- Organizations can disable fast mode for their accounts
Rate Limiting:
- Displays "Fast limit reached and temporarily disabled · resets in X" when hitting limits
- Shows "Fast limit reset · now using fast mode" when cooldown expires
- Tracks remaining time to reset in the UI
Promotions:
- May show discount percentages for extra usage (e.g., "50% off through [date]")
Fast mode system (search for "Fast mode", "tengu_penguins_enabled", "/fast")