claude-sonnet-4-6 β API, Pricing & Context Window | Vivgrid
claude-sonnet-4-6 on Vivgrid: Anthropic's balanced Sonnet model with a 1M-token context window, Messages API, and geo-distributed acceleration.
claude-sonnet-4-6 is Anthropic's balanced Sonnet model, offering much of Opus's coding and reasoning quality at a lower price and faster speed. Its 1M-token context window makes it a versatile workhorse for production agents.
On Vivgrid, claude-sonnet-4-6 is geo-distributed across AMER and EMEA and reachable on the Messages API through the same unified key as the rest of the catalog.
Specifications
| Provider | Anthropic |
| Model ID | claude-sonnet-4-6 |
| Best for | Coding |
| Context window | 1,000,000 tokens |
| Max output | 128,000 tokens |
| Modalities | Text, Image |
| Tool / function calling | Yes |
| Knowledge cutoff | 2025-08 |
| Acceleration | β‘ Geo-Distributed β AMER, EMEA |
Pricing
Pricing in USD per 1M tokens, matching the provider's rates.
| Input | Cached input | Output |
|---|---|---|
| $4.00 | $1.00 | $15.00 |
Quick start
Call claude-sonnet-4-6 through Vivgrid's unified, OpenAI-compatible endpoint. Get an API key from the Vivgrid Console.
curl https://api.vivgrid.com/v1/messages \
-H "Authorization: Bearer $VIVGRID_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "claude-sonnet-4-6",
"max_tokens": 1024,
"messages": [
{ "role": "user", "content": "Say hello in English, Chinese and Spanish." }
],
"stream": true
}'Ideal use cases
- Everyday coding agents balancing quality, speed, and cost
- High-volume reasoning and content workflows
- Long-context analysis within a 1M-token budget
- Production assistants needing reliable tool use
Related models
- claude-opus-4-7 β the frontier Opus model
- claude-haiku-4-5 β the fastest, cheapest Claude
- gpt-5.4 β comparable OpenAI coding model
claude-opus-4-6
claude-opus-4-6 on Vivgrid: Anthropic's high-end Opus coding model with a 1M-token context window, Messages API, and geo-distributed acceleration.
claude-haiku-4-5
claude-haiku-4-5 on Vivgrid: Anthropic's fast, low-cost Haiku model with a 200K context window, Messages API, and geo-distributed acceleration.