deepseek-v4-pro β API, Pricing & Context Window | Vivgrid
deepseek-v4-pro on Vivgrid: DeepSeek's flagship coding model with a 1M-token context window, up to 384K output tokens, and competitive pricing.
deepseek-v4-pro is DeepSeek's flagship coding model, pairing strong agentic performance with a standout 1M-token context window and an unusually large 384K-token max output. Its competitive pricing makes frontier-class coding accessible at scale.
On Vivgrid, deepseek-v4-pro is available through the unified, OpenAI-compatible endpoint, so you can route DeepSeek alongside OpenAI, Anthropic, and Google models with a single API key.
Specifications
| Provider | DeepSeek |
| Model ID | deepseek-v4-pro |
| Best for | Coding |
| Context window | 1,000,000 tokens |
| Max output | 384,000 tokens |
| Modalities | Text |
| Tool / function calling | Yes |
| Knowledge cutoff | 2025-05 |
| Acceleration | π Global (Centralized) |
Pricing
Pricing in USD per 1M tokens, matching the provider's rates.
| Input | Cached input | Output |
|---|---|---|
| $1.74 | $0.14 | $3.48 |
Quick start
Call deepseek-v4-pro through Vivgrid's unified, OpenAI-compatible endpoint. Get an API key from the Vivgrid Console.
curl https://api.vivgrid.com/v1/chat/completions \
-H "Authorization: Bearer $VIVGRID_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "deepseek-v4-pro",
"messages": [
{ "role": "user", "content": "Say hello in English, Chinese and Spanish." }
],
"stream": true
}'Ideal use cases
- Cost-effective coding agents needing very long context
- Tasks that generate large outputs (up to 384K tokens)
- Whole-repository reasoning and refactoring
- High-volume engineering workloads on a budget
Related models
- deepseek-v4-flash β faster, cheaper V4 sibling
- deepseek-v3.2 β prior-generation model
- glm-5.1 β alternative low-cost coding model
gemini-2.0-flash
gemini-2.0-flash on Vivgrid: Google's efficient multimodal model with a 1M-token context window across text, image, video, audio and PDF.
deepseek-v4-flash
deepseek-v4-flash on Vivgrid: DeepSeek's fast, ultra-affordable model with a 1M-token context window and up to 384K output tokens.