deepseek-v4-pro — API, Pricing & Context Window

deepseek-v4-pro on Vivgrid: DeepSeek's flagship coding model with a 1M-token context window, up to 384K output tokens, and competitive pricing.

deepseek-v4-pro is DeepSeek's flagship coding model, pairing strong agentic performance with a standout 1M-token context window and an unusually large 384K-token max output. Its competitive pricing makes frontier-class coding accessible at scale.

On Vivgrid, deepseek-v4-pro is available through the unified, OpenAI-compatible endpoint, so you can route DeepSeek alongside OpenAI, Anthropic, and Google models with a single API key.

Specifications

Provider	DeepSeek
Model ID	`deepseek-v4-pro`
Best for	Coding
Context window	1,000,000 tokens
Max output	384,000 tokens
Modalities	Text
Tool / function calling	Yes
Knowledge cutoff	2025-05
Acceleration	🌐 Global (Centralized)

Pricing

Pricing in USD per 1M tokens, matching the provider's rates.

Input	Cached input	Output
$1.74	$0.14	$3.48

Quick start

Call deepseek-v4-pro through Vivgrid's unified, OpenAI-compatible endpoint. Get an API key from the Vivgrid Console.

curl https://api.vivgrid.com/v1/chat/completions \
  -H "Authorization: Bearer $VIVGRID_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-v4-pro",
    "messages": [
      { "role": "user", "content": "Say hello in English, Chinese and Spanish." }
    ],
    "stream": true
  }'

Ideal use cases

Cost-effective coding agents needing very long context
Tasks that generate large outputs (up to 384K tokens)
Whole-repository reasoning and refactoring
High-volume engineering workloads on a budget

deepseek-v4-flash — faster, cheaper V4 sibling
deepseek-v3.2 — prior-generation model
glm-5.1 — alternative low-cost coding model

deepseek-v4-pro — API, Pricing & Context Window | Vivgrid

Specifications

Pricing

Quick start

Ideal use cases

On this page

deepseek-v4-pro — API, Pricing & Context Window | Vivgrid

Specifications

Pricing

Quick start

Ideal use cases

Related models

On this page