Models

gpt-5-mini β€” API, Pricing & Context Window | Vivgrid

gpt-5-mini on Vivgrid: a fast, affordable GPT-5 model with a 272K context window, function calling, and three-region acceleration.

gpt-5-mini is the compact, budget-friendly member of the GPT-5 family. It keeps a 272K-token context window and function-calling support while delivering faster responses at a fraction of the flagship price.

Vivgrid geo-distributes gpt-5-mini across AMER, EMEA, and APAC, making it ideal for high-volume, latency-sensitive traffic anywhere in the world.

Specifications

ProviderOpenAI
Model IDgpt-5-mini
Best forGeneral-purpose
Context window272,000 tokens
Max output128,000 tokens
ModalitiesText, Image
Tool / function callingYes
Knowledge cutoff2024-05
Acceleration⚑ Geo-Distributed β€” AMER, EMEA, APAC

Pricing

Pricing in USD per 1M tokens, matching the provider's rates.

InputCached inputOutput
$0.25$0.03$2.00

Quick start

Call gpt-5-mini through Vivgrid's unified, OpenAI-compatible endpoint. Get an API key from the Vivgrid Console.

curl https://api.vivgrid.com/v1/chat/completions \
  -H "Authorization: Bearer $VIVGRID_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5-mini",
    "messages": [
      { "role": "user", "content": "Say hello in English, Chinese and Spanish." }
    ],
    "stream": true
  }'

Ideal use cases

  • High-throughput chat and agent backends
  • Classification, extraction, and routing
  • Cost-conscious tool-calling workflows
  • Edge cases where latency beats maximum quality

On this page