Models
gpt-5-mini β API, Pricing & Context Window | Vivgrid
gpt-5-mini on Vivgrid: a fast, affordable GPT-5 model with a 272K context window, function calling, and three-region acceleration.
gpt-5-mini is the compact, budget-friendly member of the GPT-5 family. It keeps a 272K-token context window and function-calling support while delivering faster responses at a fraction of the flagship price.
Vivgrid geo-distributes gpt-5-mini across AMER, EMEA, and APAC, making it ideal for high-volume, latency-sensitive traffic anywhere in the world.
Specifications
| Provider | OpenAI |
| Model ID | gpt-5-mini |
| Best for | General-purpose |
| Context window | 272,000 tokens |
| Max output | 128,000 tokens |
| Modalities | Text, Image |
| Tool / function calling | Yes |
| Knowledge cutoff | 2024-05 |
| Acceleration | β‘ Geo-Distributed β AMER, EMEA, APAC |
Pricing
Pricing in USD per 1M tokens, matching the provider's rates.
| Input | Cached input | Output |
|---|---|---|
| $0.25 | $0.03 | $2.00 |
Quick start
Call gpt-5-mini through Vivgrid's unified, OpenAI-compatible endpoint. Get an API key from the Vivgrid Console.
curl https://api.vivgrid.com/v1/chat/completions \
-H "Authorization: Bearer $VIVGRID_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-5-mini",
"messages": [
{ "role": "user", "content": "Say hello in English, Chinese and Spanish." }
],
"stream": true
}'Ideal use cases
- High-throughput chat and agent backends
- Classification, extraction, and routing
- Cost-conscious tool-calling workflows
- Edge cases where latency beats maximum quality
Related models
- gpt-5 β the full-size GPT-5 model
- gpt-5.4-mini β newer-generation mini
- gpt-5-chat β conversation-tuned variant