Models
gemini-2.5-flash β API, Pricing & Context Window | Vivgrid
gemini-2.5-flash on Vivgrid: Google's fast, affordable multimodal model with a 1M-token context window across text, image, video, audio and PDF.
gemini-2.5-flash is Google's fast, affordable multimodal model, popular for high-volume workloads that mix media. It pairs a 1M-token context window with text, image, video, audio, and PDF inputs.
On Vivgrid it is served as a globally centralized model through the same unified API key as the rest of the catalog, making it easy to scale or swap.
Specifications
| Provider | |
| Model ID | gemini-2.5-flash |
| Best for | General-purpose |
| Context window | 1,048,576 tokens |
| Modalities | Text, Image, Video, Audio, Pdf |
| Tool / function calling | Yes |
| Knowledge cutoff | 2025-01 |
| Acceleration | π Global (Centralized) |
Pricing
Pricing in USD per 1M tokens, matching the provider's rates.
| Input | Cached input | Output |
|---|---|---|
| $0.30 | $0.08 | $2.50 |
Quick start
Call gemini-2.5-flash through Vivgrid's unified, OpenAI-compatible endpoint. Get an API key from the Vivgrid Console.
curl https://api.vivgrid.com/v1/chat/completions \
-H "Authorization: Bearer $VIVGRID_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-2.5-flash",
"messages": [
{ "role": "user", "content": "Say hello in English, Chinese and Spanish." }
],
"stream": true
}'Ideal use cases
- High-volume multimodal agents and pipelines
- Cost-sensitive summarization and extraction
- Media-rich ingestion at scale
- Fast assistants needing very large context
Related models
- gemini-3-flash-preview β next-gen flash
- gemini-2.5-pro β higher-quality sibling
- gemini-3.5-flash β latest flash model