Models

gemini-3.1-flash-lite-preview β€” API, Pricing & Context Window | Vivgrid

gemini-3.1-flash-lite-preview on Vivgrid: Google's lightweight multimodal model with a ~1M-token context window at a very low price.

gemini-3.1-flash-lite-preview is Google's lightweight, low-cost Gemini model, tuned for high-volume multimodal workloads. It keeps a ~1.05M-token context window and accepts text, image, video, audio, and PDF inputs.

On Vivgrid it is served as a globally centralized model through the same unified, OpenAI-compatible API used across the catalog.

Specifications

ProviderGoogle
Model IDgemini-3.1-flash-lite-preview
Best forGeneral-purpose
Context window1,048,576 tokens
Max output65,536 tokens
ModalitiesText, Image, Video, Audio, Pdf
Tool / function callingYes
Knowledge cutoff2025-01
Acceleration🌐 Global (Centralized)

Pricing

Pricing in USD per 1M tokens, matching the provider's rates.

InputCached inputOutput
$0.25β€”$1.50

Quick start

Call gemini-3.1-flash-lite-preview through Vivgrid's unified, OpenAI-compatible endpoint. Get an API key from the Vivgrid Console.

curl https://api.vivgrid.com/v1/chat/completions \
  -H "Authorization: Bearer $VIVGRID_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3.1-flash-lite-preview",
    "messages": [
      { "role": "user", "content": "Say hello in English, Chinese and Spanish." }
    ],
    "stream": true
  }'

Ideal use cases

  • Very high-volume multimodal classification and extraction
  • Cost-sensitive media ingestion pipelines
  • Lightweight assistants needing large context
  • Bulk PDF, image, and audio triage

On this page