
Gemini 3.1 Flash Lite
Google Gemini 3.1 Flash Lite preview, lightweight multimodal, ultra cost-efficient
Context Window
1.0M
Input price / 1M tokens
$0.251M tokens
Output price / 1M tokens
$1.501M tokens
Cached input / 1M tokens
$0.031M tokens
Max Completion
66K
Input Modalities
text, image
Output Modalities
text
Function callingChatVisionStreaming
Description
Google Gemini 3.1 Flash Lite preview, lightweight multimodal, ultra cost-efficient
Available Providers
AllToken can route requests to the providers below based on route priority and policy.
ProviderContext LengthInput PriceOutput PriceCached / MLatency p50Throughput
Best For
Google Gemini 3.1 Flash Lite preview, lightweight multimodal, ultra cost-efficient
How To Use This Model
Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.
curl https://api.alltoken.ai/v1/chat/completions \
-H "Authorization: Bearer sk-your-key" \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-3.1-flash-lite-preview",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'Supported Parameters
temperaturetop_pmax_tokenstoolsAPI Key Setup
Smart Routing
Let the platform choose the best provider path automatically.
Default Model
If a request does not specify a model, default the key to gemini-3.1-flash-lite-preview.
Forced Model
Always override incoming requests and lock the key to gemini-3.1-flash-lite-preview.