# Google: Gemini 3.1 Flash Lite Preview: AI model fact sheet

- **Provider:** google
- **Released:** 2026-03-03
- **Context window:** 1,048,576 tokens
- **API pricing:** $0.25 / 1M input, $1.50 / 1M output
- **OpenRouter ID:** google/gemini-3.1-flash-lite-preview
- **Capabilities:** conversation, reasoning, analysis, code-generation, data-extraction, translation, tool-use

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Delivers 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Priced at half the cost of Gemini 3 Flash.

## Benchmarks

| Benchmark | Score |
| --- | --- |
| GPQA Diamond | 86.9% |
| MMMLU | 88.9% |
| LiveCodeBench | 72.0% |
| MMMU-Pro | 76.8% |
| Video-MMMU | 84.8% |

Source: real side-by-side outputs, pricing and specs at https://rival.tips/models/gemini-3.1-flash-lite-preview