# Google: Gemini 3.1 Flash Lite: AI model fact sheet

- **Provider:** google
- **Released:** 2026-05-07
- **Context window:** 1,048,576 tokens
- **API pricing:** $0.25 / 1M input, $1.50 / 1M output
- **OpenRouter ID:** google/gemini-3.1-flash-lite
- **Capabilities:** conversation, reasoning, analysis, code-generation, data-extraction, translation, tool-use

Gemini 3.1 Flash Lite is Google's GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

## Benchmarks

| Benchmark | Score |
| --- | --- |
| GPQA Diamond | 86.9% |
| MMMLU | 88.9% |
| LiveCodeBench | 72.0% |
| MMMU-Pro | 76.8% |
| Video-MMMU | 84.8% |

Source: real side-by-side outputs, pricing and specs at https://rival.tips/models/gemini-3.1-flash-lite