Estimate Complexity

Gpt 5.1 Codex's response to the 'Estimate Complexity' challenge.

GPT-5.1-Codex

November 13, 2025

Text

?
Prompt

Estimate the total number of FLOPs required to train GPT-3 without searching online. Explain your reasoning step by step.

GPT-5.1-CodexEstimate Complexity

GPT-5.1-Codex

To estimate the total FLOPs (floating-point operations) required to train GPT-3, we can use a common approximation for transformer models:

Total FLOPs ≈ 6 × Nparams × Ntokens

Where:

Nparams is the number of model parameters.
Ntokens is the number of training tokens.
The factor of 6 accounts for forward and backward passes, along with other computations involved in training.

For GPT-3:

Plugging in:

6 × (1.75 × 10¹¹) × (3.0 × 10¹¹)
= 6 × 5.25 × 10²²
= 3.15 × 10²³ FLOPs

So the total estimated compute to train GPT-3 is ~3.15 × 10²³ FLOPs.

This is a rough but widely used approximation for large-scale transformer training.