GPT-5.3-Codex

February 25, 2026

Text

Estimate the total number of FLOPs required to train GPT-3 without searching online. Explain your reasoning step by step.

GPT-5.3-CodexEstimate Complexity

A common back-of-the-envelope estimate for transformer training compute is:

[ \text{Training FLOPs} \approx 6 \times N \times D ]

where:

(N) = number of model parameters
(D) = number of training tokens

For GPT-3 (175B), reasonable known assumptions are:

Parameters: (N \approx 175 \text{ billion} = 1.75 \times 10^{11})
Training tokens: (D \approx 300 \text{ billion} = 3.0 \times 10^{11})

Now plug in:

[ 6 \times (1.75 \times 10^{11}) \times (3.0 \times 10^{11}) = 6 \times 5.25 \times 10^{22} = 3.15 \times 10^{23} \text{ FLOPs} ]

Estimate

[ \boxed{\text{Total training compute} \approx 3 \times 10^{23}\ \text{FLOPs}} ]

A reasonable range (depending on exact token count, sequence-length effects, optimizer/update overhead, and hardware efficiency) is roughly:

[ \boxed{(2.5\text{ to }4)\times10^{23}\ \text{FLOPs}} ]

So the headline answer is: on the order of (10^{23}) FLOPs, about (3.1\times10^{23}).

ConversationReasoningCode GenerationAnalysisAgentic Tool Use

Back to GPT-5.3-Codex

Estimate Complexity

Gpt 5.3 Codex's response to the 'Estimate Complexity' challenge.

GPT-5.3-Codex

February 25, 2026

Text

?
Prompt

Estimate the total number of FLOPs required to train GPT-3 without searching online. Explain your reasoning step by step.

Response

GPT-5.3-CodexEstimate Complexity

GPT-5.3-Codex

A common back-of-the-envelope estimate for transformer training compute is:

[ \text{Training FLOPs} \approx 6 \times N \times D ]

where:

(N) = number of model parameters
(D) = number of training tokens

For GPT-3 (175B), reasonable known assumptions are:

Parameters: (N \approx 175 \text{ billion} = 1.75 \times 10^{11})
Training tokens: (D \approx 300 \text{ billion} = 3.0 \times 10^{11})

Now plug in:

[ 6 \times (1.75 \times 10^{11}) \times (3.0 \times 10^{11}) = 6 \times 5.25 \times 10^{22} = 3.15 \times 10^{23} \text{ FLOPs} ]

Estimate

[ \boxed{\text{Total training compute} \approx 3 \times 10^{23}\ \text{FLOPs}} ]

A reasonable range (depending on exact token count, sequence-length effects, optimizer/update overhead, and hardware efficiency) is roughly:

[ \boxed{(2.5\text{ to }4)\times10^{23}\ \text{FLOPs}} ]

So the headline answer is: on the order of (10^{23}) FLOPs, about (3.1\times10^{23}).

Estimate Complexity

Turn this model response into notes, narration, or a short video

Auto Notes Transcript Narrate Video Cut

Partner link

This output lives in our open dataset — 5K+ responses across 200 models. Take it. It’s free.

About GPT-5.3-Codex

Capabilities

ConversationReasoningCode GenerationAnalysisAgentic Tool Use

Specifications

Provider: Openai
Released: 2026-02-24
Size: XLARGE
Context: 400,000 tokens

Keep exploring

SAME PROMPT

Claude Sonnet 4.6's version

Same prompt, different result

COMPARE

GPT-5.3-Codex vs Gemini 3.1 Pro Preview

Both outputs, side by side

Estimate Complexity

?
Prompt

Response

Estimate

About GPT-5.3-Codex

Capabilities

Categories

Specifications

Claude Sonnet 4.6's version

GPT-5.3-Codex vs Gemini 3.1 Pro Preview

Estimate Complexity

?
Prompt

Response

Estimate

About GPT-5.3-Codex

Capabilities

Categories

Specifications

Claude Sonnet 4.6's version

GPT-5.3-Codex vs Gemini 3.1 Pro Preview

GPT-5.3-Codex — Estimate Complexity

Estimate Complexity

?Prompt

Response

Estimate

About GPT-5.3-Codex

Capabilities

Categories

Specifications

Claude Sonnet 4.6's version

GPT-5.3-Codex vs Gemini 3.1 Pro Preview

GPT-5.3-Codex — Estimate Complexity

Estimate Complexity

?Prompt

Response

Estimate

About GPT-5.3-Codex

Capabilities

Categories

Specifications

Claude Sonnet 4.6's version

GPT-5.3-Codex vs Gemini 3.1 Pro Preview

?
Prompt

?
Prompt