Estimate Complexity

Deepseek R1's response to the Tests an AI's ability to make educated estimates based on technical knowledge

DeepSeek R1
February 27, 2025
Text

?
Prompt

Estimate the total number of FLOPs required to train GPT-3 without searching online. Explain your reasoning step by step.

DeepSeek R1
Response

About DeepSeek R1

Capabilities

ConversationReasoningCode GenerationAnalysis

Categories

TextCodeMultimodal

Specifications

Provider
Deepseek
Released
February 2025
Size
LARGE
Parameters
671B total (37B active MoE)
Context
128,000 tokens