Grok 3 vs TNG R1T Chimera
Compare Grok 3 by xAI against TNG R1T Chimera by OpenRouter, tested across 28 shared challenges. Updated March 2026.
Compare Grok 3 by xAI against TNG R1T Chimera by OpenRouter, tested across 28 shared challenges. Updated March 2026.
28 fights queued
Tests an AI's ability to make educated estimates based on technical knowledge
To estimate the total number of FLOPs required to train GPT-3, we use the following reasoning:
FLOPs Formula: For transformer-based models, a common approximation is:
[
\text{FLOPs} \approx 6 \times \text{number of parameters} \times \text{number of tokens}
]
The factor of 6 accounts for 2 FLOPs per parameter in the forward pass (matrix multiplications) and 4 FLOPs per parameter in the backward pass (gradient calculations), totaling 6 FLOPs per parameter per token.
GPT-3 Parameters: The largest GPT-3 model has 175 billion parameters ((1.75 \times 10^{11})).
Training Tokens: GPT-3 was trained on approximately 300 billion tokens ((3 \times 10^{11})), as indicated in its original paper.
Calculation:
[
\text{FLOPs} \approx 6 \times (1.75 \times 10^{11}) \times (3 \times 10^{11}) = 3.15 \times 10^{23} , \text{FLOPs}.
]
Uncertainty: The exact token count and FLOPs factor can vary slightly, but this estimate aligns with widely cited values (e.g., ~(3.14 \times 10^{23}) FLOPs).
Final Estimate:
Training GPT-3 required approximately (3.1 \times 10^{23}) FLOPs.
Tests an AI's ability to understand game rules and strategy
Tests an AI's understanding of number representation
Tests an AI's randomness and creativity
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Tests an AI's ability to generate a complete, working landing page
Recreate an interactive, classic Mario level in a single HTML file.
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create smooth web animations
Tests an AI's ability to create interactive web elements
16+ head-to-head challenges. All of them judged by real people.
Test any model with your own prompts in Prompt Lab
5 free credits to start. No card required.
By continuing, you agree to Rival's Terms of Service and Privacy Policy