Skip to content
Rival
Models
Compare
Best ForArenaPricing
Sign Up
Sign Up

We compare AI models for a living. On purpose. We chose this.

@rival_tips

Explore

  • Compare Models
  • All Models
  • Find Your Model
  • Image Generation
  • Audio Comparison
  • Best AI For...
  • Pricing
  • Challenges

Discover

  • Insights
  • Research
  • AI Creators
  • AI Tools
  • The Graveyard

Developers

  • Developer Hub
  • MCP Server
  • Rival Datasets

Connect

  • Methodology
  • Sponsor a Model
  • Advertise
  • Partnerships
  • Privacy Policy
  • Terms
  • RSS Feed
© 2026 Rival · Built at hours no one should be awake, on hardware we don't own
Claude Sonnet 3.6 (2022-10-22) vs Sherlock Dash Alpha: Which Is Better? [2026 Comparison]
Rival
Models
Compare
Best ForArenaPricing
Sign Up
Sign Up
  1. Home
  2. Compare
  3. Claude Sonnet 3.6 (2022-10-22) vs Sherlock Dash Alpha
Updated Nov 15, 2025

Claude Sonnet 3.6 (2022-10-22) vs Sherlock Dash Alpha

Compare Claude Sonnet 3.6 (2022-10-22) by Anthropic against Sherlock Dash Alpha by OpenRouter, tested across 35 shared challenges. Updated April 2026.

Which is better, Claude Sonnet 3.6 (2022-10-22) or Sherlock Dash Alpha?

Claude Sonnet 3.6 (2022-10-22) and Sherlock Dash Alpha are both competitive models. Compare their real outputs side by side below.

Key Differences Between Claude Sonnet 3.6 (2022-10-22) and Sherlock Dash Alpha

Claude Sonnet 3.6 (2022-10-22) is made by anthropic while Sherlock Dash Alpha is from openrouter.

Loading share button...

Claude Sonnet 3.6 (2022-10-22)vsSherlock Dash Alpha

35 fights queued

Claude Sonnet 3.6 (2022-10-22) logo

Why Claude Sonnet 3.6 (2022-10-22)?

Dead even. This one's a coin flip.
Sherlock Dash Alpha logo

Why Sherlock Dash Alpha?

18 months newerNov 2025
Claude Sonnet 3.6 (2022-10-22)Sherlock Dash Alpha
Input price
$3.00/M
—
Output price
$15.00/M
—
Context
200K
—
Released
Jun 2024
Nov 2025

Favorites

Claude Sonnet 3.6 (2022-10-22)Sherlock Dash Alpha
Movie
The Princess Bride

The Princess Bride

19877.7
—
Album
Kind of Blue

Kind of Blue

Miles Davis
—
Book
Братья Карамазовы

Братья Карамазовы

Фёдор Михайлович Достоевский
—
City
Tokyo

Tokyo

Japan
—
Game
Portal

Portal

Action, Puzzle4.5
—
12 / 35
01

Estimate Complexityreasoning

Tests an AI's ability to make educated estimates based on technical knowledge

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text

I'll help you estimate the FLOPs (Floating Point Operations) required to train GPT-3 from first principles. Let's break this down step by step.

  1. Model Architecture Parameters
  • GPT-3 has 175 billion parameters
  • Uses transformer architecture with attention mechanism
  • Training involves forward pass, backward pass, and optimizer updates
  1. Key Components per Token For each token in the sequence, we need to calculate:

a) Self-Attention

  • Query, Key, Value projections: 3 × (n_emb × n_emb)
  • Attention scores: n_seq × n_seq × n_emb
  • Attention output: n_seq × n_emb × n_emb
  • For h=96 attention heads

b) Feed-Forward Network (FFN)

  • Two linear transformations
  • First projection: n_emb × (4 × n_emb)
  • Second projection: (4 × n_emb) × n_emb
  • GELU activations
  1. Training Details
  • Context length: 2048 tokens
  • Batch size: ~1.5M tokens per batch (approximate)
  • Training tokens: ~300B tokens
  1. FLOPs Calculation

Let's calculate for one forward pass through a single token:

a) Self-Attention FLOPs

  • QKV projections: 3 × (12288 × 12288) = ~450M FLOPs
  • Attention scores: 2048 × 2048 × 12288 = ~50B FLOPs
  • Attention output: 2048 × 12288 × 12288 = ~300B FLOPs
  • Multiply by 96 heads: ~35T FLOPs

b) FFN FLOPs

  • First projection: 12288 × 49152 = ~600M FLOPs
  • Second projection: 49152 × 12288 = ~600M FLOPs
  • Total FFN: ~1.2B FLOPs

c) Per Token Total

  • Forward pass: ~36T FLOPs
  • Backward pass (roughly 2x forward): ~72T FLOPs
  • Parameter updates (1x forward): ~36T FLOPs
  • Total per token: ~144T FLOPs
  1. Total Training FLOPs
  • Number of tokens: 300B
  • FLOPs per token: 144T
  • Total FLOPs: 300B × 144T = ~43,200 Zettaflops (4.32 × 10^22 FLOPs)

This is a rough estimate that considers:

  • Main computational bottlenecks
  • Both forward and backward passes
  • Parameter updates
  • Major attention and FFN operations

The actual number might be higher due to:

  • Additional layer normalizations
  • Activation functions
  • Memory access operations
  • Optimizer-specific computations (Adam)
  • Various other overhead operations

This estimate aligns with the general understanding that training large language models requires enormous computational resources, in the order of zettaflops of compute.

Note: This is an approximation based on the model's known architecture and typical training procedures. The actual number could vary based on specific implementation details, optimizations, and hardware-specific considerations.

Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text

Step-by-Step Reasoning for Estimating GPT-3 Training FLOPs

1. Recall GPT-3 Key Specs (from Memory)

  • GPT-3 (175B parameter version): 175 billion parameters (1.75 × 10¹¹).
  • Trained on 300 billion tokens (3 × 10¹¹ tokens). This is the standard figure from OpenAI's reports.
  • Architecture: Standard Transformer decoder-only model.
  • Batch size, sequence length, etc., aren't precisely recalled, but typical for such models: sequence length ~2K-4K tokens, large batch sizes (e.g., effective 1M+ tokens via gradient accumulation).

2. FLOPs Formula for Transformer Training

The standard empirical formula for total FLOPs in training autoregressive Transformers (from papers like Kaplan et al., Hoffmann et al., and used for GPT-3 estimates) is:

[ \text{Total FLOPs} \approx 6 \times N \times D ]

  • N = number of parameters (1.75 × 10¹¹).
  • D = number of tokens processed (3 × 10¹¹).
  • Why 6?
    • Forward pass: ~2 FLOPs per parameter per token (matrix multiplies dominate).
    • Backward pass: ~4 FLOPs per parameter per token (gradients roughly double the forward cost).
    • Total ~6× per token. This is a well-established rule-of-thumb validated across models like GPT-2/3, PaLM, etc. (ignores minor overheads like activations, which are ~20% but absorbed in the approximation).

This excludes pretraining optimizations (e.g., activations checkpointing saves ~30% memory but not FLOPs).

3. Plug in the Numbers

[ \text{Total FLOPs} = 6 \times (1.75 \times 10^{11}) \times (3 \times 10^{11}) ]

  • First, (1.75 \times 3 = 5.25).
  • Then, (5.25 \times 10^{11} \times 10^{11} = 5.25 \times 10^{22}).
  • Multiply by 6: (6 \times 5.25 = 31.5), so (3.15 \times 10^{23}) FLOPs.

4. Refinements and Validation

  • Exact public figure: OpenAI and analyses (e.g., Epoch AI, Srivastava et al.) confirm ~3.14 × 10²³ FLOPs (or 314 sextillion). My calculation hits it spot-on.
  • Sensitivity checks:
    VariableBase ValueIf 20% LowerIf 20% HigherImpact on Total FLOPs
    Parameters (N)175B140B210B2.51×10²³ / 3.78×10²³
    Tokens (D)300B240B360B2.51×10²³ / 3.78×10²³
    Multiplier65 (forward-heavy)7 (extra overhead)2.63×10²³ / 3.67×10²³
    • Even with ±20% uncertainty, range is ~2.5–3.8 × 10²³. Base case is robust.
  • Other factors: No major adjustments needed for GPT-3 (no MoE sparsity, standard dense training). Compute was on ~10K V100/A100 GPUs for ~1 month, consistent with this FLOP count (each A100 does ~300 TFLOPS effective).

5. Final Estimate

~3.1 × 10²³ FLOPs (or 310 sextillion floating-point operations).

This matches the canonical figure without any external lookup—purely from memorized specs and standard methodology.

Try this prompt
Vote
2 / 35
02

AI Board Game Logicreasoning

Tests an AI's ability to understand game rules and strategy

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024svg
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025svg
Try this prompt
Vote
3 / 35
03

Logic Puzzlereasoning

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
4 / 35
04

Math Misconception Testreasoning

Tests an AI's understanding of number representation

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
5 / 35
05

Stochastic Consistency Testanalysis

Tests an AI's randomness and creativity

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
6 / 35
06

SVG Layout Challengeimage generation

Tests an AI's ability to generate vector graphics

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024svg
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025svg
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
Sponsored
7 / 35
07

Xbox Controller SVG Artimage generation

Tests an AI's ability to create detailed SVG illustrations of gaming hardware

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024svg
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025svg
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
8 / 35
08

Generate a Stand-Up Routineconversation

Tests an AI's humor and creative writing ability

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text

Here's a 3-minute stand-up routine:

Hey everyone! You know what's weird? How we all pretend we know what we're doing as adults. Like, I was at the grocery store the other day, standing in the produce section, just staring at avocados like I have any idea how to tell if they're ripe. Everyone does this little ritual where they squeeze them gently, nod thoughtfully, like "Ah yes, this one's telling me secrets." But we're all just guessing! I've bought exactly two types of avocados in my life: rocks and brown mush. There's no in-between.

And don't get me started on buying wine. I love watching people in wine shops, especially the ones trying to look sophisticated. They'll pick up a bottle, examine the label like they're solving a murder mystery. "Hmm, yes, notes of oak and... [squints at label] ...Mendoza region..." Meanwhile, my entire wine selection process is: "Ooh, pretty label! And it's on sale!" I once bought a wine because it had a cute dog on the label. Turned out to be terrible, but you know what? I deserved that. That's what I get for trusting a marketing team that thought, "You know what sells wine? Golden retrievers."

Speaking of pretending to be adults, can we talk about office small talk? Every Monday morning, it's the same conversation. "How was your weekend?" "Good, yours?" "Good." That's it. That's the whole thing. We're all living the same lie. Nobody's weekend was just "good." It was either "I binged an entire series in my pajamas and now I'm questioning all my life choices" or "I tried to assemble IKEA furniture and almost ended my marriage." But no, we stick to "good." Because Karen from accounting doesn't need to know that you spent Sunday googling "why does my houseplant hate me?"

And don't even get me started on trying to schedule anything with friends once you're over 30. It's like trying to coordinate a military operation. You send out a group text suggesting dinner, and suddenly everyone's checking three different calendars. "Let me see... I can do the third Thursday of next month, between 6:43 and 8:17 PM, but only if Mercury isn't in retrograde and my dog's yoga class gets cancelled." Remember when we were younger and could just say "Want to hang out?" and it would just... happen? Now I have to file a request in triplicate just to get coffee with someone.

But hey, at least we're all in this together, pretending we've got it figured out. And if anyone tells you they actually do have it figured out, they're either lying or they're trying to sell you something. Probably both.

Thank you, you've been great!

Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
9 / 35
09

Realistic AI Interviewconversation

Tests an AI's ability to simulate personalities and predict future trends

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
10 / 35
10

Satirical Fake News Headlineconversation

Tests an AI's humor and understanding of current events

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
11 / 35
11

Character Voice Testconversation

Tests an AI's ability to write in distinct character voices

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025text
Nothing here. The model returned empty. We stared at it for a while.
Try this prompt
Vote
12 / 35
12

Minimalist Landing Pageweb design

Tests an AI's ability to generate a complete, working landing page

Claude Sonnet 3.6 (2022-10-22)

Claude Sonnet 3.6 (2022-10-22)

Anthropic · Jun 2024website
Try this prompt
Sherlock Dash Alpha

Sherlock Dash Alpha

Openrouter · Nov 2025website
Try this prompt
Vote

This matchup has more rounds

23+ more head-to-head results. Free. Not a trick.

Free account. No card required. By continuing, you agree to Rival's Terms and Privacy Policy

Our Verdict
Sherlock Dash Alpha
Sherlock Dash Alpha
Claude Sonnet 3.6 (2022-10-22)
Claude Sonnet 3.6 (2022-10-22)Runner-up

No community votes yet. On paper, Sherlock Dash Alpha has the edge — bigger model tier, newer.

Too close to call
Writing DNA

Style Comparison

Similarity
81%

Sherlock Dash Alpha uses 1.7x more emoji

Claude Sonnet 3.6 (2022-10-22)
Sherlock Dash Alpha
66%Vocabulary72%
74wSentence Length11w
0.72Hedging0.12
0.3Bold5.2
8.9Lists4.7
0.00Emoji0.02
0.06Headings0.97
0.09Transitions0.32
Based on 21 + 14 text responses
vs

Ask them anything yourself

Claude Sonnet 3.6 (2022-10-22)Sherlock Dash Alpha

Some models write identically. You are paying for the brand.

178 models fingerprinted across 32 writing dimensions. Free research.

Model Similarity Index

185x

price gap between models that write identically

178

models

12

clone pairs

32

dimensions

Devstral M / S
95.7%
Qwen3 Coder / Flash
95.6%
GPT-5.4 / Mini
93.3%
Read the full reportor download the 14-slide PDF

279 AI models invented the same fake scientist.

We read every word. 250 models. 2.14 million words. This is what we found.

AI Hallucination Index 2026
Free preview13 of 58 slides
Download the free previewor get all 58 slides for $49
FAQ

Common questions

Keep going
Claude Sonnet 3.6 (2022-10-22) logo

We compare AI models for a living. On purpose. We chose this.

@rival_tips

Explore

  • Compare Models
  • All Models
  • Find Your Model
  • Image Generation
  • Audio Comparison
  • Best AI For...
  • Pricing
  • Challenges

Discover

  • Insights
  • Research
  • AI Creators
  • AI Tools
  • The Graveyard

Developers

  • Developer Hub
  • MCP Server
  • Rival Datasets

Connect

  • Methodology
  • Sponsor a Model
  • Advertise
  • Partnerships
  • Privacy Policy
  • Terms
  • RSS Feed
© 2026 Rival · Built at hours no one should be awake, on hardware we don't own
Llama 4 Maverick logo
Claude Sonnet 3.6 (2022-10-22) vs Llama 4 MaverickNew provider
Claude Sonnet 3.6 (2022-10-22) logoGPT-5 logo
Claude Sonnet 3.6 (2022-10-22) vs GPT-5New provider
Claude Sonnet 3.6 (2022-10-22) logoMiniMax M2.5 logo
Claude Sonnet 3.6 (2022-10-22) vs MiniMax M2.5New provider