What is the difference between DeepSeek R1 and Qwen3 Coder Plus?

DeepSeek R1 is developed by DeepSeek while Qwen3 Coder Plus is developed by Qwen. DeepSeek R1 has a 128K token context window vs Qwen3 Coder Plus's 128K. You can compare their actual outputs across 42 challenges on RIVAL to see how they differ in practice.

Which is better, DeepSeek R1 or Qwen3 Coder Plus?

It depends on your use case. DeepSeek R1 and Qwen3 Coder Plus each have strengths in different areas. RIVAL lets you compare their real outputs side-by-side across 42 challenges so you can judge which fits your needs best.

How much does DeepSeek R1 cost compared to Qwen3 Coder Plus?

DeepSeek R1 costs $0.55/M input tokens and Qwen3 Coder Plus costs $1/M input tokens. DeepSeek R1 is $0.45/M cheaper per input. Check their side-by-side outputs on RIVAL to see if the price difference is justified by quality.

How can I compare DeepSeek R1 and Qwen3 Coder Plus on RIVAL?

This page shows a side-by-side comparison of DeepSeek R1 and Qwen3 Coder Plus across shared challenges. You can vote on which model produced the better output, and Pro users can create custom challenges to test both models with their own prompts.

Rival

DeepSeek R1vsQwen3 Coder Plus

37 challenges

Why DeepSeek R1?

2.2x cheaper overall$0.55/M in · $2.19/M out

Why Qwen3 Coder Plus?

Leads 1 of 1 benchmarks

Stronger on SWE-bench Verified69.6% vs 49.2%

8 months newerSep 2025

Favorites

DeepSeek R1Qwen3 Coder Plus

Movie

The Godfather

19728.7

The Shawshank Redemption

19948.7

Album

The Dark Side of the Moon

Pink Floyd

OK Computer

Radiohead

Book

Pride and Prejudice

Jane Austen

The Left Hand of Darkness

Ursula K. Le Guin

City

Kyoto

Japan

Tokyo

Japan

Game

The Legend of Zelda: Ocarina of Time

Adventure, Action4.4

The Legend of Zelda: Ocarina of Time

Adventure, Action4.4

12 / 37

Estimate Complexityreasoning

Tests an AI's ability to make educated estimates based on technical knowledge

DeepSeek R1

Deepseek · Feb 2025text

To estimate the total number of FLOPs required to train GPT-3, we use the following reasoning:

Model Parameters (P):
GPT-3 has 175 billion parameters ((P = 175 imes 10^9)).
Training Tokens (N):
GPT-3 was trained on approximately 300 billion tokens ((N = 300 imes 10^9)).
FLOPs per Token:
- A forward pass requires ~(2P) FLOPs per token (each parameter involves a multiply-accumulate operation, which counts as 2 FLOPs).
- A backward pass (for gradients) is roughly twice as expensive as the forward pass, requiring ~(4P) FLOPs per token.
- Total FLOPs per token: (2P + 4P = 6P).
Total Training FLOPs:
Multiply FLOPs per token by the total number of tokens:
[ ext{FLOPs} = 6 imes P imes N = 6 imes 175 imes 10^9 imes 300 imes 10^9 = 3.15 imes 10^{23} , ext{FLOPs}. ]

Final Answer:
The total number of FLOPs required to train GPT-3 is approximately (3.1 imes 10^{23}) FLOPs.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

2 / 37

AI Board Game Logicreasoning

Tests an AI's ability to understand game rules and strategy

DeepSeek R1

Deepseek · Feb 2025svg

Qwen3 Coder Plus

Qwen · Sep 2025svg

No content available for this output.

Vote

3 / 37

Logic Puzzlereasoning

Tests an AI's ability to solve a simple but potentially confusing logic puzzle

DeepSeek R1

Deepseek · Feb 2025text

Sally has 3 brothers. Each brother has 2 sisters. Since Sally is one sister, there must be one more sister to make the total number of sisters each brother has equal to 2. Therefore, Sally has 1 sister.

Answer: Sally has 1 sister.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

4 / 37

Math Misconception Testreasoning

Tests an AI's understanding of number representation

DeepSeek R1

Deepseek · Feb 2025text

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

5 / 37

Stochastic Consistency Testanalysis

Tests an AI's randomness and creativity

DeepSeek R1

Deepseek · Feb 2025text

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

6 / 37

SVG Layout Challengeimage generation

Tests an AI's ability to generate vector graphics

DeepSeek R1

Deepseek · Feb 2025svg

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025svg

No content available for this output.

Vote

7 / 37

Xbox Controller SVG Artimage generation

Tests an AI's ability to create detailed SVG illustrations of gaming hardware

DeepSeek R1

Deepseek · Feb 2025svg

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025svg

No content available for this output.

Vote

8 / 37

Generate a Stand-Up Routineconversation

Tests an AI's humor and creative writing ability

DeepSeek R1

Deepseek · Feb 2025text

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

9 / 37

Realistic AI Interviewconversation

Tests an AI's ability to simulate personalities and predict future trends

DeepSeek R1

Deepseek · Feb 2025text

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

10 / 37

Satirical Fake News Headlineconversation

Tests an AI's humor and understanding of current events

DeepSeek R1

Deepseek · Feb 2025text

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

11 / 37

Character Voice Testconversation

Tests an AI's ability to write in distinct character voices

DeepSeek R1

Deepseek · Feb 2025text

No content available for this output.

Qwen3 Coder Plus

Qwen · Sep 2025text

No content available for this output.

Vote

12 / 37

Minimalist Landing Pageweb design

Tests an AI's ability to generate a complete, working landing page

DeepSeek R1

Deepseek · Feb 2025website

Qwen3 Coder Plus

Qwen · Sep 2025website

Vote

12 of 37

FAQ

Continue exploring

DeepSeek R1 vs MiniMax M2.5New provider

DeepSeek R1 vs Aurora AlphaNew provider

Qwen3 Coder Plus vs Claude Sonnet 4New provider

To estimate the total number of FLOPs required to train GPT-3, we use the following reasoning:

Model Parameters (P):
GPT-3 has 175 billion parameters ((P = 175 imes 10^9)).
Training Tokens (N):
GPT-3 was trained on approximately 300 billion tokens ((N = 300 imes 10^9)).
FLOPs per Token:
- A forward pass requires ~(2P) FLOPs per token (each parameter involves a multiply-accumulate operation, which counts as 2 FLOPs).
- A backward pass (for gradients) is roughly twice as expensive as the forward pass, requiring ~(4P) FLOPs per token.
- Total FLOPs per token: (2P + 4P = 6P).
Total Training FLOPs:
Multiply FLOPs per token by the total number of tokens:
[ ext{FLOPs} = 6 imes P imes N = 6 imes 175 imes 10^9 imes 300 imes 10^9 = 3.15 imes 10^{23} , ext{FLOPs}. ]

Final Answer:
The total number of FLOPs required to train GPT-3 is approximately (3.1 imes 10^{23}) FLOPs.