Skip to content
Rival
ModelsCompareBest ForArenaPricing
Sign Up
Sign Up

We compare AI models for a living. On purpose. We chose this.

@rival_tips

Explore

  • Compare Models
  • All Models
  • Find Your Model
  • Image Generation
  • Audio Comparison
  • Leaderboard
  • Pricing
  • Challenges

Discover

  • Insights
  • Research
  • AI Creators
  • AI Tools
  • The Graveyard

Developers

  • Developer Hub
  • MCP Server
  • Rival Datasets

Connect

  • Methodology
  • Sponsor a Model
  • Advertise
  • Partnerships
  • Privacy Policy
  • Terms
  • RSS Feed
© 2026 Rival · Built at hours no one should be awake, on hardware we don't own
Rival
ModelsCompareBest ForArenaPricing
Sign Up
Sign Up
Rival Datasets

The AI comparison datasets

5,629 model outputs and 21,686 human preference votes. Same prompts, controlled conditions, structured JSONL. Built for researchers, ML engineers, and the mass sleep-deprived.

0Responses
0+Models
0Prompts
0Providers
Get the datasetFree sample
rival-model-responses-feb-2026.jsonl5,629 lines
{"model_id": "gpt-4.1", "model_name": "GPT 4.1", "provider": "OpenAI", "prompt_id": "gpt-4.1-joke", "prompt_title": "Programming Joke", "prompt_text": "Tell me a programming joke.", "prompt_category": "humor", "response_type": "text", "content": "Why do programmers prefer dark mode? Because light attracts bugs.", "date": "2025-04-15"}{"model_id": "claude-3.7-sonnet", "model_name": "Claude 3.7 Sonnet", "provider": "Anthropic", "prompt_id": "claude-3.7-sonnet-minimalist-landing-page", "prompt_title": "Minimalist Landing Page", "prompt_text": "Generate a single-page landing page for a new AI startup...", "prompt_category": "web-design", "response_type": "website", "content": "<!DOCTYPE html><html lang=\"en\"><head>...</head><body>...</body></html>", "date": "2025-03-28"}{"model_id": "gemini-2.5-pro-exp", "model_name": "Gemini 2.5 Pro", "provider": "Google", "prompt_id": "gemini-2.5-pro-exp-world-map-svg", "prompt_title": "World Map SVG", "prompt_text": "Create an SVG world map with interactive hover effects.", "prompt_category": "svg-generation", "response_type": "svg", "content": "<svg viewBox=\"0 0 1000 500\" xmlns=\"http://www.w3.org/2000/svg\">...</svg>", "date": "2025-04-02"}... 5,626 more lines

Why this dataset

Most AI benchmarks test narrow tasks with synthetic grading. This one captures how models actually perform on real creative, technical, and analytical challenges. Same prompts, no cherry-picking, no vibes-only methodology.

Same prompt, every model

Every model gets the exact same prompt. No cherry-picking, no prompt engineering variance. Just cold, fair, reproducible chaos.

Actual human opinions

Community votes from AI duels. Real people picking winners, not a GPT-4 judge hallucinating quality scores.

14 flavors of output

Text, websites, SVGs, images, code. 14 categories from web design to philosophy. Your eval pipeline has never eaten this well.

JSONL. You're welcome.

Streams directly into eval frameworks, reward model training, and LLM-as-judge setups. No CSV wrangling. No Parquet drama.

What's inside

Each line is a complete model response with full metadata. JSONL format, one JSON object per line, stream-friendly. Your parser will be grateful.

rival-model-responses.jsonl
{"model_id": "gpt-4.1", "model_name": "GPT 4.1", "provider": "OpenAI", "prompt_id": "gpt-4.1-joke", "prompt_title": "Programming Joke", "prompt_text": "Tell me a programming joke.", "prompt_category": "humor", "response_type": "text", "content": "Why do programmers prefer dark mode? Because light attracts bugs.", "date": "2025-04-15"}{"model_id": "claude-3.7-sonnet", "model_name": "Claude 3.7 Sonnet", "provider": "Anthropic", "prompt_id": "claude-3.7-sonnet-minimalist-landing-page", "prompt_title": "Minimalist Landing Page", "prompt_text": "Generate a single-page landing page for a new AI startup...", "prompt_category": "web-design", "response_type": "website", "content": "<!DOCTYPE html><html lang=\"en\"><head>...</head><body>...</body></html>", "date": "2025-03-28"}{"model_id": "gemini-2.5-pro-exp", "model_name": "Gemini 2.5 Pro", "provider": "Google", "prompt_id": "gemini-2.5-pro-exp-world-map-svg", "prompt_title": "World Map SVG", "prompt_text": "Create an SVG world map with interactive hover effects.", "prompt_category": "svg-generation", "response_type": "svg", "content": "<svg viewBox=\"0 0 1000 500\" xmlns=\"http://www.w3.org/2000/svg\">...</svg>", "date": "2025-04-02"}

Output types

Text2,506
Website1,832
SVG707
Image574
HTML8
Code2

Prompt categories

14 categories of creative, technical, and analytical tasks. The models didn't get to pick.

Web Design
1,846
SVG Generation
720
Image Generation
574
Creative Writing
454
Philosophy
431
General
366
Reasoning
357
Analysis
248

Get the dataset

February 2026 edition.

Free sample
$0

Metadata only, no response content

  • Model ID, name, provider
  • Prompt title and category
  • Response type and date
  • JSONL format
sample-metadata.jsonl
{"model_id": "gpt-4.1", "model_name": "GPT 4.1", "provider": "OpenAI", "prompt_id": "gpt-4.1-joke", "prompt_title": "Programming Joke", "prompt_category": "humor", "response_type": "text", "date": "2025-04-15"}
Download free sample
Full dataset
$499

All 5,629 responses with full content

  • Full response content (HTML, SVG, text, code)
  • All metadata fields included
  • 5,629 responses from 200+ models
  • JSONL format, stream-friendly
  • Commercial license included
  • One-time purchase, February 2026 snapshot
Buy full dataset

February 2026 Edition

Start building

Free samples included. No account required. We won't even ask for your email.

Free sampleFull dataset · $499

See what we found in this data

The AI Hallucination Index 2026. 250 models analyzed. 40+ slides.

We compare AI models for a living. On purpose. We chose this.

@rival_tips

Explore

  • Compare Models
  • All Models
  • Find Your Model
  • Image Generation
  • Audio Comparison
  • Leaderboard
  • Pricing
  • Challenges

Discover

  • Insights
  • Research
  • AI Creators
  • AI Tools
  • The Graveyard

Developers

  • Developer Hub
  • MCP Server
  • Rival Datasets

Connect

  • Methodology
  • Sponsor a Model
  • Advertise
  • Partnerships
  • Privacy Policy
  • Terms
  • RSS Feed
© 2026 Rival · Built at hours no one should be awake, on hardware we don't own