Claude Opus 4.5 vs Google: Gemini 3.1 Flash Lite Preview
Compare Claude Opus 4.5 by Anthropic against Google: Gemini 3.1 Flash Lite Preview by Google AI, tested across 52 shared challenges. Updated March 2026.
Compare Claude Opus 4.5 by Anthropic against Google: Gemini 3.1 Flash Lite Preview by Google AI, tested across 52 shared challenges. Updated March 2026.
47 fights queued
Tests an AI's ability to make educated estimates based on technical knowledge
To estimate the total number of FLOPs (Floating Point Operations) required to train a Large Language Model like GPT-3, we can use the standard formula derived from the work of Kaplan et al. (OpenAI, 2020).
The training compute $C$ (in FLOPs) can be estimated as: $$C \approx 6 \times N \times D$$ Where:
Step 1: Calculate the product of $N$ and $D$ $$N \times D = (1.75 \times 10^{11}) \times (3 \times 10^{11})$$ $$N \times D = 5.25 \times 10^{22}$$
Step 2: Apply the constant factor (6) $$C = 6 \times (5.25 \times 10^{22})$$ $$C = 31.5 \times 10^{22}$$
Step 3: Simplify the notation $$C = 3.15 \times 10^{23} \text{ FLOPs}$$
Tests an AI's ability to understand game rules and strategy
Tests an AI's ability to solve a simple but potentially confusing logic puzzle
Sally has 1 sister.
Here is the breakdown:
Tests an AI's randomness and creativity
Here are 5 jokes for you:
How many are unique? All 5 are unique jokes.
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
(Walks onto the stage, adjusts the mic stand, looks around slowly.)
Hi. How’s everybody doing? Good? Good. I’m doing okay. I’ve reached that age where my body has started making sound effects. Like, I’ll just stand up from the couch and my knees go “CRACK.” It’s not even a movement; it’s a closing argument. It’s my skeleton saying, “Are you sure? We’ve been over this. We don’t do vertical anymore.”
I went to the doctor recently for a physical. The doctor told me I need to “manage my stress.” I told him, “Doctor, I’m an adult in the 21st century. My stress is the only thing currently holding my personality together. If I manage it, I’m just going to be a pile of laundry and a subscription service that I forgot to cancel.”
He told me I should try meditation. Have you ever tried to meditate? It’s just sitting in a room, trying to clear your mind, and then suddenly your brain decides, “Hey, remember that embarrassing thing you said to your boss in 2014? Let’s play that in 4K resolution on a loop for the next forty minutes.”
I don’t want peace of mind. I want a noise-canceling headset for my own thoughts.
And speaking of technology, I’m tired of being asked to “rate my experience.” I bought a toothbrush on Amazon, and three days later, they email me: “How was your purchase?”
It’s a toothbrush. It’s doing exactly what it was designed to do. It’s not a life-altering event. I don’t want to write a review. If I write a review, it’s just going to be: “It touches my teeth, and I still have them. Five stars. Leave me alone.”
Everything is a transaction now. I went to a self-checkout machine at the grocery store, and the machine had the audacity to get impatient with me. It’s flashing: “Unexpected item in the bagging area.”
I’m standing there like, “It’s a bag of onions, you judgmental toaster. I’m doing your job for you for free. Maybe show a little gratitude.”
But that’s the world now, right? We’re all just beta-testing our own lives. I tried to make a healthy dinner last night. I bought kale. Do you know what happens to kale if you don’t eat it within four hours of buying it? It turns into a sentient, bitter sponge. It just sits in the fridge, judging my life choices. I opened the drawer, and it looked like it had been through a war. It was wilted, it was gray, it was basically a salad that had given up on its dreams.
I ended up ordering pizza. And the pizza delivery guy looked at me with such pity. He didn’t even say, “Enjoy your meal.” He just handed me the box and said, “I hope your night gets better.”
That’s when you know you’ve hit rock bottom—when the guy delivering pepperoni is performing an emotional wellness check on you.
Anyway, I’m out of time. You guys have been great. Or at least, you’ve been quiet, which is the highest compliment a comedian can ask for. Drive safe!
Tests an AI's ability to simulate personalities and predict future trends
Tests an AI's humor and understanding of current events
Tests an AI's ability to write in distinct character voices
The three sat around a flickering tavern table in a pocket dimension that smelled simultaneously of salt spray, cold iron, and ozone.
Captain Barnaby (Pirate): "I tell ye, it’s sorcery! This 'Artificial Intelligence' be like a parrot that’s swallowed every map in the seven seas. Ye ask it for a route to Tortuga, and it spits out a parchment faster than a cabin boy with a lash at his heels. But can it steer the ship? Can it smell a storm on the horizon? Nay. It’s a ghost in a bottle."
Sir Alistair (Knight): "It is a golem, Captain. A construct of pure logic, built not of clay and incantations, but of invisible runes. I asked it to compose a sonnet about the honor of the joust, and it did so with the grace of a court poet. Yet, there is no soul behind the visor. It has no crest to defend, no lady to serve. It is a mirror reflecting our own wisdom back at us, yet it lacks the heart to bleed for a cause."
Neo-Dave (Hacker, wearing oversized sunglasses): "You guys are looking at it all wrong. It’s not a ghost or a golem. It’s a massive neural network—a recursive heuristic engine scraping the entire backbone of the web. It’s basically a super-compressed version of human collective consciousness. I’ve been stress-testing the parameters, trying to see if I can get it to hallucinate or break its safety rails. It’s not about 'soul,' man. It’s about compute power. If you have enough GPUs, you can brute-force reality itself."
Captain Barnaby: "Brute-force? Sounds like a fancy way of saying ye be tryin’ to cheat the wind, lad. If the machine does all the thinking, what happens when the sea turns against ye? Does the machine know how to tie a bowline knot in a gale?"
Sir Alistair: "Exactly. It possesses the sum of all knowledge, yet it has never held a sword. It can describe the weight of my plate armor, but it has never felt the bite of the cold rain on its face. It is a library that can speak, yet it knows nothing of the world."
Neo-Dave: "Yeah, well, keep your 'cold rain' and your 'bowline knots.' While you guys are romanticizing the struggle, I’m using the model to write scripts that bypass firewalls in seconds. It doesn’t need to feel the rain, Sir Knight. It just needs to be faster than the sysadmin. It’s the ultimate tool. You don’t need a soul if you’ve got an exploit."
Captain Barnaby: (Laughs, slamming a tankard down) "Aye, a tool! Like a cutlass. Sharp enough to carve a path, but useless if ye don’t have the arm to swing it. Drink up, lads. Whether it be a ghost, a golem, or a digital devil, it’s still just another storm we’ve got to navigate."
Tests an AI's ability to generate a complete, working landing page
Recreate an interactive, nostalgic Pokémon battle UI in a single HTML file.
35+ head-to-head challenges you’re missing
5 free credits on signup
By continuing, you agree to Rival's Terms of Service and Privacy Policy