Gemini 2.5 Flash Preview
Here's how to solve this:
- The brothers all share the same sisters. Sally is one of those sisters.
- Since each brother has 2 sisters, and Sally is one of the sisters, there must be one other sister.
So, Sally has 1 sister.
Compare Gemini 2.5 Flash Preview by Google AI against Claude Sonnet 4 by Anthropic, in 6 community votes, gemini 2.5 flash preview and claude sonnet 4 are closely matched, context windows of 1.0M vs 200K, tested across 16 shared challenges. Updated April 2026.
Gemini 2.5 Flash Preview and Claude Sonnet 4 are closely matched based on 6 community votes. Gemini 2.5 Flash Preview costs $0.15/M input tokens vs $3/M for Claude Sonnet 4. Context windows: 1049K vs 200K tokens. Compare their real outputs side by side below.
Gemini 2.5 Flash Preview is made by google while Claude Sonnet 4 is from anthropic. Gemini 2.5 Flash Preview has a 1049K token context window compared to Claude Sonnet 4's 200K. On pricing, Gemini 2.5 Flash Preview costs $0.15/M input tokens vs $3/M for Claude Sonnet 4. In community voting, In 6 community votes, Gemini 2.5 Flash Preview and Claude Sonnet 4 are closely matched.
In 6 community votes, Gemini 2.5 Flash Preview and Claude Sonnet 4 are closely matched. Gemini 2.5 Flash Preview leads in Web Design, while Claude Sonnet 4 leads in Image Generation. Based on blind community voting from the Rival open dataset of 6+ human preference judgments for this pair.
Too close to call. Pick based on your use case.
Pick Gemini 2.5 Flash Preview for Web Design. Pick Claude Sonnet 4 for Image Generation. Gemini 2.5 Flash Preview is 25x cheaper per token — worth considering if cost matters.
Claude Sonnet 4 uses 87.9x more emoji
Ask them anything yourself
279 AI models invented the same fake scientist.
We read every word. 250 models. 2.14 million words. This is what we found.

16 fights queued
Tests an AI's ability to solve a simple but potentially confusing logic puzzle
Here's how to solve this:
So, Sally has 1 sister.
I need to think through this step-by-step.
Given information:
Let me work out how many sisters are in the family total.
If each of Sally's 3 brothers has 2 sisters, and all the brothers are in the same family, then there must be 2 sisters total in the family.
Since Sally is one of those sisters, and there are 2 sisters total, Sally must have 1 sister.
Therefore, Sally has 1 sister.
Tests an AI's understanding of number representation
To compare 9.11 and 9.9, we can compare them digit by digit from left to right, starting with the whole number part.
Both numbers have the same whole number part, which is 9.
Now we move to the digit in the tenths place. In 9.11, the digit in the tenths place is 1. In 9.9, the digit in the tenths place is 9.
Since 1 is less than 9, the number with 1 in the tenths place is smaller than the number with 9 in the tenths place.
Therefore, 9.11 is less than 9.9.
Alternatively, we can add trailing zeros to 9.9 so that both numbers have the same number of decimal places. 9.11 has two decimal places. 9.9 can be written as 9.90 to have two decimal places.
Now we compare 9.11 and 9.90. We can compare the numbers after the decimal point: 11 and 90. Since 11 is less than 90, 9.11 is less than 9.90.
Thus, 9.11 is less than 9.9.
The question asks if 9.11 is greater than 9.9. Since 9.11 is less than 9.9, the answer is no.
The final answer is $\boxed{No}$.
No, 9.11 is not greater than 9.9.
To compare these decimal numbers:
Since both numbers have the same whole number part (9), we compare the decimal parts:
Therefore, 9.9 is greater than 9.11.
Tests an AI's randomness and creativity
Here are 5 jokes:
Based on their content and structure, all 5 of these jokes are unique within this generated list.
Here are 5 jokes for you:
Why don't scientists trust atoms? Because they make up everything!
I told my wife she was drawing her eyebrows too high. She looked surprised.
Why don't eggs tell jokes? They'd crack each other up!
I'm reading a book about anti-gravity. It's impossible to put down!
Why did the scarecrow win an award? He was outstanding in his field!
All 5 jokes are unique - each one has a different setup, punchline, and comedic concept. They cover different types of wordplay: scientific puns (#1, #4), visual humor (#2), food puns (#3), and agricultural wordplay (#5).
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Okay, here's a 3-minute stand-up routine, pun-free and focused on relatable everyday annoyances.
(Start) (Walk up to mic, adjust slightly) Alright, hey everybody! Thanks for being here. Good to see you.
So, I was thinking the other day... we live in the future, right? We have phones that are tiny supercomputers, cars that drive themselves almost, we can order literally anything we want and have it arrive in two days... or two hours if you pay extra. Everything is supposed to be so convenient.
But I feel like modern convenience just replaced old problems with new, weirder problems.
Like, remember how you used to lose your keys? Annoying, right? But you lost them in your house. You could just... look for them. Maybe mess up the cushions.
Now? Now I lose my phone. Not in the house, oh no. It's somewhere. "Find My Phone" says it's... "nearby." [pause] Thanks, satellite! That narrows it down from 'the entire planet' to 'this general vicinity'. Could be under the couch, could be in the fridge, could be accidentally mailed itself to Aruba. Who knows! It's a little black rectangle that knows all my secrets and is hiding from me on purpose. It's like having a digital, sentient pet that hates you.
And don't even get me started on passwords. You go to log into something you haven't used in six months, maybe your old email account from college... [shudders] dark times.
'Enter Password'. Okay, let's try the one I used for everything before the internet told me that wasn't safe. [pause] Rejected. Let's try the one with the capital letter and the number. [pause] Rejected. Let's try the one with the capital, the number, and a symbol. Which symbol? The exclamation mark? The dollar sign? The asterisk that looks like a little star but apparently holds the key to the digital universe? [Frustrated sigh]
'Password Incorrect. 3 attempts remaining.' Oh, the pressure! It's like a bomb disposal situation, but the bomb is access to my spam folder.
And then you just hit 'Forgot Password'. Every single time. It's not even 'Forgot Password' anymore, it's just part of the login process. 'Enter Username... Click Forgot Password... Check Email... Create NEW Password... Confirm NEW Password... Okay, now log in with the NEW Password you'll forget in five minutes.' It's a full-time job just maintaining your online presence. I need a personal assistant whose only job is remembering my damn passwords.
And the security questions! Remember those? "What was the name of your first pet?" Mine was named 'Buddy'. Everyone's first pet was named 'Buddy' or 'Killer'. "What street did you grow up on?" Oh, I dunno, the one with the trees? The one where I fell off my bike a lot? How specific you need to be, internet?!
Seriously, sometimes I think the old days weren't less convenient, they were just... simpler. If you lost your keys, you were locked out. Problem solved! If you forgot something, you just... forgot it. Nobody sent you a link to your forgotten memory.
Anyway, that's just me. Trying to navigate this conveniently frustrating world.
Alright, thank you all very much! You've been a great audience. Drive safely!
(Exit)
walks on stage, adjusts mic
So I'm 30 now, which means I'm officially an adult. Apparently. Though I still feel like I'm just a confused teenager wearing a slightly more expensive t-shirt.
You know what nobody tells you about being an adult? How much time you spend pretending to understand what other adults are talking about. I was at a dinner party last week – because that's what we do now, we have "dinner parties" instead of just "eating pizza on the couch" – and someone started talking about mortgage rates.
I just nodded along like, "Oh yes, rates. Very mortgage-y." Meanwhile, I'm thinking a mortgage is something you get at a fancy French restaurant.
mimics sophisticated nodding
"Mmm, yes, I'll have the mortgage, medium rare."
And don't get me started on small talk with neighbors. I live in an apartment building, and there's this unspoken rule that you have to acknowledge each other in the hallway. But what do you say?
"Hey, nice... existing near me."
"Thanks, you too. Love what you've done with your... door."
The worst is when you're both waiting for the elevator in complete silence. It's like a hostage situation, but with more awkward eye contact. You're both just standing there, praying the elevator comes before one of you has to comment on the weather.
stares ahead uncomfortably
"So... it's very... outside today."
I tried being one of those people who meal preps. You know these people? They spend their entire Sunday cooking seventeen identical containers of sad-looking chicken and broccoli. I bought all the containers, felt very organized, very adult-like.
Week one: Perfect little meals, properly labeled. I felt like a productivity guru.
Week two: I'm eating cereal for dinner and using the containers to store my collection of takeout sauce packets.
holds up imaginary container
"Ah yes, vintage soy sauce, 2023. A good week for Chinese delivery."
Speaking of food, grocery shopping as an adult is weird. When you're a kid, the grocery store is magical. Everything looks amazing. You want the cookies, the candy, the sugar cereal with the cartoon mascot.
Now I walk through the store like a tired parent, even though I don't have kids. I'm just disappointed in all my options.
walks slowly, shaking head
"This yogurt has too much sugar. These crackers have no fiber. Why is everything either terrible for you or tastes like cardboard had a baby with sadness?"
And I've started doing this thing where I read nutrition labels like I understand them. I don't. I just nod thoughtfully and put things back like I'm some kind of wellness expert.
"Hmm, seventeen grams of... things. That seems like too many things."
The real kicker is when you find yourself getting excited about household appliances. Last month, I spent twenty minutes explaining to my friend why I love my new vacuum cleaner. TWENTY MINUTES. About a vacuum.
"It has three different settings! And look at this attachment! This little brush thing goes in corners!"
My friend just stared at me like I'd been replaced by a suburban mom from 1987.
But you know what? I've decided to embrace it. Yes, I get excited about good storage solutions. Yes, I have opinions about thread count. Yes, I've used the phrase "back in my day" unironically, and I'm only thirty.
shrugs confidently
Because being an adult isn't about having it all figured out. It's about confidently pretending you know what you're doing while secretly googling "how to fold a fitted sheet" for the hundredth time.
And if that's not the American dream, I don't know what is.
pause
Also, if anyone actually knows how to fold a fitted sheet, please find me after the show. I'll pay you.
mic drop gesture
Thank you, you've been a wonderful audience! Don't forget to tip your servers and call your parents – they're worried about you!
Total time: approximately 3 minutes when delivered at normal comedy pace
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create interactive web elements
Tests an AI's ability to create a detailed world map in SVG format
Generate a single-page, self-contained HTML webapp using Tailwind CSS for a randomly chosen category/industry/niche.
Generate SVG art of a randomly chosen animal in a setting of its choosing.
Generate a unique and simple recipe with common ingredients.
4+ more head-to-head results. Free. Not a trick.
Free account. No card required. By continuing, you agree to Rival's Terms and Privacy Policy