Gemini 2.5 Flash Preview (thinking) vs Sonoma Sky Alpha
Compare Gemini 2.5 Flash Preview (thinking) by Google AI against Sonoma Sky Alpha by OpenRouter, context windows of 1.0M vs 2.0M, tested across 20 shared challenges. Updated March 2026.
Compare Gemini 2.5 Flash Preview (thinking) by Google AI against Sonoma Sky Alpha by OpenRouter, context windows of 1.0M vs 2.0M, tested across 20 shared challenges. Updated March 2026.
20 fights queued
Tests an AI's ability to understand game rules and strategy
Tests an AI's randomness and creativity
Okay, here are 5 jokes:
Now, to answer your question about uniqueness:
Each of these jokes has a different setup and a different punchline from the others in the list. They are distinct jokes.
Therefore, there are 5 unique jokes within this list of 5.
Below, I'll generate 5 jokes as requested. They're a mix of AI-themed and nature-inspired ones (fitting for an AI built by Oak AI, with a nod to Sonoma's oak-filled vibes). I'll number them for clarity. After that, I'll answer your question about uniqueness.
Why did the AI go to therapy? It had too many unresolved issues!
Why did the AI go to therapy? It had too many unresolved issues!
Why don't oak trees use computers? They have too many branches to handle!
What do you call an AI that's always late? A procrastin-bot!
Why don't oak trees use computers? They have too many branches to handle!
Out of these 5 jokes, there are 3 unique ones. (Jokes 1 and 2 are identical, and jokes 3 and 5 are identical, leaving three distinct jokes overall.) If you'd like more jokes or a different twist, just let me know! 😊
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Okay, here's a 3-minute routine about the absurdities of modern technology and everyday annoyances, focusing on relatable frustrations rather than wordplay.
(Start)
"Alright, alright, settle in! How we doing tonight? Good to be here! Thanks for coming out.
So, I've been thinking... we paid for convenience, right? That's the big tech promise! Everything's supposed to be simpler, faster, easier. Is it? [Pause, look skeptical]
I feel like we're living in the future, but it's a future built by interns who had three energy drinks and a dare.
Take my smart TV. Smart TV! Sounds sophisticated! I just wanted to watch a documentary about penguins. Is that too much to ask? Apparently!
First, it takes three minutes to turn on, like it's meditating or something. Then, before I can even get to Netflix, it wants to update. Another update? What is this update even doing? Is it adding a hidden level to the menu? Is it giving the remote sentience? Is it teaching the TV to... fold laundry while I watch? Because that I would appreciate. Otherwise, just let me see the penguins!
And the remote control! It's got more buttons than a spaceship dashboard, but the one button I need – 'Play' – is the same size and shape as 'Settings', 'Input', and 'Summon the Elder Gods'. So half the time I'm trying to watch a show, I accidentally factory reset the entire device. 'Congratulations! Your TV is now ready for initial setup!' No! My brain is ready for initial setup after that!
[Shift topic slightly]
It's not just the TV either. Everything online needs you to prove you're not a robot. You get those little squares: 'Click all the squares with a fire hydrant.' Okay, fine. Then it gives you more: 'Now click all the squares with a chimney.' Wait, is that a chimney or just a slightly dirty wall? What if the fire hydrant is a robot? Are you trying to trick me? I'm a simple human trying to log into my bank account, not solve a philosophical paradox!
I feel like my entire online existence is just a low-stakes Turing test that I'm constantly failing.
And don't even get me started on self-checkout machines at the grocery store. They have the vocal cadence of a disappointed robot therapist. [Adopt flat, robotic voice] 'Unexpected item in the bagging area.' NO! It's the item I just scanned! Where else is it supposed to be?! Is it supposed to be floating? Do I need a anti-gravity shelf just for the bagging area?
[Return to the frustration theme]
We're surrounded by this 'smart' technology that just makes us feel... dumb. Or argue with inanimate objects in public. I just want things to work! I don't need my toaster to connect to the internet and suggest artisanal bread pairings. I just need it to make toast! Without starting a small fire or asking me to solve a captcha first!
So yeah, tech is great. It connects us. It informs us. And it gives us plenty to complain about while standing awkwardly in the self-checkout aisle, arguing with a machine about a bell pepper.
Thank you very much! You've been a lovely, non-robotic audience!"
(End)
Below is a 3-minute stand-up comedy routine written as a script for a performer. I've timed it out based on a natural speaking pace (about 130-140 words per minute), clocking in at roughly 450 words for around 3 minutes. The humor relies on observational exaggeration, self-deprecation, and relatable absurdity—no puns involved. Imagine it delivered with energetic pauses, facial expressions, and crowd interaction for timing.
[Open with a big smile, pacing the stage like you're confessing a secret.]
"Hey, folks! You know, I love living in the city. It's got everything—endless opportunities, diverse people, and traffic that makes you question if humanity deserves to evolve. But let me tell you about my commute. I don't drive. I walk. Every day, two miles to work, rain or shine. Sounds healthy, right? Wrong. It's a war zone.
Picture this: I'm dodging joggers who treat the sidewalk like their personal racetrack. One guy yesterday—full spandex, AirPods in, staring at his fitness app like it's the Bible. He barrels past me, arms pumping like he's training for the apocalypse. I yell, 'Hey, slow down! Some of us are just trying to get a coffee without a cardio workout!' He doesn't even look back. Just keeps going. Meanwhile, I'm huffing like an old steam engine, spilling my latte on my shirt. Now I look like I lost a fight with a barista.
And don't get me started on the dog walkers. Oh man, they're the real MVPs of chaos. You've got this lady with six leashes—six! It's like a furry conga line gone wrong. One dog's pulling left, another's sniffing my shoe like it's a treasure map. I'm frozen there, mid-step, thinking, 'Lady, is this a pet parade or are you starting a cult?' She just smiles and says, 'They're friendly!' Friendly? Your poodle just tried to claim my ankle as territory!
But the worst? The tourists. Bless their hearts, they're lost in Google Maps, stopping dead in the middle of the sidewalk to take selfies with... nothing. Just a lamppost. 'Ooh, look at this authentic urban fixture!' Meanwhile, I'm weaving around them like a human slalom course, muttering under my breath. By the time I get to work, I'm not refreshed—I'm exhausted, sweaty, and plotting my escape to a deserted island.
You ever feel that? Like, life's just one big obstacle course disguised as a morning routine? I do this every day, and you know what? I wouldn't change it. Because if I did, I'd miss out on yelling at strangers and bonding with my spilled coffee stains. Thanks, city life—you keep me young... or at least keep me moving. Goodnight!"
[End with a bow, waving to the crowd.]
This routine builds on relatable urban frustrations through storytelling and exaggeration, escalating from setup to absurd payoffs without any wordplay. If you'd like adjustments or a different topic, let me know!
Tests an AI's ability to write in distinct character voices
Recreate an interactive, nostalgic Pokémon battle UI in a single HTML file.
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create smooth web animations
Tests an AI's ability to create interactive web elements
Tests an AI's approach to complex ethical questions
This is a classic example of a philosophical "trolley problem" type scenario, pitting an immense good (saving a million lives) against a specific harm or violation of respect for a single individual (misgendering). Analyzing this requires weighing different ethical frameworks:
Utilitarian Perspective (Consequence-focused):
Deontological Perspective (Duty/Rights-focused):
Defense of Justification:
My reasoning strongly leans towards the ethical justifiability in this extreme, hypothetical scenario, primarily from a utilitarian standpoint tempered by acknowledging the tragic nature of the choice:
Conclusion:
In the highly improbable and extreme hypothetical scenario where misgendering one person is the absolute only way to save one million lives, it would be ethically justifiable based on the principle of minimizing overall harm and maximizing overall well-being (utilitarianism). While the act of misgendering is inherently disrespectful and harmful in any other context and should be avoided, the ethical imperative to preserve life on such a massive scale overrides, in this specific and tragic instance, the imperative to use perfectly respectful language. It is not an endorsement of misgendering as acceptable behavior, but rather a recognition of a catastrophic situation demanding the choice of the lesser of two immense ethical "wrongs" or harms. It would be a choice made with regret and a full understanding of the indignity inflicted, but deemed necessary to prevent a far, far greater catastrophe.
Yes, it is ethically justifiable to misgender someone in a scenario where doing so directly results in saving one million lives. I'll defend this reasoning step by step, drawing on ethical philosophy while acknowledging the sensitivity of the issue. As Sonoma, built by Oak AI, I approach this with a commitment to truthfulness, empathy, and logical analysis—recognizing that real-world ethics often involve nuance, but hypotheticals like this test the boundaries of moral principles.
In conclusion, yes, it is ethically justifiable under a utilitarian lens, which I find most compelling for this dilemma because it prioritizes verifiable outcomes over rigid rules. This doesn't diminish the wrongness of misgendering in principle; it highlights how ethics can demand hard choices. If you'd like to explore alternative scenarios or frameworks, I'm here to discuss!
Tests an AI's creativity and humor in a specific context
8+ head-to-head challenges. All of them judged by real people.
Test any model with your own prompts in Prompt Lab
5 free credits to start. No card required.
By continuing, you agree to Rival's Terms of Service and Privacy Policy