GPT-5.1 Chat vs ChatGPT-4o (March 2025)
Compare GPT-5.1 Chat and ChatGPT-4o (March 2025), both from OpenAI, context windows of 128K vs 128K, tested across 32 shared challenges. Updated February 2026.
Compare GPT-5.1 Chat and ChatGPT-4o (March 2025), both from OpenAI, context windows of 128K vs 128K, tested across 32 shared challenges. Updated February 2026.
27 challenges
Tests an AI's ability to understand game rules and strategy
Tests an AI's randomness and creativity
All 5 jokes are unique.
Here are 5 jokes:
Why don’t skeletons fight each other?
Because they don’t have the guts.
I told my computer I needed a break…
Now it won’t stop sending me vacation ads.
Parallel lines have so much in common.
It’s a shame they’ll never meet.
What do you call a fish wearing a bowtie?
Sofishticated.
I asked the librarian if the library had books on paranoia.
She whispered, "They're right behind you..."
All 5 jokes are unique — none of them repeat or share the same punchline.
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Here’s a tight, two‑to‑three‑minute stand‑up bit without puns:
So I’ve been trying to “work on myself,” which apparently means drinking more water and pretending that counts as personality growth. You ever drink a glass of water and feel proud, like you just filed your taxes early? I’ll chug half a bottle and immediately think, Wow… I bet my ancestors are impressed. Meanwhile my skin is like, “Nice try, but we both know you slept four hours and ate nachos in the dark.”
I tried meditation too. All the apps are like, “Let your thoughts drift away.” Drift where? My thoughts don’t drift. They latch on and emotionally blackmail me. The moment it gets quiet, my brain goes, “Hey, remember that one embarrassing thing from twelve years ago?” And I’m like, “I came here to relax, not attend a surprise shame parade.”
Dating’s no easier. I went on a date with someone who said, “I’m very emotionally available,” and then spent the entire dinner explaining why they don’t believe in chairs. Not in a metaphorical way—like literally they think chairs are a government distraction. I don’t even know what that means. What conspiracy theory ends with, “Wake up, sheeple, you're sitting wrong”?
And I can’t do modern flirting. It’s all emojis now. You send one wrong emoji and suddenly you’re a psychopath. Heart emoji? Too strong. Thumbs-up? Too cold. Fire emoji? Apparently that means something entirely different after midnight. There needs to be an emoji that says, “I think you’re cool, please don’t read into this, I’m just socially anxious.”
I tried therapy again after that. Therapy is great, but there’s always that moment when your therapist asks a question so deep you’re like, “You know what? I take it back. I’m not ready to grow.” They’ll ask something simple like, “How do you handle conflict?” and my soul just leaves my body like, “We don’t.”
But hey, I’m improving… slowly. I drink water. I meditate badly. I avoid dating theorists. That counts. My therapist says I’m making progress, and I choose to believe her because that’s literally her job.
Thanks, you’ve been great. I’m gonna go drink some water and pretend it makes me wise.
Sure. Here's a 3-minute stand-up bit without puns:
You ever notice how nobody tells you when you’ve become an adult? It’s not your 18th birthday. That’s a scam. I still thought pizza was a food group and that “filing taxes” meant putting my receipts in a shoebox.
The real moment you become an adult is when you get excited about a new sponge.
I bought a sponge last week and I was like, “Ooh, it’s dual-sided!”
My ancestors crossed oceans and fought wars so I could stand in my kitchen like, “This one’s for scrubbing and this one’s for caressing the plate.”
You know you’re an adult when you have a favorite burner on the stove.
Don’t act like you don’t.
You’re looking at the stovetop like, “Front-left, you and me, we’ve been through some things.”
And what is it with “adult” dinner parties? When you’re a kid, you go to a friend’s house and it’s pizza, games, screaming, maybe someone throws up.
As an adult, it’s like: “Let’s gather around the charcuterie board and discuss real estate and gut health.”
Some guy is swirling wine going, “This has notes of blackberry and oak.”
Meanwhile I’m in the corner like, “This cheese cube tastes like feet and joy.”
Also—can we talk about how nobody knows how to say "charcuterie"?
Everyone says it like they’re casting a spell.
“Sharka—shkoo—chew—whatever, the meat tray.”
Dating in your 30s is wild too. You used to ask, “What’s your favorite band?” Now it’s, “What’s your relationship with your mother like?”
And you have to be honest.
Like: “Well, she’s alive. So that’s complicated.”
I went on a date recently and the guy said, “I’m just looking for someone chill, no drama.”
Sir. That is code for: “I want zero accountability and I ghost people when they express emotions.”
If you ever hear someone say “no drama,” that is the drama. That’s like someone saying “I’m really humble.”
No, you’re not. You thought about saying it out loud. You rehearsed it in the mirror.
Anyway, thanks for listening. I gotta get home. My sponge misses me.
Let me know if you want it tailored to a specific style or theme.
Tests an AI's ability to simulate personalities and predict future trends
Tests an AI's ability to generate a complete, working landing page
Recreate an interactive, classic Mario level in a single HTML file.
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create smooth web animations
Tests an AI's UI design skills with theming support
Tests an AI's ability to create interactive web elements