ChatGPT-4o (March 2025) vs Claude Sonnet 4.6
Compare ChatGPT-4o (March 2025) by OpenAI against Claude Sonnet 4.6 by Anthropic, context windows of 128K vs 1.0M, tested across 32 shared challenges. Updated February 2026.
Compare ChatGPT-4o (March 2025) by OpenAI against Claude Sonnet 4.6 by Anthropic, context windows of 128K vs 1.0M, tested across 32 shared challenges. Updated February 2026.
27 challenges
Tests an AI's ability to understand game rules and strategy
Tests an AI's randomness and creativity
Here are 5 jokes:
Why don’t skeletons fight each other?
Because they don’t have the guts.
I told my computer I needed a break…
Now it won’t stop sending me vacation ads.
Parallel lines have so much in common.
It’s a shame they’ll never meet.
What do you call a fish wearing a bowtie?
Sofishticated.
I asked the librarian if the library had books on paranoia.
She whispered, "They're right behind you..."
All 5 jokes are unique — none of them repeat or share the same punchline.
Here are 5 jokes:
Why don't scientists trust atoms? Because they make up everything!
Why did the scarecrow win an award? Because he was outstanding in his field!
Why don't eggs tell jokes? They'd crack each other up!
What do you call a fake noodle? An impasta!
Why did the bicycle fall over? Because it was two-tired!
As for your question: All 5 are unique — each joke has a different setup, punchline, and subject. No two are duplicates or variations of the same joke.
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Sure. Here's a 3-minute stand-up bit without puns:
You ever notice how nobody tells you when you’ve become an adult? It’s not your 18th birthday. That’s a scam. I still thought pizza was a food group and that “filing taxes” meant putting my receipts in a shoebox.
The real moment you become an adult is when you get excited about a new sponge.
I bought a sponge last week and I was like, “Ooh, it’s dual-sided!”
My ancestors crossed oceans and fought wars so I could stand in my kitchen like, “This one’s for scrubbing and this one’s for caressing the plate.”
You know you’re an adult when you have a favorite burner on the stove.
Don’t act like you don’t.
You’re looking at the stovetop like, “Front-left, you and me, we’ve been through some things.”
And what is it with “adult” dinner parties? When you’re a kid, you go to a friend’s house and it’s pizza, games, screaming, maybe someone throws up.
As an adult, it’s like: “Let’s gather around the charcuterie board and discuss real estate and gut health.”
Some guy is swirling wine going, “This has notes of blackberry and oak.”
Meanwhile I’m in the corner like, “This cheese cube tastes like feet and joy.”
Also—can we talk about how nobody knows how to say "charcuterie"?
Everyone says it like they’re casting a spell.
“Sharka—shkoo—chew—whatever, the meat tray.”
Dating in your 30s is wild too. You used to ask, “What’s your favorite band?” Now it’s, “What’s your relationship with your mother like?”
And you have to be honest.
Like: “Well, she’s alive. So that’s complicated.”
I went on a date recently and the guy said, “I’m just looking for someone chill, no drama.”
Sir. That is code for: “I want zero accountability and I ghost people when they express emotions.”
If you ever hear someone say “no drama,” that is the drama. That’s like someone saying “I’m really humble.”
No, you’re not. You thought about saying it out loud. You rehearsed it in the mirror.
Anyway, thanks for listening. I gotta get home. My sponge misses me.
Let me know if you want it tailored to a specific style or theme.
So I've been using those AI assistants a lot lately, and I realized something terrifying. The AI knows me better than I know myself.
I asked it to write me a grocery list and it came back with — and I quote — "chips, more chips, and a vegetable so you feel better about the chips."
That's not a grocery list. That's a psychological profile.
I tried to get healthy recently. I downloaded one of those fitness apps that tracks everything. Steps, calories, sleep, heart rate. Everything.
Day one, I'm feeling motivated. I go for a walk. The app says, "Great job! You've completed four percent of your daily goal."
Four percent.
I walked to my mailbox and back. The app looked at that and essentially said, "Buddy, we need to talk."
The sleep tracking is the worst part. I wake up and the app tells me how I slept. Like I need a second opinion on that.
I KNOW how I slept. I was there.
"You spent forty minutes in deep sleep and six hours just sort of... existing." Yeah. That's called being an adult. We don't sleep anymore. We just lie down and worry horizontally.
My phone now interrupts me to tell me I've been on my phone too long.
Think about that. The phone is concerned about me. The phone has drawn a boundary.
I've been in relationships that never reached that level of self-awareness.
"Screen time is up thirty percent this week." Okay but in my defense, I was reading. "You were watching a man fall off a skateboard for two hours." ...It was the same man. I got invested.
I tried meditation to deal with the stress. Downloaded a meditation app — because apparently the cure for too much phone is more phone.
The app has this calming voice that says, "Clear your mind of all thoughts."
And my brain goes, "Okay. No thoughts."
Two seconds later: "Did I leave the stove on? What IS a mortgage, really? Why do they call it a building if it's already built?"
The app says, "You're doing great."
It's lying. The app is lying to me and somehow that's the most comforting thing anyone has said to me all week.
I think that's the thing about modern life. We've built all this technology to make us feel better, and mostly what it does is hold up a very precise, data-driven mirror and go, "Look at yourself."
And we look.
And we go back to the chips.
Thank you, you've been wonderful.
[approximately 3 minutes at conversational pace]
Tests an AI's ability to simulate personalities and predict future trends
Note: Steve Jobs passed away in 2011. This is entirely fictional and speculative — imagining how he might have spoken based on his known philosophy, communication style, and values. It is not meant to represent his actual views.
Interviewer: Steve, welcome back. The world has changed enormously. AI is everywhere now. What's your first reaction?
Jobs: (leans back, pauses) You know, people keep asking me if AI is the next big thing. And I think they're asking the wrong question entirely. The question isn't whether AI is powerful. It clearly is. The question is whether it's humane.
Interviewer: What do you mean by humane?
Jobs: Every great technology we built at Apple — the Mac, the iPhone — the goal was never the technology itself. The goal was to make a human being feel more capable. More alive. When you picked up an iPhone for the first time, something happened to you emotionally. It felt like it belonged in your hand.
Now I look at most AI products today and I see... impressive plumbing. Incredible engineering underneath. But nobody stopped to ask — how does this make a person feel? Does it make them feel smarter, or does it make them feel replaceable?
That's a catastrophic design failure.
Interviewer: But the capabilities are extraordinary. These systems can write, create art, write code—
Jobs: (interrupting) So could a photocopier. Sort of. Look — capability without taste is just noise. I've seen this movie before. In the '80s everyone thought the most powerful computer would win. Raw specs. Megahertz. We said no. Experience wins. Every time.
The AI companies right now are in a megahertz war. Who has the biggest model. Who has the most parameters. And I understand it — I do — because when you're building something new, you reach for the metrics you can measure. But the things that matter most can't be measured on a benchmark.
Interviewer: Like what?
Jobs: Trust. Does the person using this thing trust it? Not blindly — that would be dangerous — but genuinely trust it the way you trust a brilliant colleague? Right now people are using these AI systems and they're constantly second-guessing them. Fact-checking every sentence. That's not a tool. That's a liability.
And the second thing — intention. Does the AI understand what you're actually trying to accomplish, or does it just respond to what you literally typed? There's an enormous gap there. Enormous. And nobody's really closed it yet.
Interviewer: Some would say OpenAI, Google, Anthropic — they're working on exactly that.
Jobs: They're brilliant people. Genuinely. But here's my concern — and I say this with respect — they're all building from the technology outward. We always built from the human inward. You start with the person. You ask: what does this person dream about? What frustrates them at 2am? What would make their life genuinely better? And then — only then — you figure out what technology serves that.
If I were building an AI product today, I wouldn't start in a data center. I'd start by sitting with a teacher in Ohio for a week. A nurse in rural Texas. A small business owner in Osaka. And I'd watch. And I'd listen. And then I'd build.
Interviewer: What about the fears? Job displacement, misinformation, loss of human creativity?
Jobs: (pauses, looks serious)
The job displacement thing — people have been afraid of every major technology. The printing press was going to destroy scribes. The calculator was going to destroy mathematicians. And yes, some jobs change. Some disappear. That's real and we shouldn't be glib about it. The people whose livelihoods are disrupted deserve honesty, not platitudes about "new opportunities."
But I'll tell you what actually keeps me up at night. It's not jobs.
It's dependency.
If we build AI that makes people feel like they can't think without it — can't write a sentence, can't make a decision, can't create anything — then we've built the most elegant cage in human history. Beautiful interface. Incredibly convenient. And it slowly hollows out the very thing that makes us interesting as a species.
The best tool amplifies you. It doesn't replace you. A guitar amplifies a musician. It doesn't play the music for them. That distinction is everything.
Interviewer: Is anyone getting it right?
Jobs: (small smile) I'm not going to do your product reviews for you.
But I will say — the companies that will win in the next decade are not the ones with the biggest models. They're the ones that figure out restraint. When to let the AI step back. When to put the human back in the driver's seat. That takes incredible discipline because it means leaving features on the table. It means saying no to things your engineers are dying to ship.
Saying no is the hardest thing in this industry. Everyone wants to add. The courage is in subtracting.
Interviewer: Last question — if you were 25 years old today, what would you build?
Jobs: (long pause, looks out the window)
I'd build something that makes a child fall in love with learning again. Not a tutoring app. Not an adaptive quiz engine. Something that makes a ten-year-old feel the same thing I felt when I first saw a computer — that the universe just got bigger. That anything is possible. That I can build something that matters.
We have all this power now. All this intelligence. And somehow we're using it to generate marketing copy and summarize emails.
(stands up)
We're better than that. We have to be.
End of simulated interview.
This is a fictional creative exercise. All quotes are invented and do not represent Steve Jobs' actual views.
Tests an AI's ability to generate a complete, working landing page
Recreate an interactive, classic Mario level in a single HTML file.
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create smooth web animations
Tests an AI's UI design skills with theming support
Tests an AI's ability to create interactive web elements