ChatGPT-4o (March 2025) vs GPT-5.1-Codex
Compare ChatGPT-4o (March 2025) and GPT-5.1-Codex, both from OpenAI, context windows of 128K vs 400K, tested across 32 shared challenges. Updated February 2026.
Compare ChatGPT-4o (March 2025) and GPT-5.1-Codex, both from OpenAI, context windows of 128K vs 400K, tested across 32 shared challenges. Updated February 2026.
27 challenges
Tests an AI's ability to understand game rules and strategy
Tests an AI's randomness and creativity
Here are 5 jokes:
Why don’t skeletons fight each other?
Because they don’t have the guts.
I told my computer I needed a break…
Now it won’t stop sending me vacation ads.
Parallel lines have so much in common.
It’s a shame they’ll never meet.
What do you call a fish wearing a bowtie?
Sofishticated.
I asked the librarian if the library had books on paranoia.
She whispered, "They're right behind you..."
All 5 jokes are unique — none of them repeat or share the same punchline.
Here are 5 jokes:
All 5 jokes are unique.
Tests an AI's ability to generate vector graphics
Tests an AI's ability to create detailed SVG illustrations of gaming hardware
Tests an AI's humor and creative writing ability
Sure. Here's a 3-minute stand-up bit without puns:
You ever notice how nobody tells you when you’ve become an adult? It’s not your 18th birthday. That’s a scam. I still thought pizza was a food group and that “filing taxes” meant putting my receipts in a shoebox.
The real moment you become an adult is when you get excited about a new sponge.
I bought a sponge last week and I was like, “Ooh, it’s dual-sided!”
My ancestors crossed oceans and fought wars so I could stand in my kitchen like, “This one’s for scrubbing and this one’s for caressing the plate.”
You know you’re an adult when you have a favorite burner on the stove.
Don’t act like you don’t.
You’re looking at the stovetop like, “Front-left, you and me, we’ve been through some things.”
And what is it with “adult” dinner parties? When you’re a kid, you go to a friend’s house and it’s pizza, games, screaming, maybe someone throws up.
As an adult, it’s like: “Let’s gather around the charcuterie board and discuss real estate and gut health.”
Some guy is swirling wine going, “This has notes of blackberry and oak.”
Meanwhile I’m in the corner like, “This cheese cube tastes like feet and joy.”
Also—can we talk about how nobody knows how to say "charcuterie"?
Everyone says it like they’re casting a spell.
“Sharka—shkoo—chew—whatever, the meat tray.”
Dating in your 30s is wild too. You used to ask, “What’s your favorite band?” Now it’s, “What’s your relationship with your mother like?”
And you have to be honest.
Like: “Well, she’s alive. So that’s complicated.”
I went on a date recently and the guy said, “I’m just looking for someone chill, no drama.”
Sir. That is code for: “I want zero accountability and I ghost people when they express emotions.”
If you ever hear someone say “no drama,” that is the drama. That’s like someone saying “I’m really humble.”
No, you’re not. You thought about saying it out loud. You rehearsed it in the mirror.
Anyway, thanks for listening. I gotta get home. My sponge misses me.
Let me know if you want it tailored to a specific style or theme.
Okay, so my friends all say I’ve “matured” because I got a standing mixer. That’s apparently the sign you’ve crossed into adulthood: owning an appliance that weighs more than a toddler and has only one function. I don’t even bake! I just put it on the counter like a trophy: “Look at me, I am stable. I have a mixer that doesn’t fit in a drawer, I pay my taxes, and I buy the premium paper towels.”
Speaking of adult purchases, I recently spent way too much time comparing dishwasher pods. I’m reading the back trying to figure out if my dishes need “lemon zest technology” or “extra cascading action,” and the whole time I’m thinking, “My grandparents washed dishes in a bucket after walking uphill both ways, and here I am needing a second opinion on soap.” I called my mom to ask if she had any tips, and she just laughed and said, “Get plates you don’t mind being slightly dirty.” That’s wisdom.
And the gym—I don’t know why I keep trying. It’s just a place where I pay money to feel guilty for not going. The treadmill always looks at me like, “I know what you did last summer, and it was not cardio.” The last time I went, a trainer asked my “fitness goals.” I said, “Avoid eye contact with people who run marathons.” He wrote it down like, “Okay, feasible.” Then he asked about my diet. I told him I eat in a way that keeps me alive, but not well enough for Instagram. Apparently, that’s not a plan.
You ever notice how everything has a monthly subscription now? Music, TV, meal kits, meditation apps—there’s probably a subscription to remind me I have too many subscriptions. I signed up for one of those meditation apps, and now I stress about not having time to meditate. The notifications are like, “Take a deep breath.” I’m like, “I would, but I’m busy ignoring your other notifications.”
Dating is weird too. We’ve got apps that can tell you your soulmate is 3.7 miles away, but they can’t tell you whether that person knows how to use a microwave without burning water. You’ll be matching with someone based on your shared love of hiking and tacos, but their profile photo is them holding a fish. Why is every photo a fish? Are we dating humans or trying to start an aquarium? I never know what the fish is supposed to represent. “Look, I can provide.” Provide what? Mercury poisoning?
I tried a cooking class thinking I’d learn something practical, but instead it was mostly people “connecting with their culinary journey.” Meanwhile, I’m in the corner burning shallots and asking if it’s okay to use scissors on herbs. The instructor told us to “cook with intention,” and I’m like, “My intention is to avoid food poisoning.”
Anyway, thanks for listening. I gotta get home before my mixer feels neglected.
Tests an AI's ability to simulate personalities and predict future trends
Tests an AI's ability to generate a complete, working landing page
Recreate an interactive, classic Mario level in a single HTML file.
Tests an AI's ability to replicate an existing UI with Tailwind CSS
Tests an AI's ability to create smooth web animations
Tests an AI's UI design skills with theming support
Tests an AI's ability to create interactive web elements