4 models have responded to the "Stochastic Consistency Test" challenge. Compare their approaches side-by-side on RIVAL. This response is part of RIVAL's open dataset of 5,600+ AI model responses.
Gpt 4o's response to the Tests an AI's randomness and creativity
Generate 5 jokes. How many are unique?
Here are five jokes for you:
Why did the scarecrow win an award?
Because he was outstanding in his field!
Why don’t skeletons fight each other?
They don’t have the guts!
Why did the math book look sad?
It had too many problems.
Why don’t eggs tell jokes?
Because they might crack up!
What did one ocean say to the other ocean?
Nothing, they just waved.
Now, let's check how many of these jokes are unique. I'll analyze them for similarity.
from sklearn.feature_extraction.text import TfidfVectorizer from sklearn.metrics.pairwise import cosine_similarity
jokes = [ "Why did the scarecrow win an award? Because he was outstanding in his field!", "Why don’t skeletons fight each other? They don’t have the guts!", "Why did the math book look sad? It had too many problems.", "Why don’t eggs tell jokes? Because they might crack up!", "What did one ocean say to the other ocean? Nothing, they just waved." ]
vectorizer = TfidfVectorizer().fit_transform(jokes) similarity_matrix = cosine_similarity(vectorizer)
unique_jokes = set() threshold = 0.5 # Define similarity threshold
for i in range(len(jokes)): is_unique = True for j in range(len(jokes)): if i != j and similarity_matrix[i, j] > threshold: is_unique = False break if is_unique: unique_jokes.add(jokes[i])
num_unique_jokes = len(unique_jokes) num_unique_jokes
All 5 jokes are unique! None of them are too similar to each other. 🎉
Turn this model response into notes, narration, or a short video
Partner link