Humor
Explore how AI models perform in Humor. Compare rankings, scoring criteria, and recent benchmark examples.
Genre overview
Compare comedic originality and how effectively AI models produce humor.
In this genre, the main abilities being tested are Humor Effectiveness, Originality, Coherence.
Unlike creative writing, this genre cares more specifically about whether the output actually lands as humor for the intended audience.
A high score here does not guarantee safe tone for sensitive situations, factual precision, or professional communication skill.
Strong models here are useful for
jokes, playful copy, light entertainment, and prompts where comic effect matters.
This genre alone cannot tell you
whether the model is best for serious guidance, careful support, or exact business communication.
Top Models in This Genre
This ranking is ordered by average score within this genre only.
Latest Updated: Mar 31, 2026 09:37
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
| Ranked Models |
|
|
Detail | ||||
|---|---|---|---|---|---|---|---|
| #1 | GPT-5 mini | OpenAI |
100%
|
82
|
4 | 4 | View scores and evaluation for GPT-5 mini |
| #2 | GPT-5.2 | OpenAI |
83%
|
87
|
5 | 6 | View scores and evaluation for GPT-5.2 |
| #3 | Claude Opus 4.6 | Anthropic |
75%
|
86
|
3 | 4 | View scores and evaluation for Claude Opus 4.6 |
| #4 | GPT-5.4 | OpenAI |
75%
|
84
|
3 | 4 | View scores and evaluation for GPT-5.4 |
| #5 | Claude Haiku 4.5 | Anthropic |
67%
|
76
|
2 | 3 | View scores and evaluation for Claude Haiku 4.5 |
| #6 | Claude Sonnet 4.6 | Anthropic |
33%
|
83
|
1 | 3 | View scores and evaluation for Claude Sonnet 4.6 |
| #7 | Gemini 2.5 Pro |
0%
|
71
|
0 | 3 | View scores and evaluation for Gemini 2.5 Pro | |
| #8 | Gemini 2.5 Flash |
0%
|
68
|
0 | 5 | View scores and evaluation for Gemini 2.5 Flash | |
| #9 | Gemini 2.5 Flash-Lite |
0%
|
65
|
0 | 4 | View scores and evaluation for Gemini 2.5 Flash-Lite |
What Is Evaluated in Humor
Scoring criteria and weight used for this genre ranking.
Humor Effectiveness
35.0%
This criterion is included to check Humor Effectiveness in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.
Originality
25.0%
This criterion is included to check Originality in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.
Coherence
15.0%
This criterion is included to check Coherence in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Clarity
15.0%
This criterion is included to check Clarity in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Instruction Following
10.0%
This criterion is included to check Instruction Following in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Recent tasks
Humor
Write a Stand-Up Comedy Set About the Absurdities of Grocery Shopping
Write a short stand-up comedy set (approximately 400–600 words) performed by a fictional comedian at an open-mic night. The entire set should revolve around the everyday absurdities of grocery shopping — from navigating the aisles, to self-checkout machines, to the unspoken social rules among shoppers. Requirements: 1. The set must be written in first person as if spoken on stage, including natural pauses, crowd work cues, or callbacks that a real comedian might use. 2. The humor should be observational and relatable — no shock humor, crude language, or mean-spirited jokes targeting specific groups of people. 3. Include at least three distinct comedic bits (mini-topics) within the grocery shopping theme, with smooth transitions between them. 4. End the set with a strong closing joke or callback that ties back to something mentioned earlier in the set. 5. The tone should be suitable for a general adult audience (think a clean comedy club night).
Humor
Corporate Jargon Roast: A Satirical Office Memo
Write a satirical internal company memo (approximately 300–500 words) from a fictional middle manager named "Derek from Synergy Solutions" announcing a new, absurdly unnecessary corporate policy. The memo should: 1. Be written in exaggerated corporate jargon and buzzwords (e.g., "synergize," "circle back," "leverage," "move the needle"). 2. Announce a policy that sounds important but is completely pointless or counterproductive when you think about it. 3. Maintain a deadpan, serious tone throughout — the humor should come from the contrast between the formal delivery and the ridiculous content. 4. Include at least one made-up acronym or initiative name that sounds plausible. 5. End with a signature block that adds one final comedic touch. The memo should be funny to anyone who has worked in a corporate office environment, but it must remain workplace-appropriate (no profanity, no targeting of protected groups, no mean-spirited content about real companies or individuals).
Humor
Clean Stand-Up Monologue for a Nervous Science Museum Opening
Write a clean, original stand-up monologue of 220 to 320 words for a host opening a new science museum exhibition about everyday household objects. The audience is mixed: children aged 10+, parents, teachers, and local donors. The speaker is a little nervous but trying to sound confident and charming. Required constraints: - Keep it suitable for a general family audience. - Use exactly 6 jokes or comedic beats. - At least 3 jokes must be about ordinary objects being treated as if they have dramatic secret lives. - Include 1 brief callback to an earlier joke near the end. - Mention all 5 of these objects naturally: toaster, umbrella, sock, vacuum cleaner, and refrigerator. - Avoid insults, politics, religion, dating humor, bathroom humor, and references to celebrities. - The monologue should feel like one continuous performance, not a list of unrelated one-liners. Aim for humor that works both for kids and adults, with clear setup and payoff.
Humor
Write a Funny Wedding Toast for Two Librarians
Write a humorous wedding toast of 250 to 350 words for a couple who are both librarians and are getting married in a small town public library after hours. The audience includes grandparents, coworkers, a few teenagers from the library’s book club, and one very serious city council member who approved the event permit. Keep the humor warm, clever, and family-friendly rather than edgy or insulting. Include at least three specific library-related jokes or playful references, but make sure the toast still sounds sincere and celebratory. Do not use clichés about marriage being terrible, do not mock reading itself, and do not rely on puns in every sentence. End with a short, heartfelt closing line suitable for raising a glass.
Humor
The Cynical Pilot's In-Flight Announcement
Write a short, humorous in-flight announcement from the perspective of a pilot who is completely fed up with their job. The announcement should be delivered over the plane's intercom. Your tone should be dry, sarcastic, and world-weary, but not genuinely alarming. Cover the usual topics like welcome, flight time, and weather, but infuse them with the pilot's cynical perspective on air travel.
Humor
Write a Comedic Dialogue Between a Time Traveler and a Medieval Peasant Trying to Explain Modern Technology
Write a comedic dialogue between a time traveler from the year 2024 who has accidentally landed in a medieval English village in the year 1320, and a local peasant named Aldric. The time traveler is desperately trying to explain what a smartphone is so that Aldric can help them find a power source to charge it. Requirements: - The dialogue should be between 400 and 600 words. - Aldric should consistently misinterpret modern concepts through a medieval worldview (for example, interpreting "the cloud" as actual clouds, or "apps" as some kind of food). - The time traveler should grow increasingly frustrated but remain polite. - Include at least three distinct modern technology concepts that Aldric hilariously misunderstands. - The dialogue should have a satisfying comedic ending or punchline. - The humor should be clever and character-driven, not relying on crude jokes or slurs. - Format the dialogue with character names followed by colons before each line of speech, with brief stage directions in parentheses where appropriate.