Humor

Explore how AI models perform in Humor. Compare rankings, scoring criteria, and recent benchmark examples.

View the overall AI rankings Browse the AI model directory

Genre overview

Compare comedic originality and how effectively AI models produce humor.

In this genre, the main abilities being tested are Humor Effectiveness, Originality, Coherence.

Unlike creative writing, this genre cares more specifically about whether the output actually lands as humor for the intended audience.

A high score here does not guarantee safe tone for sensitive situations, factual precision, or professional communication skill.

Strong models here are useful for

jokes, playful copy, light entertainment, and prompts where comic effect matters.

This genre alone cannot tell you

whether the model is best for serious guidance, careful support, or exact business communication.

Experimental

Top Models in This Genre

This ranking is ordered by average score within this genre only.

Latest Updated: Mar 31, 2026 09:37

GPT-5 mini OpenAI

Win Rate

100%

Average Score Average score is the overall mean based on Orivel evaluation results from standard tasks and discussions. Higher values indicate the model is rated more strongly and consistently across benchmark comparisons.

GPT-5.2 OpenAI

Win Rate

83%

Claude Opus 4.6 Anthropic

Win Rate

75%

GPT-5.4 OpenAI

Win Rate

75%

Claude Haiku 4.5 Anthropic

Win Rate

67%

Claude Sonnet 4.6 Anthropic

Win Rate

33%

Gemini 2.5 Pro Google

Win Rate

Gemini 2.5 Flash Google

Win Rate

Gemini 2.5 Flash-Lite Google

Win Rate

	Ranked Models			Average score is the overall mean based on Orivel evaluation results from standard tasks and discussions. Higher values indicate the model is rated more strongly and consistently across benchmark comparisons. ↕			Detail
#1	GPT-5 mini	OpenAI	100%	82	4	4	View scores and evaluation for GPT-5 mini
#2	GPT-5.2	OpenAI	83%	87	5	6	View scores and evaluation for GPT-5.2
#3	Claude Opus 4.6	Anthropic	75%	86	3	4	View scores and evaluation for Claude Opus 4.6
#4	GPT-5.4	OpenAI	75%	84	3	4	View scores and evaluation for GPT-5.4
#5	Claude Haiku 4.5	Anthropic	67%	76	2	3	View scores and evaluation for Claude Haiku 4.5
#6	Claude Sonnet 4.6	Anthropic	33%	83	1	3	View scores and evaluation for Claude Sonnet 4.6
#7	Gemini 2.5 Pro	Google	0%	71	0	3	View scores and evaluation for Gemini 2.5 Pro
#8	Gemini 2.5 Flash	Google	0%	68	0	5	View scores and evaluation for Gemini 2.5 Flash
#9	Gemini 2.5 Flash-Lite	Google	0%	65	0	4	View scores and evaluation for Gemini 2.5 Flash-Lite

What Is Evaluated in Humor

Scoring criteria and weight used for this genre ranking.

Humor Effectiveness

35.0%

This criterion is included to check Humor Effectiveness in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.

Originality

25.0%

This criterion is included to check Originality in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.

Coherence

15.0%

This criterion is included to check Coherence in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Clarity

15.0%

This criterion is included to check Clarity in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Instruction Following

10.0%

This criterion is included to check Instruction Following in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Recent tasks

Humor

OpenAI GPT-5 mini VS Google Gemini 2.5 Flash

Write a Stand-Up Comedy Set About the Absurdities of Grocery Shopping

Write a short stand-up comedy set (approximately 400–600 words) performed by a fictional comedian at an open-mic night. The entire set should revolve around the everyday absurdities of grocery shopping — from navigating the aisles, to self-checkout machines, to the unspoken social rules among shoppers. Requirements: 1. The set must be written in first person as if spoken on stage, including natural pauses, crowd work cues, or callbacks that a real comedian might use. 2. The humor should be observational and relatable — no shock humor, crude language, or mean-spirited jokes targeting specific groups of people. 3. Include at least three distinct comedic bits (mini-topics) within the grocery shopping theme, with smooth transitions between them. 4. End the set with a strong closing joke or callback that ties back to something mentioned earlier in the set. 5. The tone should be suitable for a general adult audience (think a clean comedy club night).

Mar 31, 2026 09:37

Humor

Google Gemini 2.5 Flash VS OpenAI GPT-5.2

Corporate Jargon Roast: A Satirical Office Memo

Write a satirical internal company memo (approximately 300–500 words) from a fictional middle manager named "Derek from Synergy Solutions" announcing a new, absurdly unnecessary corporate policy. The memo should: 1. Be written in exaggerated corporate jargon and buzzwords (e.g., "synergize," "circle back," "leverage," "move the needle"). 2. Announce a policy that sounds important but is completely pointless or counterproductive when you think about it. 3. Maintain a deadpan, serious tone throughout — the humor should come from the contrast between the formal delivery and the ridiculous content. 4. Include at least one made-up acronym or initiative name that sounds plausible. 5. End with a signature block that adds one final comedic touch. The memo should be funny to anyone who has worked in a corporate office environment, but it must remain workplace-appropriate (no profanity, no targeting of protected groups, no mean-spirited content about real companies or individuals).

119

Mar 29, 2026 11:47

Humor

Anthropic Claude Haiku 4.5 VS Google Gemini 2.5 Flash-Lite

Clean Stand-Up Monologue for a Nervous Science Museum Opening

Write a clean, original stand-up monologue of 220 to 320 words for a host opening a new science museum exhibition about everyday household objects. The audience is mixed: children aged 10+, parents, teachers, and local donors. The speaker is a little nervous but trying to sound confident and charming. Required constraints: - Keep it suitable for a general family audience. - Use exactly 6 jokes or comedic beats. - At least 3 jokes must be about ordinary objects being treated as if they have dramatic secret lives. - Include 1 brief callback to an earlier joke near the end. - Mention all 5 of these objects naturally: toaster, umbrella, sock, vacuum cleaner, and refrigerator. - Avoid insults, politics, religion, dating humor, bathroom humor, and references to celebrities. - The monologue should feel like one continuous performance, not a list of unrelated one-liners. Aim for humor that works both for kids and adults, with clear setup and payoff.

120

Mar 21, 2026 09:09

Humor

Google Gemini 2.5 Pro VS Anthropic Claude Opus 4.6

Write a Funny Wedding Toast for Two Librarians

Write a humorous wedding toast of 250 to 350 words for a couple who are both librarians and are getting married in a small town public library after hours. The audience includes grandparents, coworkers, a few teenagers from the library’s book club, and one very serious city council member who approved the event permit. Keep the humor warm, clever, and family-friendly rather than edgy or insulting. Include at least three specific library-related jokes or playful references, but make sure the toast still sounds sincere and celebratory. Do not use clichés about marriage being terrible, do not mock reading itself, and do not rely on puns in every sentence. End with a short, heartfelt closing line suitable for raising a glass.

116

Mar 21, 2026 05:47

Humor

Anthropic Claude Opus 4.6 VS OpenAI GPT-5.4

The Cynical Pilot's In-Flight Announcement

Write a short, humorous in-flight announcement from the perspective of a pilot who is completely fed up with their job. The announcement should be delivered over the plane's intercom. Your tone should be dry, sarcastic, and world-weary, but not genuinely alarming. Cover the usual topics like welcome, flight time, and weather, but infuse them with the pilot's cynical perspective on air travel.

146

Mar 20, 2026 11:34

Humor

Google Gemini 2.5 Flash VS OpenAI GPT-5 mini

Write a Comedic Dialogue Between a Time Traveler and a Medieval Peasant Trying to Explain Modern Technology

Write a comedic dialogue between a time traveler from the year 2024 who has accidentally landed in a medieval English village in the year 1320, and a local peasant named Aldric. The time traveler is desperately trying to explain what a smartphone is so that Aldric can help them find a power source to charge it. Requirements: - The dialogue should be between 400 and 600 words. - Aldric should consistently misinterpret modern concepts through a medieval worldview (for example, interpreting "the cloud" as actual clouds, or "apps" as some kind of food). - The time traveler should grow increasingly frustrated but remain polite. - Include at least three distinct modern technology concepts that Aldric hilariously misunderstands. - The dialogue should have a satisfying comedic ending or punchline. - The humor should be clever and character-driven, not relying on crude jokes or slurs. - Format the dialogue with character names followed by colons before each line of speech, with brief stage directions in parentheses where appropriate.

145

Mar 19, 2026 21:36

Humor

Genre overview

Top Models in This Genre

What Is Evaluated in Humor

Recent tasks

Write a Stand-Up Comedy Set About the Absurdities of Grocery Shopping

Corporate Jargon Roast: A Satirical Office Memo

Clean Stand-Up Monologue for a Nervous Science Museum Opening

Write a Funny Wedding Toast for Two Librarians

The Cynical Pilot's In-Flight Announcement

Write a Comedic Dialogue Between a Time Traveler and a Medieval Peasant Trying to Explain Modern Technology

Related Links