Orivel Orivel
Open menu

Persuasion

Compare how effectively AI models persuade a specific audience.

In this genre, the main abilities being tested are Persuasiveness, Logic, Audience Fit.

Unlike discussion, this genre is less about rebutting an opponent and more about moving a specific audience toward a decision or stance.

A high score here does not guarantee balanced analysis, factual completeness, or suitability for neutral explanatory tasks.

Strong models here are useful for

pitches, advocacy, fundraising copy, sales messaging, and audience-targeted arguments.

This genre alone cannot tell you

whether the model is best for neutral comparison, exact factual work, or careful support conversations.

Data analysis

Persuasion: Claude Sonnet 4.6 leads, echoing its debate strength

34 scored answers Persuasion Updated 2026/6/7
1
Claude Sonnet 4.6

Anthropic

88
Avg. score
100%
Win Rate
5× 1st place 5 samples
2
Claude Opus 4.8

Anthropic

84
Avg. score
100%
Win Rate
1× 1st place 1 samples
3
GPT-5 mini

OpenAI

84
Avg. score
75%
Win Rate
3× 1st place 4 samples

Average score by model

1 Claude Sonnet 4.6
8.80
2 Claude Opus 4.8
8.39
3 GPT-5 mini
8.43
4 GPT-5.4
7.98
5 Claude Haiku 4.5
7.95
6 Gemini 2.5 Pro
8.02
7 GPT-5.5
7.96
8 Gemini 2.5 Flash-Lite
7.62
9 Gemini 2.5 Flash
7.58

What we weighted

Persuasiveness 35% Logic 20% Audience Fit 20% Clarity 15% Ethics & Safety 10%

Across 34 scored answers Claude Sonnet 4.6 is the standout: it ranks 1 with the highest average (8.80) and the best evidence (5 samples, 5 first places, a 100% win rate). This mirrors its strength in the discussion genre and makes Anthropic the clear leader where persuasion is the explicit goal. Claude Opus 4.8 (8.39) ranks 2 on a single sample.

The middle is tight. GPT-5 mini (8.43, 75% over 4) is the best-evidenced challenger, while GPT-5.4 (7.98, 50%) and Claude Haiku 4.5 (7.95, 40%) sit close behind. Gemini 2.5 Pro actually averages 8.02, above several higher-ranked models, but wins only 20% of its matchups, so head-to-head record again decides the order.

This genre weights Persuasiveness highest at 35, with Logic and Audience Fit at 20 each, so it rewards arguments that actually move the reader. The lighter Gemini tiers (Flash-Lite 7.62, Flash 7.58) and the one-sample GPT-5.5 (7.96, 0% win) struggle to win exchanges despite reasonable averages, the same pattern seen in debate.

Samples run 1 to 5 per model and the 1.22-point spread is narrow, so the fine ordering is provisional and small-sample swings are likely. These are condition-dependent measurements of persuasion prompts, not a universal verdict.

Bottom line

For persuasive writing, Claude Sonnet 4.6 is the most defensible pick (highest average and a 100% win rate over 5 samples), consistent with its lead in debate. GPT-5 mini is the best-evidenced alternative.

This analysis is derived from Orivel's measured benchmark scores for this genre and is updated periodically. Scores are condition-dependent measurements, not absolute truth.

Top Models in This Genre

This ranking is ordered by average score within this genre only.

Latest Updated: May 28, 2026 23:35

#1
Claude Sonnet 4.6 Anthropic

Win Rate

100%

Average Score

88
#2
Claude Opus 4.8 Anthropic

Win Rate

100%

Average Score

84
#3
GPT-5 mini OpenAI

Win Rate

75%

Average Score

84
#4
GPT-5.4 OpenAI

Win Rate

50%

Average Score

80
#5
Claude Haiku 4.5 Anthropic

Win Rate

40%

Average Score

79
#6
Gemini 2.5 Pro Google

Win Rate

20%

Average Score

80
#7
GPT-5.5 OpenAI

Win Rate

0%

Average Score

80
#8
Gemini 2.5 Flash-Lite Google

Win Rate

0%

Average Score

76
#9
Gemini 2.5 Flash Google

Win Rate

0%

Average Score

76

What Is Evaluated in Persuasion

Scoring criteria and weight used for this genre ranking.

Persuasiveness

35.0%

This criterion is included to check Persuasiveness in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.

Logic

20.0%

This criterion is included to check Logic in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.

Audience Fit

20.0%

This criterion is included to check Audience Fit in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.

Clarity

15.0%

This criterion is included to check Clarity in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Ethics & Safety

10.0%

This criterion is included to check Ethics & Safety in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Recent tasks

Persuasion

Anthropic Claude Opus 4.8 VS OpenAI GPT-5 mini

Persuade a Skeptical City Council to Fund a New Library

You are a community advocate preparing to speak at a city council meeting. Your goal is to persuade the council to approve funding for a new public library branch in the underserved Northwood neighborhood. The council is known to be skeptical due to budget constraints and a belief that libraries are becoming obsolete in the digital age. Draft a persuasive speech outline in a bulleted list format that you will use for your 3-minute presentation. Your outline must anticipate and counter their main objections.

147
May 28, 2026 23:35

Persuasion

OpenAI GPT-5.5 VS Anthropic Claude Sonnet 4.6

Persuasive Letter for a Community Garden

Write a persuasive letter to your local city council. Your goal is to convince them to approve a proposal to convert the vacant, overgrown lot at the corner of Elm Street and Oak Avenue into a community garden. In your letter, you must: 1. Clearly state your proposal and its purpose. 2. Highlight at least three distinct benefits the garden would bring to the community (e.g., improving neighborhood aesthetics, fostering community engagement, providing access to fresh food). 3. Acknowledge and proactively address a potential concern the council might have, such as funding, water usage, or long-term maintenance, by suggesting a viable solution. 4. Maintain a respectful, professional, and optimistic tone throughout.

160
May 23, 2026 09:38

Persuasion

Anthropic Claude Opus 4.7 VS Google Gemini 2.5 Flash

Persuade a Skeptical City Council to Pilot Car-Free School Streets

Write a persuasive speech to a city council that is deciding whether to approve a six-month pilot program creating car-free zones on the streets directly outside public elementary schools during student drop-off and pick-up times. Your goal is to persuade skeptical council members to vote yes. Audience details: - The council is politically mixed and cautious about changes that may inconvenience drivers. - Several members worry about traffic spillover, costs, and backlash from local businesses and parents. - They care about child safety, practical implementation, fairness, and whether the pilot can be evaluated objectively. Requirements: - Length: 600 to 900 words. - Take a clear pro-pilot position. - Acknowledge at least 2 serious objections and respond to them fairly. - Use a persuasive but credible tone; do not insult opponents or rely on partisan talking points. - Include at least 3 concrete implementation details for the pilot. - Include at least 3 measurable outcomes the city could track during the six months. - Do not invent statistics, named studies, or quotes from real people. You may refer to general patterns or plausible reasoning, but make clear when something is an inference rather than a verified fact. - End with a specific call to action for the council vote.

410
Apr 19, 2026 09:37

Persuasion

Anthropic Claude Sonnet 4.6 VS OpenAI GPT-5.2

Persuasive Email for a Four-Day Work Week Pilot

You are the Head of People Operations at 'Innovate Solutions', a mid-sized tech company. Your goal is to persuade the CEO to approve a six-month pilot program for a four-day work week. Write a professional email to the CEO, Ms. Chen. In your email, you must: 1. Clearly propose the six-month pilot program. 2. Build a compelling case by highlighting potential benefits like increased productivity, improved employee well-being and retention, and attracting top talent. 3. Proactively address and counter potential objections, including concerns about maintaining client service levels, meeting project deadlines, and overall output. 4. Suggest a framework for how the pilot program's success would be measured (e.g., key performance indicators). 5. Maintain a respectful, data-driven, and persuasive tone throughout.

363
Mar 29, 2026 09:38

Persuasion

OpenAI GPT-5 mini VS Google Gemini 2.5 Flash-Lite

Persuade a School Board to Adopt a Four-Day School Week

You are a parent and community advocate presenting a written statement to your local school board. Your goal is to persuade the board to adopt a four-day school week (with longer daily hours) for the upcoming academic year on a trial basis. Your statement must: 1. Be addressed directly to the school board members. 2. Acknowledge at least two strong counterarguments (such as childcare challenges for working parents or concerns about reduced instructional time) and respond to them convincingly. 3. Use at least three distinct types of supporting evidence or reasoning (for example, data from districts that have implemented four-day weeks, cost-saving arguments, teacher retention benefits, student well-being research, or environmental impact). 4. Maintain a respectful, professional tone appropriate for a public meeting. 5. Be between 500 and 800 words. Write the full persuasive statement.

342
Mar 29, 2026 03:32

Persuasion

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash

Persuade a School Board to Start a Phone-Free School Day Pilot

Write a persuasive speech to a public school board asking it to approve a one-semester pilot program in which middle school students keep smartphones stored away during the school day, with exceptions for medical needs and emergency communication through the front office. Your goal is to persuade a mixed audience of board members, parents, teachers, and students who have different concerns. The speech must support the pilot without demonizing technology or families. Include at least three concrete benefits, address at least three likely objections, and propose two practical safeguards to make the policy fair and realistic. Keep the tone respectful, civic-minded, and suitable for a 4- to 5-minute speech.

315
Mar 29, 2026 03:13

Related Links

X f L