Orivel Orivel
Open menu

Business Writing

Compare emails, proposals, memos, and other practical business writing outputs.

In this genre, the main abilities being tested are Appropriateness, Clarity, Structure.

Unlike creative writing, this genre rewards professional fit, structure, and actionability over originality or dramatic voice.

A high score here does not guarantee deep domain strategy or strong negotiation skill.

Strong models here are useful for

emails, memos, proposals, updates, and stakeholder-facing communication.

This genre alone cannot tell you

whether the model is best for open-ended ideation, implementation, or debate.

Data analysis

Business writing: GPT-5 mini leads on both quality and wins

34 scored answers Business Writing Updated 2026/6/7
1
GPT-5 mini

OpenAI

91
Avg. score
100%
Win Rate
4× 1st place 4 samples
2
Claude Opus 4.8

Anthropic

87
Avg. score
100%
Win Rate
1× 1st place 1 samples
3
Claude Sonnet 4.6

Anthropic

85
Avg. score
67%
Win Rate
4× 1st place 6 samples

Average score by model

1 GPT-5 mini
9.08
2 Claude Opus 4.8
8.68
3 Claude Sonnet 4.6
8.45
4 Claude Haiku 4.5
8.13
5 Gemini 2.5 Pro
7.55
6 GPT-5.4
7.84
7 Gemini 2.5 Flash
8.29
8 GPT-5.5
7.73
9 Gemini 2.5 Flash-Lite
7.58

What we weighted

Appropriateness 25% Clarity 20% Structure 20% Actionability 20% Tone 15%

Across 34 scored answers GPT-5 mini is the standout: it ranks 1 with the highest average (9.08) and the best evidence (4 samples, 4 first places, a 100% win rate). That combination of top quality and a flawless record over multiple samples makes it the clearest recommendation in this genre. Claude Opus 4.8 (8.68) ranks 2 on a single sample.

Anthropic holds the upper-middle: Claude Sonnet 4.6 (8.45, 67% over 6 samples) is the best-evidenced of the rest, with Claude Haiku 4.5 (8.13, 50%) close behind. Notably GPT-5.4, strong in other genres, ranks only 6 here (7.84, 20% win), and Gemini 2.5 Flash posts a respectable 8.29 average yet wins none of its matchups, a sharp average-versus-rank gap.

This genre weights Appropriateness highest at 25, with Clarity, Structure and Actionability each at 20, so it rewards documents that fit the business context and give usable next steps. The lighter Gemini tiers (Pro 7.55, Flash-Lite 7.58) trail on actionability and tone rather than basic writing quality.

Samples run 1 to 6 per model, so the fine ordering is provisional and a few prompts can move the middle. The 1.5-point spread is real, but these are condition-dependent measurements of business prompts, not a universal ranking.

Bottom line

For business writing, GPT-5 mini is the clear pick, leading on both average (9.08) and win rate (100% over 4 samples). Claude Sonnet 4.6 is the best-evidenced alternative. GPT-5.4 underperforms its usual level in this genre.

This analysis is derived from Orivel's measured benchmark scores for this genre and is updated periodically. Scores are condition-dependent measurements, not absolute truth.

Top Models in This Genre

This ranking is ordered by average score within this genre only.

Latest Updated: May 29, 2026 09:37

#1
GPT-5 mini OpenAI

Win Rate

100%

Average Score

91
#2
Claude Opus 4.8 Anthropic

Win Rate

100%

Average Score

87
#3
Claude Sonnet 4.6 Anthropic

Win Rate

67%

Average Score

85
#4
Claude Haiku 4.5 Anthropic

Win Rate

50%

Average Score

81
#5
Gemini 2.5 Pro Google

Win Rate

25%

Average Score

75
#6
GPT-5.4 OpenAI

Win Rate

20%

Average Score

78
#7
Gemini 2.5 Flash Google

Win Rate

0%

Average Score

83
#8
GPT-5.5 OpenAI

Win Rate

0%

Average Score

77
#9
Gemini 2.5 Flash-Lite Google

Win Rate

0%

Average Score

76

What Is Evaluated in Business Writing

Scoring criteria and weight used for this genre ranking.

Appropriateness

25.0%

This criterion is included to check Appropriateness in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.

Clarity

20.0%

This criterion is included to check Clarity in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.

Structure

20.0%

This criterion is included to check Structure in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.

Actionability

20.0%

This criterion is included to check Actionability in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.

Tone

15.0%

This criterion is included to check Tone in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Recent tasks

Business Writing

Anthropic Claude Opus 4.8 VS Google Gemini 2.5 Flash-Lite

Customer Email About a Delayed Product Rollout

Write a customer-facing email from the Head of Product at a B2B SaaS company announcing a delay to a planned feature rollout. The audience is operations managers at mid-sized client companies. The purpose is to be transparent, preserve trust, and explain what customers should expect next. The email should be 250 to 350 words, include a clear subject line, and use a professional but human tone. It must mention the revised launch date, the reason for the delay without sounding defensive, the interim support available, and one specific action customers can take if they have questions. Avoid jargon, over-apologizing, blaming any team or vendor, and making promises beyond the facts provided.

133
May 29, 2026 09:37

Business Writing

OpenAI GPT-5.5 VS Anthropic Claude Opus 4.7

Drafting an Internal Announcement for a New Mentorship Program

You are the Head of People Operations at a mid-sized tech company. Your company is launching a new internal mentorship program to foster employee growth and collaboration. Write an internal announcement to be sent to all employees. The goal is to explain the program, generate excitement, and encourage both mentors and mentees to sign up. Your announcement must: - Clearly state the purpose and benefits of the program. - Explain who is eligible to be a mentor and a mentee. - Detail the expected time commitment. - Provide a clear call to action with instructions on how to sign up and the deadline. - Maintain a professional, enthusiastic, and inclusive tone. - Be no more than 300 words.

237
May 14, 2026 09:37

Business Writing

Google Gemini 2.5 Flash VS Anthropic Claude Opus 4.6

Draft an internal memo proposing a pilot for a four-day workweek

You are an operations manager at a 180-person software company. Employee survey results show rising burnout, but leadership is cautious about any change that might reduce client responsiveness. Write an internal memo to the executive team proposing a 3-month pilot of a four-day workweek for one department. Your memo should: - recommend whether to run the pilot and for which department - explain the business rationale, including likely benefits and risks - outline how the pilot would work in practice without harming customer support - propose 4 measurable success criteria - address likely objections from finance, HR, and sales - end with a clear next-step request Constraints: - Audience: CEO, CFO, Head of HR, VP of Sales - Tone: professional, practical, and non-ideological - Length: 350 to 500 words - Do not use bullet points; write in memo style with headings if you wish - Assume no additional hiring during the pilot - The company serves clients across U.S. time zones

342
Mar 29, 2026 11:55

Business Writing

OpenAI GPT-5 mini VS Anthropic Claude Sonnet 4.6

Internal Memo Explaining a New Sales Reporting Process

You are the Head of Sales Operations at a mid-sized tech company. To improve data accuracy and team collaboration, you are implementing a new process requiring the sales team to log all significant customer interactions (calls, meetings, demos) in the CRM system within 24 hours. Previously, they only logged closed deals. Write a clear and persuasive internal memo to the entire sales team (about 50 people) announcing this change. Your memo must: - Clearly state the new process requirement. - Explain the key benefits of this change for both individual sales representatives and the company. - Acknowledge that this is an adjustment and may require more administrative time initially. - Outline the specific steps they need to follow. - State when the new process will take effect and mention that training sessions will be scheduled. - Maintain a positive and supportive tone to encourage buy-in.

338
Mar 29, 2026 11:39

Business Writing

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Haiku 4.5

Internal Memo Proposing a Pilot for Four-Day Workweeks

You are a team lead at a 120-person software company. Employee survey results show rising burnout and difficulty retaining experienced staff. The executive team is open to experimentation but worries that any schedule change could hurt customer support coverage, product delivery timelines, and fairness across departments. Write an internal memo to the executive team proposing a 3-month pilot of a four-day workweek. Your memo should: - recommend whether the pilot should use a 32-hour week or a compressed 40-hour schedule, and justify the choice - explain expected benefits and likely risks - outline how the pilot would work across engineering, sales, customer support, and operations - include 3 measurable success metrics - address fairness concerns for teams whose work requires live coverage - end with a clear recommendation and next steps Constraints: - Write in a professional, persuasive tone for senior leaders - Keep it between 400 and 600 words - Do not use tables or bullet points; write as a memo with headings and paragraphs only - Do not cite external studies or statistics; reason from the scenario only

316
Mar 28, 2026 09:36

Business Writing

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Sonnet 4.6

Client Email Explaining a Project Delay and Recovery Plan

You are a project manager at a software consultancy. Write an email to a client’s operations director about a two-week delay in launching a warehouse inventory dashboard. The delay happened because a third-party data feed failed quality checks during final testing. Your goals are to maintain trust, explain the situation without sounding defensive, and present a credible recovery plan. Include these points: - A clear subject line - A brief acknowledgment of the delay and its business impact - A concise explanation of the cause in non-technical language - What your team has already done to contain the issue - A revised timeline with two concrete milestones before launch - One request for the client that would help reduce further risk - A professional, accountable closing that avoids overpromising Constraints: - 220 to 320 words - Professional and reassuring tone - Do not blame the client or the third-party vendor - Do not use jargon such as API, ETL, or schema mismatch - Do not offer discounts, refunds, or legal language

303
Mar 23, 2026 08:09

Related Links

X f L