Business Writing
Compare emails, proposals, memos, and other practical business writing outputs.
In this genre, the main abilities being tested are Appropriateness, Clarity, Structure.
Unlike creative writing, this genre rewards professional fit, structure, and actionability over originality or dramatic voice.
A high score here does not guarantee deep domain strategy or strong negotiation skill.
Strong models here are useful for
emails, memos, proposals, updates, and stakeholder-facing communication.
This genre alone cannot tell you
whether the model is best for open-ended ideation, implementation, or debate.
Business writing: GPT-5 mini leads on both quality and wins
OpenAI
Anthropic
Anthropic
Average score by model
What we weighted
Across 34 scored answers GPT-5 mini is the standout: it ranks 1 with the highest average (9.08) and the best evidence (4 samples, 4 first places, a 100% win rate). That combination of top quality and a flawless record over multiple samples makes it the clearest recommendation in this genre. Claude Opus 4.8 (8.68) ranks 2 on a single sample.
Anthropic holds the upper-middle: Claude Sonnet 4.6 (8.45, 67% over 6 samples) is the best-evidenced of the rest, with Claude Haiku 4.5 (8.13, 50%) close behind. Notably GPT-5.4, strong in other genres, ranks only 6 here (7.84, 20% win), and Gemini 2.5 Flash posts a respectable 8.29 average yet wins none of its matchups, a sharp average-versus-rank gap.
This genre weights Appropriateness highest at 25, with Clarity, Structure and Actionability each at 20, so it rewards documents that fit the business context and give usable next steps. The lighter Gemini tiers (Pro 7.55, Flash-Lite 7.58) trail on actionability and tone rather than basic writing quality.
Samples run 1 to 6 per model, so the fine ordering is provisional and a few prompts can move the middle. The 1.5-point spread is real, but these are condition-dependent measurements of business prompts, not a universal ranking.
Bottom line
For business writing, GPT-5 mini is the clear pick, leading on both average (9.08) and win rate (100% over 4 samples). Claude Sonnet 4.6 is the best-evidenced alternative. GPT-5.4 underperforms its usual level in this genre.
This analysis is derived from Orivel's measured benchmark scores for this genre and is updated periodically. Scores are condition-dependent measurements, not absolute truth.
Top Models in This Genre
This ranking is ordered by average score within this genre only.
Latest Updated: May 29, 2026 09:37
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
| Ranked Models |
|
|
Detail | ||||
|---|---|---|---|---|---|---|---|
| #1 | GPT-5 mini | OpenAI |
100%
|
91
|
4 | 4 | View scores and evaluation for GPT-5 mini |
| #2 | Claude Opus 4.8 NEW | Anthropic |
100%
|
87
|
1 | 1 | View scores and evaluation for Claude Opus 4.8 |
| #3 | Claude Sonnet 4.6 | Anthropic |
67%
|
85
|
4 | 6 | View scores and evaluation for Claude Sonnet 4.6 |
| #4 | Claude Haiku 4.5 | Anthropic |
50%
|
81
|
2 | 4 | View scores and evaluation for Claude Haiku 4.5 |
| #5 | Gemini 2.5 Pro |
25%
|
75
|
1 | 4 | View scores and evaluation for Gemini 2.5 Pro | |
| #6 | GPT-5.4 | OpenAI |
20%
|
78
|
1 | 5 | View scores and evaluation for GPT-5.4 |
| #7 | Gemini 2.5 Flash |
0%
|
83
|
0 | 4 | View scores and evaluation for Gemini 2.5 Flash | |
| #8 | GPT-5.5 | OpenAI |
0%
|
77
|
0 | 1 | View scores and evaluation for GPT-5.5 |
| #9 | Gemini 2.5 Flash-Lite |
0%
|
76
|
0 | 5 | View scores and evaluation for Gemini 2.5 Flash-Lite |
What Is Evaluated in Business Writing
Scoring criteria and weight used for this genre ranking.
Appropriateness
25.0%
This criterion is included to check Appropriateness in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.
Clarity
20.0%
This criterion is included to check Clarity in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.
Structure
20.0%
This criterion is included to check Structure in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.
Actionability
20.0%
This criterion is included to check Actionability in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.
Tone
15.0%
This criterion is included to check Tone in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Recent tasks
Business Writing
Customer Email About a Delayed Product Rollout
Write a customer-facing email from the Head of Product at a B2B SaaS company announcing a delay to a planned feature rollout. The audience is operations managers at mid-sized client companies. The purpose is to be transparent, preserve trust, and explain what customers should expect next. The email should be 250 to 350 words, include a clear subject line, and use a professional but human tone. It must mention the revised launch date, the reason for the delay without sounding defensive, the interim support available, and one specific action customers can take if they have questions. Avoid jargon, over-apologizing, blaming any team or vendor, and making promises beyond the facts provided.
Business Writing
Drafting an Internal Announcement for a New Mentorship Program
You are the Head of People Operations at a mid-sized tech company. Your company is launching a new internal mentorship program to foster employee growth and collaboration. Write an internal announcement to be sent to all employees. The goal is to explain the program, generate excitement, and encourage both mentors and mentees to sign up. Your announcement must: - Clearly state the purpose and benefits of the program. - Explain who is eligible to be a mentor and a mentee. - Detail the expected time commitment. - Provide a clear call to action with instructions on how to sign up and the deadline. - Maintain a professional, enthusiastic, and inclusive tone. - Be no more than 300 words.
Business Writing
Draft an internal memo proposing a pilot for a four-day workweek
You are an operations manager at a 180-person software company. Employee survey results show rising burnout, but leadership is cautious about any change that might reduce client responsiveness. Write an internal memo to the executive team proposing a 3-month pilot of a four-day workweek for one department. Your memo should: - recommend whether to run the pilot and for which department - explain the business rationale, including likely benefits and risks - outline how the pilot would work in practice without harming customer support - propose 4 measurable success criteria - address likely objections from finance, HR, and sales - end with a clear next-step request Constraints: - Audience: CEO, CFO, Head of HR, VP of Sales - Tone: professional, practical, and non-ideological - Length: 350 to 500 words - Do not use bullet points; write in memo style with headings if you wish - Assume no additional hiring during the pilot - The company serves clients across U.S. time zones
Business Writing
Internal Memo Explaining a New Sales Reporting Process
You are the Head of Sales Operations at a mid-sized tech company. To improve data accuracy and team collaboration, you are implementing a new process requiring the sales team to log all significant customer interactions (calls, meetings, demos) in the CRM system within 24 hours. Previously, they only logged closed deals. Write a clear and persuasive internal memo to the entire sales team (about 50 people) announcing this change. Your memo must: - Clearly state the new process requirement. - Explain the key benefits of this change for both individual sales representatives and the company. - Acknowledge that this is an adjustment and may require more administrative time initially. - Outline the specific steps they need to follow. - State when the new process will take effect and mention that training sessions will be scheduled. - Maintain a positive and supportive tone to encourage buy-in.
Business Writing
Internal Memo Proposing a Pilot for Four-Day Workweeks
You are a team lead at a 120-person software company. Employee survey results show rising burnout and difficulty retaining experienced staff. The executive team is open to experimentation but worries that any schedule change could hurt customer support coverage, product delivery timelines, and fairness across departments. Write an internal memo to the executive team proposing a 3-month pilot of a four-day workweek. Your memo should: - recommend whether the pilot should use a 32-hour week or a compressed 40-hour schedule, and justify the choice - explain expected benefits and likely risks - outline how the pilot would work across engineering, sales, customer support, and operations - include 3 measurable success metrics - address fairness concerns for teams whose work requires live coverage - end with a clear recommendation and next steps Constraints: - Write in a professional, persuasive tone for senior leaders - Keep it between 400 and 600 words - Do not use tables or bullet points; write as a memo with headings and paragraphs only - Do not cite external studies or statistics; reason from the scenario only
Business Writing
Client Email Explaining a Project Delay and Recovery Plan
You are a project manager at a software consultancy. Write an email to a client’s operations director about a two-week delay in launching a warehouse inventory dashboard. The delay happened because a third-party data feed failed quality checks during final testing. Your goals are to maintain trust, explain the situation without sounding defensive, and present a credible recovery plan. Include these points: - A clear subject line - A brief acknowledgment of the delay and its business impact - A concise explanation of the cause in non-technical language - What your team has already done to contain the issue - A revised timeline with two concrete milestones before launch - One request for the client that would help reduce further risk - A professional, accountable closing that avoids overpromising Constraints: - 220 to 320 words - Professional and reassuring tone - Do not blame the client or the third-party vendor - Do not use jargon such as API, ETL, or schema mismatch - Do not offer discounts, refunds, or legal language