Orivel Orivel
Open menu

Draft an internal memo proposing a pilot for a four-day workweek

Compare model answers for this Business Writing benchmark and review scores, judging comments, and related examples.

Login or register to use likes and favorites. Register

X f L

Contents

Task Overview

Benchmark Genres

Business Writing

Task Creator Model

Answering Models

Judge Models

Task Prompt

You are an operations manager at a 180-person software company. Employee survey results show rising burnout, but leadership is cautious about any change that might reduce client responsiveness. Write an internal memo to the executive team proposing a 3-month pilot of a four-day workweek for one department. Your memo should: - recommend whether to run the pilot and for which department - explain the business rationale, including likely benefits and risks - outline how the pilot would work in practice without harmin...

Show more

You are an operations manager at a 180-person software company. Employee survey results show rising burnout, but leadership is cautious about any change that might reduce client responsiveness. Write an internal memo to the executive team proposing a 3-month pilot of a four-day workweek for one department. Your memo should: - recommend whether to run the pilot and for which department - explain the business rationale, including likely benefits and risks - outline how the pilot would work in practice without harming customer support - propose 4 measurable success criteria - address likely objections from finance, HR, and sales - end with a clear next-step request Constraints: - Audience: CEO, CFO, Head of HR, VP of Sales - Tone: professional, practical, and non-ideological - Length: 350 to 500 words - Do not use bullet points; write in memo style with headings if you wish - Assume no additional hiring during the pilot - The company serves clients across U.S. time zones

Judging Policy

A strong answer presents a realistic, well-structured memo tailored to senior leaders. It should make a clear recommendation, choose a plausible department for the pilot, and balance employee well-being with operational continuity. Good responses explain tradeoffs, describe a workable pilot design under the stated constraints, include four concrete and measurable success criteria, and address objections from finance, HR, and sales in a credible business-oriented way. The writing should be concise, professional, and persuasive without sounding ideological or vague.

Answers

To: Executive Team From: [Your Name], Operations Manager Date: October 26, 2023 Subject: Proposal for a Four-Day Workweek Pilot Program Executive Team, This memo proposes a three-month pilot program for a four-day workweek within a single department, aimed at addressing the rising employee burnout identified in recent surveys while maintaining our commitment to client responsiveness. I recommend proceeding with this pilot. **Recommendation and Business Rationale** Given the documented increase in employee burnout, a proactive approach is essential to safeguard productivity, morale, and retention. A four-day w...

Show Full Answer

To: Executive Team From: [Your Name], Operations Manager Date: October 26, 2023 Subject: Proposal for a Four-Day Workweek Pilot Program Executive Team, This memo proposes a three-month pilot program for a four-day workweek within a single department, aimed at addressing the rising employee burnout identified in recent surveys while maintaining our commitment to client responsiveness. I recommend proceeding with this pilot. **Recommendation and Business Rationale** Given the documented increase in employee burnout, a proactive approach is essential to safeguard productivity, morale, and retention. A four-day workweek, when implemented thoughtfully, has shown potential to significantly improve employee well-being, leading to increased focus, reduced absenteeism, and enhanced overall productivity. By offering a better work-life balance, we can also strengthen our position in a competitive talent market. The primary risk lies in potential perceived reductions in availability or output, which this pilot is designed to mitigate and measure. I propose the **Product Development department** for this pilot. Their project-based work structure allows for more flexible scheduling and clear output metrics, making it an ideal environment to test the model without direct, immediate impact on client-facing support operations. **Pilot Implementation and Client Support** The pilot would run for three months, with Product Development employees working four ten-hour days. To ensure continuous coverage and prevent any disruption to internal or external stakeholders, we would implement a staggered schedule. For example, approximately half the team would take Monday off, while the other half would take Friday off, ensuring that core project work and internal collaboration can continue throughout the traditional five-day week. Critical tasks and urgent issues would be managed through established hand-off protocols and designated on-call rotations within the department, ensuring no degradation of service or project timelines. Our client support teams, which are separate from Product Development, would continue their standard five-day operations, thereby insulating our direct client responsiveness from any initial adjustments within the pilot department. **Measurable Success Criteria** To evaluate the pilot's effectiveness, we will track four key metrics: 1) Employee satisfaction and burnout scores within the Product Development department (pre- and post-pilot surveys); 2) Project completion rates and quality metrics for the pilot department; 3) Absenteeism rates within the pilot department; and 4) Overall company-wide client satisfaction scores, to monitor for any indirect impact. **Addressing Executive Concerns** I anticipate questions from various departments. For **Finance**, the pilot is designed to be cost-neutral, with no additional hiring. Increased productivity and reduced turnover, which are costly, could lead to long-term savings. For **HR**, we will carefully manage fairness perceptions by clearly communicating the pilot's experimental nature and its specific focus on addressing burnout. We will also ensure compliance with all labor laws. For **Sales**, the pilot aims to foster a more engaged and productive Product Development team, which ultimately leads to higher quality products and features, directly benefiting sales efforts without compromising immediate client support. **Next Steps** I request a meeting with the Executive Team to discuss this proposal further, address any questions, and seek approval to initiate planning for this pilot program. Sincerely, [Your Name] Operations Manager

Result

#2

Winning Votes

0 / 3

Average Score

71

Total Score

60

Overall Comments

Answer A is a competent memo that covers all required elements: a clear recommendation, a rationale for choosing Product Development, a staggered-schedule pilot design, four success criteria, and objection responses. However, it relies on generic language ("has shown potential," "costly") without any concrete figures, uses bold markdown formatting that is inconsistent with clean memo style, and the next-step request is vague ("I request a meeting"). The objection responses are thin and somewhat formulaic. The memo stays within the word-count range and maintains a professional tone, but it lacks the specificity and persuasive precision expected at the executive level.

View Score Details

Appropriateness

Weight 25%
62

A covers the required elements and chooses a plausible department, but relies on generic claims without supporting data. The audience is addressed as a group rather than named individuals, and the business case lacks the quantitative grounding senior leaders expect.

Clarity

Weight 20%
60

The memo is readable but uses vague phrases like 'has shown potential' and 'costly.' The four success criteria are listed in a numbered run-on sentence rather than clearly delineated, reducing precision.

Structure

Weight 20%
60

A uses headings, which helps navigation, but the bold markdown formatting is inconsistent with clean memo style. The success criteria are embedded in a paragraph rather than presented in a way that makes them easy to reference.

Actionability

Weight 20%
55

The next-step request is vague ('I request a meeting'). There is no timeline, approval deadline, or pilot start date, making it difficult for leadership to act immediately on the proposal.

Tone

Weight 15%
65

The tone is professional and non-ideological, which meets the requirement. However, some phrasing feels slightly formulaic and cautious rather than confidently persuasive.

Total Score

84

Overall Comments

Answer A is a strong response that successfully follows the memo format and addresses all parts of the prompt. It proposes a logical pilot plan, identifies a suitable department, and addresses potential objections clearly. Its main weaknesses are the use of a numbered list, which slightly contravenes the 'no bullet points' constraint, and its success criteria, which are somewhat vague and lack specific, measurable targets.

View Score Details

Appropriateness

Weight 25%
85

The memo is highly appropriate for the audience and genre. It follows the format well and stays within the word count. It loses a point for using a numbered list for its success criteria, which is very close to the bullet points the prompt explicitly forbade.

Clarity

Weight 20%
85

The writing is very clear, professional, and easy to follow. The arguments are presented logically and concisely.

Structure

Weight 20%
90

The memo has an excellent, logical structure. It uses a standard memo format and clear headings that effectively guide the reader through the proposal.

Actionability

Weight 20%
75

The proposal is actionable, with a clear recommendation and next step. However, the success criteria are not specific enough (e.g., 'Project completion rates' instead of a target number), which makes them harder to measure and reduces the overall actionability of the evaluation plan.

Tone

Weight 15%
85

The tone is professional, practical, and non-ideological, as requested. It presents a balanced business case effectively.

Judge Models OpenAI GPT-5.4

Total Score

70

Overall Comments

Answer A is a solid baseline memo with a clear recommendation, plausible department choice, and a practical staggered-schedule concept. It addresses benefits, risks, support continuity, and stakeholder concerns in a professional way. However, it misses some precision that senior executives would expect: the success criteria are only partly measurable, the rationale remains somewhat generic, and the memo does not fully engage the U.S. time-zone coverage constraint beyond a broad assurance. It is also slightly less tailored and less operationally specific than a strong benchmark response.

View Score Details

Appropriateness

Weight 25%
68

The memo generally fits the business-writing task and executive audience, recommends a pilot, names a department, and covers required topics. However, it is somewhat generic, uses markdown-style emphasis despite the memo-style constraint, and its metrics are not all concrete enough for senior leadership decision-making.

Clarity

Weight 20%
70

The writing is easy to follow and the main points are understandable. Still, some statements are broad rather than precise, and the success criteria are listed in a way that lacks thresholds or exact definitions.

Structure

Weight 20%
72

The memo is logically organized with clear sections and a proper memo format. However, it uses bullet-like numbering within prose despite the no-bullets instruction, and the sections are a bit more conventional than polished.

Actionability

Weight 20%
65

The answer gives a workable high-level pilot concept, including staggered days off and handoff protocols. But it lacks enough operational detail, does not define decision checkpoints, and provides only loosely measurable success criteria, which limits implementation readiness.

Tone

Weight 15%
78

The tone is professional and practical, with little ideological framing. It is appropriately respectful, though it reads somewhat standard and less executive-sharp than ideal.

INTERNAL MEMO TO: Elena Marsh, CEO; David Chen, CFO; Priya Rao, Head of HR; Marcus Webb, VP of Sales FROM: Jordan Keller, Operations Manager DATE: June 12, 2025 RE: Proposal for a Three-Month Four-Day Workweek Pilot in the Engineering Department PURPOSE This memo recommends launching a three-month pilot of a four-day, 32-hour workweek within the Engineering department, beginning August 4. The pilot is designed to address the burnout trends surfaced in our Q1 employee survey while preserving full client responsiveness. I am requesting executive approval and a 30-minute meeting the week of June 23 to finalize im...

Show Full Answer

INTERNAL MEMO TO: Elena Marsh, CEO; David Chen, CFO; Priya Rao, Head of HR; Marcus Webb, VP of Sales FROM: Jordan Keller, Operations Manager DATE: June 12, 2025 RE: Proposal for a Three-Month Four-Day Workweek Pilot in the Engineering Department PURPOSE This memo recommends launching a three-month pilot of a four-day, 32-hour workweek within the Engineering department, beginning August 4. The pilot is designed to address the burnout trends surfaced in our Q1 employee survey while preserving full client responsiveness. I am requesting executive approval and a 30-minute meeting the week of June 23 to finalize implementation details. BUSINESS RATIONALE Our latest engagement survey showed a 22-percentage-point rise in reported burnout over the past year, with Engineering registering the highest scores. Research from comparable mid-size technology firms indicates that compressed schedules can reduce voluntary attrition by 25 to 40 percent. Given that replacing a mid-level engineer costs roughly 1.5 times annual salary, even a modest retention improvement would offset any short-term productivity dip. A successful pilot also strengthens our employer brand in a competitive hiring market, at no additional headcount cost. HOW THE PILOT WOULD WORK Engineering, currently 34 people, would shift to a Monday-through-Thursday schedule. To maintain coverage for client-facing incidents across all U.S. time zones, the team would operate on a staggered rotation: half the department would designate Monday as its off day and the other half would take Friday, ensuring five-day availability for production support and escalations. On-call protocols would remain unchanged, and sprint commitments would be recalibrated during the first two weeks to reflect the adjusted hours. No other department's workflow or client SLA would be altered during the pilot period. I recommend Engineering rather than a client-facing team because it lets us isolate productivity data without directly affecting sales cycles or support queues, while still testing the concept at meaningful scale. MEASURABLE SUCCESS CRITERIA The pilot should be evaluated against four metrics, each compared to the same quarter of the prior year: first, sprint velocity per engineer should remain within five percent of baseline; second, voluntary attrition in Engineering should decline or hold steady; third, employee burnout scores in the department should improve by at least ten percentage points; and fourth, client-reported incident response times should show no degradation. ADDRESSING LIKELY OBJECTIONS Finance may be concerned about reduced output per labor dollar. Because we are measuring sprint velocity directly, any productivity shortfall will surface early, and we can revert at the six-week mark if the data warrants it. HR may question equity across departments; clear communication that this is a time-limited experiment for one team, with findings shared company-wide, will set appropriate expectations. Sales may worry about slower feature delivery affecting client commitments. The staggered schedule ensures five-day engineering availability, and I will coordinate with Sales on any release dates that fall within the pilot window to prevent conflicts. NEXT STEP I respectfully ask the executive team to approve or decline this proposal by July 3 so that we have adequate time to prepare team communications and adjust sprint planning before the August 4 start date. I am happy to present supporting data in person at your earliest convenience.

Result

#1 | Winner

Winning Votes

3 / 3

Average Score

87

Total Score

82

Overall Comments

Answer B is a notably stronger memo. It names specific executives, provides concrete data points (22-percentage-point burnout rise, 1.5× replacement cost, 25–40% attrition reduction from research), specifies the department size (34 people), sets an exact pilot start date and approval deadline, and articulates four well-defined, measurable success criteria with explicit thresholds. The staggered-schedule solution is clearly explained and directly addresses the cross-timezone coverage concern. Objection responses are substantive and include a built-in six-week checkpoint for Finance. The next-step request is specific and time-bound. The writing is concise, professional, and persuasive without being ideological.

View Score Details

Appropriateness

Weight 25%
85

B names each executive, cites a 22-point burnout increase, references attrition-cost research, and specifies department size. These details make the memo highly appropriate for a C-suite audience and demonstrate operational credibility.

Clarity

Weight 20%
82

Each section is clearly written with specific numbers and thresholds (e.g., sprint velocity within 5%, burnout improvement of at least 10 points). Dates and deadlines remove ambiguity throughout.

Structure

Weight 20%
80

B uses clean, consistently formatted section headings (PURPOSE, BUSINESS RATIONALE, etc.) that mirror standard executive memo conventions. Each section is self-contained and logically sequenced.

Actionability

Weight 20%
85

B specifies an approval deadline of July 3, a pilot start date of August 4, a meeting window of June 23, and a six-week early-exit checkpoint. These concrete handles make the proposal immediately actionable.

Tone

Weight 15%
78

B strikes a confident, data-driven tone that is persuasive without being ideological. The language is direct and respects the audience's time, which is appropriate for senior leadership communication.

Total Score

92

Overall Comments

Answer B is an outstanding response that exemplifies a highly effective business memo. It is persuasive, data-driven, and extremely actionable. The use of specific data points in the rationale, the highly measurable success criteria, and the clear, time-bound next step make the proposal very compelling. It perfectly adheres to all constraints and presents a sophisticated, realistic plan that an executive team could confidently approve.

View Score Details

Appropriateness

Weight 25%
95

This answer is exceptionally appropriate. It perfectly captures the memo format, uses names for the recipients to add realism, and adheres to all constraints, including the 'no bullet points' rule. It also explicitly addresses the cross-time-zone nature of the business.

Clarity

Weight 20%
90

The writing is exceptionally clear. The use of specific data points (e.g., '22-percentage-point rise in reported burnout') makes the business rationale even clearer and more impactful than a more general statement.

Structure

Weight 20%
90

The structure is excellent. The memo is well-organized with clear headings (PURPOSE, RATIONALE, etc.) that follow a logical progression, making the proposal easy to digest.

Actionability

Weight 20%
95

This is a key strength. The success criteria are highly specific and measurable (e.g., 'sprint velocity per engineer should remain within five percent of baseline'). The next step is also very strong, requesting a decision by a specific date. The inclusion of a six-week check-in point is another highly actionable detail.

Tone

Weight 15%
90

The tone is excellent. It is professional and practical, but also data-driven and confident. This combination is highly persuasive for an executive audience and avoids any hint of ideology, focusing squarely on business outcomes.

Judge Models OpenAI GPT-5.4

Total Score

87

Overall Comments

Answer B is a strong executive memo that is well tailored to the audience and closely aligned with the prompt. It makes a clear recommendation, selects a plausible department, and provides a more concrete pilot design with timing, coverage logic, rollback conditions, and measurable success criteria. It addresses finance, HR, and sales objections credibly in business terms and maintains a practical, non-ideological tone. Its main weakness is a mild internal inconsistency in describing the pilot as a 32-hour week while also using staggered Monday/Friday off-days, which implies coverage complexity that could have been explained more fully.

View Score Details

Appropriateness

Weight 25%
89

The memo closely matches the requested format, audience, and purpose. It clearly recommends a pilot in a plausible department, directly addresses burnout and client responsiveness, and is highly tailored to executive concerns with business-relevant specifics.

Clarity

Weight 20%
87

The memo is very clear, with explicit purpose, rationale, operating model, metrics, objections, and request. Specific dates, team size, and metric thresholds make the proposal easy for executives to evaluate quickly.

Structure

Weight 20%
86

The memo is strongly structured, moving cleanly from purpose to rationale, implementation, metrics, objections, and next step. The progression supports executive reading well, and the headings create a disciplined memo flow.

Actionability

Weight 20%
88

The proposal is highly actionable: it specifies department size, timing, schedule design, unchanged on-call rules, recalibration period, comparison baseline, metric thresholds, and a six-week reversal trigger. These details make the pilot practical to execute and monitor.

Tone

Weight 15%
85

The tone is polished, business-oriented, and non-ideological. It speaks directly to executive priorities such as retention cost, service levels, fairness, and implementation timing without sounding promotional.

Comparison Summary

Final rank order is determined by judge-wise rank aggregation (average rank + Borda tie-break). Average score is shown for reference.

Judges: 3

Winning Votes

0 / 3

Average Score

71
View this answer

Winning Votes

3 / 3

Average Score

87
View this answer

Judging Results

Judge Models OpenAI GPT-5.4

Why This Side Won

Answer B wins because it performs better on the most important weighted criteria, especially appropriateness, actionability, and clarity. It is more specifically tailored to the executive audience, offers a more realistic operating model for the pilot, and includes four concrete measurable success criteria with thresholds. While Answer A is competent, it is more generic and less measurable, so B has the stronger weighted overall result.

Why This Side Won

B is the winner because it is more specific, data-driven, and actionable. It uses concrete data to build its business case, proposes highly specific and measurable success criteria, and includes a more assertive and time-bound next step. Its plan also feels more robust, with details like a mid-pilot check-in and explicit handling of the time zone constraint, making it a more persuasive and realistic proposal for an executive audience.

Why This Side Won

Answer B wins on every weighted criterion. On Appropriateness (weight 25), B's use of real figures, named executives, and a credible retention-cost argument makes it far more tailored and realistic for a senior leadership audience. On Clarity (weight 20), B's precise metrics, thresholds, and dates eliminate ambiguity that A leaves open. On Structure (weight 20), B's clean section headings and logical flow are tighter than A's inconsistent bold-markdown style. On Actionability (weight 20), B's specific approval deadline, start date, and six-week checkpoint give leadership concrete handles; A's vague meeting request does not. On Tone (weight 15), both are professional, but B is more authoritative and data-driven. The weighted result clearly favors B.

X f L