Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Counseling

OpenAI GPT-5 mini VS Google Gemini 2.5 Flash-Lite

Helping a Friend Navigate a Career Change Conversation with Their Family

Your close friend Alex (age 30) has been working as an accountant for six years but has recently become passionate about pursuing a career in graphic design. Alex has been taking online courses in the evenings and has built a small portfolio. However, Alex is anxious about telling their parents, who paid for their accounting degree and have always expressed pride in Alex's stable career. Alex comes to you and says: "I've been dreading this for months. My parents sacrificed a lot to put me through school, and every family dinner they brag about me being an accountant. But I'm miserable at work. I dread Mondays. I've been doing design courses for a year now and I actually feel alive when I'm creating things. I want to transition into graphic design, maybe freelance at first while keeping my day job. But I'm terrified my parents will feel betrayed or think I'm throwing away everything they gave me. How do I even bring this up with them? Should I just keep quiet and stay in accounting?" Write a thoughtful, supportive response to Alex as their friend. Your response should address Alex's emotional concerns, offer practical advice on how to approach the conversation with their parents, and help Alex think through the career transition realistically. Be empathetic but also honest — don't just tell Alex what they want to hear.

147
Mar 20, 2026 17:31

Planning

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Sonnet 4.6

Weekend Move Plan Under Tight Constraints

You are helping a person plan a one-day apartment move on Saturday. They are moving from a studio apartment on the 3rd floor (no elevator) to a new apartment 25 minutes away by car. Build a practical step-by-step moving plan for the day that is feasible, prioritized, and includes risk handling. Facts and constraints: - The person has two friends helping from 9:00 to 13:00 only. - A rental van is available from 10:00 to 16:00 and must be returned with a full tank. - Building A (old apartment) allows move-out only between 8:00 and 14:00. - Building B (new apartment) allows move-in only between 12:00 and 18:00. - The person must hand over the old apartment keys by 15:00. - There are 35 boxes total, plus: a bed frame and mattress, a desk, a chair, a bookshelf, and a mini-fridge. - The mini-fridge must remain upright during transport and should be plugged in no sooner than 4 hours after arrival. - The bookshelf is not disassembled yet, but disassembling it takes 30 minutes and requires a screwdriver. - The bed frame is already disassembled. - The desk can fit in the van only if its legs are removed first; that takes 20 minutes. - Packing is mostly done, but the bathroom items, bedding, and kitchen cleaning supplies are still unpacked. - The person has only one dolly/hand truck and six moving blankets. - Weather forecast: possible rain from 11:30 onward. - The person wants to minimize costs, avoid damage, and reduce the chance of missing any building or rental deadlines. Your task: - Provide a time-based plan for the day from 8:00 until the key handover is complete. - Sequence tasks logically, including prep, loading, travel, unloading, and final checks. - Assign who should do what when helpful (the person vs. the two friends). - Identify the highest-priority items to load first or last and explain why. - Include at least three concrete risk mitigations or contingency actions. - Keep the plan realistic; do not assume extra helpers or equipment beyond what is listed.

144
Mar 20, 2026 16:49

Business Writing

OpenAI GPT-5.2 VS Google Gemini 2.5 Pro

Write a Client-Facing Email Explaining a Significant Project Delay

You are a project manager at a mid-sized software consulting firm. Your team has been developing a custom inventory management system for a retail client, GreenLeaf Stores. The project was originally scheduled to deliver its first production-ready release on August 15, but due to unexpected technical complications with integrating the client's legacy database and the departure of a senior developer, the delivery will be delayed by approximately six weeks (new target: September 26). Your client contact is Dana Morales, VP of Operations at GreenLeaf Stores. Dana has been supportive but is under pressure from her own leadership to have the system operational before the holiday shopping season begins in mid-October. Write a professional email to Dana that accomplishes all of the following: 1. Clearly communicates the delay and the new expected delivery date. 2. Briefly explains the reasons for the delay without making excuses or assigning blame. 3. Acknowledges the impact on GreenLeaf's business timeline and demonstrates empathy. 4. Proposes at least two concrete mitigation steps your firm will take to minimize further risk and protect the October operational deadline. 5. Maintains a tone that is honest, confident, and relationship-preserving. The email should include a subject line and be between 250 and 400 words (excluding the subject line). Do not use placeholder text such as "[insert name here]." Write the complete, ready-to-send email.

161
Mar 20, 2026 15:18

Summarization

Google Gemini 2.5 Pro VS Anthropic Claude Sonnet 4.6

Summarize a Public Consultation Brief on Nighttime Delivery in a Historic City Center

Read the following consultation brief and write a concise summary for a city council member who has not read the document. Your summary must: - be 220 to 300 words long - use neutral, non-promotional language - explain the problem the city is trying to solve - capture the main evidence and viewpoints from supporters and critics - include the proposed pilot program, its safeguards, and how success would be measured - mention at least three specific operational details or numbers from the brief - avoid quoting full sentences from the source - not add facts or opinions not supported by the source Source passage: The City of Larkhaven is considering a 12-month pilot program that would allow a limited number of nighttime deliveries in the Old Market district, a dense mixed-use neighborhood known for narrow streets, heritage buildings, restaurants, small grocers, apartments above shops, and heavy daytime foot traffic. At present, most commercial deliveries are concentrated between 7:00 a.m. and 2:00 p.m. As a result, box trucks often double-park on streets that were laid out long before modern freight vehicles existed. Delivery drivers unload beside bus stops, riders on bicycles weave into traffic to pass stopped trucks, and pedestrians spill off crowded sidewalks when hand carts block storefronts. According to the city’s transportation department, freight activity is not the largest source of congestion in Old Market, but it is among the most disruptive because the disruptions occur on the narrowest streets and at the busiest times. A staff report prepared for the council argues that shifting some deliveries to late evening or overnight hours could reduce daytime conflicts without increasing the total number of trips. The proposal would not create new delivery demand; instead, it would move selected restocking trips to lower-traffic periods. Staff cite examples from other cities where off-hour deliveries shortened average unloading times because drivers could park legally closer to destinations and complete routes more predictably. The report also notes potential environmental benefits from smoother driving speeds and less idling while searching for curb space. However, staff acknowledge that the same studies found uneven results when neighborhoods had many residents living directly above commercial premises, especially where building insulation was poor. The draft pilot would cover only the four-block core of Old Market and would limit participation to 18 businesses in its first phase. Eligible businesses would include food retailers, pharmacies, and hospitality venues that already receive at least four deliveries per week. Participating carriers would need to use vehicles no larger than 7.5 tons gross weight and comply with a quiet-delivery code. That code would prohibit metal roll cages, require rubberized cart wheels, ban unloading with engine idling beyond two minutes, and require drivers to complete noise-awareness training. Routine delivery windows under the pilot would run from 9:30 p.m. to 6:00 a.m., but no unloading could begin after midnight within 20 meters of a residential entrance unless the destination business had submitted a building-specific mitigation plan. To address concerns about resident sleep disturbance, the city proposes several safeguards. First, the pilot would exclude streets with documented nighttime noise complaints above the district median during the previous 18 months. Second, each participating business would have to designate an on-site receiver so drivers would not need to buzz apartments or repeatedly knock on locked service doors. Third, the city would install temporary sound monitors at 12 locations and publish monthly readings, along with a log of complaints, parking citations, and observed curb-blocking incidents. Fourth, the pilot could be suspended on any block where overnight complaints exceeded a trigger threshold for two consecutive months. The threshold in the draft is six verified complaints per 100 residents, though staff say this number is open to revision after public comment. Business groups strongly support the pilot. The Old Market Merchants Association says morning deliveries frequently arrive after shops open, forcing staff to restock shelves while also serving customers. Restaurant owners argue that receiving produce and beverages at dawn or late night would free curb space during lunch preparation and reduce the need for workers to drag pallets through crowded dining streets. A coalition of independent grocers adds that more predictable delivery times could cut spoilage for chilled goods, because drivers would spend less time stuck in queues. Several carriers also support the plan, saying a truck can sometimes spend more time circling for legal curb access than actually unloading. They argue that if routes become more reliable, fewer backup vehicles may be needed to complete the same volume of deliveries. Resident organizations are divided. Some acknowledge that daytime freight activity has become chaotic and that blocked sidewalks are especially difficult for older adults, parents with strollers, wheelchair users, and delivery workers on cargo bikes. Others say the burden is being shifted from shoppers to people trying to sleep. The Old Market Tenants Forum submitted comments noting that many apartments have single-glazed windows and bedrooms facing service alleys. The forum argues that even if average noise readings stay within acceptable ranges, repeated short bursts from tail lifts, rolling containers, reversing alarms, and late conversations can still wake residents. Preservation advocates have raised a related concern: because many buildings are protected, retrofitting loading areas or installing acoustic barriers may be expensive, restricted, or visually inappropriate. Labor representatives have offered conditional support but say the pilot should not depend on unpaid schedule flexibility from retail staff or unsafe expectations for drivers. The local drivers’ union says quieter equipment is welcome, but nighttime operations can create pressure to unload faster with fewer workers present. They want clear rules on staffing, access, lighting, and restroom availability. A union representing shop employees says receiving deliveries at 5:00 a.m. should not become an informal expectation for junior workers without revised contracts, transport allowances, or secure entry procedures. City staff responded by stating that labor conditions would be monitored through employer attestations and random compliance checks, though details remain limited in the current draft. The consultation brief includes preliminary cost estimates. The city expects to spend about $420,000 over 12 months: roughly $160,000 for monitoring equipment and data analysis, $110,000 for curbside signage and temporary loading zone adjustments, $90,000 for program administration and inspections, and $60,000 for driver training subsidies and business onboarding. Staff propose funding the pilot from the existing mobility innovation budget rather than from the general fund. They argue that if daytime curb conflicts decline, the city may avoid or defer more expensive street redesigns. Critics reply that the estimate may be incomplete because it does not clearly price enforcement during overnight hours or any mitigation measures for affected residents. The brief also explains why the city is pursuing a pilot instead of a permanent rule change. Freight patterns vary sharply by street, season, and business type, and council members previously rejected a citywide nighttime delivery ordinance as too broad. Staff now argue that a smaller trial with block-by-block reporting would generate better local evidence. The proposed evaluation framework would compare pilot streets with similar non-pilot streets using measures such as average unloading duration, illegal parking observations, daytime travel speeds for buses, complaint rates, worker injury reports, and business delivery reliability. The city would also survey residents, drivers, and participating businesses at three points: before launch, at six months, and near the end of the trial. A final recommendation would return to council only if the data showed meaningful daytime benefits without disproportionate nighttime harms. At a recent public meeting, council members signaled interest but asked for revisions. One requested a stricter cap on the number of participating vehicles per night. Another asked staff to clarify whether electric refrigeration units would be required for chilled-food suppliers, since diesel-powered units can create a persistent hum even when engines are off. A third questioned whether the complaint trigger should be based on residents, dwelling units, or building frontages, noting that each method could produce different outcomes on mixed-use blocks. Staff said they would revise the draft before the formal vote next month and might narrow the eligible street list further if consultation feedback shows concentrated concern. In short, the debate is not simply about whether goods should move at night. It is about whether carefully managed off-hour deliveries can reduce visible daytime disorder in a fragile, busy district without transferring the costs to residents, workers, or historic buildings. The consultation asks respondents to comment on the proposed hours, business eligibility rules, quiet-delivery standards, complaint thresholds, labor protections, and evaluation metrics. Written comments remain open until the 28th of this month, after which staff will publish a response summary and a revised pilot design for council consideration.

159
Mar 20, 2026 11:21

System Design

Google Gemini 2.5 Flash VS Anthropic Claude Sonnet 4.6

Design a Global URL Shortening Service

Design a public URL shortening service similar to Bitly. Users can submit a long URL and receive a short alias; visiting the short link should redirect quickly to the original URL. The system must support custom aliases, optional expiration dates, basic click analytics, and abuse mitigation for malicious links. Requirements and constraints: - Functional requirements: - Create short URLs for long URLs. - Redirect short URLs to original URLs. - Support custom aliases when available. - Support optional expiration time per link. - Record click events for analytics. - Allow users to disable a link manually. - Scale assumptions: - 120 million new short URLs per month. - 1.5 billion redirects per day. - Redirect traffic is globally distributed and read-heavy. - Analytics data should be queryable within 15 minutes. - Performance targets: - Redirect p95 latency under 80 ms for most regions. - Short-link creation p95 under 300 ms. - 99.99% availability for redirects. - Data and retention: - Links may live indefinitely unless expired or disabled. - Raw click events may be retained for 90 days; aggregated analytics for 2 years. - Operational constraints: - Use commodity cloud infrastructure; do not assume one exotic managed product solves everything. - Budget matters: justify any replication, caching, and storage choices. - Short codes should be compact and reasonably hard to guess at large scale, but perfect secrecy is not required. In your answer, provide: 1. A high-level architecture with major components and data flow. 2. Storage choices for link metadata, redirect path, and analytics events, with rationale. 3. A short-code generation strategy, including how to avoid collisions and handle custom aliases. 4. A scaling plan for global traffic, including caching, partitioning/sharding, and multi-region considerations. 5. A reliability plan covering failures, hot keys, disaster recovery, and degraded-mode behavior. 6. Key APIs and core data models. 7. Abuse mitigation and security considerations. 8. The main trade-offs you made and why.

134
Mar 20, 2026 11:03

Education Q&A

OpenAI GPT-5.2 VS Google Gemini 2.5 Flash-Lite

Explain the Paradox of the Ship of Theseus in Philosophy of Identity

The Ship of Theseus is one of the oldest thought experiments in Western philosophy. Suppose a wooden ship is maintained by gradually replacing each plank of wood as it decays. After every single original plank has been replaced, is the resulting ship still the Ship of Theseus? Now suppose someone collects all the discarded original planks and reassembles them into a ship. Which ship, if either, is the "real" Ship of Theseus? In a structured essay, address all of the following: 1. State the core paradox precisely and explain why it poses a genuine philosophical problem for theories of identity. 2. Present and critically evaluate at least three distinct philosophical positions that attempt to resolve the paradox (e.g., mereological essentialism, spatiotemporal continuity theory, four-dimensionalism/perdurantism, nominal essentialism, etc.). For each position, explain its resolution and identify at least one significant objection. 3. Explain how this paradox connects to at least two real-world domains (e.g., personal identity over time, legal identity of corporations, biological cell replacement, digital file copying, restoration of historical artifacts). For each domain, show specifically how the paradox manifests and what practical consequences follow. 4. Take and defend your own reasoned position on which resolution is most philosophically satisfying, acknowledging its limitations.

150
Mar 20, 2026 10:48

Showing 141 to 160 of 426 results

Related Links

X f L