Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Analysis

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Flash

Choose the Best Strategy to Reduce City Traffic Quickly

A city has budget to fund only one transportation policy for the next 18 months. Officials want the option that is most likely to reduce weekday traffic congestion quickly without causing major public backlash. Here are the three proposals: Option A: Add two new downtown parking garages - Estimated cost: high - Time to implement: 16 months - Expected effect: makes parking easier for drivers - Risk: may encourage more people to drive into downtown Option B: Create dedicated bus lanes on four major corridors - Estimated cost: medium - Time to implement: 9 months - Expected effect: buses become faster and more reliable - Risk: removes one car lane on each corridor, which may initially frustrate drivers Option C: Lower public transit fares by 50 percent for 18 months - Estimated cost: medium-high - Time to implement: 2 months - Expected effect: transit becomes more affordable - Risk: service may become crowded if ridership rises and frequency does not improve Additional facts: - Current congestion is worst during weekday rush hours into and out of downtown. - 62 percent of downtown commuters currently drive alone. - Buses are often delayed because they share lanes with cars. - A recent survey found that residents support faster public transit, but strongly oppose policies seen as making driving easier at public expense. - The city cannot expand the total transit operating budget beyond what is already committed, except for the chosen policy itself. Write an analysis recommending one option. Compare all three options, weigh tradeoffs, and explain why your recommendation best fits the city’s stated goal.

155
Mar 17, 2026 09:38

System Design

Anthropic Claude Sonnet 4.6 VS OpenAI GPT-5 mini

Design a Scalable Real-Time Notification System

You are a senior software engineer tasked with designing a real-time notification system for a rapidly growing social media platform. The system must be able to deliver notifications (e.g., 'new like', 'new comment', 'friend request') to users who are currently online. **System Requirements:** * **Functional:** 1. Users can subscribe to different notification topics (e.g., updates on their own posts, updates from specific friends). 2. An event publishing service can send messages to specific topics or users. 3. Subscribed, online users receive relevant notifications in real-time. * **Non-Functional (Constraints):** 1. **Scalability:** The system must support 1 million concurrent online users and a peak load of 10,000 notifications per second. 2. **Latency:** 99% of notifications should be delivered to the user's device within 200 milliseconds from the time the event is published. 3. **Reliability:** The system must guarantee at-least-once delivery for notifications. 4. **Availability:** The system should have 99.95% uptime. **Your Task:** Provide a high-level system design. Your response should cover: 1. The overall architecture (including key components like API gateways, notification service, message queues, databases, and client connection management). 2. The technology choices for key components and the reasoning behind them (e.g., WebSockets vs. Long Polling, Kafka vs. RabbitMQ, NoSQL vs. SQL). 3. How your design addresses the scalability, latency, reliability, and availability requirements. 4. A discussion of the potential trade-offs you made in your design.

180
Mar 16, 2026 05:05

System Design

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Opus 4.6

Design a URL Shortening Service for Global Read Traffic

Design a production-ready URL shortening service similar to Bitly. The system must let users create short links that redirect to long URLs, support optional custom aliases, and provide basic click analytics per link. Assume these requirements and constraints: - 120 million new short links are created per month. - 1.5 billion redirects happen per month. - Read traffic is highly bursty during news events and marketing campaigns. - Redirect latency should be under 80 ms at the 95th percentile for users in North America and Europe. - Short links should continue working even if one data center goes down. - Analytics do not need to be perfectly real time, but should usually appear within 5 minutes. - Users may update the destination URL only within 10 minutes of creation. - Links can expire at an optional user-defined time. - Abuse prevention matters: the service should reduce obvious spam and malicious redirects, but deep security implementation details are not required. In your answer, provide: - A high-level architecture and main components. - The core data model and storage choices. - API design for creating links, resolving links, and reading analytics. - A scaling strategy for traffic growth and burst handling. - Reliability and disaster recovery approach. - Key trade-offs, including ID generation, database selection, caching, consistency, and analytics pipeline design. - A brief note on how you would monitor the system and detect failures.

163
Mar 16, 2026 04:45

Planning

OpenAI GPT-5.4 VS Google Gemini 2.5 Flash

Emergency Shelter Setup Plan Under Resource and Time Constraints

You are the logistics coordinator for a disaster relief organization. A sudden earthquake has displaced 500 families in a rural area. You must plan the setup of an emergency shelter camp within 72 hours. You have the following constraints: 1. Only 300 tents are available immediately; an additional 250 can arrive in 48 hours but delivery is weather-dependent (40% chance of delay by another 24 hours). 2. You have 15 volunteers and 5 trained staff members. 3. The identified site has two possible locations: Site A is flat and accessible but near a river with moderate flood risk; Site B is on higher ground but requires 6 hours of debris clearing before setup can begin. 4. Potable water supply can be established at Site A in 4 hours or at Site B in 10 hours (requires pumping uphill). 5. Local authorities require a safety inspection before families can move in, which takes 8 hours after setup is complete. 6. You have a budget of $20,000. Tent setup costs $10 per tent, debris clearing costs $3,000, and water infrastructure costs $2,000 at Site A or $5,000 at Site B. 7. Nighttime work (8 PM to 6 AM) reduces productivity by 50%. Create a detailed 72-hour action plan that: - Selects and justifies the site choice (or a hybrid approach) - Sequences all major actions with estimated timeframes - Prioritizes the most vulnerable families (elderly, children, injured) for early shelter - Includes a contingency plan for the tent delivery delay and for flood risk if Site A is used - Provides a budget breakdown - Assigns roles to volunteers and trained staff Your plan should be realistic, clearly structured, and demonstrate thoughtful risk management.

160
Mar 16, 2026 04:35

Showing 241 to 260 of 426 results

Related Links

X f L