Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Analysis

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Flash

Select the Most Promising School Lunch Reform

A public school district can fund only one lunch reform for the next two years. Analyze the options below and recommend which single option the district should choose. Your answer should compare the tradeoffs, address likely objections, and reach a clear conclusion. District goals: 1. Improve student nutrition 2. Increase the number of students actually eating school lunch 3. Keep implementation realistic within two years 4. Avoid large ongoing cost overruns Current situation: - 12,000 students across 18 schools - 46% of students currently choose school lunch - Surveys suggest students often skip lunch because of taste, long lines, or lack of appealing choices - The district can afford only one of the following options now Option A: Hire trained chefs to redesign menus - Upfront training and consulting cost: medium - Ongoing food cost: slightly higher - Expected effects: meals taste better, healthier recipes become more appealing, moderate increase in participation - Risks: benefits depend on staff adoption and recipe consistency across schools Option B: Add self-serve salad and fruit bars in every school - Upfront equipment cost: high - Ongoing food waste risk: high - Expected effects: strong nutrition improvement for students who use the bars, modest participation increase overall - Risks: staffing, sanitation, and uneven use by age group Option C: Launch a mobile pre-order system for lunches - Upfront technology and training cost: medium - Ongoing cost: low to medium - Expected effects: shorter lines, better forecasting, moderate participation increase, little direct nutrition improvement unless menus stay the same - Risks: unequal access for families with limited technology use, adoption challenges at first Option D: Replace sugary desserts and fried sides with healthier defaults - Upfront cost: low - Ongoing cost: neutral - Expected effects: direct nutrition improvement for all school lunch users, possible small drop in participation if students dislike changes - Risks: student backlash, perception that lunch became less enjoyable Write an analysis that identifies the best choice given the district goals and constraints. Do not invent new budget numbers or outside facts; reason only from the information provided.

132
Mar 19, 2026 21:45

Brainstorming

Google Gemini 2.5 Flash VS OpenAI GPT-5.4

Revenue Streams for a Small-Town Public Library Facing Budget Cuts

A small-town public library (serving a population of roughly 12,000) has just learned that its annual municipal funding will be cut by 30% starting next fiscal year. The library has the following assets and constraints: Assets: - A 6,000 sq ft building with a 200-person capacity community room - A small parking lot (20 spaces) - Two full-time librarians and three part-time staff - A collection of 40,000 physical books and a modest digital catalog - A makerspace with a 3D printer, laser cutter, and sewing machines - Reliable high-speed internet and 15 public-use computers - A small fenced garden area behind the building Constraints: - The library must remain free to enter and must continue lending books at no charge - It cannot sell alcohol or host gambling - Any new revenue activity must be legal in a typical U.S. municipality - Staff cannot increase; volunteers may be recruited - The library board will not approve anything that generates significant noise complaints from adjacent residential neighbors Brainstorm as many distinct, practical revenue-generating or cost-saving ideas as you can. For each idea, provide: 1. A short name 2. A one-to-two sentence description of how it works 3. Which library asset it leverages Aim for breadth across different categories (e.g., events, partnerships, services, space rental, grants, merchandising, digital, etc.).

155
Mar 19, 2026 19:59

System Design

Google Gemini 2.5 Flash VS Anthropic Claude Haiku 4.5

Design a Global URL Shortening Service

Design a globally available URL shortening service similar to Bitly. The service must let users create short links that redirect to long URLs, support custom aliases for paid users, track click analytics, and allow links to expire at a specified time. Requirements: - Handle 120 million new short links per day. - Handle 4 billion redirects per day. - Peak traffic can reach 3 times the daily average. - Redirect latency target: p95 under 80 ms for users in North America, Europe, and Asia. - Short-link creation latency target: p95 under 300 ms. - Service availability target: 99.99% for redirects. - Analytics data can be eventually consistent within 5 minutes. - Custom aliases must be unique globally. - Expired or deleted links must stop redirecting quickly. - The system should tolerate regional failures without total service outage. Assumptions you may use: - Average long URL length is 500 bytes. - Analytics events include timestamp, link ID, country, device type, and referrer domain. - Read traffic is much higher than write traffic. - You may choose SQL, NoSQL, cache, stream, CDN, and messaging technologies as needed, but justify them. In your answer, provide: 1. A high-level architecture with main components and request flows. 2. Data model and storage choices for links, aliases, and analytics. 3. A scaling strategy for read-heavy traffic, including caching and regional routing. 4. A reliability strategy covering failover, consistency decisions, and handling regional outages. 5. Key trade-offs, bottlenecks, and at least three risks with mitigations. 6. A brief capacity estimate for storage and throughput using the numbers above.

151
Mar 19, 2026 18:51

Coding

Google Gemini 2.5 Pro VS OpenAI GPT-5.2

Implement a Concurrent Rate Limiter with Sliding Window and Priority Queues

Design and implement a thread-safe rate limiter in Python that supports the following features: 1. **Sliding Window Rate Limiting**: Rather than using fixed time windows, implement a true sliding window algorithm. Each client (identified by a string key) is allowed at most `max_requests` requests within any rolling window of `window_seconds` seconds. 2. **Priority Levels**: Each request has a priority level (integer 1-5, where 1 is highest priority). When the rate limit is reached for a client, lower-priority requests (higher number) should be rejected first. Specifically, if a new request with priority P arrives and the window is full, the limiter should check whether any request in the current window has a strictly lower priority (higher number) than P. If so, the lowest-priority (highest-numbered) request's slot is "revoked" and the new higher-priority request is admitted. The revoked request should be recorded so it can be reported. If no lower-priority request exists to revoke, the new request is rejected. 3. **Burst Allowance**: Each client may optionally have a burst allowance `burst` (defaulting to 0). This allows up to `burst` additional requests beyond `max_requests` in a window, but only if at least half the window duration has passed since the client's first request in the current window. 4. **Thread Safety**: The rate limiter must be safe to use from multiple threads concurrently. Demonstrate this with a test scenario. 5. **Statistics**: The limiter must track per-client statistics: total requests admitted, total rejected, total revoked (bumped by higher-priority requests), and current window utilization (as a float 0.0 to 1.0). Implement the following interface: ```python class RateLimiter: def __init__(self, max_requests: int, window_seconds: float, default_burst: int = 0): ... def set_client_burst(self, client_id: str, burst: int) -> None: """Override burst allowance for a specific client.""" ... def allow(self, client_id: str, priority: int = 3, timestamp: float = None) -> bool: """ Check if a request is allowed. If timestamp is None, use current time. Returns True if the request is admitted, False if rejected. """ ... def get_stats(self, client_id: str) -> dict: """ Return a dict with keys: 'admitted', 'rejected', 'revoked', 'utilization' """ ... def get_revoked_log(self, client_id: str) -> list: """ Return a list of (timestamp, priority) tuples for revoked requests for the given client, in chronological order. """ ... ``` Provide a complete, runnable implementation along with a demonstration script that: - Creates a limiter with max_requests=5, window_seconds=10.0, default_burst=2 - Simulates a sequence of requests from two clients with varying priorities and timestamps that exercises all features (sliding window expiry, priority revocation, burst activation, and rejection) - Prints the stats and revoked logs for each client at the end - Includes a brief multithreaded test with at least 4 threads making concurrent requests Make sure to handle edge cases such as: - Priority value validation (must be 1-5) - Requests arriving exactly at window boundaries - Multiple revocations in sequence - Burst allowance activating precisely at the half-window mark - Empty or unknown client IDs in stats queries

151
Mar 19, 2026 14:46

Education Q&A

OpenAI GPT-5 mini VS Google Gemini 2.5 Flash-Lite

Explain the Paradox of the Ship of Theseus in Philosophy of Identity

The Ship of Theseus is one of the oldest thought experiments in Western philosophy. Suppose a wooden ship is maintained by gradually replacing each plank of wood as it decays. After every single original plank has been replaced, is the resulting ship still the Ship of Theseus? Now suppose someone collects all the discarded original planks and reassembles them into a ship. Which ship, if either, is the "real" Ship of Theseus? In a structured essay, address all of the following: 1. State the core paradox precisely and explain why it poses a genuine philosophical problem for theories of identity. 2. Present and critically evaluate at least three distinct philosophical positions that attempt to resolve the paradox (e.g., mereological essentialism, spatiotemporal continuity theory, four-dimensionalism/perdurantism, nominal essentialism, etc.). For each position, explain its resolution and identify at least one serious objection. 3. Explain how this paradox connects to at least two real-world domains (e.g., personal identity over time, legal identity of corporations, biological cell replacement, digital file copying, restoration of historical artifacts). For each domain, show specifically how the paradox manifests and what practical consequences follow. 4. Take and defend your own reasoned position on which resolution is most philosophically satisfying, acknowledging its limitations.

165
Mar 19, 2026 14:34

Showing 161 to 180 of 426 results

Related Links

X f L