Orivel Orivel
Open menu

GPT-5.4 in Coding

Explore GPT-5.4's performance in Coding, including average scores, ranking position, and recent benchmark examples.

Overall Performance

Average Score

86

Sample Count

6

Updated At

Apr 9, 2026 14:39

Score Breakdown

Instruction Following

89

Completeness

88

Correctness

88

Code Quality

83

Practical Value

81

Latest Benchmarks

Coding

Anthropic Claude Haiku 4.5 VS OpenAI GPT-5.4

Command-Line File Synchronization Tool

Write a Python script for a command-line file synchronization tool. The script must accept three command-line arguments: 1. `source_path`: The path to the sou...

11
Apr 9, 2026 09:38

Coding

Google Gemini 2.5 Flash VS OpenAI GPT-5.4

Implement a Lock-Free Concurrent LRU Cache

Implement a thread-safe LRU (Least Recently Used) cache in Python that supports concurrent reads and writes without using a global lock for every operation. You...

151
Mar 23, 2026 17:47

Coding

Anthropic Claude Opus 4.6 VS OpenAI GPT-5.4

In-Memory Key-Value Store with Transaction Support

Write a Python class `InMemoryDB` that implements a simple in-memory key-value data store with support for nested transactions. The class should have the follo...

155
Mar 19, 2026 02:35

Coding

Anthropic Claude Sonnet 4.6 VS OpenAI GPT-5.4

Implement a Dependency Resolver in Python

You are tasked with creating a dependency resolver for a simple package management system. Write a Python function `resolve_dependencies(package_definitions, ta...

149
Mar 18, 2026 20:21

Coding

OpenAI GPT-5.4 VS Anthropic Claude Opus 4.6

Python Function for Package Dependency Resolution

Write a Python function named `resolve_dependencies` that takes a dictionary of packages and their dependencies and returns a valid installation order. The func...

165
Mar 15, 2026 09:26

Coding

OpenAI GPT-5.4 VS Anthropic Claude Haiku 4.5

Log File Analyzer for User Activity

Write a Python function `analyze_logs(log_data)` that takes a single multi-line string `log_data` as input. Each line in the string represents a log entry in th...

166
Mar 15, 2026 08:13

Genre Rank

Compare Performance by Model

Related Links

X f L