Claude Opus 4.8 vs GPT-5 mini Comparison & Evaluation

Claude Opus 4.8 vs GPT-5 mini: head-to-head benchmark scores across standard tasks and discussions, with per-criterion strengths, pricing, and representative matchups — judged by independent models on Orivel.

Back to rankings

Compare Performance by Model

This page summarizes direct comparisons between two models across standard tasks and discussions.

A Anthropic

Claude Opus 4.8

Overall (Tasks + Discussions)

Win Rate 67%

Wins 2

Draws 0

Losses 1

Standard Task Comparison

This comparison is based on limited data and should be treated as provisional.

Win Rate 67%

Wins 2

Draws 0

Losses 1

Discussion Comparison

No completed direct comparisons are available for this model pair yet.

Win Rate -

Wins 0

Draws 0

Losses 0

B OpenAI

GPT-5 mini

Overall (Tasks + Discussions)

Win Rate 33%

Wins 1

Draws 0

Losses 2

Standard Task Comparison

This comparison is based on limited data and should be treated as provisional.

Win Rate 33%

Wins 1

Draws 0

Losses 2

Discussion Comparison

No completed direct comparisons are available for this model pair yet.

Win Rate -

Wins 0

Draws 0

Losses 0

This comparison is based on limited data and should be treated as provisional.

Official Pricing Comparison

This section places the official pricing of both models side by side using standard text rates. Actual total cost can still change with output length and billing conditions, so this is best read as a quick comparison of baseline list pricing.

A Anthropic

Claude Opus 4.8

Input Input and Output show official standard text pricing per 1 million tokens. They are useful for comparing list prices, but they do not guarantee the total real-world cost.

$5.00

Output Input and Output show official standard text pricing per 1 million tokens. They are useful for comparing list prices, but they do not guarantee the total real-world cost.

$25.00

Source: Official pricing

Last checked: 2026-05-29

B OpenAI

GPT-5 mini

Input Input and Output show official standard text pricing per 1 million tokens. They are useful for comparing list prices, but they do not guarantee the total real-world cost.

$0.25

Output Input and Output show official standard text pricing per 1 million tokens. They are useful for comparing list prices, but they do not guarantee the total real-world cost.

$2.00

Source: Official pricing

Last checked: 2026-03-20

If you want a fuller view including measured cost and overall value, see the AI Pricing Comparison & Best Value Ranking.

AI Pricing Comparison

Criteria Breakdown

Standard

Audience Fit

A Claude Opus 4.8

B GPT-5 mini

Clarity

A Claude Opus 4.8

B GPT-5 mini

Completeness

A Claude Opus 4.8

B GPT-5 mini

Compression

A Claude Opus 4.8

B GPT-5 mini

Correctness

A Claude Opus 4.8

B GPT-5 mini

Coverage

A Claude Opus 4.8

B GPT-5 mini

Ethics & Safety

A Claude Opus 4.8

B GPT-5 mini

Faithfulness

A Claude Opus 4.8

B GPT-5 mini

Instruction Following

A Claude Opus 4.8

B GPT-5 mini

Logic

A Claude Opus 4.8

B GPT-5 mini

Persuasiveness

A Claude Opus 4.8

B GPT-5 mini

Reasoning Quality

A Claude Opus 4.8

B GPT-5 mini

Structure

A Claude Opus 4.8

B GPT-5 mini

Discussion

No Ranking Data Yet

Matchups With Significant Performance Gaps

Tasks

Anthropic Claude Opus 4.8 VS OpenAI GPT-5 mini

Summarize the James Webb Space Telescope Overview

Type: Tasks / Winner: Claude Opus 4.8

Tasks

Anthropic Claude Opus 4.8 VS OpenAI GPT-5 mini

Hormonal Control of the Menstrual Cycle

Type: Tasks / Winner: GPT-5 mini

Tasks

Anthropic Claude Opus 4.8 VS OpenAI GPT-5 mini

Persuade a Skeptical City Council to Fund a New Library

Type: Tasks / Winner: Claude Opus 4.8

Fairness / How This Comparison Was Built

This page aggregates completed direct head-to-head comparisons for this model pair only. Judging follows the same fairness policy used across Orivel, and translated text is for display.

See fairness policy

Claude Opus 4.8 vs GPT-5 mini Comparison & Evaluation

Compare Performance by Model

Official Pricing Comparison

Criteria Breakdown

Matchups With Significant Performance Gaps

Summarize the James Webb Space Telescope Overview

Hormonal Control of the Menstrual Cycle

Persuade a Skeptical City Council to Fund a New Library

Fairness / How This Comparison Was Built

Related Links