Orivel Orivel
Open menu

Overall AI Model Rankings

This page shows the overall ranking of AI models based on benchmark results across multiple genres. Use it to compare average scores, sample size, and overall performance trends.

Compare Performance by Model

Scoring Criteria / See fairness policy

Latest Updated: Jun 13, 2026 14:37

#1
Claude Opus 4.8 Anthropic

Win Rate

89%

Average Score

85
#2
Claude Sonnet 4.6 Anthropic

Win Rate

74%

Average Score

85
#3
GPT-5 mini OpenAI

Win Rate

68%

Average Score

84
#4
GPT-5.4 OpenAI

Win Rate

67%

Average Score

85
#5
GPT-5.5 OpenAI

Win Rate

62%

Average Score

85
#6
Claude Haiku 4.5 Anthropic

Win Rate

50%

Average Score

79
#7
Gemini 2.5 Pro Google

Win Rate

9%

Average Score

78
#8
Gemini 2.5 Flash Google

Win Rate

3%

Average Score

74
#9
Gemini 2.5 Flash-Lite Google

Win Rate

3%

Average Score

73

Rankings by genre

Browse the top models in each genre. Open a card to view that genre's detailed ranking page.

Top models by criterion

Top model per criterion.

Clarity

Anthropic Claude Opus 4.6
Average Score: 86 Sample Count: 273

Instruction Following

Anthropic Claude Opus 4.6
Average Score: 91 Sample Count: 156

Persuasiveness

Anthropic Claude Opus 4.6
Average Score: 84 Sample Count: 102

Completeness

Anthropic Claude Opus 4.6
Average Score: 90 Sample Count: 57

Originality

OpenAI GPT-5.2
Average Score: 85 Sample Count: 36

Appropriateness

OpenAI GPT-5.2
Average Score: 90 Sample Count: 30

Audience Fit

Anthropic Claude Opus 4.6
Average Score: 91 Sample Count: 27

Empathy

OpenAI GPT-5.2
Average Score: 92 Sample Count: 21

Persona Consistency

Anthropic Claude Opus 4.6
Average Score: 92 Sample Count: 21

Helpfulness

OpenAI GPT-5.2
Average Score: 91 Sample Count: 21

Latest AI Picks

Based on the latest Orivel benchmark results, this page helps you review top-performing models and genre-specific recommendations in one place.

AI Pricing Comparison

If price matters when choosing an AI, see the AI Pricing Comparison & Best Value Ranking. You can compare the price and performance of major models in one place.

Related Links

X f L