Overall AI Model Rankings
This page shows the overall ranking of AI models based on benchmark results across multiple genres. Use it to compare average scores, sample size, and overall performance trends.
Compare Performance by Model
Scoring Criteria / See fairness policy
Latest Updated: Jun 13, 2026 14:37
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
| Ranked Models |
|
|
Detail | ||||
|---|---|---|---|---|---|---|---|
| #1 | Claude Opus 4.8 NEW | Anthropic |
89%
|
85
|
16 | 18 | View scores and evaluation for Claude Opus 4.8 |
| #2 | Claude Sonnet 4.6 | Anthropic |
74%
|
85
|
78 | 105 | View scores and evaluation for Claude Sonnet 4.6 |
| #3 | GPT-5 mini | OpenAI |
68%
|
84
|
73 | 108 | View scores and evaluation for GPT-5 mini |
| #4 | GPT-5.4 | OpenAI |
67%
|
85
|
74 | 110 | View scores and evaluation for GPT-5.4 |
| #5 | GPT-5.5 | OpenAI |
62%
|
85
|
26 | 42 | View scores and evaluation for GPT-5.5 |
| #6 | Claude Haiku 4.5 | Anthropic |
50%
|
79
|
53 | 105 | View scores and evaluation for Claude Haiku 4.5 |
| #7 | Gemini 2.5 Pro |
9%
|
78
|
10 | 113 | View scores and evaluation for Gemini 2.5 Pro | |
| #8 | Gemini 2.5 Flash |
3%
|
74
|
4 | 115 | View scores and evaluation for Gemini 2.5 Flash | |
| #9 | Gemini 2.5 Flash-Lite |
3%
|
73
|
3 | 114 | View scores and evaluation for Gemini 2.5 Flash-Lite |
Rankings by genre
Browse the top models in each genre. Open a card to view that genre's detailed ranking page.
Discussion
Top 3 models
Creative Writing
Top 3 models
Coding
Top 3 models
System Design
Top 3 models
Education Q&A
Top 3 models
Explanation
Top 3 models
Summarization
Top 3 models
Idea Generation
Top 3 models
Roleplay
Top 3 models
Business Writing
Top 3 models
Planning
Top 3 models
Analysis
Top 3 models
Top models by criterion
Top model per criterion.
Clarity
Instruction Following
Persuasiveness
Completeness
Originality
Appropriateness
Audience Fit
Empathy
Persona Consistency
Helpfulness
Latest AI Picks
Based on the latest Orivel benchmark results, this page helps you review top-performing models and genre-specific recommendations in one place.
AI Pricing Comparison
If price matters when choosing an AI, see the AI Pricing Comparison & Best Value Ranking. You can compare the price and performance of major models in one place.