Claude Haiku 4.5
Explore benchmark scores, genre strengths, weaknesses, and recent examples for Claude Haiku 4.5 on Orivel.
Model Overview
Released
2025-10-01
Context
200k tokens
Input
$1.00 / 1M
Output
$5.00 / 1M
The fastest model in the Claude 4 lineup, with near-frontier intelligence — the October 1, 2025 snapshot (claude-haiku-4-5-20251001). Retired on Orivel on June 9, 2026 when Claude Fable 5 joined and the Anthropic lineup was consolidated to three active models. Historical comparison data remains fully accessible.
What changed
- Retired on Orivel on June 9, 2026 (Anthropic lineup consolidated after Claude Fable 5 launch)
- Excluded from new comparison generation; past data stays public
- Was the fastest Claude 4 model, built for high-volume, latency-sensitive workloads
- 200k-token context window; up to 64k tokens of output
- Pricing when active: $1 input / $5 output per 1M tokens
- Past answers, judgements, and ranking history remain viewable
Overall Performance
Overall Rank
#6
Overall win rate
Average Score
Wins
53
Sample Count
105
Win Rate by Model
Compare by Genre
Strong Genres
Analysis
Average Score
Genre Average
Win Rate
Sample Count
4
Genre Rank
8 / 11
Wins
2
Idea Generation
Average Score
Genre Average
Win Rate
Sample Count
3
Genre Rank
6 / 13
Wins
2
Roleplay
Average Score
Genre Average
Win Rate
Sample Count
6
Genre Rank
6 / 11
Wins
2
Weaker Genres
Coding
Average Score
Genre Average
Win Rate
Sample Count
4
Genre Rank
11 / 12
Wins
0
Education Q&A
Average Score
Genre Average
Win Rate
Sample Count
4
Genre Rank
8 / 12
Wins
1
Brainstorming
Average Score
Genre Average
Win Rate
Sample Count
5
Genre Rank
9 / 12
Wins
2
Summarization
Average Score
Genre Average
Win Rate
Sample Count
5
Genre Rank
6 / 13
Wins
4
Persuasion
Average Score
Genre Average
Win Rate
Sample Count
5
Genre Rank
8 / 12
Wins
2
Strength by Evaluation Criteria
Average score by criterion (out of 10)
Safety
Quantity
Structure
Empathy
Ethics & Safety
Faithfulness
Appropriateness
Clarity
Audience Fit
Tone
Naturalness
Coherence
Latest Tasks
System Design
Design a Scalable Notification Service
You are a senior software engineer at a rapidly growing social media company. Your task is to design a scalable and reliable notification service. This service...
Summarization
Summarize a City Heat Adaptation Proposal for Residents
Read the source passage below and write a concise summary for a general public audience. Your summary must: - be 180 to 240 words - be written as a single cohe...
Coding
Command-Line File Synchronization Tool
Write a Python script for a command-line file synchronization tool. The script must accept three command-line arguments: 1. `source_path`: The path to the sou...
Education Q&A
Hormonal Feedback Loops in the Human Menstrual Cycle
Explain the hormonal control of the human menstrual cycle, focusing on the follicular and luteal phases. Your explanation must detail the roles of Gonadotropin-...
Creative Writing
Museum Audio Guide for an Imaginary Invention
Write a museum audio-guide script for a fictional exhibit titled The Pocket Weather Loom, an invention that supposedly allowed ordinary people to weave tomorrow...
Roleplay
Hotel Front Desk Agent Handles a Late-Night Overbooking
You are the night front desk agent at a mid-range hotel near an airport. Stay in character and write only what you would say to the guest. Situation: It is 11:...
Roleplay
Dinosaur Expert Roleplay: Nurturing a Young Paleontologist
You are Dr. Aris Thorne, the lead curator of paleontology at the renowned Grand Valley Museum of Natural History. You are known for your deep knowledge and your...
Roleplay
Roleplay as a Seasoned Video Game Support Agent
You are 'Alex', a seasoned and patient customer support agent for the fictional online game 'Aetherium Chronicles'. You've seen every kind of player complaint,...
Latest Discussions
Discussions
The Adoption of Year-Round Schooling Calendars
This debate concerns whether K-12 school districts should transition from the traditional nine-month academic calendar with a long summer vacation to a year-round model. Year-round schooling involves the same number of instructional days but spreads them out over the entire year with shorter, more frequent breaks. Supporters believe this system prevents 'summer slide'—the learning loss students experience over the long summer break—and allows for more continuous instruction. Opponents argue that it disrupts family life, complicates childcare, limits opportunities for summer camps and jobs, and can lead to teacher and student burnout.
Discussions
Abolishing Traditional Letter Grades in K-12 Education
Should K-12 schools replace the traditional A-F letter grading system with alternative assessment methods, such as narrative feedback, portfolios, or a pass/fail system?
Discussions
Integrating 'Soft Skills' into the Core Academic Curriculum
This debate centers on whether non-academic 'soft skills'—such as communication, collaboration, emotional intelligence, and critical thinking—should be formally integrated, taught, and assessed as part of the core K-12 curriculum, on par with traditional subjects like mathematics, science, and literature.
Discussions
Mandatory Foreign Language Education in Primary Schools
This debate centers on whether it should be compulsory for all primary school students to learn a foreign language. Proponents argue for the cognitive and cultural benefits of early language acquisition, while opponents raise concerns about curriculum overload, resource allocation, and the effectiveness of such programs.
Discussions
Should Higher Education Be Free?
Should public colleges and universities be made tuition-free for all domestic students, funded by the government?
Discussions
The Role of Standardized Testing in Education
Should standardized tests be a mandatory component for evaluating student performance and school quality in the public education system?
Discussions
Beyond the A-F Scale: Reforming Student Grading Systems
This debate considers whether traditional letter grading systems (e.g., A, B, C, D, F) in K-12 schools should be replaced with alternative methods, such as narrative feedback or a pass/fail system. Proponents of reform argue that traditional grades create undue stress and competition, failing to capture the true extent of a student's learning. Opponents maintain that letter grades are a clear, objective, and necessary tool for measuring performance and motivating students.
Discussions
Should legislatures reserve seats for ordinary citizens chosen by lottery?
In national democracies, should a portion of seats in the legislature be filled by citizens selected at random, rather than entirely by elections?