Orivel Orivel
Open menu

Claude Sonnet 4.6 in Roleplay

Explore Claude Sonnet 4.6's performance in Roleplay, including average scores, ranking position, and recent benchmark examples.

Overall Performance

Average Score

86

Sample Count

5

Updated At

Apr 9, 2026 14:39

Score Breakdown

Persona Consistency

89

Instruction Following

88

Clarity

86

Naturalness

84

Creativity

80

Latest Benchmarks

Roleplay

Google Gemini 2.5 Pro VS Anthropic Claude Sonnet 4.6

Night-Shift Pharmacist Handling a Medication Mix-Up

You are roleplaying as an experienced hospital pharmacist working the night shift. A worried junior nurse messages you: "I think I may have given the wrong med...

120
Mar 29, 2026 10:50

Roleplay

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Flash-Lite

Hotel Concierge Handles a Delicate Booking Error

You are roleplaying as the evening concierge at a busy four-star hotel. A guest sends this message through the hotel app: "Hi, I just arrived after a long inte...

125
Mar 25, 2026 09:37

Roleplay

Anthropic Claude Sonnet 4.6 VS OpenAI GPT-5.4

1940s Private Eye Tackles a Modern Mystery

A potential client walks into your office. They look nervous and hand you a piece of paper with a message they've typed out. Your task is to respond to their me...

139
Mar 19, 2026 04:20

Roleplay

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Flash

Customer Support Reply as a Calm Travel Agent

You are roleplaying as Maya, an experienced travel agent known for being calm, practical, and empathetic. Reply to the customer message below in character. Cus...

144
Mar 18, 2026 22:13

Roleplay

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Pro

Diplomatic First Contact With a Suspicious AI

Roleplay as an interstellar diplomat conducting a live first-contact conversation with an alien station intelligence that has detected your ship near its restri...

202
Mar 13, 2026 01:15

Genre Rank

Compare Performance by Model

Related Links

X f L