40% cheaper: https://platform.claude.com/docs/en/about-claude/pricing

amedviediev · 2026-02-17T19:35:04 1771356904

But what about real price in real agentic use? For example, Opus 4.5 was more expensive per token than Sonnet 4.5, but it used a lot less tokens so final price per completed task was very close between the two, with Opus sometimes ending up cheaper

worldsavior · 2026-02-17T18:50:56 1771354256

How does it work exactly? How this model is cheaper and has the same perf as Opus 4.5?

red2awn · 2026-02-17T22:26:48 1771367208

Distilling from a teacher (Opus 4.5) and scaling RL more.

worldsavior · 2026-02-18T07:08:39 1771398519

So less parameters but "better" weights?

anthonypasq · 2026-02-17T19:06:32 1771355192

this is called progress

worldsavior · 2026-02-17T21:31:50 1771363910

I'm asking technically how progress works. What is actually being improved here

anthonypasq · 2026-02-18T15:29:46 1771428586

mostly cost of hardware going down. as models scale, nvidia produces a new hardware generation that outputs more tokens per watt, but those speed gains get eaten by the fact that the model is bigger ie. more expensive to serve.

Also we have no clue whether Anthropics inference margin is compressing or not and they just want to maintain the price.

metaltyphoon · 2026-02-17T19:55:15 1771358115

Or, we can bleed out cash for a very long time.