News

Pricing updates and platform news

Curated coverage of AI platform pricing, model launches, and API changes.

OOpenAI
pricing

OpenAI cuts GPT-5 input by 17%, expands cached input discount

Standard input drops from $3.00 to $2.50 per 1M. Cached input now $0.625 per 1M, a 50% reduction.

2026-05-15OpenAI
DDeepSeek
pricing

DeepSeek V3.2 launches with sparse attention, 23% cheaper than V3.1

Long-context inputs become materially cheaper via sparse attention. R1 also receives a permanent price cut.

2026-05-15DeepSeek
AAnthropic
api

Anthropic expands prompt caching to all Claude 4.5 tiers

Cached input now 1/10th the price of standard input across Sonnet, Haiku, and Opus.

2026-05-14Anthropic
GGoogle
pricing

Gemini 2.5 Flash price drops 25% across Vertex and AI Studio

Standard input now $0.15 per 1M tokens. Cached input remains 25% of standard.

2026-05-12Google
AAnthropic
pricing

Claude 4.5 Haiku input rises 14% alongside context window upgrade

Haiku now supports a 200K context window. The price increase reflects the expanded capability.

2026-04-01Anthropic
xxAI
pricing

Grok 4 standard input increases 7%

xAI cites realtime context infrastructure costs. Grok 4 mini remains unchanged.

2026-04-12xAI
MMistral
launch

Mistral Large 3 launches at $2 input, 20% under Large 2

Multilingual benchmarks improve significantly while pricing moves down.

2026-05-09Mistral
OOpenAI
deprecation

OpenAI deprecates GPT-4 Turbo on July 1

Customers are migrated to GPT-5 mini by default. Migration guide published.

2026-05-02OpenAI
CCohere
pricing

Cohere Command R+ input drops to $2.50

Aligns Command R+ with mid-frontier market pricing.

2026-05-08Cohere