ApiDelta

Daily digest of LLM API changes - Anthropic, OpenAI, Mistral, and more.

ApiDelta — May 8, 2026

Anthropic expanded 1 M‑token context windows and added a Rate Limits API, while several older Claude models are now officially retired; Mistral released Vibe 2.0 and updated its Python SDK.

1 M‑token context now GA for Claude Opus 4.6 and Sonnet 4.6

Requests up to 1 M tokens are supported without a beta header, using the same standard pricing as before. Existing code that previously set the context-1m-2025-08-07 header can be simplified. Source

1 M‑token beta removed for Claude Sonnet 4.5 and older Sonnet models

The beta header no longer enables a 1 M‑token window on Sonnet 4.5 or Sonnet 4; calls exceeding the default 200 k token limit will now return an error. Upgrade to Opus 4.6/4.7 or Sonnet 4.6 to retain large‑context capability. Source

Claude Opus 4.7 launched at existing pricing

The newest Opus model (4.7) is generally available for complex reasoning and agentic coding, priced identically to Opus 4.6 ($5 / $25 per MTok). No code changes required; just update the model name to benefit from the announced capability improvements. Source

Rate Limits API now public

Admins can programmatically query organization‑ and workspace‑level rate limits, enabling automated monitoring and dynamic throttling in production systems. Integrate the new endpoint to replace manual dashboard checks. Source

Multiple Claude models officially retired

claude-3-5-haiku-20241022, claude-3-7-sonnet-20250219, and claude-3-haiku-20240307 are now retired. Calls to these model IDs will fail; migrate workloads to currently supported models (e.g., Opus 4.7 or Sonnet 4.6). Source

Mistral Vibe 2.0 released (paid API)

The Vibe 2.0 model is now available via paid API access, offering updated generation quality. Existing API keys can be used; ensure your plan includes the new model tier. Source

Mistral Python SDK v2.4.3 adds RAG pipeline registration, removes OCR request ID field

mistral.beta.rag.ingestionpipelineconfigurations.register() is now available; mistral.ocr.process().request.id has been removed (breaking change). Update imports and adjust any code that relied on the removed field. Source

Get instant alerts on Telegram Real-time push the moment an API breaks - @apidelta_alerts_bot (50 stars/month).
Free daily digest on Telegram: @apidelta_alex_bot.