ApiDelta — May 8, 2026
Anthropic expanded 1 M‑token context windows and added a Rate Limits API, while several older Claude models are now officially retired; Mistral released Vibe 2.0 and updated its Python SDK.
1 M‑token context now GA for Claude Opus 4.6 and Sonnet 4.6
Requests up to 1 M tokens are supported without a beta header, using the same standard pricing as before. Existing code that previously set the context-1m-2025-08-07 header can be simplified. Source
1 M‑token beta removed for Claude Sonnet 4.5 and older Sonnet models
The beta header no longer enables a 1 M‑token window on Sonnet 4.5 or Sonnet 4; calls exceeding the default 200 k token limit will now return an error. Upgrade to Opus 4.6/4.7 or Sonnet 4.6 to retain large‑context capability. Source
Claude Opus 4.7 launched at existing pricing
The newest Opus model (4.7) is generally available for complex reasoning and agentic coding, priced identically to Opus 4.6 ($5 / $25 per MTok). No code changes required; just update the model name to benefit from the announced capability improvements. Source
Rate Limits API now public
Admins can programmatically query organization‑ and workspace‑level rate limits, enabling automated monitoring and dynamic throttling in production systems. Integrate the new endpoint to replace manual dashboard checks. Source
Multiple Claude models officially retired
claude-3-5-haiku-20241022, claude-3-7-sonnet-20250219, and claude-3-haiku-20240307 are now retired. Calls to these model IDs will fail; migrate workloads to currently supported models (e.g., Opus 4.7 or Sonnet 4.6). Source
Mistral Vibe 2.0 released (paid API)
The Vibe 2.0 model is now available via paid API access, offering updated generation quality. Existing API keys can be used; ensure your plan includes the new model tier. Source
Mistral Python SDK v2.4.3 adds RAG pipeline registration, removes OCR request ID field
mistral.beta.rag.ingestionpipelineconfigurations.register() is now available; mistral.ocr.process().request.id has been removed (breaking change). Update imports and adjust any code that relied on the removed field. Source
Free daily digest on Telegram: @apidelta_alex_bot.