Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Was this page helpful?
For release notes on Claude Apps, see the Release notes for Claude Apps in the Claude Help Center.
For updates to Claude Code, see the complete CHANGELOG.md in the claude-code repository.
claude-sonnet-4-20250514) and the Claude Opus 4 model (claude-opus-4-20250514). All requests to these models will now return an error. We recommend upgrading to Claude Sonnet 4.6 and Claude Opus 4.8 respectively. Researchers can request ongoing access through the External Researcher Access Program.GET /v1/environments/{id}/work endpoint, which lists pending work for a self-hosted sandbox, is now available on Claude Platform on AWS. See IAM actions for Claude Platform on AWS for the GetEnvironment action that authorizes it.claude-fable-5), our most capable widely released model, alongside Claude Mythos 5 (claude-mythos-5) for Project Glasswing participants. Both models support a 1M token context window by default, 128k max output tokens, and always-on adaptive thinking. See Introducing Claude Fable 5 and Claude Mythos 5 for capabilities, API changes, and availability.model: "claude-fable-5" to measure your prompts under the new tokenizer.stop_reason: "refusal". You are not billed for a request refused before any output is generated. An opt-in fallbacks parameter (in beta on the Claude API and Claude Platform on AWS; not supported on the Message Batches API) re-runs refused requests on another model, billed at the fallback model's rates. See Handling stop reasons.claude-opus-4-1-20250805), with retirement on the Claude API scheduled for August 5, 2026. We recommend migrating to Claude Opus 4.8. Read more in model deprecations.max_tokens parameter to cap the advisor model's output per call, reducing latency and output token cost for workloads that don't need full-length advisor responses. Set tools[].max_tokens on the advisor tool definition; see Capping advisor output.stop_reason: "refusal" without Claude having generated any output. See Streaming refusals for detecting and handling refusals.AnthropicSelfHostedEnvironmentAccess managed policy.role: "system" messages after a user turn (subject to placement rules) in the messages array, preserving prompt cache hits when instructions change during a long-running session. No beta header is required.stop_details field on refusal responses is now publicly documented; it returns a category (cyber, bio, or null) and a human-readable explanation, so your application can route different classes of refusal to the right next step. No beta header is required.usage.output_tokens_details.thinking_tokens, reporting how many of the billed output tokens were extended thinking. When streaming, the breakdown appears only on the final message_delta event. No beta header is required.agent_toolset and MCP tools exceeding 100K tokens are now automatically spilled to a file in the sandbox. The model receives a truncated preview with the file path and can read the full content from there.diagnostics.previous_message_id on a Messages request and the API reports a cache_miss_reason explaining where the prompt cache prefix diverged from the previous turn. Include the cache-diagnosis-2026-04-07 beta header in your requests.speed: "fast" with model: "claude-opus-4-7" and the fast-mode-2026-02-01 beta header for significantly faster output token generation at premium pricing. Pricing, rate limits, and access are the same as for Opus 4.6 fast mode; interested customers should join the waitlist.managed-agents-2026-04-01 beta header.mcp_oauth credentials. See Authenticate with vaults.context-1m-2025-08-07) for Claude Sonnet 4.5 and Claude Sonnet 4. The beta header now has no effect on these models, and requests exceeding the standard 200k-token context window return an error. To use the 1M context window, migrate to Claude Sonnet 4.6 or Claude Opus 4.6, where it's generally available at standard pricing with no beta header required.managed-agents-2026-04-01 header. See Using agent memory for the full integration guide.claude-3-haiku-20240307). All requests to this model will now return an error. We recommend upgrading to Claude Haiku 4.5./anthropic/v1/messages, in 27 AWS regions with global and regional endpoints.claude-sonnet-4-20250514) and the Claude Opus 4 model (claude-opus-4-20250514), with retirement on the Claude API scheduled for June 15, 2026. We recommend migrating to Claude Sonnet 4.6 and Claude Opus 4.8 respectively. Read more in model deprecations.advisor-tool-2026-03-01 in your requests.managed-agents-2026-04-01 beta header. Learn more in Claude Managed Agents overview.ant CLI, a command-line client for the Claude API that enables faster interaction with the Claude API, native integration with Claude Code, and versioning of API resources in YAML files. Learn more in the CLI quickstart./anthropic/v1/messages uses the same request shape as the first-party Claude API and runs on AWS-managed infrastructure with zero operator access. Available in us-east-1; contact your Anthropic account executive to request access. Learn more in Claude in Amazon Bedrock.max_tokens cap to 300k on the Message Batches API for Claude Opus 4.6 and Sonnet 4.6. Include the output-300k-2026-03-24 beta header to generate longer single-turn outputs for long-form content, structured data, and large code generation tasks.context-1m-2025-08-07 beta header will have no effect on these models, and requests that exceed the standard 200k-token context window will return an error. To continue using 1M context windows, migrate to Claude Sonnet 4.6 or Claude Opus 4.6, which support the full 1M token context window at standard pricing with no beta header required.GET /v1/models and GET /v1/models/{model_id} now return max_input_tokens, max_tokens, and a capabilities object. Query the API to discover what each model supports.display field for extended thinking, letting you omit thinking content from responses for faster streaming. Set thinking.display: "omitted" to receive thinking blocks with an empty thinking field and the signature preserved for multi-turn continuity. Billing is unchanged. Learn more in Controlling thinking display.cache_control field to your request body and the system automatically caches the last cacheable block, moving the cache point forward as conversations grow. No manual breakpoint management required. Works alongside existing block-level cache control for fine-grained optimization. Available on the Claude API and Microsoft Foundry (preview). Learn more in Prompt caching.claude-3-7-sonnet-20250219) and the Claude Haiku 3.5 model (claude-3-5-haiku-20241022). All requests to these models will now return an error. We recommend upgrading to Claude Sonnet 4.6 and Claude Haiku 4.5 respectively. Researchers can request ongoing access through the External Researcher Access Program.claude-3-haiku-20240307), with retirement scheduled for April 20, 2026. We recommend migrating to Claude Haiku 4.5. Read more in Model deprecations.speed parameter. Fast mode is up to 2.5x as fast at premium pricing. Interested customers should join the waitlist.thinking: {type: "adaptive"}); manual thinking (type: "enabled" with budget_tokens) is deprecated. Opus 4.6 does not support prefilling assistant messages. Learn more in What's new in Claude 4.6.budget_tokens for controlling thinking depth on new models.inference_geo parameter. US-only inference is available at 1.1x pricing for models released after February 1, 2026.output_format parameter has moved to output_config.format. Existing beta users can continue using the beta header during the transition period. Structured outputs remain in public beta on Amazon Bedrock and Microsoft Foundry.console.anthropic.com now redirects to platform.claude.com. The Claude Console has moved to its new home as part of our Claude brand consolidation. Existing bookmarks and links will continue working via automatic redirect. For more details, see the September 16, 2025 announcement.claude-3-opus-20240229). All requests to this model will now return an error. We recommend upgrading to Claude Opus 4.5, which offers significantly improved intelligence at a third of the cost. Researchers can request ongoing access to Claude Opus 3 on the API through the External Researcher Access Program.tool_runner.structured-outputs-2025-11-13.clear_thinking_20251015), enabling automatic management of thinking blocks. Learn more in Context editing.skills-2025-10-02 beta), a new way to extend Claude's capabilities. Skills are organized folders of instructions, scripts, and resources that Claude loads dynamically to perform specialized tasks. The initial release includes:
/v1/skills endpoints) to package domain expertise and organizational workflowsmodel_context_window_exceeded that allows you to request the maximum possible tokens without calculating input size. Learn more in Handling stop reasons.request-id header. Learn more in Errors.claude-3-5-sonnet-20240620 and claude-3-5-sonnet-20241022). These models will be retired on October 28, 2025. We recommend migrating to Claude Sonnet 4.5 (claude-sonnet-4-5-20250929) for improved performance and capabilities. Read more in Model deprecations.rate_limit_error) errors following a sharp increase in API usage due to acceleration limits on the API. Previously, 529 (overloaded_error) errors would occur in similar scenarios.search-results-2025-06-09 is no longer required. Learn more in Search results.* - Opus 4.1 does not allow both temperature and top_p parameters to be specified. Please use only one.
text_editor_20250728, an updated text editor tool that fixes some issues from the previous versions and adds an optional max_characters parameter that allows you to control the truncation length when viewing large files.search-results-2025-06-09.fine-grained-tool-streaming-2025-05-14.signature field of thinking block output.interleaved-thinking-2025-05-14.content block of tool_result and document.source. For backwards compatibility, if cache control is detected on the last block in tool_result.content or document.source.content, it will be automatically applied to the parent block instead. Cache control on any other blocks within tool_result.content and document.source.content will result in a validation error.none option to the tool_choice parameter in the Messages API that prevents Claude from calling any tools. Additionally, you're no longer required to provide any tools when including tool_use and tool_result blocks.bash_20250124: Same functionality as previous version but is independent from computer use. Does not require a beta header.text_editor_20250124: Same functionality as previous version but is independent from computer use. Does not require a beta header.anthropic-organization-id response header to all API responses. This header provides the organization ID associated with the API key used in the request.The following features are now generally available in the Claude API:
We also released new official SDKs:
user/assistant turns in our Messages API. Consecutive user/assistant messages will be combined into a single message instead of erroring, and we no longer require the first input message to be a user message.disable_parallel_tool_use: true in the tool_choice field to ensure that Claude uses at most one tool. Read more in Parallel tool use.dangerouslyAllowBrowser: true in the SDK instantiation to enable this feature.anthropic-beta: max-tokens-3-5-sonnet-2024-07-15 header."reasoning_extraction""cyber""bio"thinking: {"type": "disabled"} is not supported, and manual extended thinking budgets and assistant prefill are not supported (both return a 400 error). See Migrating from Claude Mythos Preview to Claude Mythos 5.thinking.display defaults to "omitted", the same as Claude Opus 4.8, Claude Opus 4.7, and Claude Mythos Preview; set display: "summarized" to receive readable thinking summaries. The raw chain of thought is never returned; pass thinking blocks back unchanged in multi-turn conversations on the same model. See Thinking output on Claude Fable 5 and Claude Mythos 5.session.thread_* webhook events now include a session_thread_id field identifying the multi-agent thread that triggered the event.high across all surfaces, including Claude Code and the Messages API.temperature, top_p, or top_k to a non-default value returns a 400 error on Claude Opus 4.8, same as on Claude Opus 4.7. See the migration guide for details.top_p nucleus sampling parameter in the Messages API from 0.999 to 0.99 for all models. To revert this change, set top_p to 0.999.
Additionally, when extended thinking is enabled, you can now set top_p to values between 0.95 and 1.computer_20250124: Updated computer use tool with new command options including "hold_key", "left_mouse_down", "left_mouse_up", "scroll", "triple_click", and "wait". This tool requires the "computer-use-2025-01-24" anthropic-beta header.
Learn more in Tool use with Claude.