for AWS partners →Get AWS credits to run Claude →

claude on amazon bedrock · pricing, access, setup · 2026

Claude on Amazon Bedrock — pricing, access & setup.

A complete, neutral reference for running Anthropic's Claude models on Amazon Bedrock in 2026: the Claude family (Opus, Sonnet, Haiku) and which model fits which job; why run Claude through Bedrock instead of the Anthropic API direct (AWS-native IAM and VPC, consolidated billing, data residency — and AWS credits that apply); model IDs and how to enable model access; a per-model pricing table; the capabilities that matter (vision, tool use, long context, prompt caching, extended thinking); a minimal Converse API snippet; use cases per model; a brief Claude-vs-Nova-vs-GPT view; and how AWS credits make running Claude $0.

Get AWS credits to run Claude →→ jump to the per-model pricing table

models

Opus · Sonnet · Haiku

access via

one AWS API

data privacy

stays in your account

cost with credits

TL;DR

Claude runs natively on Amazon Bedrock as one of the providers behind Bedrock's single API. You get the current Claude generation — Opus (deepest reasoning), Sonnet (the balanced workhorse), and Haiku (fast and cheap) — accessed through the same Converse API and IAM/VPC controls as every other Bedrock model, with your prompts and data staying in your AWS account and region.
Running Claude via Bedrock instead of the Anthropic API direct buys you AWS-native security (IAM, VPC/PrivateLink, KMS, CloudTrail), consolidated billing on your existing AWS invoice, data-residency control by region — and, the decisive one for startups, AWS credits apply to Claude usage. Capabilities on Bedrock include vision, tool use, long context, prompt caching, and extended thinking.
Pricing is per-token and per-model: Haiku costs cents per million tokens, Sonnet is the mid workhorse, Opus-class is dollars per million for the hardest reasoning. The same workload can be 10×+ cheaper on the right Claude model. AWS credits (Activate up to $100K, Bedrock/GenAI POC $10K–$50K, GenAI Accelerator up to $1M) cover Claude inference entirely — CloudRoute routes you to the credit pool and a vetted AWS partner, so you pay $0.

the models

IThe Claude family on Amazon Bedrock

Anthropic's Claude is available natively on Amazon Bedrock — it is one of the foundation-model providers behind Bedrock's single managed API, alongside Amazon's own Nova and Titan, Meta Llama, Mistral, Cohere, and others. On Bedrock the Claude lineup follows a clear three-tier shape, and choosing the right tier is the most important cost-and-quality decision you will make.

The Claude family is organized as a ladder of three tiers, each a deliberate trade-off between intelligence, speed, and cost. As of 2026 the current generation on Bedrock spans: Claude Opus — the most capable tier, built for the hardest reasoning, complex multi-step analysis, and agentic work where quality dominates; Claude Sonnet — the balanced workhorse that handles the large majority of production traffic with strong reasoning at a fraction of Opus cost and latency; and Claude Haiku — the fast, low-cost tier for high-throughput, latency-sensitive, or simpler tasks. The exact version names and identifiers advance over time as Anthropic ships new generations; this page describes the durable tier structure and points you to the model catalog for current IDs.

The practical discipline is the same one that governs all Bedrock cost: match the model to the task. Use Haiku for the easy, high-volume requests; use Sonnet as the default for real work; reserve Opus for the genuinely hard requests where its extra reasoning earns its higher price. Many production systems route across all three — a cheap model triages and handles the bulk, escalating only the hard cases to a stronger one. That single pattern routinely cuts Claude spend several-fold with little quality loss.

Because all three are served through the same Bedrock API, switching between them is usually a one-line change to the model ID — which makes the route-and-escalate pattern easy to build and easy to tune. The capabilities (vision, tool use, long context, prompt caching, extended thinking) and the security model are consistent across the family, so you design once and choose the tier per request.

One caveat, stated once and meant throughout: exact model version names, model IDs, regional availability, context-window sizes, and per-token prices all change frequently as Anthropic ships new Claude generations and AWS updates Bedrock. The figures and identifiers here are representative as of 2026 to convey the structure and relative cost. Always confirm the current model IDs in the Bedrock model catalog and current rates on the AWS Bedrock pricing page before you build or budget.

the three-tier shape

Opus = deepest reasoning, highest cost — reserve for hard problems. Sonnet = the balanced workhorse — your sensible default for production. Haiku = fast and cheap — high-volume, latency-sensitive, simpler tasks. Switching between them is a one-line model-ID change, which is why tiered routing is the standard cost pattern.

the positioning question

IIWhy run Claude via Bedrock instead of the Anthropic API direct?

Claude is available both directly from Anthropic's own API and through Amazon Bedrock. The model is the same Claude either way — so the choice is about everything around the model: security posture, billing, data control, and funding. For teams already on AWS, Bedrock usually wins on all four.

This is the central question this page exists to answer, and the honest framing is that it is an operational decision, not a model-quality one. You are choosing where Claude runs and how it is governed, billed, and paid for. Here is what running Claude through Bedrock buys you over the direct API:

AWS-native security and identity — Calls are authenticated and authorized with IAM — the same roles, policies, and least-privilege model you already use across AWS. You can keep traffic on your private network with VPC endpoints (PrivateLink), encrypt with your own KMS keys, and get a full audit trail in CloudTrail. No separate API key to provision, rotate, and secure outside your AWS account.
Consolidated billing — Claude usage lands on your existing AWS invoice, in the same Cost Explorer and budgets as the rest of your stack — no separate vendor bill, no separate payment relationship, no extra procurement. For finance and FinOps that single-invoice consolidation is a real simplification.
Data residency and privacy — You choose the AWS region Claude runs in, so prompts and responses stay in the jurisdiction you select — important for GDPR, data-residency, and regulated workloads. As with all of Bedrock, your inputs and outputs are not used to train the base model and stay within your account and region.
One API across many models — Through Bedrock's Converse API, Claude sits behind the same interface as Nova, Llama, Mistral, Cohere, and Titan. You can build a model-agnostic application and switch or mix models — including routing easy requests to a cheaper model and hard ones to Claude — without rewriting your integration.
The decisive one — AWS credits apply — Claude usage on Bedrock draws down AWS credits (Activate, Bedrock/GenAI POC, the GenAI Accelerator) like any other AWS spend. The direct Anthropic API does not. For a startup with a credit pool, running Claude via Bedrock can be effectively $0 during the build — which is exactly why this is CloudRoute's tightest fit.

When is the direct API the better pick? If you are not on AWS, want the very newest Claude release the day it ships (new generations sometimes appear on the direct API first), or rely on an Anthropic-specific feature before it lands on Bedrock, going direct can make sense. But for the large population of teams already building on AWS — and especially any startup eligible for AWS credits — Bedrock is usually the stronger home for Claude.

getting in

IIIModel IDs and how to enable model access

Before you can call Claude on Bedrock, you have to do one small but mandatory thing: request model access in your account. Foundation models on Bedrock are off by default; turning Claude on is a one-time, no-cost step in the console.

Enabling access. In the Bedrock console, open Model access, find the Claude models you want, and request access. For most Claude models this is granted effectively immediately; some models prompt for brief use-case details. There is no charge for enabling access — you only pay when you actually call a model. Access is per-account and per-region, so if you operate in several regions, enable Claude in each one you will call from. This is also where cross-region inference profiles come in: they let Bedrock route your Claude calls across a set of regions for better availability and throughput (see the amazon-bedrock-cross-region-inference sibling).

Model IDs. Every model on Bedrock is invoked by a model ID — a string identifying the provider, model, and version (Claude IDs are namespaced under Anthropic, e.g. an identifier of the shape anthropic.claude-…, with a version suffix). You pass this ID to the API to choose which model and tier answers a request, so moving a request from Haiku to Sonnet to Opus is just a change of model-ID string. Because IDs advance with each Claude generation, do not hard-code a guessed value — read the current ID from the Bedrock model catalog (console) or list it via the API/CLI, and treat it as configuration rather than a literal in your code.

Permissions. The IAM principal making the call needs permission for the relevant Bedrock invoke actions (and, if you use cross-region inference profiles, permission on the profile). A least-privilege policy scoped to the specific Claude model ARNs you intend to use is the recommended posture. Once access is granted and IAM is in place, you are ready to call Claude — the next section shows the minimal request.

Open the Bedrock console → Model access → request access to the Claude models you need (free; usually instant).
Enable access in each region you will call from; consider a cross-region inference profile for availability.
Get the current model ID from the model catalog or via the API — do not hard-code a guessed version string.
Attach an IAM policy granting the Bedrock invoke actions on the specific Claude model ARNs (least privilege).
You are billed only on invocation — enabling access costs nothing.

what it costs

IVClaude on Bedrock — per-model pricing

Claude on Bedrock is billed per token: a rate per 1,000 input tokens (everything you send) and a higher rate per 1,000 output tokens (everything Claude generates), with output typically priced several times higher than input. The rate depends entirely on the tier — and the spread across tiers is wide enough that model choice is the dominant cost lever.

The table below gives representative 2026 on-demand rates for the three Claude tiers, shown per 1,000 and per 1,000,000 tokens (the per-million column is simply the per-1K figure × 1,000; providers increasingly quote per-million). Use it to rank the tiers by cost and sanity-check a budget — not as an audited price sheet. Two cost levers sit on top of these rates and are not shown in the table: Batch (submit non-interactive work as an async job for roughly half the on-demand price) and prompt caching (stop re-paying full input price for a repeated prefix like a long system prompt). Both can substantially lower the effective rate — see amazon-bedrock-pricing and amazon-bedrock-prompt-caching.

representative on-demand Claude-on-Bedrock pricing · per 1K and per 1M tokens · 2026

Claude tier	Input / 1K	Output / 1K	Input / 1M	Output / 1M	Cost position
Claude Haiku	$0.00025	$0.00125	$0.25	$1.25	Cheapest — high-volume / fast
Claude Sonnet	$0.003	$0.015	$3.00	$15.00	Mid — the workhorse default
Claude Opus-class	$0.015	$0.075	$15.00	$75.00	Highest — hardest reasoning

Representative 2026 figures for relative comparison only — confirm current rates on the AWS Bedrock pricing page (they change with each generation and vary by region). Output is typically ~5× input. Batch (~50% off) and prompt caching (discounted repeated context) lower the effective rate further. Opus is roughly 60× Haiku's input rate — which is why tiered routing matters.

what it can do

VCapabilities: vision, tool use, long context, prompt caching, extended thinking

Claude on Bedrock is not just text-in/text-out. The current generation brings a set of capabilities that materially expand what you can build — and several of them are also cost or quality levers. Availability of any given capability can vary by Claude tier and version, so confirm specifics for your chosen model.

Vision (multimodal input)

Claude models on Bedrock can accept images alongside text in a request and reason about them — reading charts and diagrams, extracting data from screenshots, interpreting documents and photos, and answering questions about visual content. This turns a large class of document-understanding and visual-QA problems into a single Converse call, no separate OCR or vision pipeline required.

Tool use (function calling)

Claude supports tool use: you describe tools (functions, APIs, database queries) and Claude decides when to call them and with what arguments, then incorporates the results into its answer. This is the foundation of agentic systems — letting Claude look things up, take actions, and ground its responses in live data. On Bedrock it is exposed through the Converse API's tool fields and underpins Bedrock Agents.

Long context

Claude models offer a large context window — room for very long inputs (lengthy documents, large codebases, extended conversation history, many retrieved chunks) in a single request. Long context simplifies RAG and document workflows because you can fit more relevant material in one call. It is also a cost consideration: input is billed per token, so a big context costs more — which is exactly where prompt caching earns its keep.

Prompt caching

When many requests share a large common prefix — a long system prompt, a fixed instruction set, a reference document, or tool definitions — prompt caching lets Bedrock cache that prefix so subsequent requests are not billed full input price for it again. On chatbots and RAG with a large fixed context, this can cut the input portion of the bill by a large fraction. It is one of the most effective Claude cost levers; see the amazon-bedrock-prompt-caching sibling for the mechanics.

Extended thinking

Newer Claude models support extended thinking — an explicit reasoning mode in which the model spends additional internal steps working through a hard problem before answering, improving quality on complex math, multi-step analysis, and difficult coding. You can typically control how much thinking budget to allow. It trades some latency and output cost for accuracy on genuinely hard tasks — best reserved for the requests that need it rather than turned on for everything.

calling it

VIA minimal Converse API call

The recommended way to call Claude (and any chat model) on Bedrock is the <strong>Converse API</strong> — a single, model-agnostic interface for multi-turn messages, system prompts, tool use, and multimodal input. Because it is model-agnostic, the same code calls Haiku, Sonnet, or Opus by changing only the model ID.

A minimal text request with the AWS SDK looks like the snippet below (Python / boto3). You create a Bedrock Runtime client, call converse with a model ID and a list of messages, and read the reply from the response. Swapping modelId between the Haiku, Sonnet, and Opus IDs is the only change needed to move a request across tiers — which is what makes tiered routing a one-line decision.

import boto3
client = boto3.client("bedrock-runtime", region_name="us-east-1")
resp = client.converse(
  modelId="anthropic.claude-<tier>-<version>", # from the model catalog
  messages=[{"role": "user", "content": [{"text": "Summarize this contract clause: ..."}]}],
  system=[{"text": "You are a concise legal assistant."}],
  inferenceConfig={"maxTokens": 512, "temperature": 0.2},
)
print(resp["output"]["message"]["content"][0]["text"])

That is the whole pattern for a basic call. From here you add multi-turn history (append assistant and user messages), tool use (a toolConfig describing your functions, with a streaming or multi-step loop to feed results back), vision (image blocks in the message content), and streaming (the converse_stream variant for token-by-token output). The same shape holds throughout — the API surface barely changes as you add capabilities, which is the point of Converse. The exact model ID string must come from the Bedrock model catalog; the placeholder above is illustrative, not a literal value.

why Converse

The Converse API is model-agnostic: one interface for messages, system prompts, tool use, and images across every Bedrock model. Switching Claude tiers — or swapping Claude for Nova or Llama — is a change to modelId, not a rewrite. Build once, route per request.

matching tier to job

VIIUse cases — which Claude for which job

The clearest way to think about the family is by mapping common production workloads to the cheapest tier that does them well. Start a request on the smallest tier that clears your quality bar and only escalate when it does not.

Claude Haiku — high-volume, latency-sensitive, simpler tasks — Classification, routing and triage, data extraction, short-form generation, real-time chat where speed matters, the cheap first stage of a tiered router, and bulk processing (especially via Batch). When good-enough quality at the lowest cost and fastest response is the goal, start here.
Claude Sonnet — the production workhorse — The sensible default for most real work: RAG knowledge assistants, customer-support agents, content generation, coding assistance, document analysis, and the reasoning behind most Bedrock Agents. Strong quality at a fraction of Opus cost and latency — where the majority of production traffic should live.
Claude Opus — hardest reasoning and complex agents — Reserve for the genuinely difficult: deep multi-step reasoning, complex analysis, hard coding and refactoring, research-style synthesis, and agentic workflows where a wrong step is expensive. Pricier and a bit slower — worth it only for the requests that actually need its depth, ideally reached via escalation rather than as a default.
Tiered routing — use all three — The highest-leverage pattern: a cheap model triages and handles the easy majority, escalating only hard cases to Sonnet or Opus. Because switching tiers is a one-line model-ID change on the Converse API, this is straightforward to build — and it routinely cuts Claude spend several-fold with little quality loss.

the field on Bedrock

VIIIClaude vs Amazon Nova vs GPT on Bedrock — briefly

Claude is one strong choice among several on Bedrock. A quick, honest orientation versus the two other names people ask about — Amazon's own Nova family, and OpenAI's GPT models — without turning this into a full shootout.

Claude vs Amazon Nova. Nova is Amazon's own foundation-model family on Bedrock (Micro / Lite / Pro / Premier for text, plus Canvas for images and Reel for video), engineered for low cost and low latency. At the cheap end, Nova Micro and Lite undercut even Haiku and are excellent for very high-volume, simple, latency-sensitive work. Claude tends to be the pick when you want the strongest reasoning and the specific Claude behaviour and capability profile, particularly Sonnet and Opus for harder tasks. A common pattern is to mix them: Nova for the cheapest bulk path, Claude for the quality path — trivial to do behind one Converse API. See the amazon-nova sibling.

Claude vs GPT. The Bedrock-specific point is availability: Bedrock's value is its catalog of providers, and which exact third-party frontier models are offered changes over time, by region, and by provider agreement. Where multiple frontier families are available on Bedrock, the right choice is workload-specific — benchmark the candidates on your task and prompts rather than on leaderboard headlines, since relative strengths shift with each generation. The structural advantages of running on Bedrock (IAM/VPC, consolidated billing, data residency, and AWS credits) apply regardless of which model you land on. For a fuller treatment, see amazon-bedrock-vs-openai.

The meta-point: Bedrock lets you defer and revisit this choice cheaply. Because every model sits behind the same API, you can start on Claude, A/B a Nova or other model on part of your traffic, and re-tier as prices and capabilities move — without re-plumbing your application.

how it becomes $0

IXHow AWS credits make running Claude $0

Everything above prices Claude on Bedrock if you pay AWS directly. For most startups and many companies the relevant number is different — because AWS will frequently fund the build with credits, and Claude usage on Bedrock draws those credits down before it ever touches your card. This is the single tightest fit in CloudRoute's whole offer.

Claude inference on Bedrock is ordinary AWS spend, so it is fully credit-eligible and credits apply automatically against your bill until exhausted — covering Claude tokens, any Batch and prompt-caching usage, plus the supporting services (Knowledge Bases, vector store, S3, logging). The relevant pools: AWS Activate (general startup credits, commonly up to $100K for institutionally-funded startups); a dedicated Bedrock / Generative-AI POC pool ($10K–$50K) aimed at proving out a GenAI use case; and the competitive Generative AI Accelerator (awards up to $1M for a small cohort of AI-first startups). This is precisely the advantage Bedrock has over the direct Anthropic API for a funded startup: credits apply to Claude on Bedrock; they do not apply to the direct API.

The practical mechanic is that most of these pools are partner-filed — requested through the AWS Partner Network (the ACE program), not a public self-serve form — which is why teams route through an AWS partner rather than applying alone. That is the gap CloudRoute fills. CloudRoute matches you to the right credit pool for your stage and to a vetted AWS DevOps/ML partner who both files the credit application and helps build the Claude workload — the tiered model router, the RAG pipeline behind Knowledge Bases, the agent with tool use, prompt caching on the fixed context. The customer pays $0 — AWS funds the credit pool, AWS pays the partner through engagement-funding programs, and the partner pays CloudRoute a routing commission. You never see an invoice.

Put together with the tiered-routing and caching levers above, the picture for a startup is: build on the Claude tier each request actually needs, cache the repeated context, and run the whole thing on a $25K–$100K (or larger) credit pool while you find product-market fit — paying real money only once usage, and ideally revenue, has scaled past the credits. Related: AWS credits for generative-AI startups and Bedrock POC funding for the full credit mechanics.

pick a tier

Claude Opus vs Sonnet vs Haiku on Bedrock — cost, speed, use

The core decision in one place: the three Claude tiers compared on intelligence, speed, cost, and the work each is suited to. Match the request to the cheapest tier that clears the bar and escalate from there. Representative 2026 figures for relative comparison, not quotes.

Tier	Intelligence	Speed	Relative cost (input/1M)	Best for	Avoid for
Claude Haiku	Good	Fastest	~$0.25 (lowest)	High-volume, latency-sensitive, simple tasks; tier-1 of a router; Batch	Hard multi-step reasoning
Claude Sonnet	Strong	Fast	~$3 (mid)	The production default: RAG, agents, support, coding, content	Throwaway bulk where Haiku suffices
Claude Opus-class	Deepest	Moderate	~$15 (highest)	Hardest reasoning, complex analysis, high-stakes agentic steps	High-volume simple work (wasteful)

Roughly an order-of-magnitude cost step between each tier (Opus input ≈ 60× Haiku). That spread is why tiered routing — cheap model triages, hard cases escalate — is the standard cost pattern. Batch (~50% off) and prompt caching lower every tier further. Switching tiers is a one-line model-ID change on the Converse API.

the tightest fit in the whole offer

Credits apply to Claude on Bedrock (not the direct API) — get the pool + a partner to build it ($0)

Get matched in 24h →

a recent match

Claude moved from the direct API onto Bedrock — and onto $0 — anonymized

inquiry · Series-A B2B SaaS, Austin

Series-A B2B SaaS, 24 people, already shipping a Claude-powered feature on the direct Anthropic API

Situation: The product already used Claude via the Anthropic API direct and the bill was climbing as usage grew — paid out of runway, on a separate vendor invoice, with every request hitting a frontier tier. They were already an AWS customer for the rest of their stack and wanted (a) to bring Claude under their AWS security and billing, and (b) to stop paying for it out of pocket.

What CloudRoute did: CloudRoute matched them in under 24 hours to a US-East AWS partner with GenAI experience. The partner (1) migrated the Claude calls to Bedrock's Converse API — IAM auth, VPC endpoints, one consolidated bill; (2) introduced a tiered router (Haiku for the easy majority, Sonnet for real work, Opus only for the hard cases); (3) turned on prompt caching for the fixed system prompt; and (4) filed a Bedrock POC credit application plus an Activate Portfolio application to fund the workload.

Outcome: Claude now runs on Bedrock under the team's existing AWS IAM and billing, and the tiered router plus caching cut the modeled per-request cost substantially — but the decisive change was that the spend now draws down AWS credits instead of runway, so the team pays $0 during the build and early scale. CloudRoute's commission was paid by the partner from AWS engagement funding, not by the customer.

moved: direct API → Bedrock Converse · pattern: tiered routing + prompt caching · credits secured: POC + Activate · out-of-pocket: $0

faq

Common questions

Is Claude available on Amazon Bedrock?

Yes. Anthropic's Claude runs natively on Amazon Bedrock as one of the foundation-model providers behind Bedrock's single managed API, alongside Amazon Nova and Titan, Meta Llama, Mistral, Cohere, and others. As of 2026 the current generation spans three tiers — Claude Opus (deepest reasoning), Claude Sonnet (the balanced workhorse), and Claude Haiku (fast and cheap) — all accessed through the same Converse API and IAM/VPC controls. You enable access per account and region in the Bedrock console.

Should I run Claude on Bedrock or use the Anthropic API directly?

The model is the same Claude either way, so it is an operational choice. Bedrock gives you AWS-native security (IAM, VPC/PrivateLink, KMS, CloudTrail), consolidated billing on your existing AWS invoice, data-residency control by region, one API across many models — and, decisively for startups, AWS credits apply to Claude usage. The direct API can make sense if you are not on AWS, want the newest Claude release the day it ships, or need an Anthropic-specific feature before it reaches Bedrock. For teams already on AWS, and any startup with AWS credits, Bedrock usually wins.

How much does Claude cost on Bedrock?

It is billed per token, per tier: representative 2026 on-demand rates are roughly $0.25 / $1.25 per million input/output tokens for Haiku, $3 / $15 for Sonnet, and $15 / $75 for Opus-class. Output is priced about 5× input, and Opus input is roughly 60× Haiku's — so model choice is the dominant cost lever. Batch (~50% off) and prompt caching lower the effective rate further. These are representative figures for relative comparison; confirm current rates on the AWS Bedrock pricing page, as they change with each generation and vary by region.

How do I enable access to Claude on Bedrock?

In the Bedrock console, open Model access, find the Claude models you want, and request access — it is free and usually granted immediately (some models ask for brief use-case details). Access is per-account and per-region, so enable Claude in each region you will call from, and consider a cross-region inference profile for availability. Then attach an IAM policy granting the Bedrock invoke actions on the specific Claude model ARNs. You are billed only when you invoke a model; enabling access costs nothing.

What is the Claude model ID on Bedrock?

Each Claude model is invoked by a model ID — a string identifying the provider, model, and version, namespaced under Anthropic (of the shape anthropic.claude-… with a version suffix). You pass it to the API to pick the tier, so moving a request between Haiku, Sonnet, and Opus is just a change of model-ID string. Because IDs advance with each generation, do not hard-code a guessed value — read the current ID from the Bedrock model catalog in the console or list it via the API/CLI, and treat it as configuration.

What capabilities does Claude support on Bedrock?

The current generation supports vision (image input reasoning), tool use / function calling (the basis for agents), a large context window for long documents and history, prompt caching (discounting repeated context like a long system prompt), and extended thinking on newer models (an explicit deeper-reasoning mode for hard tasks). All are exposed through the Converse API, and Claude underpins Bedrock Agents and Knowledge Bases. Availability of a given capability can vary by tier and version, so confirm specifics for your chosen model.

Which Claude model should I use — Opus, Sonnet, or Haiku?

Match the model to the task and escalate only when needed. Use Haiku for high-volume, latency-sensitive, or simple work (classification, extraction, routing, the cheap first stage of a router). Use Sonnet as the production default for most real work (RAG, agents, support, coding, content). Reserve Opus for the genuinely hard requests (deep reasoning, complex analysis, high-stakes agentic steps). Because switching tiers is a one-line model-ID change on the Converse API, tiered routing — cheap model triages, hard cases escalate — typically cuts spend several-fold with little quality loss.

How do I call Claude on Bedrock in code?

Use the Converse API — a single, model-agnostic interface for messages, system prompts, tool use, and images. With the AWS SDK you create a bedrock-runtime client, call converse with a modelId and a list of messages, and read the reply from the response; converse_stream gives token-by-token streaming. Switching Claude tiers (or swapping Claude for another Bedrock model) is just a change to modelId, not a rewrite. The exact model ID must come from the Bedrock model catalog rather than a hard-coded guess.

Can AWS credits cover Claude usage on Bedrock?

Yes — and this is the key advantage over the direct Anthropic API. Claude on Bedrock is ordinary AWS spend, so it is fully credit-eligible and credits apply automatically against your bill, covering Claude tokens, Batch and prompt-caching usage, and supporting services. The relevant pools are AWS Activate (up to $100K), a Bedrock/GenAI POC pool ($10K–$50K), and the GenAI Accelerator (up to $1M). These are largely partner-filed via the AWS Partner Network. CloudRoute routes you to the right pool and a vetted AWS partner who files the application and builds the Claude workload — customer pays $0, AWS funds it.

Run Claude on AWS's budget, not your runway

The direct Anthropic API bills your card; Claude on Bedrock draws down AWS credits — under your existing IAM, VPC, and billing. CloudRoute routes you to the right credit pool (Activate up to $100K, Bedrock POC $10K–$50K, GenAI Accelerator up to $1M) and a vetted AWS partner who moves Claude onto Bedrock, builds the tiered router, and turns on caching. Customer pays $0.

Get matched in 24h →→ see the AI-team persona detail

matched within< 24h

GenAI credit ceilingup to $1M

cost to you$0