Feat/anthropic extended ttl #6205

md2k · 2025-06-19T22:54:55Z

Description

Implements granular per-message-type caching for Anthropic models to improve token efficiency in Agent mode. Adds new CacheBehavior options to specify how many of each message type to cache (user messages, tool results, assistant tool calls, etc.) instead of only caching the last 2 user messages.
This is related to issue #6135

Checklist

I've read the contributing guide
The relevant docs, if any, have been updated or created
The relevant tests, if any, have been updated or created

Screenshots

N/A - Backend caching enhancement with no visual changes.

Tests

Added comprehensive test suite core/llm/llms/Anthropic.enhanced-caching.test.ts with 6 test cases covering:

Tool result message caching
Assistant tool call message caching
Per-type caching limits validation
Disabled caching behavior
Fallback TTL handling
Core shouldCacheMessage logic

All tests pass and validate the new per-type caching functionality while maintaining backward compatibility.

Added to `cacheBehaviorSchema` extra optional parameters. ``` useExtendedCacheTtlBeta: z.boolean().optional(), cacheTtl: z.enum(["5m", "1h"]).optional(), ```

netlify · 2025-06-19T22:54:58Z

👷 Deploy request for continuedev pending review.

Visit the deploys page to approve it

Name	Link
🔨 Latest commit	`e3140dd`

github-actions · 2025-06-19T22:55:04Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

core/llm/llms/Anthropic.enhanced-caching.test.ts

recurseml · 2025-06-19T22:56:32Z

😱 Found 3 issues. Time to roll up your sleeves! 😱

md2k · 2025-06-19T22:56:50Z

I have read the CLA Document and I hereby sign the CLA

md2k · 2025-06-19T23:44:51Z

Some details about how long session with big context looks with 5min cache and 1h cache and cost perspective:
5 min cache:

md2k · 2025-06-19T23:45:49Z

1h ttl

sestinj

This is a great PR as far as the code goes. I kind of want to step back though to better understand whether you think this could be a sensible default rather than a configuration option. I'm weary of too many options and if everyone would benefit from the way you are configuring your Anthropic models, maybe we should just ship that as the default (I repeated all this in a comment below)

sestinj · 2025-06-21T01:17:19Z

docs/docs/customize/model-providers/top-level/anthropic.mdx

@@ -62,18 +62,40 @@ Anthropic currently does not offer any reranking models.

 Anthropic supports [prompt caching with Claude](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching), which allows Claude models to cache system messages and conversation history between requests to improve performance and reduce costs.

+> **NOTE:** As part of their `Beta` support [Extended caching](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching#1-hour-cache-duration)


The docs here feel a bit extensive since they take up most of this page now. I think we should try to make a collapsible block or make a dedicated page to prompt caching. If possible, I think the collapsible would be the better option

sestinj · 2025-06-21T01:38:15Z

core/index.d.ts

@@ -927,6 +927,13 @@ export interface RequestOptions {
 export interface CacheBehavior {
  cacheSystemMessage?: boolean;
  cacheConversation?: boolean;
+  useExtendedCacheTtlBeta?: boolean;


I'm coming at this review with the lens of "if we add it now, we'll have to support it forever (or go through a deliberate deprecation process)". I'm worried there are a large number of options here that aren't going to be relevant forever, or that they might not be the final form of this configuration.

It would be helpful to better understand whether all of the cacheUserMessages, cacheAssistantMessages, etc. are truly necessary for people to customize, or whether we just need to set a more sensible default. For example, I'd be curious what values you set here and whether you think we should just ship those as the defaults for everyone. Usage patterns in Continue are probably similar across a variety of users. Not that we couldn't eventually also allow this customization, but it might save a lot of maintenance (and give many users money back without needing to configure anything)

chezsmithy · 2025-06-21T05:44:02Z

@sestinj maybe we align to this previous PR. #5371

It introduced a single caching setting that controls all the options. Whatever we do here I should likely bring to Bedrock as well.

Watching.

sestinj · 2025-06-23T03:12:00Z

Agreed @chezsmithy ! Thanks for linking the PR here, that what I had in mind

md2k added 5 commits June 19, 2025 22:52

Update models.ts

09601a2

Added to `cacheBehaviorSchema` extra optional parameters. ``` useExtendedCacheTtlBeta: z.boolean().optional(), cacheTtl: z.enum(["5m", "1h"]).optional(), ```

Update index.d.ts

20d6227

Update Anthropic.ts

04a1ed3

Update anthropic.mdx

090f5df

adding support to cache other types of messages

cc8864a

md2k requested a review from a team as a code owner June 19, 2025 22:54

md2k requested review from sestinj and removed request for a team June 19, 2025 22:54

github-project-automation bot added this to Issues and PRs Jun 19, 2025

github-project-automation bot moved this to Todo in Issues and PRs Jun 19, 2025

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jun 19, 2025

recurseml bot reviewed Jun 19, 2025

View reviewed changes

core/llm/llms/Anthropic.enhanced-caching.test.ts Outdated Show resolved Hide resolved

recurseml bot reviewed Jun 19, 2025

View reviewed changes

core/llm/llms/Anthropic.enhanced-caching.test.ts Outdated Show resolved Hide resolved

recurseml bot reviewed Jun 19, 2025

View reviewed changes

core/llm/llms/Anthropic.enhanced-caching.test.ts Outdated Show resolved Hide resolved

md2k added 2 commits June 20, 2025 01:05

suggestions applied

a0dc44e

new lines, mdx format fix

39489c3

fix formatting

c245ad0

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Jun 19, 2025

md2k added 2 commits June 20, 2025 02:03

test file moved to correct place

e430ab6

fixed missed new line in test file

e3140dd

sestinj reviewed Jun 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/anthropic extended ttl #6205

Feat/anthropic extended ttl #6205

md2k commented Jun 19, 2025 •

edited

Loading

Uh oh!

netlify bot commented Jun 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

recurseml bot commented Jun 19, 2025

Uh oh!

md2k commented Jun 19, 2025

Uh oh!

md2k commented Jun 19, 2025

Uh oh!

md2k commented Jun 19, 2025

Uh oh!

sestinj left a comment

Uh oh!

sestinj Jun 21, 2025

Uh oh!

sestinj Jun 21, 2025

Uh oh!

chezsmithy commented Jun 21, 2025

Uh oh!

sestinj commented Jun 23, 2025

Uh oh!

Uh oh!

		@@ -62,18 +62,40 @@ Anthropic currently does not offer any reranking models.

		Anthropic supports [prompt caching with Claude](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching), which allows Claude models to cache system messages and conversation history between requests to improve performance and reduce costs.

		> NOTE: As part of their `Beta` support [Extended caching](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching#1-hour-cache-duration)

Feat/anthropic extended ttl #6205

Are you sure you want to change the base?

Feat/anthropic extended ttl #6205

Conversation

md2k commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Screenshots

Tests

Uh oh!

netlify bot commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👷 Deploy request for continuedev pending review.

Uh oh!

github-actions bot commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

recurseml bot commented Jun 19, 2025

Uh oh!

md2k commented Jun 19, 2025

Uh oh!

md2k commented Jun 19, 2025

Uh oh!

md2k commented Jun 19, 2025

Uh oh!

sestinj left a comment

Choose a reason for hiding this comment

Uh oh!

sestinj Jun 21, 2025

Choose a reason for hiding this comment

Uh oh!

sestinj Jun 21, 2025

Choose a reason for hiding this comment

Uh oh!

chezsmithy commented Jun 21, 2025

Uh oh!

sestinj commented Jun 23, 2025

Uh oh!

Uh oh!

md2k commented Jun 19, 2025 •

edited

Loading

netlify bot commented Jun 19, 2025 •

edited

Loading

github-actions bot commented Jun 19, 2025 •

edited

Loading