Rendered at 17:46:58 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
nithiink 1 days ago [-]
How do you handle prompt caching? A lot of cost savings for a single model chat come from cache hits on the conversation context, and switching models invalidates that cache — the new model has to reprocess everything at full input price.
2 days ago [-]
patch_dev 1 days ago [-]
What does this solve that well used subagents doesn't solve already?
FrancescoMassa 1 days ago [-]
On our tests subagents & well used workflows are 20-30% more expensive for context & token efficiency