| Coding agents |
Strong fit for complex coding and agent loops; test cost and latency first. |
Good default for daily coding and repo chat when quality and cost both matter. |
| Cost-sensitive automation |
Use only when higher answer quality is worth the higher public token row. |
Balanced option; compare expected input, output, and cache use in the calculator. |
| Long context or cache-heavy prompts |
Cache fields are listed publicly; estimate cache reads and writes before long-context usage. |
Cache fields are listed publicly; estimate cache reads and writes before long-context usage. |
| OpenAI-compatible tools |
Can work through compatible routes where supported, but check whether your tool expects OpenAI-style or Anthropic-style requests. |
Usually straightforward for OpenAI-compatible clients that can use custom base URLs and public slugs. |
| Quality-sensitive reasoning |
Best suited when quality matters more than lowest listed cost. |
Balanced choice for mixed chat, coding, writing, and analysis. |