| Coding agents |
Strong fit for complex coding and agent loops; test cost and latency first. |
Useful for lighter coding support, extraction, and automation; validate quality before larger refactors. |
| Cost-sensitive automation |
Use only when higher answer quality is worth the higher public token row. |
Often the better starting point for cost-sensitive, repetitive, or lower-risk tasks. |
| Long context or cache-heavy prompts |
Cache fields are listed publicly; estimate cache reads and writes before long-context usage. |
Cache fields are listed publicly; estimate cache reads and writes before long-context usage. |
| OpenAI-compatible tools |
Can work through compatible routes where supported, but check whether your tool expects OpenAI-style or Anthropic-style requests. |
Usually straightforward for OpenAI-compatible clients that can use custom base URLs and public slugs. |
| Quality-sensitive reasoning |
Best suited when quality matters more than lowest listed cost. |
Pilot first for general usage and compare output quality against the alternative row. |