---
title: Model Selection Router
category: product
entity_type: skill
price: $15
canonical: https://forgehouse.ai/skills/model-selection-router/
lang: en
hreflang_alt: https://forgehouse.ai/tr/skiller/model-selection-router/
last_updated: 2026-06-20
---

# Model Selection Router

> Lock every AI call to the most capable Opus model (no downgrade) and route cost savings through prompt caching, batch APIs and context engineering instead of cutting quality.

An enforcement skill that locks every AI call in your system to the most capable Opus model, with no downgrade to cheaper or faster models permitted. Instead of cutting quality to cut cost, it routes savings through prompt caching, batch APIs, and context engineering, so output quality stays consistent across every agent, script, and report.

## Use cases
- Auditing agent frontmatter to confirm every dispatch uses the Opus alias
- Reviewing new skill files to block hidden cheaper-model declarations
- Catching scripts that hardcode a frozen model ID instead of the alias
- Rejecting cost-cutting proposals to switch to a lighter model
- Setting up prompt caching to recover cost without dropping quality
- Configuring batch processing for non-realtime report and audit workloads

## Benefits
- Eliminates the hidden revision loops that downgraded outputs cause downstream
- Keeps every client-facing deliverable at one predictable quality bar
- Cuts spend through caching and batching rather than quality compromise
- Future-proofs every agent so new model releases upgrade automatically via the alias

## What’s included
- Total-cost-of-iteration reasoning that counts revision rounds, not just per-call price
- Prompt caching pattern that recovers up to 90 percent on repeated static context
- Batch API pattern for asynchronous workloads at half the cost
- Context-budget engineering to trim irrelevant tokens before each call
- A ten-point anti-pattern catalog covering every common downgrade excuse
- A model-audit output template for agents, skills, and scripts

## Who it’s for
Teams running multi-agent or automated pipelines who want consistent top-tier output and disciplined cost control without trading quality for a cheaper model.

## How it runs
Downgrading to a cheaper model rarely saves money once revision rounds are priced in. This skill audits the whole fleet against that math and enforces a single top-model standard.
1. Scans every agent frontmatter for the model field: missing or downgraded entries get flagged, and the alias form is enforced so new model releases upgrade the whole fleet automatically with zero code changes.
2. Scans the skill library for explicit lower-tier model declarations and scripts for hardcoded model ID strings, replacing frozen versions with the alias plus an environment override so no file pins itself to a stale model.
3. Intercepts downgrade proposals at the argument level: when a 'cheaper model is enough for this task' or 'complex work on the big model, fast iterations on the small one' hierarchy appears, it gets rejected with the total-cost-of-iteration math, because revision rounds, re-prompting time and reputation risk make the downgrade more expensive than it looks.
4. Routes the cost pressure to the legitimate levers instead: prompt caching for up to 90 percent input discount on repeated static prefixes, the asynchronous batch API at 50 percent for non-realtime workloads, and context engineering that cuts irrelevant chunks out of the token budget.
5. Re-checks consistency across the whole pipeline, because internal automation output feeds customer-facing reports: a downgrade at the start of a chain becomes a quality loss at the end of it, so the standard applies to every stage including tool-call reasoning.
6. Produces a structured audit brief: total agents and how many carry the correct model field, skills with explicit downgrades to fix, scripts with hardcoded IDs to fix, the correction actions taken and a final verified count.

## FAQ
### Does it audit scripts and cron jobs too, or just interactive agents?
All three: it audits agent frontmatter, reviews skill files for hidden cheaper-model declarations, and catches scripts that hardcode a frozen model ID instead of the alias. The audit output template covers agents, skills, and scripts in one pass.

### If downgrading is banned, how does the bill stay under control?
Savings come from mechanics, not quality cuts: prompt caching recovers up to 90 percent on repeated static context, the batch API runs asynchronous workloads at half cost, and context-budget engineering trims irrelevant tokens before each call. The total-cost reasoning also counts the revision rounds that downgraded output causes.

### Can I make an exception for one low-stakes task?
No. The ten-point anti-pattern catalog exists to reject exactly that argument, because exceptions are where quality drift starts. If a task is genuinely cheap, batching and caching make it cheap at full quality.

## Price
$15, one-time, no subscription. VAT included.

Related guide: [AI and LLM engineering](https://forgehouse.ai/guides/ai-llm-engineering/)
