MCP At Scale FAQ
This page is a living FAQ. It collects answers to the most common questions about MCP: token bloat, lazy loading, tool discovery, registries, and governance.
Token bloat
What is token bloat in MCP?
Token bloat in MCP happens when too many tool definitions (schemas + descriptions) are loaded into the model context, inflating prompt size before reasoning starts.
How do you prevent token bloat when your catalog has thousands of tools?
Use search-based tool discovery, rank to a shortlist, enforce context budgets, filter by policy (role/tenant), and expand schemas on demand (summaries first, full schema later).
What is a context budget for MCP tools?
A context budget is a hard limit on tool/schema tokens per request. It keeps cost and latency predictable and prevents runaway prompts.
Discovery & lazy loading
What is lazy loading for MCP tools?
Lazy loading means loading tool schemas only after the agent identifies relevant tools (usually via search/ranking), rather than preloading the entire catalog into context.
What is tool discovery in MCP?
Tool discovery is querying a registry (or authority layer) to retrieve a small, ranked set of candidate tools for a task, then loading only those tools into context.
Registry & governance
What is an MCP registry?
An MCP registry is a catalog of tools/servers and their metadata used for discovery, versioning, ownership, and governance.
What is shadow MCP?
Shadow MCP is ungoverned MCP tooling that appears without centralized discovery, policy enforcement, or auditing. It often shows up first as tool sprawl and token bloat.