Question 1

What is token bloat in MCP?

Accepted Answer

Token bloat in MCP happens when too many tool definitions (schemas + descriptions) are loaded into the model context, inflating prompt size before reasoning starts.

Question 2

How do you prevent token bloat when your catalog has thousands of tools?

Accepted Answer

Use search-based tool discovery, rank to a shortlist, enforce context budgets, filter by policy (role/tenant), and expand schemas on demand (summaries first, full schema later).

Question 3

What is a context budget for MCP tools?

Accepted Answer

A context budget is a hard limit on tool/schema tokens per request. It keeps cost and latency predictable and prevents runaway prompts.

Question 4

What is lazy loading for MCP tools?

Accepted Answer

Lazy loading means loading tool schemas only after the agent identifies relevant tools (usually via search/ranking), rather than preloading the entire catalog into context.

Question 5

What is tool discovery in MCP?

Accepted Answer

Tool discovery is querying a registry (or authority layer) to retrieve a small, ranked set of candidate tools for a task, then loading only those tools into context.

Question 6

What is an MCP registry?

Accepted Answer

An MCP registry is a catalog of tools/servers and their metadata used for discovery, versioning, ownership, and governance.

Question 7

What is shadow MCP?

Accepted Answer

Shadow MCP is ungoverned MCP tooling that appears without centralized discovery, policy enforcement, or auditing. It often shows up first as tool sprawl and token bloat.

MCP At Scale FAQ

Token bloat

Discovery & lazy loading

Registry & governance