github.com

Releases · PrefectHQ/fastmcp - GitHub

3/14/2026Updated 3/18/2026

Excerpt

## Code Mode Standard MCP has two scaling problems. The entire tool catalog loads into context upfront — with a large server, that's tens of thousands of tokens before the LLM reads a single word of the user's request. And every tool call is a round-trip: the LLM calls a tool, the result flows back through the context window, the LLM reasons about it, calls another tool, and so on. Intermediate results that only exist to feed the next step still burn tokens every time. … ## Under the Hood Heavy imports are now lazy-loaded, meaningfully reducing startup time for servers that don't use every feature. `fastmcp run` and `dev inspector` gain a `-m`/`--module` flag for module-style invocation, `MCPConfigTransport` now correctly persists sessions across tool calls, and `search_result_serializer` gives you a hook to customize how search results are serialized for markdown output. Eight new contributors, and the usual round of fixes.

Source URL

https://github.com/PrefectHQ/fastmcp/releases

Related Pain Points