I don’t understand. LLMs cannot connect to MCP servers directly they would always need a client (like a chat app or agent) to call the servers. Where are you calling your LLMs from?
Yes indeed. I prefer my APIs to be decoupled when working on a larger project. Somewhere near the end I mention building GraphQL schema or code-first. Including generators for GraphQL schemas like StepZen