How to create Multi-Agent MCP servers calling multiple agents using Python, with enterprise features
Overview
In October 2025, Microsoft announced their Agent Framework a few days before OpenAI announced their version 5 LLM. Google has its Agent Development toolkit.
PROTIP: I think betting our time to get paid work working on MCP from Microsoft has better payback than OpenAI or others. OpenAI gets a lot of attention, but I think Microsoft has been the one to pull in profits from AI. That’s because if the battle for AI supremacy is about infra. lock-in and assured revenue for vendor, throughtout its history, Microsoft has developed masterey of enterprise with their co-pilot-infused Office 365 and Azure ecosystems with centralized authentication capabilities.
https://www.youtube.com/watch?v=CoGO6s7bS3A The truth about AI-assited
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
The “unified” in “Unified Microsoft Agent Framework” unifies and extends capabilities from two previous Microsoft projects so teams no longer have to choose between experimentation and production:
-
Semantic Kernel provides a stable SDK for AI foundations of connectors into enterprise systems, content moderation, and telemetry. Supports MCP, A2A, OpenAPI.
-
AutoGen, a multi-agent orchestration & collaboration (from Microsoft Research), with added workflows which mix agents with business processes to provide concensus report generation and human-in-the-loop approvals.
Workflow processing run several agents in parallel, unlike traditional Agent-based orchestration:
https://aka.ms/AgentFramework reroutes to the SDK at
https://github.com/microsoft/agent-framework
The demo4.py workflow parses a VC pptx pitch deck file to JSON, than approves or rejects each:
References:
- https://github.com/webmaxru/awesome-microsoft-agent-framework/ by Maxim Salnikov
- Announcement video oct 1, 2025
- https://learn.microsoft.com/en-us/agent-framework/tutorials/overview”>Tutorials</a>
- https://devblogs.microsoft.com/foundry/introducing-microsoft-agent-framework-the-open-source-engine-for-agentic-ai-apps/
- https://aka.ms/AgentFramework/Docs reroutes to
https://learn.microsoft.com/en-us/agent-framework/overview/agent-framework-overview - See it in action: Watch demos on AI Show
https://aka.ms/AgentFramework/AIShow “for Next-Gen Multi-Agent” with Product Manager Elijah Straight in San Francisco. - VIDEO: “Agent Framework: Building Blocks for the Next Generation of AI Agents” rounted from “Open at Microsoft” https://aka.ms/AgentFramework/OpenAtMicrosoft Learn step by step: Microsoft Learn modules for Agent Framework and AI Agents for Beginners
- https://discord.com/channels/1113626258182504448/1422947050441543861 after joining Azure AI Foundry Discord to connect with developers and product groups, sharpen your AI skills, and stay inspired through real-time community.
- AMA Tuesday 7th October 9am PST
- https://learn.microsoft.com/en-us/agent-framework/user-guide/overview
- https://www.youtube.com/watch?v=jyIepE19_0M”>”Getting Started with Microsoft Agent Framework (Semantic Kernel + AutoGen)”</a> by Designing With AI
Training
LEARN: https://learn.microsoft.com/en-us/training/paths/develop-ai-agents-on-azure/ “Develop AI agents on Azure”
Get the “Microsoft Certified: Azure AI Engineer Associate” by passing the $165USD AI-102 “Designing and Implementing a Microsoft Azure AI Solution” exam in 100 minutes.
“Design and implement an Azure AI solution using Azure AI services, Azure AI Search, and Azure Open AI”.
Industry Use Cases for MCP
Industry MCP Application Example Healthcare Secure EHR access, patient summaries (Gartner) Finance Real-time risk monitoring, automated reporting Legal Contract screening, compliance audit (Nasuni) Sales/CRM Auto-fetch customer histories, sales insights HR Resume screening, employee query automation Manufacturing Predictive maintenance, supply chain workflows Retail Inventory management, live fraud checks
Get API keys on websites:
- In a internet browser, sign in to the Azure Portal (https://portal.azure.com).
- Use your email and credit card to create a global billing account.
- Use my program TODO:_____ to create users with minimal permissions.
- Use my program __ to identify the Region you should use based on geography, service for the location, price, speed.
-
Create a Resource Group such as “ai-westus3-251008a”
- DOCS: In the “Search resources,…” type “Azure OpenAI” to manually “Create Azure OpenAI” resources.
Alternately, run my program ___
- Use my Naming Conventions ???
- Use the “Standard SO” Pricing Tier.
- Selected networks
- Add tags for billing analytics.
- Open your .env file to edit environment variables.
AZURE_LOCATION="westus3" AZURE_RESOURCE_GROUP="westus3-251008a"
- Click “Click here to view endpoints.”
- Click to copy the URL and edit your .env file to add this environment variable:
AZURE_OPENAI_ENDPOINT="https://ai-westus3-251008a.openai.azure.com/"
endpoint=os.environ[“AZURE_OPENAI_ENDPOINT”],
- Click “Click here to manage keys”. If you’re using AzureCliCredential, create:
AZURE_OPENAI_API_KEY="12345678C..."
api_key=os.environ[“AZURE_OPENAI_API_KEY”],
-
At https://ai.azure.com/?cid=learnDocs = portal.azure.com > AI Foundary Azure OpenAI, click “Go to Azure AI Foundry portal”. - At https://ai.azure.com/resource/deployments = “Model deployments”, click the blue “+ Deploy model”.
-
Click “Deploy base model”. See ??? TODO: How to select an LLM model from the model catalog and Quality Leadershoard benchmarks.
“The gpt-35-turbo (also known as ChatGPT) is the most capable and cost-effective model in the gpt-3.5 family which has been optimized for chat using the Chat Completions API. It is a language model designed for conversational interfaces and the model behaves differently than previous gpt-3 models. Previous models were text-in and text-out, meaning they accepted a prompt string and returned a completion to append to the prompt. However, the ChatGPT model is conversation-in and message-out. The model expects a prompt string formatted in a specific chat-like transcript format and returns a completion that represents a model-written message in the chat. Learn more at https://learn.microsoft.com/azure/cognitive-services/openai/concepts/models
- Select a model by Search because some models in the docs are not really available (such as Mistral, Phi, Cohere). The lowest cost “Chat completion” model “gpt-35-turbo”.
- Click “Confirm” and select “Deployment type” select “Standard”. [DOC]
- Type the deployment (LLM) name in the .env file and click “Deploy”.
AZURE_OPENAI_RESPONSES_DEPLOYMENT_NAME="gpt-35-turbo"
deployment_name=os.environ[“AZURE_OPENAI_RESPONSES_DEPLOYMENT_NAME”],
- DOCS:
AZURE_OPENAI_API_VERSION="2023-03-15-preview" or "2024-10-01-preview" or "2024-06-01"
>api_version=os.environ["AZURE_OPENAI_API_VERSION"],</tt>
TARGET_URL="https://ai-westus3-251008a.openai.azure.com/openai/deployments/gpt-35-turbo/chat/completions?api-version=2025-01-01-preview"
https://learn.microsoft.com/en-us/azure/developer/python/sdk/examples/azure-sdk-example-list-resource-groups?tabs=bash
Agent Python code
FIXME: ai-service is gone from curl -s “https://raw.githubusercontent.com/MicrosoftDocs/azure-docs/main/articles/ai-services/openai/reference.md” | grep -i “api.*version” | head -10
https://learn.microsoft.com/en-us/agent-framework/tutorials/agents/run-agent?pivots=programming-language-python
References:
- https://learn.microsoft.com/en-us/azure/ai-foundry/openai/api-version-lifecycle?tabs=python#api-evolution
Multi-agent
MCP
A2A
OpenAPI
https://www.linkedin.com/pulse/from-prototype-production-introducing-unified-microsoft-leon-gordon-eudze/?trackingId=ihrrFL6keBkE71ORRXOrJw%3D%3D
https://aka.ms/kpmgagentframework KPMG is leveraging the framework to power its KPMG Clara AI platform, connecting specialised agents to enterprise data with built-in safeguards.
https://azure.microsoft.com/en-us/blog/introducing-microsoft-agent-framework/https://www.linkedin.com/in/yinaa/ By Yina Arenas, Corporate Vice President, Azure AI Foundry https://azure.microsoft.com/en-us/blog/author/yina-arenas/