The Model Context Protocol (MCP) is becoming a key component in the agentic AI space, enabling models to interact with external tools and data. The project's 2026 roadmap focuses on addressing challenges for production deployment. Key priorities include improving scalability by evolving the transport and session model, clarifying agent communication and task lifecycle management, maturing governance structures for wider community contribution, and preparing for enterprise requirements like audit trails and authentication. The roadmap also highlights ongoing exploration of areas like event-driven updates and security.
The New Stack encourages its readers to contribute to Towards Data Science, a leading platform for data science and AI. Recognizing the increasing convergence of cloud infrastructure, DevOps, and AI engineering, the article invites practitioners to share their experiences with building and deploying AI systems. Successful TDS submissions are technically detailed, timely, and specific. Authors can also benefit from editorial support, promotion, and potential payment opportunities, while building their reputation within the AI community.
This article explores the emerging category of AI-powered operations agents, comparing AI DevOps engineers and AI SRE agents, how cloud providers are responding, and what engineers should consider when evaluating these tools.
Late last year, startup Platform Engineering Labs made waves in the world of Infrastructure as Code (IaC) by introducing a new IaC platform, called Formae, available initially on Amazon Web Services. This week, Platform Engineering Labs‘ platform gets (beta) support from additional cloud platforms, including Google Cloud Platform, Microsoft Azure, Oracle Cloud Infrastructure, and OVHcloud. The company has also released new AI-enhanced software for managing infrastructure tooling, called the Platform for Infrastructure Builders.
>When deployed strategically, agents can empower SREs to offload low-risk, toilsome tasks so they can focus on the most critical matters.
Agents in practice include:
* **Contextual Information:** Providing SREs with details from previously resolved incidents involving the same service, including responder notes.
* **Root Cause Analysis:** Suggesting potential origins of an issue and identifying recent configuration changes that might be responsible.
* **Automated Remediation:** Handling low-risk, well-defined issues without human intervention, with SRE review of after-action reports.
* **Diagnostic Suggestions:** Nudging SREs towards running specific diagnostics for partially understood incidents and supplying them automatically.
* **Runbook Generation:** Automatically creating and updating runbooks based on successful remediation steps, preventing recurring issues.
.
Tap these Model Context Protocol servers to supercharge your AI-assisted coding tools with powerful devops automation capabilities.
* **GitHub MCP Server:** Enables interaction with repositories, issues, pull requests, and CI/CD via GitHub Actions.
* **Notion MCP Server:** Allows AI access to notes and documentation within Notion workspaces.
* **Atlassian Remote MCP Server:** Connects AI tools with Jira and Confluence for project management and collaboration. (Currently in beta)
* **Argo CD MCP Server:** Facilitates interaction with Argo CD for GitOps workflows.
* **Grafana MCP Server:** Provides access to observability data from Grafana dashboards.
* **Terraform MCP Server:** Enables AI-driven Terraform configuration generation and management. (Local use only currently)
* **GitLab MCP Server:** Allows AI to gather project information and perform operations within GitLab. (Currently in beta, Premium/Ultimate customers only)
* **Snyk MCP Server:** Integrates security scanning into AI-assisted DevOps workflows.
* **AWS MCP Servers:** A range of servers for interacting with various AWS services.
* **Pulumi MCP Server:** Enables AI interaction with Pulumi organizations and infrastructure.
Plural is bringing AI into the DevOps lifecycle with a new release that leverages a unified GitOps platform as a RAG engine. This provides AI-powered troubleshooting, natural language infrastructure querying, autonomous upgrade assistance, and agentic workflows for infrastructure modification, all with enterprise-grade guardrails.
TraceRoot accelerates the debugging process with AI-powered insights. It integrates seamlessly into your development workflow, providing real-time trace and log analysis, code context understanding, and intelligent assistance. It offers both a cloud and self-hosted version, with SDKs available for Python and JavaScript/TypeScript.
The Azure MCP Server implements the MCP specification to create a seamless connection between AI agents and Azure services. It allows agents to interact with various Azure services like AI Search, App Configuration, Cosmos DB, and more.