Awesome Agentic DevOps



Awesome Agentic DevOps is a curated map for DevOps, Cloud, SRE, and Platform Engineering agents across Claude Code, Gemini ADK, OpenAI Agents SDK, and MCP.
This repository evaluates which agents are safe, useful, and production-adjacent for infrastructure automation.
Contents
Why this exists
Most agent lists stop at discovery. Infrastructure teams need more than discovery: they need to know whether an agent can touch production, whether it has approval gates, whether it preserves evidence, and whether it can be rebuilt as a safe reference workflow.
This repo is intentionally operator-grade. Each entry is scored by real infrastructure usefulness, operational risk, human approval gates, tracing/evidence, maturity, and Gemini compatibility.
Who it is for
- DevOps and platform engineers evaluating AI automation.
- SREs designing incident-response copilots.
- Cloud engineers comparing agent frameworks.
- Security reviewers assessing infrastructure-agent risk.
- Builders creating portfolio-grade reference agents.
- Interview candidates who want a practical DevOps AI artifact.
Safety-first disclaimer
These agents may touch infrastructure. Prefer read-only or proposal mode first. Require human approval before write actions. Never expose secrets to model context. Use least-privilege credentials, dry runs, Terraform plans before applies, and CI checks before merge.
Evaluation labels
| Label |
Meaning |
| 🟢 |
production-adjacent OSS |
| 🟡 |
useful prototype |
| 🔵 |
MCP/server integration |
| 🛡️ |
has approval/safety controls |
| 📊 |
has tracing/evidence/evals |
| 💎 |
Gemini-friendly workload |
| ⚠️ |
write-capable; review before use |
Categories
- Official cloud MCP servers and agent toolkits
- Official DevOps and source-control MCP servers
- Official CI/CD and GitOps MCP servers
- Official security, code-quality, and agent-security resources
- Official IaC MCP servers
- Official SRE and observability MCP servers
- Official agent skills and agent frameworks
- Official platform agent toolkits
- Official MCP SDKs, references, registries, and governance platforms
- Official diagramming and architecture MCP tools
- Official data platform MCP servers
- Community discovery and skill references
Top picks by use case
| Use case |
Start with |
Why |
| AWS agentic cloud automation |
aws/agent-toolkit-for-aws |
Official AWS-supported MCP, skills, plugins, IAM-aware controls, CloudWatch metrics, and CloudTrail auditability. |
| Azure cloud automation |
microsoft/mcp and microsoft/azure-skills |
Official Microsoft MCP and skills/plugin sources for Azure resource workflows. |
| Google Cloud automation |
google/mcp, googleapis/gcloud-mcp, and google/skills |
Official Google MCP and skills sources for GCP, Cloud Run, GKE, observability, and storage workflows. |
| Source-control DevOps |
github/github-mcp-server, GitLab MCP server, and atlassian/atlassian-mcp-server |
Official MCP tool surfaces for repos, issues, PRs, Jira, Bitbucket, and related delivery workflows. |
| Terraform and IaC |
hashicorp/terraform-mcp-server and Pulumi MCP Server |
Official IaC MCP sources for Terraform Registry/HCP Terraform and Pulumi Cloud automation. |
| SRE incident response |
grafana/mcp-grafana, datadog-labs/mcp-server, and PagerDuty/pagerduty-mcp-server |
Official observability and incident-management MCPs for metrics, logs, traces, alerts, incidents, and on-call context. |
| Security and code quality |
SonarSource/sonarqube-mcp-server, okta/okta-mcp-server, Snyk Studio MCP docs, and Wiz WIN MCP Server docs |
Official MCPs and security resources for code quality, application security, identity-aware workflows, agent security, and cloud-security posture. |
| CI/CD and GitOps |
jenkinsci/mcp-server-plugin and argoproj-labs/mcp-for-argocd |
Official Jenkins and Argo Project resources for pipeline, build, deployment, and GitOps workflows. |
| MCP development and governance |
modelcontextprotocol/python-sdk, modelcontextprotocol/typescript-sdk, modelcontextprotocol/registry, and Docker MCP Catalog and Toolkit |
Official SDKs, registry, and Docker governance surfaces for building, packaging, and controlling DevOps MCP servers. |
| Agent frameworks and templates |
google/adk-python and GoogleCloudPlatform/agent-starter-pack |
Official Google agent framework and production templates with CI/CD, evaluation, and observability. |
Curated catalog
The source of truth is data/repos.yaml. The list below is a readable index of the current official and community research-backed entries.
Official Cloud MCP Servers and Agent Toolkits
| Repo |
Labels |
Operator note |
| aws/agent-toolkit-for-aws |
🟢 🔵 🛡️ 📊 💎 ⚠️ |
Official AWS-supported toolkit that bundles MCP server configuration, skills, plugins, and DevSecOps agent workflows. |
| awslabs/mcp |
🟢 🔵 🛡️ 💎 ⚠️ |
Official AWS Labs MCP server collection; useful legacy/source reference while AWS transitions capabilities into Agent Toolkit. |
| microsoft/mcp |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Microsoft MCP catalog, including Azure cloud and infrastructure MCP references. |
| google/mcp |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Google MCP repository listing managed and open-source MCP servers for Google and Google Cloud. |
| googleapis/gcloud-mcp |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Google API repository for gcloud, observability, storage, and backup/disaster-recovery MCP servers. |
| GoogleCloudPlatform/cloud-run-mcp |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Google Cloud Platform MCP server for deploying apps to Cloud Run. |
| cloudflare/mcp-server-cloudflare |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Cloudflare MCP servers for account, Workers, and edge configuration workflows. |
| vercel/vercel-mcp-overview |
🟢 🔵 🛡️ 💎 ⚠️ |
Official public overview of Vercel's hosted MCP server for project and deployment context. |
Official DevOps MCP Servers
| Repo |
Labels |
Operator note |
| microsoft/azure-devops-mcp |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Azure DevOps MCP server with remote-first onboarding and local server option. |
| github/github-mcp-server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official GitHub MCP server for repository, issue, pull request, code, and workflow automation. |
| GitLab MCP server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official GitLab MCP server is documented as a GitLab-hosted endpoint rather than a standalone GitHub repo. |
| atlassian/atlassian-mcp-server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Atlassian Rovo MCP server for Jira, Confluence, Jira Service Management, Bitbucket, and Compass. |
| docker/mcp-registry |
🟢 🔵 🛡️ 📊 💎 |
Official Docker MCP registry and catalog source for verified containerized MCP servers. |
| kubernetes-sigs/mcp-lifecycle-operator |
🟡 🔵 🛡️ 💎 ⚠️ |
Official Kubernetes SIG operator for declaratively deploying and rolling out MCP servers, not a general kubectl MCP server. |
Official Security and Code-Quality MCP Servers
| Repo |
Labels |
Operator note |
| SonarSource/sonarqube-mcp-server |
🟢 🔵 🛡️ 💎 |
Official SonarQube MCP server for code quality and security insights in AI agents. |
| okta/okta-mcp-server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Okta self-hosted MCP server for connecting agents to Okta identity workflows. |
| Snyk Studio MCP docs |
🟢 🔵 🛡️ 📊 💎 |
Official Snyk documentation for Snyk Studio agentic security workflows and Snyk MCP Server usage. |
| snyk/agent-scan |
🟢 🛡️ 📊 💎 |
Official Snyk security scanner for AI agents, MCP servers, and agent skills. |
| Wiz WIN MCP Server docs |
🟢 🔵 🛡️ 📊 💎 |
Official Wiz documentation for the WIN MCP server, adding CNAPP and cloud-security coverage. |
Official CI/CD and GitOps MCP Servers
| Repo |
Labels |
Operator note |
| jenkinsci/mcp-server-plugin |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Jenkins plugin that enables Jenkins to act as an MCP server for LLM-powered clients. |
| argoproj-labs/mcp-for-argocd |
🟡 🔵 🛡️ 💎 ⚠️ |
Argo Project Labs MCP server implementation for Argo CD, filling the GitOps/CD gap in the catalog. |
| CircleCI-Public/mcp-server-circleci |
🟢 🔵 🛡️ 📊 💎 |
Official CircleCI MCP server for build failure logs, pipeline status, flaky test detection, and usage analysis. |
Official MCP SDKs, Reference Implementations, Registries, and Governance Platforms
Official CloudOps Agent Samples
Official IaC MCP Servers
| Repo |
Labels |
Operator note |
| hashicorp/terraform-mcp-server |
🟢 🔵 🛡️ 📊 💎 ⚠️ |
Official HashiCorp Terraform MCP server with registry, HCP Terraform, Terraform Enterprise, and OTel support. |
| Pulumi MCP Server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Pulumi hosted MCP server for Pulumi Cloud resources, registry lookup, policies, and Pulumi Neo workflows. |
| hashicorp/vault-mcp-server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official HashiCorp Vault MCP server (beta) for secrets and mount management alongside Terraform-driven IaC workflows. |
Official SRE MCP Servers
| Repo |
Labels |
Operator note |
| grafana/mcp-grafana |
🟢 🔵 🛡️ 📊 💎 ⚠️ |
Official Grafana MCP server for Grafana and surrounding observability ecosystem access. |
| getsentry/sentry-mcp |
🟢 🔵 🛡️ 📊 💎 ⚠️ |
Official Sentry MCP service focused on human-in-the-loop coding agents and debugging workflows. |
| datadog-labs/mcp-server |
🟢 🔵 🛡️ 📊 💎 |
Official Datadog MCP server documentation and examples for connecting AI agents to Datadog observability. |
| Datadog MCP Server setup docs |
🟢 🔵 🛡️ 📊 💎 |
Official Datadog MCP setup documentation, including the ChatGPT setup path. |
| Splunk MCP Server |
🟢 🔵 🛡️ 📊 💎 |
Splunkbase listing for the Splunk-supported MCP Server for Splunk Platform, Enterprise, and Cloud customers. |
| PagerDuty/pagerduty-mcp-server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official PagerDuty MCP server for incidents, services, schedules, event orchestrations, and embedded incident UIs. |
| newrelic/mcp-server |
🟢 🔵 🛡️ 📊 💎 |
Official New Relic MCP server for APM, dashboard, and NRQL-based observability context. |
| elastic/mcp-server-elasticsearch |
🟢 🔵 🛡️ 💎 |
Official Elastic MCP server for Elasticsearch search and mapping access; deprecated in favor of Elastic Agent Builder MCP. |
Official Agent Skills and Frameworks
| Repo |
Labels |
Operator note |
| microsoft/azure-devops-skills |
🟢 💎 ⚠️ |
Official Microsoft Azure DevOps skill examples; approval gates should be verified per skill. |
| microsoft/skills |
🟢 🔵 🛡️ 💎 |
Official Microsoft skills, MCP configurations, custom agents, and AGENTS.md guidance for coding agents. |
| microsoft/azure-skills |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Azure skills plugin with Azure skills, Azure MCP tools, and Foundry MCP coverage. |
| google/skills |
🟢 🛡️ 💎 |
Official Google Agent Skills repository for Google products and technologies. |
| google/adk-python |
🟢 🛡️ 📊 💎 |
Official Google Agent Development Kit for building, evaluating, and deploying agents. |
| GoogleCloudPlatform/agent-starter-pack |
🟢 🛡️ 📊 💎 |
Official Google Cloud starter pack for shipping agents with CI/CD, evaluation, observability, and security. |
Official Platform Agent Toolkits
| Repo |
Labels |
Operator note |
| databricks-solutions/ai-dev-kit |
🟢 🔵 🛡️ 📊 💎 ⚠️ |
Databricks field-engineering AI Dev Kit with Databricks MCP server, Databricks skills, tools core, and builder app support. |
| kubeflow/mcp-server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official Kubeflow MCP server for AI-assisted development with Kubeflow tools. |
| backstage/backstage (mcp-actions-backend) |
🟢 🔵 🛡️ 💎 ⚠️ |
Official CNCF Backstage plugin that exposes Internal Developer Portal actions as MCP tools for AI agents. |
Official Diagramming and Architecture MCP Tools
| Repo |
Labels |
Operator note |
| jgraph/drawio-mcp |
🟢 🔵 🛡️ 💎 |
draw.io MCP server and Claude Code plugin for generating, opening, and exporting draw.io diagrams with shape search. |
Official Data Platform MCP Servers
| Repo |
Labels |
Operator note |
| mongodb-js/mongodb-mcp-server |
🟢 🔵 🛡️ 💎 ⚠️ |
Official MongoDB MCP server (public preview) connecting agents to MongoDB Community, Enterprise, and Atlas deployments. |
Community Discovery and Skills
How to contribute
Start with CONTRIBUTING.md. New entries should update data/repos.yaml, include a real operational use case, classify action level, and explain safety or approval behavior.
Run validation before opening a PR:
python scripts/validate_repos_yaml.py
pytest -q
python scripts/run_mock_eval_scenarios.py
python scripts/audit_github_repos.py --workers 12 --fail-on-unreachable