Microsoft

Chill Agent: Navigating Chaos with GeekWire

Published

1 hour ago

June 23, 2026

BTI Team

an agent that keeps its cool when everything falls apart – GeekWire

Microsoft Introduces Azure Copilot Observability Agent to Aid Engineers in Cloud Glitches

Brendan Burns, Microsoft technical fellow and a co-founder of Kubernetes. (Microsoft Photo)

In an effort to alleviate the burden on engineers facing late-night outages and cloud glitches, Microsoft has unveiled the Azure Copilot Observability Agent. This innovative agent draws on Microsoft’s extensive experience running Azure to swiftly diagnose issues and suggest potential solutions.

One key advantage of this automated agent is its ability to operate without the limitations of stress, fatigue, or tunnel vision that often plague human responders working on minimal sleep.

Brendan Burns, a Microsoft technical fellow and co-founder of Kubernetes, highlighted the impartial nature of agents, devoid of the pressure faced by individuals when pressed for rapid problem resolution.

The Azure Copilot Observability Agent, which has been in preview since last year, was officially launched on Tuesday. It functions by analyzing incidents across a company’s systems, connecting various data points such as logs, metrics, and traces to pinpoint potential causes of issues.

While the agent is capable of investigating and triaging alerts independently through its autonomous operations feature, it does not execute fixes on its own. Instead, it leaves the final decision-making and implementation to human engineers.

Microsoft’s entry into this space comes amidst a competitive landscape, with other industry players like Datadog and AWS also offering similar AI-driven solutions. Pricing for the Azure Copilot Observability Agent is based on usage, aligning with the model adopted by AWS for its DevOps Agent.

Recognized observability providers such as Dynatrace, Splunk, New Relic, and Grafana, along with a new wave of AI-focused startups, are rapidly embracing this trend towards AI-powered incident resolution.

Burns emphasized Microsoft’s unique advantage in this space, citing the company’s comprehensive view of a customer’s software ecosystem, ranging from GitHub to Azure deployments. This holistic perspective enables the agent to effectively trace issues back to their root causes.

Having co-created Kubernetes over a decade ago, Burns highlighted the platform’s self-repair capabilities but noted its limitations in hypothesis testing and solution investigation. This gap is where AI tools like the Azure observability agent step in, offering a more analytical and proactive approach to issue resolution.

While full autonomy for the agent to execute fixes independently is still on the horizon, Burns described the current launch as a step towards “agentic operations” that leverage AI reasoning to address system signals.

Despite the agent’s advanced capabilities, human oversight remains crucial in the decision-making process. Burns reflected on past experiences, expressing how such technology would have significantly eased the challenges faced during intense on-call shifts.