AMD EPYC 9354 Servers —from €299/month or €0.42/hour ⭐ 32 cores 3.25GHz / 768GB RAM / 2x3.84TB NVMe / 10Gbps 100TB
EN
Currency:
EUR – €
Choose a currency
  • Euro EUR – €
  • United States dollar USD – $
VAT:
OT 0%
Choose your country (VAT)
  • OT All others 0%

29.11.2022

Hostkey Windows Infrastructure Monitoring Architecture

server one
HOSTKEY
Rent dedicated and virtual servers with instant deployment in reliable TIER III class data centers in the Netherlands and the USA. Free protection against DDoS attacks included, and your server will be ready for work in as little as 15 minutes. 24/7 Customer Support.

Author: Senior Devops. Hostkey Lead Infrastructure Specialist Nikita Zubarev

Earlier we talked about how Linux systems are monitored. Now let's look at how to set up Windows Server.

In one of the previous articles, we discussed integrating FreeIPA with Active Directory. In the process of developing the solution described in that article, we needed to note the metrics from the servers using Windows infrastructure. In this article, we will look at how you can set up monitoring for Windows servers using an existing system built on a combination of Prometheus + Grafana + Alertmanager. We will collect general system metrics: cpu, ram, hdd.
The WMI exporter can also be used to monitor IIS sites and applications, network interfaces, services, and even local temperature.

Detailed information about the WMI exporter can be found on GitHub.
After installing the necessary exporter, go to the "Windows Services" panel (just type "Services" in the Windows search menu) and find the WMI exporter in the list. Next, you need to check that the service is working correctly.

If the exporter is running, it should start providing metrics at http://localhost:9182/metrics, similar to the Node Exporter. All you need to do is activate the collection of metrics in Prometheus, add a dashboard and set up your alerts:

Add to prometheus.yml:

  - job_name: 'Windows_Infra'
	static_configs:
		- targets: ['”localhost”:9182'] 

Add a Grafana dashboard. It monitors the display of the dashboard, connects and optimizes the output of a detailed summary of resources. It provides a visual panel for each host:

What you have to do now is describe the rule_files by adding the necessary metrics:

 groups:
	- name: Windows_Infra
		rules:
		- alert: InstanceDown
			expr: up{job="Windows_Infra"} == 0
			for: 5m
			labels:
				severity: page
			annotations:
				summary: "Instance {{ .instance }} down"
				description: "{{ .instance }} of job {{ .job }} has been down for more than 5 minutes."
	
		- alert: DiskSpaceUsage
			expr: 100.0 - 100 * (windows_logical_disk_free_bytes{job="Windows_Infra"} / windows_logical_disk_size_bytes{job="Windows_Infra"}) > 90
			for: 10m
			labels:
				severity: high
			annotations:
				summary: "Disk Space Usage (instance {{ $labels.instance }})"
				description: "Disk Space on Drive is used more than 90%\n  VALUE = {{ $value }}\n  LABELS: {{ $labels }}"

Add any other necessary parameters in the same way. When Alertmanager is configured, notifications will automatically be sent to connected systems.

As you can see, monitoring Windows servers can be done easily with Prometheus and Grafana. This solution greatly simplifies the process of managing a large fleet of machines and allows you to quickly solve problems that arise when administering server equipment with our company. In the following articles, we will consider the task of connecting Windows services.

Rent dedicated and virtual servers with instant deployment in reliable TIER III class data centers in the Netherlands and the USA. Free protection against DDoS attacks included, and your server will be ready for work in as little as 15 minutes. 24/7 Customer Support.

Other articles

19.05.2026

How to Connect to S3 Storage: A Step-by-Step Guide with Examples

A complete practical guide to connecting and working with S3-compatible object storage. Learn how to configure AWS CLI, Rclone, boto3, Cyberduck, S3 Browser, s3cmd ands3fs for backups, file management, synchronization and application integration.

15.05.2026

India Wanted to Buy a Supercomputer. They Were Denied. So They Built Their Own

In the late 1980s, India attempted to purchase a Cray Y-MP supercomputer, but the US refused to issue an export license. Instead, the country established C-DAC and built its own PARAM 8000 supercomputer within three years. We analyze how this was achieved and why the rejection by Cray ultimately worked in India's favor.

13.05.2026

OpenWebUI: System Prompt vs. Skills vs. MCP Tools — A Live URL Validation Case Study

The System Prompt defines what the model says; Skills define how to verify it; MCP Tools enable real-world execution. We break down this three-tier architecture using a support bot that validates URLs.

12.05.2026

What is Cloud Object Storage S3 - Amazon Simple Storage Service?

This article explains how S3 object storage works, where it fits best and what limitations you should understand before using it in production.

23.04.2026

Server Price Increases in 2026: Forecasts, Causes, and Recommendations

The server market is heating up again: memory prices are rising by tens of percent, GPUs are in short supply, and cloud providers are preparing to raise rates. We break down what's happening in 2026 and how to avoid overpaying.

Upload