Lock in your current rates now! ⭐ Price update planned for January.
EN
Currency:
EUR – €
Choose a currency
  • Euro EUR – €
  • United States dollar USD – $
VAT:
OT 0%
Choose your country (VAT)
  • OT All others 0%

29.11.2022

Hostkey Windows Infrastructure Monitoring Architecture

server one
HOSTKEY
Rent dedicated and virtual servers with instant deployment in reliable TIER III class data centers in the Netherlands and the USA. Free protection against DDoS attacks included, and your server will be ready for work in as little as 15 minutes. 24/7 Customer Support.

Author: Senior Devops. Hostkey Lead Infrastructure Specialist Nikita Zubarev

Earlier we talked about how Linux systems are monitored. Now let's look at how to set up Windows Server.

In one of the previous articles, we discussed integrating FreeIPA with Active Directory. In the process of developing the solution described in that article, we needed to note the metrics from the servers using Windows infrastructure. In this article, we will look at how you can set up monitoring for Windows servers using an existing system built on a combination of Prometheus + Grafana + Alertmanager. We will collect general system metrics: cpu, ram, hdd.
The WMI exporter can also be used to monitor IIS sites and applications, network interfaces, services, and even local temperature.

Detailed information about the WMI exporter can be found on GitHub.
After installing the necessary exporter, go to the "Windows Services" panel (just type "Services" in the Windows search menu) and find the WMI exporter in the list. Next, you need to check that the service is working correctly.

If the exporter is running, it should start providing metrics at http://localhost:9182/metrics, similar to the Node Exporter. All you need to do is activate the collection of metrics in Prometheus, add a dashboard and set up your alerts:

Add to prometheus.yml:

  - job_name: 'Windows_Infra'
	static_configs:
		- targets: ['”localhost”:9182'] 

Add a Grafana dashboard. It monitors the display of the dashboard, connects and optimizes the output of a detailed summary of resources. It provides a visual panel for each host:

What you have to do now is describe the rule_files by adding the necessary metrics:

 groups:
	- name: Windows_Infra
		rules:
		- alert: InstanceDown
			expr: up{job="Windows_Infra"} == 0
			for: 5m
			labels:
				severity: page
			annotations:
				summary: "Instance {{ .instance }} down"
				description: "{{ .instance }} of job {{ .job }} has been down for more than 5 minutes."
	
		- alert: DiskSpaceUsage
			expr: 100.0 - 100 * (windows_logical_disk_free_bytes{job="Windows_Infra"} / windows_logical_disk_size_bytes{job="Windows_Infra"}) > 90
			for: 10m
			labels:
				severity: high
			annotations:
				summary: "Disk Space Usage (instance {{ $labels.instance }})"
				description: "Disk Space on Drive is used more than 90%\n  VALUE = {{ $value }}\n  LABELS: {{ $labels }}"

Add any other necessary parameters in the same way. When Alertmanager is configured, notifications will automatically be sent to connected systems.

As you can see, monitoring Windows servers can be done easily with Prometheus and Grafana. This solution greatly simplifies the process of managing a large fleet of machines and allows you to quickly solve problems that arise when administering server equipment with our company. In the following articles, we will consider the task of connecting Windows services.

Rent dedicated and virtual servers with instant deployment in reliable TIER III class data centers in the Netherlands and the USA. Free protection against DDoS attacks included, and your server will be ready for work in as little as 15 minutes. 24/7 Customer Support.

Other articles

04.12.2025

Improving LLM Benchmark for GPU Servers Equipped with NVIDIA Cards: A Focus on the Ollama Framework

How did we create our LLM benchmark for GPU servers using Ollama? We developed a script, tested it with DeepSeek R1, and configured the necessary contexts. We identified some patterns and compared the performance of different GPUs, all of which are now available on GitHub.

04.12.2025

What is the Cloud and How It Outperforms Traditional Hosting: A Comprehensive Overview of Cloud Computing

The cloud is a flexible and cost-effective solution that adapts to demand, enabling businesses to grow without unnecessary expenses. Through a simple yet realistic example, we demonstrate how cloud services work internally and why they often outperform traditional hosting in certain situations.

01.12.2025

Debian 13 “Trixie” and Proxmox VE 9.0: Implementation and Testing in Production

The new version of Debian 13 and the release of Proxmox VE 9.0 came out almost simultaneously, generating significant interest from customers. In this article, we detail how the HOSTKEY team adapted their processes, automated deployments, and prepared their infrastructure for these new releases.

27.10.2025

Checklist: 5 Signs It's Time for Your Business to Upgrade from VPS to a Dedicated Server

Do you still rely on cloud services despite paying for them? If your budget is at least €50 per year, a dedicated server could be more cost-effective. Please review the checklist and the comparative tests between cloud and bare-metal solutions.

29.09.2025

What to Do If Your Laptop Breaks Down? How Kasm Turns Even an Old Tablet into a Workstation

When technical issues disrupt work, Kasm Workspaces becomes a lifesaver, turning outdated devices into powerful workstations through a browser. The article discusses how the platform addresses issues with broken laptops and equipment shortages, compares different versions (Community, Starter, Enterprise, Cloud), examines resource requirements, and reviews test results on VPS.

Upload