-
Notifications
You must be signed in to change notification settings - Fork 32
Description
In a Windows Cluster environment with a quorum disk the Invoke-IcingaCheckDiskHealth plugin throws the following error after a node reboot and failback of the quorum disk to the rebooted host.
Icinga Plugin execution error was thrown during API request:
Ausnahme beim Aufrufen von "Value" mit 0 Argument(en): "Ausnahme beim Aufrufen von "Value" mit 0 Argument(en): "Ausnahme beim Aufrufen von "NextValue" mit 0 Argument(en): "Instanz 1 ist in der angegebenen Kategorie nicht vorhanden."""
The error only occurs in the icinga web gui. If we execute the command locally on the host, there is no error.
I assume it has something to do with the fact that the quorum disk is always owned by one cluster node. If you reboot this node the quorum disk fails over to another node. After the node has rebooted the quorum disk is still owned by one of the other cluster nodes.
Please note: At this time there is no error!!!
But now when you fail back the quorum disk to the rebooted cluster node the check goes to an unknown state and throwing the error mentioned above. Probably because the performance counters for this quorum disk are missing because at the time icinga started on the rebooted cluster node the quorum disk wan't owned by it.
A manual restart of icinga is needed in order to make it work again.
But if so why doesn't the error occur if you execute the check locally on the rebooted node ...