For the past few weeks we have been having an issue where roughly once a week the HA status to our secondary server switches to not alive but when I check the secondary server PMP is still running. We do get email alerts from both servers that HA is not alive.
We have been able to fix the issue by restarting PMP or the server.
When I check the pmp0 log file on the secondary server I see the following four entries repeated a number of times at the time of failure.
[20:47:54:892]|[05-09-2023]|[com.adventnet.passtrix.ProcessUtil]|[INFO]|[11750]: Cannot run program "ps": CreateProcess error=2, The system cannot find the file specified|
[20:47:54:892]|[05-09-2023]|[com.adventnet.passtrix.utils.HAUtils]|[INFO]|[11750]: Going to start rubyrep replication process...|
[20:47:54:970]|[05-09-2023]|[com.adventnet.passtrix.ProcessUtil]|[INFO]|[71]: Going to kill process for ResourceId 8273|
[20:47:54:970]|[05-09-2023]|[com.adventnet.passtrix.ProcessUtil]|[INFO]|[71]: No Process List for 8,273|
I would appreciate any advice you can give me.