Over the past few days, there have been numerous instances of servers going down with icinga returning "No data received from host".
This included almost every server with any sort of web access, with the exception of cp* and graylog121. So it included, mw*, reports121, matomo101, mon111, puppet111, mwtask111, test101, mail121, and phab121.
It leads to temporary visible user-facing outages, usually lasting no more then 20-30 seconds. But it happens fairly frequently lately, sometimes 2-3 times in a day. It self-recovers, usually taking a few minutes to fully recover, however sometimes it will fully recover, and less than 10 minutes later, will repeat the same issues, and recover again, this time usually staying up.
But the issues have become to frequent to make it a rare occurrence, and definitely should be investigated ASAP.
i think if everything is affected then it's because of the network
I'm leaving this UBN, since the outage is currently still on-going (affecting only some of the services at this point), and should be investigated ASAP. It can be lowered if necessary.