We've had a number of issues in the last few weeks caused by traffic surges and things unmounting and OOM'ing as a result (T5889 as today's example).
This has caused user impacting downtime. We need to work out if there's ways we can tweak config to reduce the risk of OOM's and ensure if mounts like varnish/gluster disconnect/break, they can be safely, automatically repaired.