We have a lot of pages lately, and at the current noise level it’s hard to properly track all of them. So it’s important to reduce that noise. On one side, the noisiest genuine alerts are being addressed (the main one currently being the forums issue) – but on the other side, it would be good to also do something about one of the most frequent false positive: the maintenance operation.
There are often operations that can generate downtimes - and in these cases, currently we let the alert happen, comment on the Mattermost thread, and close the alert (hopefully). Could we instead prevent the alert from being generated at all in these cases? If the downtime is expected, it isn’t actually something to page about.
I haven’t tested it, but there seem to now be a supported feature to schedule one-time or recurring monitor downtimes:
Could we start using that when performing a maintenance operation?