I've gone ahead and deputized Justin Garrisson and Chris Smith to respond to things for the next few days until Phil is back online.
I'm sure they'd appreciate help as they've never touched any of our infra, drupal or chef code. Please find them on scale slack if you have a moment.
On Fri, Oct 6, 2023 at 9:54 PM Ilan Rabinovitch ilan@linuxfests.org wrote:
Hi folks,
We're increasingly seeing outages on the website this year. So far we've gotten by on KC or I noticing and rebooting things. While we obviously need to stop the outages from happening at all, we probably should still have some plan in place for how we respond to them - especially during critical times of the year (eg today for reg, sponsorship, and CFP).
I know this isn't anyone's "job", but given how critical the infra is to the conference, should we setup an oncall rotation for it? We seem to be getting caught at various times where nobody with access is available to respond.
Thoughts?
Ilan