Skip to main content

ALZ Monitoring Alerts Process

This page is a guide for ALZ Engineers to review and act on the Azure monitoring alerts within Azure Landing zone teams channel. Alerts are only generated for the Spokes we support end-to-end: Prod Hub and Prod Shared Services.

ALZ Monitoring Rota

As an ALZ engineer you will use the current OOH rota (provided by service level management) as a guide for who will monitor the Azure Monitoring alerts channel in the day time. Please note the rota needs to be updated:

  • When Providing Holiday or Sickness cover
  • When we have new starter\leaver
  • When it’s start of a new year

Below is a snapshot of the weekly rota:

Week 1 Week 2 Week 3 Week 4
Alan Ravi Noor Kokulan
06/01/2025 13/01/2025 20/01/2025 27/01/2025
03/02/2025 10/02/2025 24/02/2025 17/02/2025
03/03/2025 10/03/2025 17/03/2025 24/03/2025
31/03/2025 07/04/2025 14/04/2025 21/04/2025
28/04/2025 05/05/2025 12/05/2025 19/05/2025
26/05/2025 02/06/2025 09/06/2025 16/06/2025
23/06/2025 30/06/2025 07/07/2025 14/07/2025
21/07/2025 28/07/2025 04/08/2025 11/08/2025
18/08/2025 25/08/2025 01/09/2025 08/09/2025
15/09/2025 22/09/2025 29/09/2025 06/10/2025
13/10/2025 20/10/2025 27/10/2025 03/11/2025
10/11/2025 17/11/2025 24/11/2025 01/12/2025
08/12/2025 15/12/2025 22/12/2025 29/12/2025
05/01/2026 12/01/2026 19/01/2026 26/01/2026

Alert Notifications

We have configured Azure Monitor to generate alerts within our Monitoring Alerts MS Teams channel.

Teams Channel:

Monitoring Alerts Channel

Example Alert:

Monitoring Alerts

Reviewing Alert Notifications

ALZ Engineers will be responsible for reviewing the Alerts within the Monitoring Alerts channel each day. When reviewing the alert the engineer needs to determine whether it’s an Incident or not:

If the Alert is not severe enough to be an incident we should at least follow Moderate Incident type as guide to a less involved process.

All alerts must be acknowledged by an ALZ Engineer Regardless a incident or not to show that the alert as been reviewed and then resolved. For now, this can be done by just replying to the Alert message that is created in the Teams chat (There will be slicker process around this in the future)

ALZ Incident Management

ALZ Incident Management Approach

This page was last reviewed on 24 January 2024. It needs to be reviewed again on 24 April 2024 .
This page was set to be reviewed before 24 April 2024. This might mean the content is out of date.