Importance of Secondary On-Call
Posted: Wed Feb 12, 2025 8:50 am
Automate Manual Toil
On-call often involves executing the same manual steps repeatedly. Look for opportunities to automate these repeated tasks. This could be as simple as a runbook script or a more sophisticated auto-remediation system. The more you can automate, the easier on-call becomes.
Foster an On-Call-Friendly Culture
Improving on-call is not just a technical challenge but also a japan whatsapp number data cultural one. Work to develop a culture emphasizing the importance of a healthy on-call experience. This means giving engineers time to work on alert hygiene, sharing best practices across teams, and celebrating alert reduction wins.
It’s also very important that teams maintain an on-call set-up with primary and secondary on-call engineers. The specific roles and responsibilities of the primary and secondary on-call engineers can vary depending on the team’s needs. Some teams use the secondary on-call as a backup for any pages that the primary might miss, while others assign the primary to handle only urgent pages and assign low-priority tickets to the secondary.
On-call often involves executing the same manual steps repeatedly. Look for opportunities to automate these repeated tasks. This could be as simple as a runbook script or a more sophisticated auto-remediation system. The more you can automate, the easier on-call becomes.
Foster an On-Call-Friendly Culture
Improving on-call is not just a technical challenge but also a japan whatsapp number data cultural one. Work to develop a culture emphasizing the importance of a healthy on-call experience. This means giving engineers time to work on alert hygiene, sharing best practices across teams, and celebrating alert reduction wins.
It’s also very important that teams maintain an on-call set-up with primary and secondary on-call engineers. The specific roles and responsibilities of the primary and secondary on-call engineers can vary depending on the team’s needs. Some teams use the secondary on-call as a backup for any pages that the primary might miss, while others assign the primary to handle only urgent pages and assign low-priority tickets to the secondary.