Remote Site Reliability Engineer – Azure & Microsoft 365 Automation (Remote Opportunity)
Posted
Apply now
Please, let Zealogics.com know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
The Site Reliability Engineer will lead the investigation and resolution of critical, recurring, or high-impact incidents across Azure and Microsoft 365 automation workflows.
This role involves deep-diving into PowerShell, Bicep, and YAML scripts to identify logic errors, misconfigurations, or scalability limitations within automated provisioning workflows.
The engineer will debug and optimize .NET (C#) components within Azure Functions or related application layers used in workflow orchestration.
Analyzing usage patterns and telemetry data from Azure Monitor, Application Insights, and Log Analytics to identify systemic issues or opportunities for automation enhancement is a key responsibility.
The engineer will implement fixes and design improvements to automation logic that reduce manual intervention and improve workflow reliability, such as auto-remediation scripts and retry logic.
They will own and evolve the automation framework for Teams and SharePoint Online lifecycle operations, including operations like create/delete, external sharing restrictions, and role/ownership changes.
Collaboration with product owners and architects to introduce new automation use cases or extend existing workflows is expected.
Conducting post-incident reviews (PIRs) for high-severity incidents, driving root cause analysis (RCA), and implementing corrective actions are essential tasks.
The engineer will mentor L1 and L2 engineers, conduct knowledge-sharing sessions, and support the onboarding of new team members.
Staying updated with changes in Azure, Microsoft 365 APIs, and automation tooling, such as PowerShell modules and Bicep schema updates, is crucial.
Providing guidance on architecture and best practices for automation reliability is also part of the role.
Requirements:
Candidates must have 12+ years of experience in cloud platform engineering, DevOps, or site reliability engineering (SRE) roles with a focus on automation and operational excellence.
Proficiency in PowerShell scripting is required, including writing reusable modules, automation logic, and error handling for production workloads.
Extensive experience with Infrastructure as Code using Bicep is necessary, including authoring, debugging, and deploying templates for complex Azure resources.
A strong understanding of CI/CD processes and YAML pipelines, with hands-on experience in automating build/release workflows in Azure DevOps, is essential.
Proficiency in .NET (C#) is required, especially for debugging Azure Functions or working on backend components integrated into Microsoft 365 automation flows.
In-depth knowledge of the Microsoft 365 platform, including API usage, Teams & SharePoint Online provisioning, governance, and permissions management, is necessary.
Proven ability to troubleshoot and optimize Azure-native services such as API Management, Azure Functions, Storage, Service Bus, Key Vault, and Container Apps is required.
Candidates should be skilled in telemetry and observability, leveraging Azure Monitor, Log Analytics, Kusto queries, and custom logging to proactively identify issues.
Experience conducting root cause analysis, post-incident reviews, and implementing system-wide improvements to reduce incident frequency and MTTR is essential.
Experience in mentoring support engineers, contributing to runbook creation, and improving team capability over time is required.
Strong analytical, documentation, collaboration, and stakeholder communication skills are necessary.
Benefits:
The position offers a remote work opportunity, allowing for flexibility in work location.
Employees will have the chance to work with cutting-edge technologies in cloud platforms and automation.
There are opportunities for professional growth and development through mentoring and knowledge-sharing sessions.
The role provides a chance to influence and improve automation reliability and operational excellence within the organization.
Employees will be part of a collaborative team environment, working closely with product owners and architects on innovative projects.
Apply now
Please, let Zealogics.com know you found this job
on RemoteYeah
.
This helps us grow 🌱.