Remote Site Reliability Engineer – Azure & Microsoft 365 Automation (Remote Opportunity)

Posted

Apply now
Please, let Zealogics.com know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • The Site Reliability Engineer will lead the investigation and resolution of critical, recurring, or high-impact incidents across Azure and Microsoft 365 automation workflows.
  • This role involves deep-diving into PowerShell, Bicep, and YAML scripts to identify logic errors, misconfigurations, or scalability limitations within automated provisioning workflows.
  • The engineer will debug and optimize .NET (C#) components within Azure Functions or related application layers used in workflow orchestration.
  • Analyzing usage patterns and telemetry data from Azure Monitor, Application Insights, and Log Analytics to identify systemic issues or opportunities for automation enhancement is a key responsibility.
  • The engineer will implement fixes and design improvements to automation logic that reduce manual intervention and improve workflow reliability, such as auto-remediation scripts and retry logic.
  • They will own and evolve the automation framework for Teams and SharePoint Online lifecycle operations, including operations like create/delete, external sharing restrictions, and role/ownership changes.
  • Collaboration with product owners and architects to introduce new automation use cases or extend existing workflows is expected.
  • Conducting post-incident reviews (PIRs) for high-severity incidents, driving root cause analysis (RCA), and implementing corrective actions are essential tasks.
  • The engineer will mentor L1 and L2 engineers, conduct knowledge-sharing sessions, and support the onboarding of new team members.
  • Staying updated with changes in Azure, Microsoft 365 APIs, and automation tooling, such as PowerShell modules and Bicep schema updates, is crucial.
  • Providing guidance on architecture and best practices for automation reliability is also part of the role.

Requirements:

  • Candidates must have 12+ years of experience in cloud platform engineering, DevOps, or site reliability engineering (SRE) roles with a focus on automation and operational excellence.
  • Proficiency in PowerShell scripting is required, including writing reusable modules, automation logic, and error handling for production workloads.
  • Extensive experience with Infrastructure as Code using Bicep is necessary, including authoring, debugging, and deploying templates for complex Azure resources.
  • A strong understanding of CI/CD processes and YAML pipelines, with hands-on experience in automating build/release workflows in Azure DevOps, is essential.
  • Proficiency in .NET (C#) is required, especially for debugging Azure Functions or working on backend components integrated into Microsoft 365 automation flows.
  • In-depth knowledge of the Microsoft 365 platform, including API usage, Teams & SharePoint Online provisioning, governance, and permissions management, is necessary.
  • Proven ability to troubleshoot and optimize Azure-native services such as API Management, Azure Functions, Storage, Service Bus, Key Vault, and Container Apps is required.
  • Candidates should be skilled in telemetry and observability, leveraging Azure Monitor, Log Analytics, Kusto queries, and custom logging to proactively identify issues.
  • Experience conducting root cause analysis, post-incident reviews, and implementing system-wide improvements to reduce incident frequency and MTTR is essential.
  • Experience in mentoring support engineers, contributing to runbook creation, and improving team capability over time is required.
  • Strong analytical, documentation, collaboration, and stakeholder communication skills are necessary.

Benefits:

  • The position offers a remote work opportunity, allowing for flexibility in work location.
  • Employees will have the chance to work with cutting-edge technologies in cloud platforms and automation.
  • There are opportunities for professional growth and development through mentoring and knowledge-sharing sessions.
  • The role provides a chance to influence and improve automation reliability and operational excellence within the organization.
  • Employees will be part of a collaborative team environment, working closely with product owners and architects on innovative projects.
Apply now
Please, let Zealogics.com know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
-
Location requirements
Experience level
Technology stack
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback