Zealogics LLC Logo

Zealogics LLC

Site Reliability Engineer – Azure & Microsoft 365 Automation (Remote Opportunity)

Job Posted 6 Days Ago Reposted 6 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Expert/Leader
Remote
Hiring Remotely in India
Expert/Leader
Lead incident resolution in Azure and Microsoft 365 automation, optimize scripts, mentor engineers, and enhance automation reliability. Conduct post-incident reviews and implement improvements.
The summary above was generated by AI

Key Responsibilities: 

  • Lead investigation and resolution of critical, recurring, or high-impact incidents across Azure and Microsoft 365 automation workflows. 

  • Deep-dive into PowerShell, Bicep, and YAML scripts to identify logic errors, misconfigurations, or scalability limitations within automated provisioning workflows. 

  • Debug and optimize .NET (C#) components within Azure Functions or related application layers used in workflow orchestration. 

  • Analyze usage patterns and telemetry data from Azure Monitor, Application Insights, and Log Analytics to identify systemic issues or opportunities for automation enhancement. 

  • Implement fixes and design improvements to automation logic that reduce manual intervention and improve workflow reliability (e.g., auto-remediation scripts, retry logic). 

  • Own and evolve the automation framework for Teams and SPO lifecycle operations — including operations like create/delete, external sharing restrictions, and role/ownership changes. 

  • Collaborate with product owners and architects to introduce new automation use cases or extend existing workflows. 

  • Conduct post-incident reviews (PIRs) for high-severity incidents, drive root cause analysis (RCA), and implement corrective actions. 

  • Mentor L1 and L2 engineers, conduct knowledge-sharing sessions, and support onboarding of new team members. 

  • Stay updated with changes in Azure, Microsoft 365 APIs, and automation tooling (PowerShell modules, Bicep schema updates, etc.) 

  • Provide guidance on architecture and best practices for automation reliability 

Required Skills & Experience: 

  • 12+ years of experience in cloud platform engineering, DevOps, or site reliability engineering (SRE) roles with a focus on automation and operational excellence. 

  • Proficiency in PowerShell scripting, including writing reusable modules, automation logic, and error handling for production workloads. 

  • Extensive experience with Infrastructure as Code using Bicep, including authoring, debugging, and deploying templates for complex Azure resources. 

  • Strong understanding of CI/CD processes and YAML pipelines, with hands-on experience in automating build/release workflows in Azure DevOps. 

  • Proficient in .NET (C#) — especially for debugging Azure Functions or working on backend components integrated into M365 automation flows. 

  • In-depth knowledge of Microsoft 365 platform, including API usage, Teams & SharePoint Online provisioning, governance, and permissions management. 

  • Proven ability to troubleshoot and optimize Azure-native services such as API Management, Azure Functions, Storage, Service Bus, Key Vault, and Container Apps. 

  • Skilled in telemetry and observability — leveraging Azure Monitor, Log Analytics, Kusto queries, and custom logging to proactively identify issues. 

  • Experience conducting root cause analysis, post-incident reviews, and implementing system-wide improvements to reduce incident frequency and MTTR. 

  • Experience in mentoring support engineers, contributing to runbook creation, and improving team capability over time. 

  • Strong analytical, documentation, collaboration and stakeholder communication skills 

Top Skills

.Net
Application Insights
Azure
Azure Devops
Azure Monitor
Bicep
C#
Log Analytics
Microsoft 365
Powershell
Yaml

Similar Jobs

4 Hours Ago
Remote or Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Responsible for enhancing performance, scalability, and reliability of the ServiceNow Platform. Involves architecting performance testing projects, analyzing system performance, and collaborating with engineering teams.
Top Skills: ElkGrafanaInfluxdbJavaJava ScriptJmeterKibanaLoad RunnerPrometheusPythonShell ScriptsSplunk
4 Hours Ago
Remote or Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves maintaining automation test frameworks, collecting quality metrics, designing testing strategies, and supporting troubleshooting in engineering teams.
Top Skills: EclipseGitJavaJavaScriptJenkinsJunitMavenSeleniumTestng
4 Hours Ago
Remote or Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead and manage software engineering teams to deliver high-quality solutions on time, leveraging AI and Agile practices while mentoring engineers.
Top Skills: AIC++GraphQLJavaJavaScriptReduxRubyShell

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account