Zuora Jobs

Senior Site Reliability Engineer

Zuora

Senior Site Reliability Engineer

Reposted 4 Days Ago

Be an Early Applicant

In-Office

Chennai, Tamil Nadu, IND

Senior level

In-Office

Chennai, Tamil Nadu, IND

Senior level

The Senior Site Reliability Engineer at Zuora will lead reliability architecture, design AI-driven automation, and enhance cloud infrastructure while mentoring other engineers.

The summary above was generated by AI

About Zuora

At Zuora, we help businesses grow smarter and adapt faster. Our platform powers modern business models — from subscriptions and usage-based pricing to AI-driven and outcome-based offerings — helping companies launch new products, automate complex billing, and unlock predictable, recurring revenue.

We’ve led the Subscription Economy for more than a decade. Now we’re evolving again by building the definitive platform for quote to cash and helping companies monetize their products and services with an adaptable, AI-ready foundation.

The Opportunity

We’re hiring a Senior Site Reliability Engineer to lead reliability strategy and drive AI-powered automation at scale. This role involves owning complex systems, shaping architecture, and influencing cross-functional teams.

You’ll:

Define and evolve SLOs, SLIs, and resilience patterns
Build AI-driven automation for detection, remediation, and forecasting
Lead cloud infrastructure and Kubernetes platforms
Drive incident response and operational excellence
Mentor engineers and influence org-wide reliability practices

About You

8+ years of hands-on experience in Site Reliability Engineering, DevOps, or large-scale production operations.
Advanced expertise in AWS, including architecture design across services such as EC2, EKS, VPC, IAM, RDS, S3, and CloudWatch.
Deep experience with Infrastructure-as-Code using Terraform, including complex modules, state management, and governance.
Strong programming and automation skills using Python and Shell; experience building production-grade automation systems.
Expert-level Linux systems knowledge, including performance tuning, security hardening, and deep troubleshooting.
Proven experience operating distributed systems and data streaming platforms such as Kafka in high-throughput environments.
Demonstrated ability to work independently on complex, ambiguous problems with broad organizational impact.
Proven technical leadership experience driving large, cross-team reliability or infrastructure initiatives, including setting technical direction, influencing design decisions, and mentoring engineers to deliver measurable outcomes at scale.

AI & Automation Expertise

Practical experience designing or implementing AI/ML-driven automation in operations, reliability, or platform engineering.
Experience integrating AI capabilities into monitoring, alerting, incident response, or workflow automation systems.
Strong understanding of how AI can be safely and effectively applied in production environments.

Nice to haves:

Experience with advanced observability platforms (Prometheus, Grafana, ELK, or similar) enhanced with AI-driven insights.
Familiarity with predictive analytics, anomaly detection, or AIOps platforms.
Experience influencing architectural decisions at a platform or product level.
Prior experience operating in a 24/7, global, high-availability SaaS environment.

The Team

Zuora's Cloud Engineering team ensures the reliability, scalability, and operational excellence of global SaaS platforms, operating 24x7 with a follow-the-sun model. The team collaborates closely with Engineering, Security,Product, and Support, leveraging technologies like AWS,Kubernetes, Kafka,Terraform, and Python, with a strong focus on AI-driven operations and automation.

Benefits

Zuora offers a comprehensive total rewards package designed to support ZEOs’ wellbeing, growth, and flexibility. While specific offerings may vary by country, we typically provide:

Competitive compensation, variable bonus and performance-based reward opportunities, and retirement programs
Medical, dental, and vision insurance
Generous, flexible time off, plus paid holidays, wellness days, and a company-wide year-end break
Paid parental leave (including fully paid leave for eligible ZEOs, subject to local policy)
Learning & development stipend to support ongoing growth
Opportunities to volunteer and give back, including charitable donation matching where available
Mental wellbeing resources and support

*Benefits may vary by location; details will be shared during the interview process

#ZEOLife at Zuora

ZEOs (our employees) are empowered to take ownership, challenge the status quo, and make a real impact. We:

Collaborate deeply across teams and regions
Learn constantly and iterate often
Build an inclusive, high-performance culture where people feel inspired, connected, and valued

Our Commitment to an Inclusive Workplace

Think, be and do you.
At Zuora, different perspectives, experiences, and contributions matter — everyone counts.

Zuora is proud to be an Equal Opportunity Employer committed to creating an inclusive environment for all. We do not discriminate on the basis of, and consider individuals seeking employment with Zuora without regard to, race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics.

We encourage candidates from all backgrounds to apply. Applicants in need of special assistance or accommodation during the interview process or in accessing our website may contact us by sending an email to [email protected] (or local equivalent, where applicable).

9th floor, Tower B, OMR, 9:13 Brigade World trade centre, SH 49A, Vijayendra Nagar, Perungudi, Chennai, Tamil Nadu, India, 600096

Similar Jobs

NVIDIA

Senior Site Reliability Engineer

19 Days Ago

In-Office or Remote

India

Senior level

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse

Operate and improve the reliability, availability, and performance of large-scale GeForce NOW services. Participate in incident triage and on-call rotations, build automation and tooling, enhance observability (metrics/logs/traces), drive SLO/SRI practices, run postmortems, and design/operate Kubernetes-based services across cloud and datacenter environments.

Top Skills: AWSAzureBashContainerizationElk/OpensearchGCPGoGrafanaKubernetesMicroservicesOpentelemetryPrometheusPython

Akamai Technologies

Senior Site Reliability Engineer

20 Days Ago

In-Office or Remote

India

Senior level

Cloud • Security • Software • Cybersecurity

Design, implement, and maintain reliable, scalable infrastructure for large distributed content delivery systems. Define and measure SLIs/SLOs, monitor availability and performance, troubleshoot incidents, and implement corrective actions. Develop automation to reduce manual work, participate in design reviews, and collaborate with product and engineering teams to improve system reliability and performance.

Top Skills: AdbmsBashCloud ComputingDatadogGrafanaJavaScriptOracle SqlPrometheusPythonUnix/Linux

Weekday, Inc.

Senior Site Reliability Engineer

23 Days Ago

In-Office

Chennai, Tamil Nadu, IND

Senior level

Artificial Intelligence • HR Tech • Professional Services • Software

Lead deployment, updates, and operational support for cloud environments. Ensure availability, scalability, performance, and reliability; define SLAs/SLOs/SLIs; drive IaC and automation; improve monitoring, observability, and security; perform RCAs and cost optimizations; collaborate with cross-functional teams to deliver reliable services.

Top Skills: AnsibleAWSAzure MonitorBashCi/CdCloudwatchDnsElkGrafanaHelmKubernetesLinuxLoad BalancingMariadbMongoDBPrometheusPythonShellTerraform

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Zuora

Senior Site Reliability Engineer

Zuora Chennai, Tamil Nadu, IND Office

Similar Jobs

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

What you need to know about the Chennai Tech Scene