Clearwater Analytics (CWAN) Logo

Clearwater Analytics (CWAN)

Senior Site Reliability Engineer

Posted 4 Hours Ago
Be an Early Applicant
Hybrid
Mumbai, Maharashtra
Senior level
Hybrid
Mumbai, Maharashtra
Senior level
Build automation and internal tools in Python to provision, monitor, and support a growing fleet of client deployments across AWS and Azure. Standardize environments, improve observability, convert runbooks into automated remediation and self-service tools, and extend provisioning pipelines (Terraform) to speed onboarding. Partner with onboarding, support, and client success teams to eliminate operational toil.
The summary above was generated by AI
About the Team

Beacon by Clearwater is the AI-powered risk analytics and modeling arm of the Clearwater platform, giving institutional investors the tools to test scenarios and evaluate portfolio exposures in real time.

As Clearwater brings Beacon to more clients, the number of client environments we provision, monitor, and support grows with it and the only way that works is through standardization and automation. This team builds the tooling that keeps a growing fleet of client deployments consistent, observable, and supportable: automating away repetitive operational work, turning incident learnings into permanent platform fixes, and giving client-facing teams the self-service tools they need to onboard and support clients without engineering escalations.

What You’ll Do
  • Build internal tools and automation primarily in Python to monitor, diagnose, and support a fleet of client deployments across AWS and Azure.
  • Drive standardization across client environments: detect and remediate configuration and infrastructure drift, converge legacy deployments onto golden paths, and make “the standard way” the easy way.
  • Improve fleet-wide observability: build monitoring, alerting, and dashboards that surface problems across all client deployments before clients notice them.
  • Turn runbooks into code; converting the manual diagnostic and remediation steps support engineers perform today into automated checks, self-healing jobs, and one-click tools.
  • Extend the client provisioning and deployment pipeline (Terraform, configuration generation) to make onboarding new clients faster and more repeatable.
  • Work directly with client-facing teams (onboarding, support, client success) to find where operational toil lives.
What We’re Looking For
  • 7-10 years of experience in software engineering, site reliability engineering, DevOps, or platform engineering.
  • Strong programming skills in Python (our platform core and tooling language); comfort writing production-quality code with tests, not just scripts.
  • Hands-on experience with at least one major cloud provider (AWS or Azure): networking (VPCs/VNets, subnets, security groups, load balancers, VPN), IAM/RBAC, storage, and compute.
  • Working knowledge of infrastructure-as-code, ideally Terraform, and what it means to manage many environments from shared modules and per-environment configuration.
  • Solid Linux fundamentals: you can read logs, trace a process, debug a service that won’t start, and automate what you did, so no one must do it by hand again.
  • An automation reflex: when you solve a problem twice, your instinct is to build a tool.
  • A collaborative, service-oriented mindset: your customers are internal teams, and your success is measured by how much easier you make their jobs.
Nice to Have
  • Experience operating multi-tenant or fleet-style environments (many similar deployments managed as one).
  • Observability stack experience (metrics, log aggregation, alerting, dashboards).
  • Formal incident management experience (on-call, postmortems, blameless RCA culture).
  • Exposure to financial services, fintech, or other regulated environments.
Why This Role
  • Direct, visible impact: every tool you ship makes onboarding the next client faster and supporting every existing client cheaper. This team is a force multiplier for the entire Beacon business.
  • Breadth: you’ll touch cloud infrastructure, a large Python platform codebase, deployment pipelines, and the human workflows of support and onboarding teams.
  • Growth: you’ll work across nearly every layer of a sophisticated financial-engineering platform, alongside experts in cloud infrastructure, quantitative finance, and large-scale SaaS operations.

Similar Jobs at Clearwater Analytics (CWAN)

Senior level
Fintech • Software • Financial Services
Build Python automation and internal tools to provision, monitor, and support many client deployments across AWS and Azure. Standardize environments, remediate drift, improve observability with monitoring/alerts/dashboards, convert runbooks into automated tooling, extend Terraform-based provisioning pipelines, and collaborate with onboarding and support teams to reduce operational toil.
Top Skills: AlertingAWSAzureDashboardsIamLinuxLoad BalancersLog AggregationMonitoringPythonRbacSecurity GroupsTerraformVnetVpcVpn
Senior level
Fintech • Software • Financial Services
Build Python automation and internal tools to provision, monitor, and support many client deployments across AWS and Azure. Standardize environments, remediate drift, improve observability with monitoring/alerts/dashboards, convert runbooks into automated tooling, extend Terraform-based provisioning pipelines, and collaborate with onboarding and support teams to reduce operational toil.
Top Skills: AlertingAWSAzureDashboardsIamLinuxLoad BalancersLog AggregationMonitoringPythonRbacSecurity GroupsTerraformVnetVpcVpn
Mid level
Fintech • Software • Financial Services
Build automation and internal tooling (primarily Python) to provision, monitor, and support a fleet of client deployments across AWS and Azure. Standardize environments, detect and remediate drift, improve observability with monitoring/alerts/dashboards, convert runbooks into automated remediation and self-service tools, extend Terraform-based provisioning pipelines, and collaborate with onboarding and support teams to reduce operational toil.
Top Skills: AlertingAWSAzureComputeDashboardsIamLinuxLoad BalancersLog AggregationMetricsPythonRbacSecurity GroupsStorageSubnetsTerraformVnetVpcVpn

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account