NatWest Group Logo

NatWest Group

Site Reliability Engineer

Reposted 2 Days Ago
Be an Early Applicant
In-Office
3 Locations
Senior level
In-Office
3 Locations
Senior level
Maintain and improve availability, performance, monitoring, security, incident response, and capacity for production and non-production environments. Support releases, incident communication, risk assessment, and continuous reliability improvements across services.
The summary above was generated by AI

Join us as a Site Reliability Engineer

  • In this key role, you’ll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
  • You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
  • This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
  • We're offering this role at associate level
What you'll do

As our Site Reliability Engineer, you’ll work alongside colleagues and feature team members to meet defined service level objectives and continually improve systems and environments. You’ll proactively contribute new ideas and innovations to meet short term and longer term goals whilst at the same time balancing and managing risk.

You’ll also be accountable for the day-to-day health of both production and non-production environments, responding to incidents as required.

A typical day will involve:

  • Providing structure and supporting release processes, suggesting and making improvements where possible
  • Supporting the clear communication and frequent update of incident status to other teams and customers
  • Providing technical expertise and input to establish the risk tolerance of products and services
  • Supporting the maintenance of services once they are live by measuring and monitoring availability, latency, and overall system health
The skills you'll need

We’re looking for someone with strong knowledge of reliability systems thinking and experience of software engineering. You’ll need experience of Agile and DevOps. We’ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processes.

You'll also need:

  • Good knowledge and experience of incident management, change management, problem management and root cause analysis
  • At least five years of experience in AWS, Python, MongoDB, FASTAPI and ReactJS
  • Strong knowledge of GitLab, cloud environment and Microservices deployment using CI/CD pipeline
  • Experience in Core Java, SQL, PL/SQL, Splunk, Autosys along with understanding of shell scripting
  • Strong communication skills with the ability to proactively engage with a wide range of stakeholders

Hours

45

Job Posting Closing Date:

03/03/2026

Top Skills

Autosys
AWS
Ci/Cd
Fastapi
Gitlab
Java
Microservices
MongoDB
Pl/Sql
Python
React
Shell Scripting
Splunk
SQL

NatWest Group Chennai, Tamil Nadu, IND Office

Kosmo One, Plot No 14 3rd Main Road, Ambattur Industrial Estate, Chennai, Tamil Nadu, India, 600 058

Similar Jobs

8 Days Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Cloud • Software
Build and maintain reliable, scalable cloud services through automation, monitoring, CI/CD, IaC, incident response, capacity planning, cost optimization, and collaboration with platform and engineering teams.
Top Skills: AnsibleAWSAzureDockerElkGCPGitGithub ActionsGitlab Ci/CdGrafanaHelmJavaKubernetesNode.jsPrometheusPythonShellTerraform
13 Days Ago
In-Office
Chennai, Tamil Nadu, IND
Mid level
Mid level
Hardware
Build and operate production ML/LLM systems: design MLOps platform capabilities, standardized pipelines, model registry and artifact lineage; deploy using containers/Kubernetes and CI/CD; implement observability, SLOs, incident response, model/data monitoring, security controls, and enable teams with documentation and best practices.
Top Skills: Python,Pytorch,Tensorflow,Scikit-Learn,Kubernetes,Containers,Rest,Grpc,Ci/Cd,Infrastructure As Code,Azure,Aws,Gcp,Mlflow,Kubeflow,Airflow,Sagemaker,Vertex Ai,Azure Machine Learning,Feast,Great Expectations,Dvc,Evidently,Whylogs,Model Registry,Artifact Storage,Metadata Lineage
15 Days Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
Senior DevOps & SRE to automate deployments and infrastructure, manage AWS components, implement Terraform/Ansible automation, build CI/CD pipelines and monitoring, support demos/PoCs, handle upgrades, and serve on-call responsibilities.
Top Skills: AnsibleAWSCi/CdIpPythonShellSocketsTcpTerraformUdp

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account