Zafin Logo

Zafin

Cloud Site Reliability Engineer I

Posted 2 Days Ago
Be an Early Applicant
Chennai, Tamil Nadu
Senior level
Chennai, Tamil Nadu
Senior level
The Cloud Site Reliability Engineer I will ensure the seamless operation and maintenance of Zafin's cloud infrastructure, enhancing system reliability and performance. Responsibilities include providing technical support for cloud issues, conducting incident management, optimizing cloud infrastructure, and developing automation scripts, while collaborating with various internal teams for operational enhancements.
The summary above was generated by AI

Who we are

Founded in 2002, Zafin offers a SaaS product and pricing platform that simplifies core modernization for top banks worldwide. Our platform enables business users to work collaboratively to design and manage pricing, products, and packages, while technologists streamline core banking systems. 

With Zafin, banks accelerate time to market for new products and offers while lowering the cost of change and achieving tangible business and risk outcomes. The Zafin platform increases business agility while enabling personalized pricing and dynamic responses to evolving customer and market needs. 

Zafin is headquartered in Vancouver, Canada, with offices and customers around the globe including ING, CIBC, HSBC, Wells Fargo, PNC, and ANZ. Zafin is proud to be recognized as a top employer and certified Great Place to Work® in Canada, India and the UK.  

Job Summary

Zafin, a global leader in financial technology solutions, is seeking a Cloud Site Reliability Engineer I (CSRE I) to join our dynamic team. Reporting directly to the VP of Cloud Services, this role is pivotal in ensuring the seamless operation, support, and maintenance of Zafin's cloud infrastructure and applications. As a CSRE I, you will leverage your expertise to enhance system reliability, scalability, and performance, collaborating with cross-functional teams to ensure exceptional service delivery to clients and stakeholders.

The ideal candidate will have a strong foundation in cloud platforms, incident management, and proactive operational practices, with a continuous improvement mindset to adapt to advancing technologies.

Key Responsibilities

  • Act as a level-3 technical support expert for Zafin products and Azure cloud issues.
  • Collaborate with Product, Platform Engineering, and DevOps teams to introduce operational enhancements and resiliency measures.
  • Conduct Root Cause Analysis (RCA) for Severity 1 and 2 incidents, ensuring timely communication with stakeholders.
  • Participate in external client escalation calls, providing technical insights and solutions.
  • Optimize cloud infrastructure for scalability, performance, and cost-effectiveness.
  • Manage container orchestration platforms such as Azure Kubernetes Service (AKS) or OpenShift to ensure optimal workload distribution.
  • Enhance monitoring and tracking tools (e.g., Azure Monitor, ELK, Log Analytics) to proactively detect and resolve issues.
  • Collaborate with internal teams to implement best practices for Azure cloud deployment and configuration.
  • Develop automation scripts for routine operational tasks, incident responses, and cloud cost optimization.
  • Maintain detailed documentation of processes, incidents, and cloud architecture.
  • Participate in a rotating on-call schedule to ensure 24/7 availability for critical incidents.

Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 8+ years of experience in cloud support, operations, or a related role.
  • Hands-on experience with Microsoft Azure (preferred) or other cloud platforms.
  • Proficiency in container orchestration platforms like AKS or OpenShift.
  • Expertise in automated deployment pipelines, particularly Azure DevOps.
  • Familiarity with enterprise monitoring platforms such as Azure Insights, Grafana, or Site24/7.
  • Proficiency in scripting languages like PowerShell or Python.
  • Proven experience in incident management and maintaining SLAs for critical production environments.
  • Knowledge of Postgres databases.

Preferred Qualifications

  • Certifications in cloud platforms (e.g., Microsoft Azure Administrator).
  • Familiarity with ITSM tools (e.g., Zendesk, ServiceNow).
  • Knowledge of compliance and security best practices in cloud environments.

Soft Skills

  • Strong problem-solving and analytical abilities.
  • Excellent communication and collaboration skills.
  • Attention to detail and a proactive mindset.
  • Innovative and forward-thinking approach to operational challenges.

What’s in it for you

Joining our team means being part of a culture that values diversity, teamwork, and high-quality work. We offer competitive salaries, annual bonus potential, generous paid time off, paid volunteering days, wellness benefits, and robust opportunities for professional growth and career advancement. Want to learn more about what you can look forward to during your career with us? Visit our careers site and our openings: zafin.com/careers

Zafin welcomes and encourages applications from people with disabilities. Accommodations are available on request for candidates taking part in all aspects of the selection process. 

Zafin is committed to protecting the privacy and security of the personal information collected from all applicants throughout the recruitment process. The methods by which Zafin contains uses, stores, handles, retains, or discloses applicant information can be accessed by reviewing Zafin’s privacy policy at https://zafin.com/privacy-notice/. By submitting a job application, you confirm that you agree to the processing of your personal data by Zafin described in the candidate privacy notice.

Top Skills

Azure
Powershell
Python

Zafin Chennai, Tamil Nadu, IND Office

TVH Agnitio Park 2nd Floor 141, Rajiv Gandhi Salai, Perungudi, Chennai, Tamil Nadu, India, 600096

Similar Jobs

2 Days Ago
Chennai, Tamil Nadu, IND
Senior level
Senior level
Fintech • Payments • Software
The Cloud Site Reliability Engineer II will manage complex technical issues in Zafin's cloud environment and enhance operational reliability. Responsibilities include conducting root cause analysis, optimizing cloud infrastructure, mentoring junior engineers, and automating operational processes while driving strategic initiatives across cross-functional teams.
Top Skills: AksAzureOpenshiftPostgresPowershellPython
3 Days Ago
Chennai, Tamil Nadu, IND
Junior
Junior
Hardware • Information Technology • Other • Software • Analytics
As a Site Reliability Engineer II, you will design, maintain, and optimize high-availability systems in the cloud, emphasizing automation through Infrastructure as Code and monitoring systems. You'll handle incident response, capacity planning, and promote best practices while collaborating with development teams. Mentorship of junior engineers and continuous improvement will be key aspects of your role.
Top Skills: Python
7 Days Ago
Chennai, Tamil Nadu, IND
Expert/Leader
Expert/Leader
Information Technology • Software
The Lead Software Engineer (Cloud DevOps) will design and implement cloud infrastructure, manage CI/CD pipelines, and provide leadership for DevOps practices. They will mentor a team of engineers, work closely with product teams, ensure reliable service delivery, and improve existing systems using modern cloud technologies.
Top Skills: JavaPythonRuby

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account