The DevOps Engineer will design and maintain cloud infrastructure, automate deployment processes, monitor system performance, and ensure compliance with security practices.
Who are we looking for?
We are seeking a skilled and experienced DevOps Engineer to join our team. You will play a critical role in designing, implementing and maintaining high performing, secure, and scalable systems in our cloud infrastructure. This role involves collaborating closely with other engineers, product and sales personnel, and other stakeholders to ensure that our product’s availability and performance comply with customer SLAs. It also requires you to be on-call, as a first line of defense, for a period of one-week (rotating).
You will be responsible for:
- Design, implement, and maintain scalable, secure, and cost-effective cloud infrastructure in Azure.
- Build and optimize automated CI/CD pipelines to ensure seamless and efficient deployment of ML models, APIs, and other applications.
- Automation of manual processes and workflows.
- Monitoring and troubleshooting to ensure uptime, performance, and reliability of our systems.
- Responding quickly to incidents and troubleshoot infrastructure or application-related issues.
- Close collaboration with Software Engineers, Data Scientists, and Product teams to ensure infrastructure meets the needs of AI and ETL workloads, and can handle high volumes of data and computation.
- Adherence to security best practices in system architecture and deployment pipelines to ensure data privacy, security, and compliance with relevant regulations (SOC 2, CCPA).
- Continuous evaluation and optimization for system performance, cost management and scalability of AI-related and ETL workloads.
- Create and maintain clear documentation on deployment processes, infrastructure setup, and troubleshooting guides to support internal teams and reduce downtime.
Requirements:
- 3+ years of hands-on experience in a DevOps role.
- Strong proficiency with Azure and AWS.
- Experience with containerization and orchestration: Docker and Kubernetes.
- Experience with monitoring and logging tools (Prometheus, Grafana, Loki, etc.)
- Experience building and maintaining workflows with GitHub Actions.
- Scripting skills in Python, Bash, or similar language is encouraged.
- Experience with git and git-based workflows.
- Familiarity with collaboration development tools like Linear, Slack, and Confluence.
Top Skills
AWS
Azure
Bash
Docker
Github Actions
Grafana
Kubernetes
Loki
Prometheus
Python
Similar Jobs
Fintech • Financial Services
The Senior DevOps Engineer will build and maintain data systems, collaborate on machine learning models, and influence decision-making across teams, ensuring secure data handling and operational effectiveness.
Top Skills:
AWSChefLinuxTerraformUnixWindows
Fintech • Financial Services
The Senior Dev Ops Engineer SRE will ensure system reliability and scalability, develop automation tools, and collaborate with teams on best practices and incident response.
Top Skills:
AWSAzureCi/CdGCPGitGradleJava 8+JavafxLinuxMavenOraclePl/SqlRestful ApisSolace Pubsub+Spring FrameworkSQLSwingUnix
Software
As a Lead Dev Ops Engineer, you'll build and maintain SaaS cloud infrastructure, manage CI/CD pipelines, ensure security, and support AI technologies across microservices architecture.
Top Skills:
AnsibleAWSAzureAzure AiAzure DevopsBashBitbucketDockerElasticsearchElk StackGithub ActionsGoogle Cloud PlatformGrafanaJenkinsKubernetesLarge Language ModelsPostgresPowershellPrometheusPythonTerraform
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.