Design, maintain, and improve resilient build, test, and deployment infrastructure. Automate provisioning and deployments, manage lifecycle updates and security patches, deploy monitoring/alerting, troubleshoot performance bottlenecks, and produce technical documentation while collaborating with engineering and QA teams.
This role is for one of the Weekday's clients
Min Experience: 10 years
Location: Chennai
JobType: full-time
Requirements
Key Responsibilities:
- Collaborate with engineering, QA, and internal technology teams to design and sustain a resilient, scalable, and secure build and testing environment.
- Improve and maintain tools utilized for test execution, orchestration, and scheduling throughout development workflows.
- Regularly evaluate system performance, detect bottlenecks, and apply enhancements to ensure system stability and efficiency.
- Oversee infrastructure lifecycle tasks, including performing updates, upgrades, and applying security patches.
- Deploy and manage monitoring solutions to observe infrastructure health, system behavior, and application performance.
- Lead automation efforts using configuration management tools to optimize provisioning and deployment processes.
- Create and update comprehensive technical documentation detailing system architecture, procedures, and troubleshooting methods.
Required Skills:
Essential Qualifications:
- Excellent interpersonal and stakeholder management skills, with the capability to work effectively across engineering and IT teams.
- Bachelor’s degree in Computer Science, Information Systems, or a related field, or equivalent hands-on experience.
- About 10 or more years of experience in DevOps, Site Reliability Engineering, or comparable roles.
- Minimum of 2 years’ experience in leading engineers or managing technical projects.
- Strong proficiency in Linux system administration and shell scripting.
- Advanced programming skills in Python.
- Practical experience with configuration management and automation tools such as Ansible, Puppet, Chef, or SaltStack.
- Knowledge of container ecosystems and tools like Docker, Kubernetes, or equivalent platforms.
- Experience with monitoring, logging, and alerting systems.
- Excellent analytical thinking and problem-solving skills.
DevOps
Linux
Python
Good-to-have skillsConfiguration Management
Docker
Kubernetes
Similar Jobs
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Design, build, and operate agentic LLM-powered workflows, autonomous agents, and RAG/vector retrieval systems. Own end-to-end delivery including CI/CD, IaC, observability, DevSecOps, and Salesforce integrations to productionize enterprise AI for GTM applications.
Top Skills:
AgentcoreAgentforceAutogenAws BedrockCdkCopadoCrewaiDastDockerGithub ActionsGitopsJavaScriptJenkinsKubernetesLangchainLanggraphLightning Web ComponentsModel Context Protocols (Mcp)PgvectorPineconePlatform EventsPythonSalesforce ApexSastSemantic KernelService MeshSlackTerraformTypescriptVertex AiWeaviate
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Lead a distributed SRE team owning CI/CD platform reliability, automation, observability, and data infrastructure. Provide people management, technical direction, architecture input, operational excellence, and cross-team collaboration while driving automation, monitoring, and AI-assisted workflows.
Top Skills:
AnsibleApache AirflowSparkAWSAzureBashBazelBitbucketChefDatadogGCPGitGithub ActionsGitlabGitlab CiGoGrafanaHumio/LogscaleJenkinsKafkaKubernetesNasNfsObject StorageOpensearchOraclePostgresPowershellPrometheusPulsarPuppetPythonRedisRedpandaSanSli/SloSplunkTerraformValleyVarnish
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Lead Salesforce Engineer responsible for designing and developing Apex, LWC, Aura and Visualforce solutions, building integrations, leading Quote-to-Cash initiatives, troubleshooting L3 production issues, enforcing Salesforce best practices, participating in Agile sprints and CI/CD deployments, mentoring developers, and optimizing org and commerce process performance.
Top Skills:
AgileApexAuraCi/CdIntegrationsLightning Web ComponentsQuote-To-CashSalesforceSalesforce CpqSalesforce Revenue CloudVisualforce
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

