DevRev Logo

DevRev

Site Reliability Engineer

Reposted 6 Days Ago
Be an Early Applicant
Easy Apply
In-Office
Chennai, Tamil Nadu
Mid level
Easy Apply
In-Office
Chennai, Tamil Nadu
Mid level
As a Site Reliability Engineer, you will design and maintain cloud infrastructure, automate processes, and ensure system reliability and scalability across platforms.
The summary above was generated by AI

About DevRev

At DevRev, we're building the future of work with Computer – your AI teammate. Unlike traditional tools, Computer unifies all your data sources, tools, and workflows into a single AI-ready platform, giving employees real-time insights, proactive suggestions, and powerful agentic actions. It extends your existing software with AI-native apps and agents that work alongside your teams and customers – updating workflows, coordinating across teams, and eliminating repetitive work. We call this Team Intelligence: human-AI collaboration that breaks down silos, brings people back together, and frees you to solve bigger problems. Backed by Khosla Ventures and Mayfield with $150M+ raised, DevRev is trusted by global companies across industries.

About the Role:

We are seeking an experienced Site Reliability Engineer / Platform Engineer to join our team and help build and maintain a resilient, scalable infrastructure supporting our applications across multiple cloud providers. In this role, you will design and implement infrastructure solutions, automate operational processes, and work closely with development teams to ensure reliable, efficient systems that scale with our business.

What You'll Do:
  • Design, build, and maintain infrastructure across AWS, GCP, and Azure using Infrastructure as Code (IaC) principles.
  • Implement and optimize CI/CD pipelines using tools like Argo and CircleCI to enable rapid, reliable deployments.
  • Manage and scale Kubernetes clusters in production environments, ensuring high availability and optimal resource utilization.
  • Administer and optimize cloud databases including MongoDB, Redis, RDS, and other data stores for performance and reliability.
  • Develop monitoring, alerting, and observability solutions to identify and resolve issues before they impact users.
  • Automate routine operational tasks to reduce manual toil and improve system reliability.
  • Conduct incident response and post-mortem analysis to drive continuous improvement.
  • Collaborate with development teams to design systems with reliability, scalability, and operational excellence in mind.
  • Document infrastructure architecture, runbooks, and operational procedures.
  • Evaluate and implement new tools and technologies to improve platform capabilities.
What You'll Bring:
  • 3+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering.
  • Strong hands-on experience with at least two major cloud providers (AWS, GCP, Azure).
  • Proficiency with Kubernetes for container orchestration and management.
  • Demonstrated expertise with IaC tools (Terraform, CloudFormation, Pulumi, or similar).
  • Experience with CI/CD platforms, particularly Argo and/or CircleCI.
  • Solid understanding of database technologies including MongoDB, Redis, and relational databases (RDS).
  • Proficiency in at least one programming or scripting language (Python, Go, Bash, Typescript, etc.).
  • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK, CloudWatch).
  • Experience implementing and managing OpenTelemetry (OTEL) for distributed tracing, metrics, and logging.
  • Strong understanding of networking, security, and infrastructure best practices.
Nice to Have
  • Experience managing multi-cloud or hybrid cloud environments.
  • Familiarity with service mesh technologies (Istio, Linkerd).
  • Knowledge of security hardening and compliance in cloud environments.
  • Experience with cost optimization in cloud infrastructure.
  • Contributions to open-source infrastructure or DevOps projects.
  • Certifications from major cloud providers.

DevRev is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

Top Skills

Argo
AWS
Azure
Bash
CircleCI
CloudFormation
Cloudwatch
Elk
GCP
Go
Grafana
Kubernetes
MongoDB
Opentelemetry
Prometheus
Python
Rds
Redis
Terraform
Typescript

Similar Jobs

Yesterday
In-Office or Remote
2 Locations
Junior
Junior
Cloud • Security • Software • Cybersecurity
As a Site Reliability Engineer II, you will ensure system reliability, oversee deployment and maintenance of platforms, improve performance, and troubleshoot issues while collaborating cross-functionally.
Top Skills: AnsibleDockerGoGrafanaJenkinsKubernetesLinuxPrometheusPythonTerraform
Yesterday
In-Office or Remote
2 Locations
Mid level
Mid level
Cloud • Security • Software • Cybersecurity
As a Site Reliability Engineer II, you'll manage cloud platforms and ensure reliability through automation, configuration management, and monitoring solutions.
Top Skills: AnsibleChefGrafanaJenkinsPrometheusPythonSalt StackShellTerraformYaml
Yesterday
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Automotive
The SRE Cloud Engineer will design, implement, and manage cloud infrastructure while automating deployments and ensuring high availability. Responsibilities include collaborating with teams, maintaining cloud solutions, strengthening observability capabilities, and resolving infrastructure issues.
Top Skills: Apache ActivemqApache KafkaApigeeArgocdCephDockerGCPGitGoGrafanaHdfsHelmJavaJavaScriptKubernetesKustomizeNfsNode.jsPrometheusPythonRabbitMQRestS3Terraform

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account