DeepIntent Logo

DeepIntent

Site Reliability Engineer

Reposted 18 Days Ago
Be an Early Applicant
In-Office
Pune, Maharashtra
Mid level
In-Office
Pune, Maharashtra
Mid level
The Site Reliability Engineer will manage production systems focusing on reliability and performance, automate processes, and maintain cloud services.
The summary above was generated by AI

DeepIntent is leading the healthcare advertising industry with data-driven solutions built for the future. From day one, our mission has been to improve patient outcomes through the artful use of advertising, data science, and real-world clinical data. For more information visit, www.DeepIntent.com or find us on LinkedIn. 

We are seeking a skilled and experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a minimum of 3 years of hands-on experience in setting up, optimizing, and securing analytical distributed data sources such as ClickHouse, Druid, or similar distributed database systems. Intermediate DBA skills required. As an SRE at [Company Name], you will play a crucial role in ensuring the stability and efficiency of our infrastructure, as well as contributing to the development of automation and monitoring tools.
 
Responsibilities:
  • Mandatory - Hands-on experience in setting up, optimizing, and securing analytical distributed data sources such as ClickHouse, Druid, or similar distributed database systems. Intermediate DBA skills required.
  • Deploy, configure, and maintain Kubernetes clusters for our microservices architecture.
  • Utilize Git and Helm for version control and deployment management.
  • Implement and manage monitoring solutions using Prometheus and Grafana.
  • Work on continuous integration and continuous deployment (CI/CD) pipelines.
  • Containerize applications using Docker and manage orchestration.
  • Manage and optimize AWS services, including but not limited to EC2, S3, RDS, and AWS CDN.
  • Maintain and optimize MySQL databases, Airflow, and Redis instances.
  • Write automation scripts in Bash or Python for system administration tasks.
  • Perform Linux administration tasks and troubleshoot system issues.
  • Utilize Ansible and Terraform for configuration management and infrastructure as code.
  • Demonstrate knowledge of networking and load-balancing principles.
  • Collaborate with development teams to ensure applications meet reliability and performance standards.
Additional Skills (Good to Know):
  • Experience with ClickHouse and Druid for data storage and analytics.
  • Experience with Jenkins for continuous integration.
  • Basic understanding of Google Cloud Platform (GCP) and data center operations.
Qualifications:
  • Minimum 3 years of experience in a Site Reliability Engineer role or similar.
  • Proven experience with Kubernetes, Git, Helm, Prometheus, Grafana, CI/CD, Docker, and microservices architecture.
  • Strong knowledge of AWS services, MySQL, Airflow, Redis, AWS CDN.
  • Proficient in scripting languages such as Bash or Python.
  • Hands-on experience with Linux administration.
  • Familiarity with Ansible and Terraform for infrastructure management.
  • Understanding of networking principles and load balancing.
Education:
Bachelor's degree in Computer Science, Information Technology, or a related field.

We believe great work starts with great support. That’s why DeepIntent offers a competitive, holistic benefits package designed to empower you both professionally and personally. Here’s what you can expect when you join our team:

Competitive base salary plus performance based bonus or commission, comprehensive medical, dental, and vision coverage, 401K match program, generous PTO policy and paid holidays, remote friendly culture with flexible work options, career development and advanced education support, WFH and internet stipends, plus many more perks and benefits! 

DeepIntent is committed to bringing together individuals from different backgrounds and perspectives. We strive to create an inclusive environment where everyone can thrive, feel a sense of belonging, and do great work together.

DeepIntent is an Equal Opportunity Employer, providing equal employment and advancement opportunities to all individuals. We recruit, hire and promote into all job levels the most qualified applicants without regard to race, color, creed, national origin, religion, sex (including pregnancy, childbirth and related medical conditions), parental status, age, disability, genetic information, citizenship status, veteran status, gender identity or expression, transgender status, sexual orientation, marital, family or partnership status, political affiliation or activities, military service, immigration status, or any other status protected under applicable federal, state and local laws. If you have a disability or special need that requires accommodation, please let us know in advance.

DeepIntent’s commitment to providing equal employment opportunities extends to all aspects of employment, including job assignment, compensation, discipline and access to benefits and training.

Top Skills

Airflow
Ansible
AWS
Bash
Ci/Cd
Docker
Git
Grafana
Helm
Kubernetes
MySQL
Prometheus
Python
Redis
Terraform

Similar Jobs

9 Days Ago
Hybrid
Pune, Maharashtra, IND
Junior
Junior
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As a Cloud Site Reliability Engineer, you'll analyze, maintain, and enhance cloud solutions, ensuring environmental stability and optimal performance while coordinating with teams to resolve issues and implement improvements.
Top Skills: AWSDynatraceHadoopJenkinsMySQLPostgresPythonScalaSparkSplunkTeamcity
23 Days Ago
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
Junior
Junior
Enterprise Web • Fintech • Financial Services
As an Associate Site Reliability Engineer, you will maintain system availability, performance, and support automation and monitoring for Morningstar's products and services, working closely with teams to implement best practices and strategies.
Top Skills: AnsibleAWSC#ChefDockerGitJavaJavaScriptJenkinsPowershellPuppetPythonRubyTerraform
25 Days Ago
Hybrid
Mumbai, Maharashtra, IND
Expert/Leader
Expert/Leader
Financial Services
The Lead Software Engineer focuses on site reliability, collaborates with teams to implement SRE practices, and mentors engineers to enhance system resilience and observability.
Top Skills: C++DatadogDynatraceGrafanaJavaOpen TelemetryPrometheusPythonSplunk

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account