Keywords Studios Logo

Keywords Studios

DevOps Engineer SE II - GCP & AI

Posted 15 Days Ago
Be an Early Applicant
In-Office
Pune, Mahārāshtra
Senior level
In-Office
Pune, Mahārāshtra
Senior level
The DevOps Engineer will manage GCP infrastructure, build AI deployment pipelines, implement security measures, and optimize costs while ensuring system observability.
The summary above was generated by AI
Responsibilities:
  • Infrastructure Ownership: Own Helpshift production services and ensure complete monitoring coverage, troubleshoot and fix production issues.
  • Infrastructure as Code (IaC): Design and maintain scalable GCP infrastructure using Terraform o
  • AI Orchestration & LLMOps: Build deployment pipelines for AI agents, managing vector databases (e.g., Vertex AI Search, Pinecone, Weaviate, ElasticSearch) and model endpoints.
  • Security (DevSecOps): Implement "Security-by-Design," including IAM least-privilege access, secret management (Secret Manager), and automated vulnerability scanning for AI workloads.
  • CI/CD Excellence: Architect high-velocity pipelines for both traditional microservices and AI model prompts/configurations. Design, implement, and maintain secure CI/CD pipelines for automating deployment, configuration, and testing processes.
  • Observability: Set up comprehensive monitoring for system health and LLM-specific metrics (latency, token usage, and cost)
  • Cloud Governance: Optimise GCP costs and manage resource quotas, especially for GPU/TPU-intensive AI tasks.
  • Cross Cloud Deployment: Establish & Optimise the connectivity among apps deployed in different cloud environments (AWS <> GCP)

RequirementsRequirements
  • Relevant experience of 6+ years and above
  • Expert-level Google Cloud Platform (GCP) administration skills: GKE, Cloud Run, Vertex AI, GCS, NEG etc
  • Experience deploying Vector Databases (Pinecone, Weaviate, ElasticSearch or Vertex Search) and managing API rate limits/throttling for LLM providers.
  • Setting up Cloud Monitoring/Logging specifically for AI metrics: token consumption, inference latency, and model error rates.
  • In-depth knowledge of running/managing UNIX-like operating systems (we use Ubuntu)
  • Strong knowledge of networking protocols, security architectures, and identity and access management (IAM) principles.
  • Experience with containerisation technologies (e.g., Docker, Kubernetes) and securing containerised environments.
  • Proficiency in Python and Bash
  • Experience in designing and building solutions that are highly scalable, fault tolerant and cost-effective
  • Experience with IaaC tools like Ansible, Terraform.
  • Ability to analyse bottlenecks in architecture and quickly debug to reach a resolution for issues
  • Have an automation mindset and ability to reason and work with complex systems.
  • Excellent communication and documentation skills
  • Quick learner and good mentor for junior team members

Top Skills

Ai Orchestration
Ansible
Bash
Ci/Cd
Docker
Elasticsearch
Google Cloud Platform
Kubernetes
Pinecone
Python
Terraform
Vector Databases
Vertex Ai
Weaviate

Similar Jobs

3 Hours Ago
Hybrid
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The role involves leading cross-functional technical projects, mentoring junior associates, and ensuring high-quality software delivery using Agile and DevOps methodologies.
Top Skills: AspAzureC#C/C++HTMLJavaScriptPythonRSQLUnix Command
3 Hours Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
The Solution Architect will lead technical solutions for Core Network projects, ensuring timely delivery and compliance with business objectives while automating processes and coordinating with stakeholders.
Top Skills: 4G5GKubernetesLinuxOpenstackPythonShellUnix
3 Hours Ago
Remote or Hybrid
India
Mid level
Mid level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The role involves business analysis and gathering requirements in wholesale credit risk, focusing on model development, regulatory compliance, and credit risk systems management.
Top Skills: Basel IiiCrrEbaEcbPra

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account