Fractal Logo

Fractal

GCP DevOps Architect

Posted 12 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu, IND
Senior level
In-Office
Chennai, Tamil Nadu, IND
Senior level
Lead a team of DevOps & cloud engineers to build and manage GCP infrastructure for AI workloads, focusing on security, governance, and cost optimization.
The summary above was generated by AI

It's fun to work in a company where people truly BELIEVE in what they are doing!

We're committed to bringing passion and customer focus to the business.

Key Responsibilities
Lead and mentor a small team of DevOps, MLOps, and cloud engineers; promote high performance, knowledge sharing, and continuous learning.
Architect and maintain scalable GCP infrastructure for AI agentic workloads (Vertex AI, GKE, Cloud Run, Cloud Functions, Artifact Registry, etc.).
Own DevOps and MLOps pipelines, including:
CI/CD with Google Cloud Build (primary) and integration/experience with Jenkins for hybrid/legacy workflows.
Artifact management using Artifact Registry (primary) with exposure to Nexus Repository for proxying or migration scenarios.
IaC (Terraform preferred), container orchestration, and Python scripting/automation for custom tooling, Glue jobs, or pipeline extensions.
Lead API endpoint security strategy, leveraging:
Apigee for enterprise-grade API management, policy enforcement, quota, monetization (if applicable), advanced security, and analytics.
Complementary GCP-native tools: Identity-Aware Proxy (IAP), Cloud Armor (WAF/DDoS), IAM least-privilege, OAuth 2.0/JWT/mTLS, Secret Manager, VPC Service Controls.
Zero-trust/BeyondCorp principles and threat protection for AI agent communications and customer-facing APIs.

Champion FinOps practices on GCP:
Implement cost monitoring (Cloud Billing, FinOps Hub), optimization recommendations (Recommender, Active Assist), commitment-based discounts (CUDs), budget alerts, and idle resource cleanup.
Drive cost allocation, forecasting, and cross-team accountability for high-cost AI workloads (e.g., model training/inference).
Collaborate with AI/ML engineers to productionize agentic workflows with secure, governed access to models/data.
Ensure high observability (Cloud Operations Suite, Prometheus/Grafana), resilience, and SRE practices (incident response, post-mortems).
Establish cloud governance, compliance, and disaster recovery aligned with business needs.

Required Qualifications & Experience
8+ years in DevOps, cloud engineering, or infrastructure roles, with 4+ years deep hands-on Google Cloud Platform (GCP).
Proven people leadership (5+ reports) in agile/fast-paced environments.
Strong expertise securing APIs/services on GCP, preferably with Apigee (enterprise API management, policies, analytics) alongside IAP, Cloud Armor, IAM, and mTLS.

Hands-on experience with:
CI/CD: Google Cloud Build + integration with Jenkins.
Artifact management: Artifact Registry + familiarity with Nexus Repository.
GKE/Cloud Run, monitoring/logging.
Python for automation, scripting, and tooling in DevOps/MLOps contexts.
Solid understanding of networking (VPC, Private Service Connect), security, and compliance.
Experience with AI/ML platforms (Vertex AI, Agent Builder) and MLOps for agentic systems is highly desirable.

Preferred Skills
Google Cloud certifications: Professional DevOps Engineer, Cloud Architect, Cloud Security Engineer (Apigee-related knowledge a plus).
FinOps experience or certification; familiarity with GCP FinOps Hub, Recommender, and commitment management.
Exposure to agentic AI patterns (multi-agent orchestration, RAG) and their infra requirements.
Experience in high-security or regulated environments.

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Not the right fit?  Let us know you're interested in a future opportunity by clicking Introduce Yourself in the top-right corner of the page or create an account to set up email alerts as new job postings become available that meet your interest!

Top Skills

Apigee
Artifact Registry
Cloud Armor
Cloud Operations Suite
Cloud Run
GCP
Gke
Google Cloud Build
Grafana
Iam
Identity-Aware Proxy
Jenkins
Jwt
Mtls
Oauth 2.0
Prometheus
Python
Secret Manager
Terraform
Vertex Ai
Vpc Service Controls

Fractal Chennai, Tamil Nadu, IND Office

Chennai, Tamil Nadu, India, 600034

Similar Jobs

An Hour Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The RTA Manager leads global real-time operations, ensuring optimal performance and adherence. They monitor performance, oversee team development, implement process improvements, and collaborate with multiple departments to enhance service levels.
Top Skills: Bi ToolsExcelPower BISQLTableauWorkforce Management Tools
An Hour Ago
Remote or Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Lead development, scaling, governance, and adoption of enterprise process capabilities (BPM, BPI, BPR). Manage tools, training, standards, integrations, and reusable assets to enable transformation, collaborate with cross-functional stakeholders, and drive delivery enablement and capability maturity.
Top Skills: Business Process Intelligence (Bpi)Business Process Management (Bpm)Business Process Reengineering (Bpr)Performance AnalyticsProcess Modeling ToolsSignavio
An Hour Ago
Hybrid
Chennai, Tamil Nadu, IND
Mid level
Mid level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Governance Reporting and Communication Specialist will support the GRC team and oversee internal communication strategies, reporting, and stakeholder engagement within TransUnion.
Top Skills: Power BIPowerPointSharepoint

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account