3Pillar Global Logo

3Pillar Global

Cloud Data Architect (With AI experience)

Posted 5 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Expert/Leader
Remote
Hiring Remotely in India
Expert/Leader
Architect and own an enterprise AI-ready data platform: design lakehouse/data-mesh models, ingestion (streaming and batch), semantic/knowledge layers, retrieval/vector infra, LLMOps, governance, observability, and CI/CD to serve conversational AI, agents, BI copilots and ML models.
The summary above was generated by AI
3Pillar is an AI transformation partner on a mission to help enterprises build the AI-native products and intelligent agents that will define the next era of business. With teams across North America, Europe, Latin America, and Asia, we work with the most ambitious companies in financial services, healthcare, media, and technology — helping them move faster, modernize boldly, and compete on their own terms. Our HelixAI platform and Helix Pods delivery model put our engineers at the center of real agentic transformation — doing work that is open, portable, and built to last. We are building the future of enterprise AI
 
We are looking for an AI Data Architect to design, build, govern, and evolve the single source of truth that powers every AI initiative in our organization.
This platform will serve as the foundational nervous system for conversational AI assistants, dashboard intelligence, autonomous AI agents, RAG-powered applications, predictive ML models, and any AI product we build today or in the future.
The resource will architect the system, drive implementation, own the data contracts that agents and AI applications depend on, enforce security and access governance for both human and agent consumers, and continuously monitor and improve the accuracy and reliability of AI outputs that flow from this platform.

Key Responsibilities:

    AI-Ready Data Platform — The Single Source of Truth
    Architect and own the enterprise AI data platform — the unified, governed layer that ingests, transforms, stores, and serves all data consumed by AI systems across the organisation.
    Design multi-domain data models (lakehouse, data mesh, event-driven) that are structured from day one to serve AI workloads: clean lineage, versioned schemas, well-documented contracts, and low-latency serving APIs.
    Own the full data stack: real-time streaming (Kafka, Spark Structured Streaming), batch processing (Databricks, PySpark, Delta Lake), cloud storage and compute (AWS, Azure), and data quality /metadata management.
    Ensure this platform is the single, authoritative data source for all downstream consumers —conversational AI, dashboard assistants, autonomous agents, ML models, and reporting —eliminating data silos and conflicting truths.
    Drive modernisation of legacy pipelines (on-prem ETL, batch DWH) to cloud-native, AI-ready architectures with measurable improvements in cost, latency, and delivery velocity.
     
    Semantic Models & Knowledge Layer
    Design the semantic layer that sits above raw data — business-aligned ontologies, entity relationships, domain taxonomies, and knowledge graphs — so AI systems understand context, not just tokens.
    Build and maintain knowledge graphs (Neo4j or equivalent) that capture relationships between business entities, policies, KPIs, hierarchies, and domain rules — enabling structured reasoning alongside unstructured retrieval.
    Define and govern a feature store and semantic data contracts that serve both classical ML models and LLM-based applications from a single, well-versioned, trusted source.
    Cisco Confidential
    Own metadata management, data lineage, and audit trails across the semantic layer — ensuring every AI system can trace its outputs back to source data with full accountability.
     
    RAG, Vector & Retrieval Infrastructure
    Design the retrieval infrastructure that powers RAG-based AI applications: embedding pipelines, vector stores (Pinecone, FAISS, ChromaDB, OpenSearch), chunking strategies, and hybrid retrieval layers combining semantic search with structured queries.
    Define the data contracts between the AI data platform and retrieval consumers — ensuring consistent, freshness-guaranteed, well-indexed data surfaces to RAG pipelines, conversational AI, and agent tools.
    Architect retrieval systems that balance precision, recall, latency, and cost — with clear evaluation benchmarks, not just infrastructure defaults.
     
    ML/LLMOps Infrastructure
    Own the ML and LLMOps data infrastructure: training data curation pipelines, feature engineering, model registry, experiment tracking (MLflow), automated evaluation, and production monitoring.
    Build CI/CD pipelines for AI systems: automated data validation, model quality gates, deployment automation, rollback mechanisms, and production health dashboards.
    Design data infrastructure for LLM fine-tuning workflows — training corpus curation, data quality filtering, RLHF pipelines, and adapter management — ensuring models trained on this platform reflect accurate, governed, domain-specific knowledge.
    Establish LLMOps best practices across the organisation: versioning, A/B evaluation, shadow deployments, and canary releases for AI model updates.
     
    Multi-Consumer AI Serving Architecture
    Conversational AI Platforms — low-latency, context-rich data APIs that power chatbots, voice assistants, and enterprise copilots with accurate, fresh, source-grounded responses.
    Dashboard Assistants & BI Copilots — semantic query layers and text-to-SQL infrastructure that allow natural language interfaces to query structured business data accurately and safely.
    Autonomous AI Agents — structured tool APIs, function-calling schemas, and memory/state data stores that agents depend on for context retrieval, action execution, and multi-step reasoning.
    Predictive ML Models — feature pipelines, training datasets, and real-time feature serving for classification, forecasting, anomaly detection, and propensity models.
    Cisco Confidential
    Ad-hoc AI Experimentation — governed sandbox environments where data scientists and AI engineers can access production-equivalent data safely for research and prototyping.
     
    Governance, Security & Access Control
    Design and enforce a comprehensive data governance model that governs access for both human users and AI agents — with role-based access control (RBAC), attribute-based policies, and agent-specific permission scopes that prevent privilege escalation. Implement data security controls across the platform: PII detection and masking, data classification, encryption at rest and in transit, audit logging, and compliance alignment (SOX, GDPR, SOC 2,AML/KYC, APAC regulations).
    Define agent data access boundaries — what data an autonomous agent can read, write, modify, or delete — and enforce those boundaries at the platform layer, not just at the application layer.
    Build data contracts and schema governance that prevent upstream changes from silently breaking downstream AI applications, with automated breaking-change detection and versioned migration paths.
    Own regulatory and compliance readiness for all AI data pipelines — ensuring audit trails, explainability artefacts, and data provenance are available on demand.
     
    Agentic Behaviour Observability & Output Accuracy
    Own the observability stack for AI agent behaviour: instrument agents to capture inputs, retrieved context, tool calls, reasoning traces, and outputs — creating a complete audit trail of every agentic action driven by platform data.
    Design and operate evaluation frameworks that continuously measure AI output quality: factual accuracy, context faithfulness, retrieval relevance, hallucination rates, and task completion success— across all AI consumers of the platform.
    Establish feedback loops between evaluation signals and platform improvements: when agent outputs degrade, trace the root cause to data freshness, retrieval failures, schema drift, or model issues — and own the remediation.
    Define SLAs for AI output quality and data freshness; build alerting and escalation frameworks that surface platform-driven AI degradation before end users notice. Implement human-in-the-loop review workflows for high-stakes agent actions — ensuring critical decisions have appropriate oversight, audit records, and rollback capability.
    Cisco Confidential
     
    Architecture Standards & Engineering Enablement
    Define and maintain the reference architecture for the AI data platform — documenting design patterns, data contracts, integration standards, and decision records (ADRs) that all engineering teams follow.
    Establish data engineering standards: pipeline testing frameworks, code review practices, CI/CD automation, infrastructure-as-code (Terraform), reusable component libraries, and observability instrumentation.
    Serve as the senior technical reviewer for all data system designs that interact with the AI platform— ensuring consistency, security, and quality across every integration point.
    Run internal architecture workshops, design reviews, and enablement sessions to embed AI-ready data platform best practices across data engineering and AI teams.

Qualification:

    • 15+ years of hands-on data engineering and architecture experience, with 3–5+ years building production AI/ML and LLM-era data infrastructure.
    • Proven experience designing enterprise-scale AI data platforms that serve multiple AI consumers —not just one application or pipeline.
    • Deep expertise in lakehouse and data mesh architectures: Databricks, Delta Lake, PySpark, Kafka, Spark Structured Streaming, cloud-native data services (AWS, Azure).
    • Hands-on experience with vector stores, semantic models, knowledge graphs, and retrieval infrastructure in production environments.
    • Working knowledge of LLMOps: model serving pipelines, MLflow, CI/CD for AI, automated evaluation, and production monitoring.
    • Strong background in data governance, security, and compliance in regulated industries (financial services, payments, cybersecurity, healthcare).
    • Experience defining data access controls for AI agents and automated systems — not just human users.
     
    Technical Skills
    Primary Skills: Python, SQL, PySpark, Kafka, Databricks, Delta Lake, Snowflake,AWS (S3, Glue, EKS, Bedrock, Kinesis, Redshift), Docker, Kubernetes, Terraform, GitHub Actions, LangChain, LlamaIndex, LLM APIs (OpenAI, AWS Bedrock, Claude, HuggingFace), vector databases (Pinecone, FAISS, ChromaDB, OpenSearch), knowledge graphs (Neo4j).
     
    Secondary Skills: MLflow, FastAPI, CI/CD pipelines, observability tooling (CloudWatch, Grafana, or equivalent), data lineage and metadata management platforms.

What It's Like to Work at 3Pillar:

    At 3Pillar, we create an environment where people can do their best work while maintaining a healthy work-life balance.
  • Flexibility & Well-being – Our remote-first approach gives you the flexibility to work where you perform best, while prioritizing your well-being and personal commitments.
  • Global Community – Collaborate with talented colleagues across the globe in a culture built on connection, support, and shared success.
  • Your Voice Matters – We foster open communication and multiple feedback channels, ensuring every employee has the opportunity to be heard and make an impact.
  • Growth & Development – Gain exposure to diverse clients, industries, and challenges that accelerate learning and career growth.

  • Our culture is guided by four core values: Collaboration, Outperform, Respect, and Evolve—the principles that shape how we work, grow, and succeed together.

Thank you,
Kiran Dhanak
Manager, Talent Acquisition

Similar Jobs

2 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
As a Lead Customer Success Manager, you will drive customer adoption of Dynatrace, build executive relationships, manage risks, and ensure long-term value for enterprise clients in India.
Top Skills: AWSAzureGCPKubernetes
3 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Own QA strategy and automation for agentic LLM and RAG workstreams: design AI-specific evaluation frameworks, build Python test suites (unit, integration, system), validate integrations (Salesforce, Slack, third-party APIs), integrate tests into CI/CD, run performance/load and security/adversarial tests, and ensure observability and regression detection for non-deterministic outputs.
Top Skills: Agentic AiApi TestingCi/CdClariCopadoDastDatadogGoogle SuiteGithub ActionsGongGrafanaJenkinsJmeterK6LangsmithLightning Web Components (Lwc)LlmsLocustMarketoOpentelemetryOwaspPlaywrightPostmanPostman/RestPytestPythonRag PipelinesRagasRest-AssuredRobot FrameworkSalesforceSastSeleniumSlackSnowflake
13 Hours Ago
In-Office or Remote
Junior
Junior
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Design, build, and ship highly scalable backend microservices in a cross-geo distributed environment. Collaborate with engineers, architects, product, and design; conduct code reviews; ensure monitoring, error reporting, reliability, security, and performance; mentor teammates and participate in Agile ceremonies.
Top Skills: AWSAzureCassandraDynamoDBGaeGoIaasJavaKotlinOraclePaasPostgresPythonRestful Micro-ServicesSaaSScala

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account