Orion Innovation Logo

Orion Innovation

Senior Data Engineer

Sorry, this job was removed at 06:11 a.m. (IST) on Monday, Oct 27, 2025
Be an Early Applicant
In-Office
Chennai, Tamil Nadu
In-Office
Chennai, Tamil Nadu

Similar Jobs

8 Days Ago
Hybrid
16 Locations
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Develop and optimize AI models and algorithms while engineering data solutions in a collaborative environment to improve healthcare outcomes.
Top Skills: Amazon NeptuneApache AirflowApache NifiAWSAzureDockerGoogle Cloud PlatformHadoopInformaticaJavaKafkaKubernetesNeo4JPrefectPythonScalaSparkSQLTalend
5 Days Ago
In-Office or Remote
3 Locations
Senior level
Senior level
Fintech • Software • Financial Services
As a Senior Data Engineer, you will build and maintain scalable ELT pipelines, ensure data reliability, and support decision-making across the company using advanced data engineering techniques and tools.
Top Skills: AirbyteApache AirflowCi/CdDbtDevOpsGreat ExpectationsKafkaMeltanoPrefectPythonSQLTerraform
11 Days Ago
Remote or Hybrid
17 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Sr. Engineer on the Data + ML Platform team, you will design and build scalable ML pipelines and ensure best practices in development and deployment, shaping critical business decisions in modern cybersecurity.
Top Skills: AirflowSparkCi/Cd FrameworksFlinkFluxcdGithub ActionsJavaJupyter NotebooksKubernetesMlflowNvidia WorkbenchPythonRayScalaTerraformVertex Ai

Orion Innovation is a premier, award-winning, global business and technology services firm.  Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity.  We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.

Job overview:

We are seeking a Senior Data Engineer with strong expertise in building and optimizing large-scale data platforms to support advanced analytics, AI, and Generative AI (GenAI) use cases. The ideal candidate will have hands-on experience with Snowflake and Databricks, combined with the ability to design scalable data pipelines and enable AI/ML integration. This role requires strong data engineering fundamentals, cloud expertise, and collaboration with data scientists, AI architects, and business stakeholders.

Key Responsibilities:

  • Design, build, and maintain large-scale data pipelines for batch and real-time processing.
  • Develop and optimize data models, ETL/ELT processes, and data warehouses/lakes on Snowflake and Databricks.
  • Ensure enterprise data quality, lineage, governance, and security.
  • Apply best practices for cost efficiency, scalability, and observability in cloud-native platforms.
  • Collaborate with AI/ML teams to prepare datasets for training, fine-tuning, and evaluation.
  • Integrate pipelines with AI/GenAI workflows including LLMs, embeddings, vector databases, and RAG.
  • Enable feature engineering and feature store integration for ML applications.
  • Support responsible AI by ensuring datasets are accurate, diverse, and compliant.
  • Work with architects, data scientists, and product owners on end-to-end solutions.
  • Mentor junior engineers, enforce coding standards, and create reusable frameworks.
  • Contribute to design sessions, architecture reviews, and sprint planning.

Key Skills:

  • 7+ years of experience in data engineering with large-scale, enterprise-grade systems.
  • Strong expertise with Snowflake (data modeling, query optimization, performance tuning).
  • Hands-on experience with Databricks (PySpark, Delta Lake, Unity Catalog).
  • Proficiency in SQL, Python, and Spark for data processing and transformation.
  • Experience with cloud platforms (AWS, Azure, GCP) and associated data services.
  • Knowledge of data governance, lineage, metadata management, and security best practices.
  • Familiarity with CI/CD pipelines, orchestration tools (Airflow, Dagster, Prefect), and DevOps practices.

Preferred Qualifications

  • Exposure to AI/GenAI use cases, including RAG pipelines, embeddings, and model-ready data pipelines.
  • Experience with vector databases (Pinecone, FAISS, Weaviate, Milvus) and feature stores.
  • Familiarity with data quality frameworks (Great Expectations, dbt tests, Monte Carlo).
  • Prior experience supporting real-time streaming data (Kafka, Kinesis, EventHub).
  • Certifications in Snowflake, Databricks, or cloud platforms (AWS, Azure, GCP).

Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.

Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.


Orion Innovation Chennai, Tamil Nadu, IND Office

Ambit IT Park, Ambit Park Road Ambattur Industrial Estate, Chennai, India, 600 058

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account