Banyan Software Logo

Banyan Software

Senior Data / RAG Engineer

Posted 2 Days Ago
Be an Early Applicant
Easy Apply
In-Office
Chennai, Tamil Nadu
Senior level
Easy Apply
In-Office
Chennai, Tamil Nadu
Senior level
The Senior Data Engineer will design and manage RAG Vector Databases, modernize data ingestion pipelines, ensure data synchronization, and collaborate across teams for AI insights.
The summary above was generated by AI

Banyan Software provides the best permanent home for successful enterprise software companies, their employees, and customers. We are on a mission to acquire, build and grow great enterprise software businesses all over the world that have dominant positions in niche vertical markets. In recent years, Banyan was named the #1 fastest-growing private software company in the US on the Inc. 5000 and amongst the top 10 fastest-growing companies by the Deloitte Technology Fast 500. Founded in 2016 with a permanent capital base setup to preserve the legacy of founders, Banyan focuses on a buy and hold for life strategy for growing software companies that serve specialized vertical markets.

Role Overview:
We’re looking for a Senior Data Engineer with deep expertise in RAG (Retrieval-Augmented Generation) and Vector Database design to build and manage the knowledge backbone for AI compliance and insights. This role focuses on modernizing archival data ingestion and enabling real-time contextual retrieval for AI-driven systems.

Key Responsibilities:

  • Design and implement RAG Vector Databases (e.g., OpenSearch, Pinecone) using archival data from S3 / Glacier and overall data management via MS SQL Server.
  • Modernize existing data ingestion pipelines, replacing legacy OCR-based processes with scalable ETL/ELT frameworks.
  • Ensure data synchronization and consistency between RDS (MS SQL Server) and Vector DB for real-time AI context.
  • Collaborate with AI, backend, and infrastructure teams to optimize retrieval performance and model access.
  • Drive data integrity, schema evolution, and compliance readiness across systems.

Required Skills & Experience:

  • Proven expertise in data engineering pipelines (Kafka / MSK, ETL / ELT).
  • Hands-on experience with Vector Databases and RAG implementations (OpenSearch, Pinecone, FAISS, Chroma).
  • Strong proficiency in SQL, data modeling, and Python / C# / Go.
  • Experience with AWS data ecosystem (S3, RDS, Glue, Lambda and related technologies).
  • 8–10 years of experience in data engineering or AI data platforms.

Diversity, Equity, Inclusion & Equal Employment Opportunity at Banyan: Banyan affirms that inequality is detrimental to our Global Teams, associates, our Operating Companies, and the communities we serve. As a collective, our goal is to impact lasting change through our actions. Together, we unite for equality and equity. Banyan is committed to equal employment opportunities regardless of any protected characteristic, including race, color, genetic information, creed, national origin, religion, sex, affectional or sexual orientation, gender identity or expression, lawful alien status, ancestry, age, marital status, or protected veteran status and will not discriminate against anyone on the basis of a disability. We support an inclusive workplace where associates excel based on personal merit, qualifications, experience, ability, and job performance.


Beware of Recruitment Scams

We have been made aware of individuals fraudulently posing as members of our Talent Acquisition team and extending fake job offers. These scams may involve requests for personal information or payment for equipment. 

Protect yourself by following these steps:

  • Verify that all communications from our recruiting team come from an @banyansoftware.com email address.
  • Remember, employers will never request payment or banking information during the hiring process.
  • If you receive a suspicious message, do not respond — instead, forward it to [email protected] and/or report it to the platform where you received it.

Your safety and security are important to us. Thank you for staying vigilant.

Top Skills

AWS
C#
Elt
ETL
Glue
Go
Kafka
Lambda
Ms Sql Server
Opensearch
Pinecone
Python
Rag
Rds
S3
SQL
Vector Database Design

Similar Jobs

40 Minutes Ago
Hybrid
Chennai, Tamil Nadu, IND
Mid level
Mid level
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
This role supports master data maintenance and accuracy across multiple domains in SAP S/4HANA, requiring collaboration with Finance and IT teams.
Top Skills: ExcelPowerPointReporting ToolsSap S/4Hana
40 Minutes Ago
Hybrid
Chennai, Tamil Nadu, IND
Mid level
Mid level
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
The Junior Accountant manages end-to-end invoice processing, ensuring compliance with accounting standards and internal controls, alongside vendor query resolutions and month-end close activities.
Top Skills: Sap,S/4Hana,Oracle P2P,Excel,Powerpoint
40 Minutes Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
The role supports Order-to-Cash operations, managing invoices, payment tracking, and reporting while ensuring compliance and assisting in automation initiatives.
Top Skills: ExcelOraclePower BISap EccSap S/4Hana

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account