Build and maintain GCP-based data pipelines and infrastructure to support generative AI/NLP model training, ensure data privacy and security, integrate backend with web and mobile apps, collaborate with AI and DevOps teams, and implement monitoring and optimization for production performance.
Job Summary
- We are seeking a Data Engineer to help build and integrate a Generative AI-powered conversational assistant, into our website and mobile app. This role is crucial in handling data pipelines, model training, and infrastructure setup to deliver a seamless, privacy-compliant experience for users seeking personalized health insights. The Data Engineer will work closely with our AI and software development teams to design scalable data solutions within Google Cloud Platform (GCP) to support this next-generation AI service.
- Data Integration & Pipeline Development: Design and implement data pipelines to support training and finetuning of knowledge base and user data, ensuring data quality, scalability, and efficiency.
- Data Processing & Transformation: Develop data transformation processes to prepare data for Natural Language Processing (NLP) models, facilitating personalized and accurate health recommendations.
- Privacy & Security Compliance: Ensure all data handling practices comply with privacy and security standards, focusing on user data protection within AI model training and deployment.
- Infrastructure Setup & Management: Build and maintain foundational cloud infrastructure on GCP to host, deploy, and scale securely and efficiently across platforms.
- Collaboration with AI & DevOps Teams: Partner with AI/ML and DevOps teams to finetune, test, and optimize NLP models for production, focusing on deployment performance and user experience.
- Website & Mobile Integration Support: Work alongside frontend developers to ensure smooth data flow and integration between the backend, website and mobile app.
- Monitoring & Optimization: Implement monitoring, logging, and automated alerts to ensure data pipelines, model interactions, and infrastructure meet performance and reliability requirements.
- Education: Bachelor’s or Master’s in Computer Science, Data Engineering, or a related field.
- Experience:
- 3+ years in data engineering, preferably within Generative AI or NLP-focused projects.
- Hands-on experience with Google Cloud Platform (GCP), including BigQuery, Dataflow, and Cloud Storage.
- Proven ability in data pipeline design and data transformations for AI model training.
- Skills:
- Strong programming skills in Python and familiarity with SQL.
- Experience with DevOps tools (e.g., Kubernetes, Docker) and CI/CD pipelines in GCP.
- Proficient in data management practices, data privacy, and security protocols.
- Familiarity with AI/ML workflows, specifically NLP model training and finetuning.
- Nice to Have:
- Experience working with Contentful, or React Native integrations.
- Knowledge of MLOps practices to support continuous model training and deployment.
Photon Chennai, Tamil Nadu, IND Office
DLF IT Park 1/124 Mount Poonamallee Road Sivaji Gardens Manapakkam , Chennai, India, 600089
Similar Jobs
Artificial Intelligence • Hardware • Information Technology • Machine Learning
Responsible for lab equipment maintenance, calibration, troubleshooting, and team leadership while ensuring compliance with regulations and conducting failure analysis.
Top Skills:
3D X-RayBend TesterCsamFibFtirHast ChamberIso/Iec 17025LinuxReflow OvenSemShock TesterSoak ChamberTemp CycleTemperature ChamberTesterThbWindowsX-Section
Artificial Intelligence • Hardware • Information Technology • Machine Learning
Responsible for setup, maintenance, calibration, and troubleshooting of lab equipment. Lead technician team and ensure compliance with safety and quality standards.
Top Skills:
3D X-RayBend TesterCsamFibFtirHast ChamberIso/Iec 17025LinuxReflow OvenSemShock TesterSoak ChamberTemp CycleTemperature ChamberTesterThbWindowsX-Section
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Manage US sales and use tax compliance across jurisdictions, including preparing and filing returns, tax data extraction and reconciliations, month-end accruals and GL reconciliations, supporting state and local tax audits, improving tax processes and automation, and partnering with Tax, Accounting, IT, and business teams on tax treatment and reporting.
Top Skills:
AlteryxAvalaraClaudeExcelOnesourceOraclePower BISAPVertex
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

.jpeg)
