NTT DATA Logo

NTT DATA

Associate Data Engineer

Job Posted 14 Days Ago Reposted 14 Days Ago
Be an Early Applicant
3 Locations
Senior level
3 Locations
Senior level
The Associate Data Engineer will design and develop data pipelines for GenAI solutions, manage cloud infrastructure, ensure data security, and collaborate with clients on data needs.
The summary above was generated by AI

Make an impact with NTT DATA
Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive.

Your day at NTT DATA

We are seeking an experienced Data Engineer to join our team in delivering cutting-edge Generative AI (GenAI) solutions to clients. The successful candidate will be responsible for designing, developing, and deploying data pipelines and architectures that support the training, fine-tuning, and deployment of LLMs for various industries. This role requires strong technical expertise in data engineering, problem-solving skills, and the ability to work effectively with clients and internal teams.

What you'll be doing

Key Responsibilities:

  • Design, develop, and manage data pipelines and architectures to support GenAI model training, fine-tuning, and deployment
  • Data Ingestion and Integration: Develop data ingestion frameworks to collect data from various sources, transform, and integrate it into a unified data platform for GenAI model training and deployment.
  • GenAI Model Integration: Collaborate with data scientists to integrate GenAI models into production-ready applications, ensuring seamless model deployment, monitoring, and maintenance.
  • Cloud Infrastructure Management: Design, implement, and manage cloud-based data infrastructure (e.g., AWS, GCP, Azure) to support large-scale GenAI workloads, ensuring cost-effectiveness, security, and compliance.
  • Write scalable, readable, and maintainable code using object-oriented programming concepts in languages like Python, and utilize libraries like Hugging Face Transformers, PyTorch, or TensorFlow
  • Performance Optimization: Optimize data pipelines, GenAI model performance, and infrastructure for scalability, efficiency, and cost-effectiveness.
  • Data Security and Compliance: Ensure data security, privacy, and compliance with regulatory requirements (e.g., GDPR, HIPAA) across data pipelines and GenAI applications.
  • Client Collaboration: Collaborate with clients to understand their GenAI needs, design solutions, and deliver high-quality data engineering services.
  • Innovation and R&D: Stay up to date with the latest GenAI trends, technologies, and innovations, applying research and development skills to improve data engineering services.
  • Knowledge Sharing: Share knowledge, best practices, and expertise with team members, contributing to the growth and development of the team.

Requirements:

  • Bachelor’s degree in computer science, Engineering, or related fields (Master's recommended)
  • Experience with vector databases (e.g., Pinecone, Weaviate, Faiss, Annoy) for efficient similarity search and storage of dense vectors in GenAI applications
  • 5+ years of experience in data engineering, with a strong emphasis on cloud environments (AWS, GCP, Azure, or Cloud Native platforms)
  • Proficiency in programming languages like SQL, Python, and PySpark
  • Strong data architecture, data modeling, and data governance skills
  • Experience with Big Data Platforms (Hadoop, Databricks, Hive, Kafka, Apache Iceberg), Data Warehouses (Teradata, Snowflake, BigQuery), and lakehouses (Delta Lake, Apache Hudi)
  • Knowledge of DevOps practices, including Git workflows and CI/CD pipelines (Azure DevOps, Jenkins, GitHub Actions)
  • Experience with GenAI frameworks and tools (e.g., TensorFlow, PyTorch, Keras)
  • Nice to have:
    • Experience with containerization and orchestration tools like Docker and Kubernetes
    • Integrate vector databases and implement similarity search techniques, with a focus on GraphRAG is a plus
    • Familiarity with API gateway and service mesh architectures
    • Experience with low latency/streaming, batch, and micro-batch processing
    • Familiarity with Linux-based operating systems and REST APIs

Location: Delhi or Bangalore

Workplace type:

Hybrid Working

About NTT DATA
NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo.

Equal Opportunity Employer
NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today.

Top Skills

Apache Hudi
AWS
Azure
BigQuery
Databricks
Delta Lake
Docker
GCP
Hadoop
Hive
Kafka
Keras
Kubernetes
Pyspark
Python
PyTorch
Snowflake
SQL
TensorFlow
Teradata

NTT DATA Chennai, Tamil Nadu, IND Office

Chennai, India

Similar Jobs

20 Days Ago
Chennai, Tamil Nadu, IND
Entry level
Entry level
Healthtech • Information Technology • Telehealth
Assist in designing, developing, and maintaining data pipelines and infrastructure for analytics and business intelligence at athenahealth.
Top Skills: AirflowJavaScriptMySQLPbs/TorquePostgresPythonSlurmSnowflakeSQL
18 Days Ago
Chennai, Tamil Nadu, IND
Junior
Junior
Hardware • Other • Appliances
The Ivalua Consultant will analyze business processes, document requirements, collaborate with technical teams, and support solution implementation and testing.
Top Skills: Data AnalysisIvaluaVisualization Tools
18 Days Ago
Chennai, Tamil Nadu, IND
Mid level
Mid level
Healthtech • Information Technology • Telehealth
The Integration Analyst supports EDI projects by managing workflows, communicating with stakeholders, and ensuring quality data transfer into athenahealth's system.
Top Skills: EdiExcel

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account