Tech Holding Logo

Tech Holding

ML / AI Data Engineer (Contract)

Posted 4 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The role involves designing and optimizing large-scale ML data pipelines, ensuring high-throughput data ingestion and processing, collaborating with teams on data workflows, and architecting GPU-based environments.
The summary above was generated by AI

About us:

Working at Tech Holding isn't just a job, it's an opportunity to be a part of something bigger. We are a full-service consulting firm that was founded on the premise of delivering predictable outcomes and high-quality solutions to our clients.  Our founders and team members have industry experience and have held senior positions in a wide variety of companies – from emerging startups to large Fortune 50 firms – and we have taken our combined experiences and developed a unique approach that is supported by the principles of deep expertise, integrity, transparency, and dependability.

We are looking for a highly skilled Senior ML / Data Pipeline Engineer who can translate complex machine learning and multimodal concepts into scalable, production-ready pipelines and workflows.
This role focuses on building and optimising large-scale video and multimodal data systems, enabling high-throughput ingestion, processing, and model training across distributed cloud environments.
Key Responsibilities
  • Design, deploy, and scale large-scale ML and data processing pipelines across cloud infrastructure.
  • Build systems to ingest, process, and serve 250,000+ hours of multimodal data (video, audio, metadata).
  • Architect and optimize GPU-based compute environments (e.g., NVIDIA Tesla clusters) for distributed training and inference.
  • Develop high-throughput backend systems for video ingestion from desktop and mobile platforms.
  • Implement distributed processing workflows, including job scheduling, fault tolerance, and resource allocation.
  • Design and build human-in-the-loop and automated annotation systems to ensure data quality and scalability.
  • Translate ML and multimodal research into scalable, production-grade cloud architectures.
  • Optimize pipelines for performance, reliability, and cost efficiency across compute, storage, and networking layers.
  • Collaborate with ML, data, and engineering teams to deliver end-to-end data workflows.
Requirements
  • 5+ years of experience in data engineering, ML pipelines, or distributed systems.
  • Strong experience building scalable data pipelines for large datasets (video/audio preferred).
  • Hands-on experience with cloud platforms (AWS, Azure, or GCP).
  • Experience working with GPU-based environments and distributed computing.
  • Strong programming skills in Python, Scala, or similar languages.
  • Experience with data processing frameworks (Spark, Ray, Kafka, Airflow, or similar).
  • Understanding of ML workflows, training pipelines, and inference systems.
  • Experience designing fault-tolerant, high-availability systems.
  • Strong knowledge of data storage systems (data lakes, object storage, distributed file systems).
  • Ability to handle high-throughput, large-scale data ingestion and processing.
Good to Have
  • Experience with multimodal AI (video, audio, NLP) systems.
  • Familiarity with annotation tools and data labeling workflows.
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Knowledge of cost optimization strategies for large-scale cloud workloads.

Tech Holding is proud to be an Equal Opportunity Employer and is committed to fostering a diverse and inclusive workplace. We welcome applicants from all backgrounds and experiences, and we consider qualified applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, veteran status, or any other legally protected characteristic. If you require accommodation in the application process, please contact our HR 

Similar Jobs

2 Hours Ago
Remote or Hybrid
Mid level
Mid level
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
The Manager of Technical Support Engineering leads a team in resolving customer issues, enhances support processes, and drives customer satisfaction through collaboration and team development.
Top Skills: Salesforce Service Cloud
4 Hours Ago
Remote
India
Senior level
Senior level
Artificial Intelligence • Consumer Web • Edtech • HR Tech • Information Technology • Software • Conversational AI
The Senior Instructional Designer will develop digital learning solutions through visualization and instructional design, collaborate with stakeholders, and ensure compliance with standards.
Top Skills: Instructional Design
12 Hours Ago
Remote
India
Senior level
Senior level
Cloud • Information Technology • Productivity • Software • Automation
As a Senior ServiceNow CRM Developer/Administrator, you will lead the design and implementation of CRM solutions, manage platform governance, and mentor junior developers while ensuring optimal service delivery.
Top Skills: CSSFlow DesignerGlide ApiHTMLIntegration HubJavaScriptRestServicenowSoapXML

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account