Tech Holding Logo

Tech Holding

ML / AI Data Engineer (Contract)

Posted 24 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The role involves designing and optimizing large-scale ML data pipelines, ensuring high-throughput data ingestion and processing, collaborating with teams on data workflows, and architecting GPU-based environments.
The summary above was generated by AI

About us:

Working at Tech Holding isn't just a job, it's an opportunity to be a part of something bigger. We are a full-service consulting firm that was founded on the premise of delivering predictable outcomes and high-quality solutions to our clients.  Our founders and team members have industry experience and have held senior positions in a wide variety of companies – from emerging startups to large Fortune 50 firms – and we have taken our combined experiences and developed a unique approach that is supported by the principles of deep expertise, integrity, transparency, and dependability.

We are looking for a highly skilled Senior ML / Data Pipeline Engineer who can translate complex machine learning and multimodal concepts into scalable, production-ready pipelines and workflows.
This role focuses on building and optimising large-scale video and multimodal data systems, enabling high-throughput ingestion, processing, and model training across distributed cloud environments.
Key Responsibilities
  • Design, deploy, and scale large-scale ML and data processing pipelines across cloud infrastructure.
  • Build systems to ingest, process, and serve 250,000+ hours of multimodal data (video, audio, metadata).
  • Architect and optimize GPU-based compute environments (e.g., NVIDIA Tesla clusters) for distributed training and inference.
  • Develop high-throughput backend systems for video ingestion from desktop and mobile platforms.
  • Implement distributed processing workflows, including job scheduling, fault tolerance, and resource allocation.
  • Design and build human-in-the-loop and automated annotation systems to ensure data quality and scalability.
  • Translate ML and multimodal research into scalable, production-grade cloud architectures.
  • Optimize pipelines for performance, reliability, and cost efficiency across compute, storage, and networking layers.
  • Collaborate with ML, data, and engineering teams to deliver end-to-end data workflows.
Requirements
  • 5+ years of experience in data engineering, ML pipelines, or distributed systems.
  • Strong experience building scalable data pipelines for large datasets (video/audio preferred).
  • Hands-on experience with cloud platforms (AWS, Azure, or GCP).
  • Experience working with GPU-based environments and distributed computing.
  • Strong programming skills in Python, Scala, or similar languages.
  • Experience with data processing frameworks (Spark, Ray, Kafka, Airflow, or similar).
  • Understanding of ML workflows, training pipelines, and inference systems.
  • Experience designing fault-tolerant, high-availability systems.
  • Strong knowledge of data storage systems (data lakes, object storage, distributed file systems).
  • Ability to handle high-throughput, large-scale data ingestion and processing.
Good to Have
  • Experience with multimodal AI (video, audio, NLP) systems.
  • Familiarity with annotation tools and data labeling workflows.
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Knowledge of cost optimization strategies for large-scale cloud workloads.

Tech Holding is proud to be an Equal Opportunity Employer and is committed to fostering a diverse and inclusive workplace. We welcome applicants from all backgrounds and experiences, and we consider qualified applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, veteran status, or any other legally protected characteristic. If you require accommodation in the application process, please contact our HR 

Similar Jobs

27 Minutes Ago
Easy Apply
Remote
India
Easy Apply
Entry level
Entry level
Cloud • Security • Software • Cybersecurity • Automation
As a Business Development Representative, you'll lead outreach to potential accounts, generate qualified meetings, and collaborate with marketing and sales teams to identify prospects and opportunities.
Top Skills: Linkedin Sales NavigatorOutreach.IoSalesforce
4 Hours Ago
In-Office or Remote
Mid level
Mid level
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
The Talent Experience Specialist manages career growth and employee recognition initiatives, supports the Career Experience platform, and coordinates learning programs and metrics at Coupa.
Top Skills: Linkedin LearningLms
5 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Senior Threat Researcher, you will lead initiatives in threat detection, malware analysis, and automation, mentoring team members and enhancing scalable solutions to combat complex cyber threats.
Top Skills: Binary NinjaC++CassandraElasticsearchGhidraGoIda ProMongoDBMySQLPostgresPythonRustSplunkX64Dbg

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account