KLA Logo

KLA

Specialist, HPC Systems Research & Development

Posted 9 Days Ago
Be an Early Applicant
Industrial Estate, Mambalam Guindy, Chennai, Tamil Nadu
Entry level
Industrial Estate, Mambalam Guindy, Chennai, Tamil Nadu
Entry level
The HPC System R&D Engineer will develop system-level HPC technologies for next-gen clusters used in KLA tools, focusing on deploying AI solutions on on-prem and cloud infrastructures while improving existing frameworks.
The summary above was generated by AI

Company Overview

KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.

Group/Division

KLA advanced computing Labs’ (ACL) mission in India is to deliver advanced parallel computing research and software architectures for AI + HPC + Cloud solutions to accelerate the performance of KLA's products. ACL explores high-risk approaches, pioneering technologies, and novel methods to accelerate KLA’s algorithms and contribute to KLA’s HPC technology roadmap. Located out of the IIT Madras Research Park in Chennai, India, we engage leading thinkers in academia, industry and KLA’s business units to create innovative parallel computing methods to enable KLA’s business growth.

Job Description

KLA’s AI Advanced Computing Labs is looking for an extraordinary HPC System R&D Engineer to join its team to develop system-level HPC technologies that would form the foundation of next-generation clusters used in KLA tools that leverage AI to push the boundaries of process control for conductor manufacturing. The technologies would be developed and demonstrated on on-prem clusters that serve as testbeds for next-generation KLA tools.

 

Your Day-to-day Roles

  • Expose limitations in existing solutions, based on clusters of CPUs & GPUs, to deploy AI-based solutions on on-prem & cloud infrastructures at scale.
  • Develop distributed frameworks and system-level solutions that enable scaling out image processing & AI loads from single GPU to multi-node clusters with multiple GPUs.
  • Install, benchmark, and evaluate pre-release hardware for early-stage evaluation and prototyping by identifying (or developing) relevant workloads.

Minimum Qualifications

  • Masters / PhD in Computer Science or related fields; bachelors degree holders with relevant experience and extraordinary track-record will also be considered.
  • Deep understanding of operating systems, computer networks, and high performance applications
  • Good mental model of the architecture of a modern distributed systems that is comprised of CPUs, GPUs, and accelerators.
  • Experience with deployments of deep-learning frameworks based on TensorFlow, and PyTorch on large-scale on-prem or cloud infrastructures.
  • Strong background in modern and advanced C++ concepts
  • Strong Scripting Skills in Bash, Python, or similar.
  • Good communication.

Things to Make us go Wow!

  • Experience in heterogenous programming languages like CUDA, Triton, etc.
  • Experience with model development on DL frameworks such as TensorFlow, and PyTorch
  • Experience with building open-source operating systems and software stack on pre-release hardware.
  • Solid understanding of container infrastructure such as Docker or singularity, and Kubernetes.
  • Active participation in C++ standards bodies or similar

We offer a competitive, family friendly total rewards package. We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees.

KLA is proud to be an equal opportunity employer

Be aware of potentially fraudulent job postings or suspicious recruiting activity by persons that are currently posing as KLA employees. KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment. Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA. Please ensure that you have searched KLA’s Careers website for legitimate job postings. KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers. If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to [email protected] to confirm the person you are communicating with is an employee. We take your privacy very seriously and confidentially handle your information.

Top Skills

Bash
C++
Python

Similar Jobs

Be an Early Applicant
11 Hours Ago
Chennai, Tamil Nadu, IND
Hybrid
5,000 Employees
Mid level
5,000 Employees
Mid level
Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
The Senior Angular Developer will develop and maintain high-performance web and mobile applications using Angular and Ionic, collaborate with teams to enhance user experiences, write clean scalable code, troubleshoot and optimize applications, and mentor junior developers while staying updated on industry trends.
Be an Early Applicant
11 Hours Ago
Chennai, Tamil Nadu, IND
Hybrid
5,000 Employees
Senior level
5,000 Employees
Senior level
Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Lead a team of developers to design, develop, and maintain web applications using Angular. Collaborate with product managers and designers to deliver high-quality solutions while providing mentorship to junior developers. Ensure best practices are followed, troubleshoot issues, and contribute to architectural decisions. Optimize applications for performance and scalability, and facilitate agile processes.
Be an Early Applicant
Yesterday
Chennai, Tamil Nadu, IND
Hybrid
1,500 Employees
Expert/Leader
1,500 Employees
Expert/Leader
Artificial Intelligence • Consumer Web • Edtech • Enterprise Web • HR Tech • Social Impact • Generative AI
The Principal Data Platform Engineer will architect and design a GenAI-based Data-Platform-As-a-Service, lead data initiatives focused on quality and governance, and collaborate with global teams to enhance Udemy's data capabilities.

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account