Applied Materials Logo

Applied Materials

High-Performance Computing (HPC) Architect

Job Posted 16 Days Ago Posted 16 Days Ago
Be an Early Applicant
2 Locations
Senior level
2 Locations
Senior level
The HPC Architect designs optimized high-performance computing solutions, collaborates with teams, manages workloads, and enhances systems' efficiency while mitigating risks.
The summary above was generated by AI

Who We Are

Applied Materials is the global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips – the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world – like AI and IoT. If you want to work beyond the cutting-edge, continuously pushing the boundaries of science and engineering to make possible the next generations of technology, join us to Make Possible® a Better Future. 

What We Offer

Location:

Bangalore,IND, Chennai,IND

At Applied, we prioritize the well-being of you and your family and encourage you to bring your best self to work. Your happiness, health, and resiliency are at the core of our benefits and wellness programs. Our robust total rewards package makes it easier to take care of your whole self and your whole family. We’re committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits

You’ll also benefit from a supportive work culture that encourages you to learn, develop and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible—while learning every day in a supportive leading global company. Visit our Careers website to learn more about careers at Applied.

About Applied 

 

Applied Materials is the leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. Our expertise in modifying materials at atomic levels and on an industrial scale enables customers to transform possibilities into reality. At Applied Materials, our innovations make possible the technology shaping the future. 

 

Our Team 

Our team is developing a high-performance computing solution for low-latency and high throughput image processing and deep-learning workloads that will enable our Chip Manufacturing process control equipment to offer differentiated value to our customers. 

Your Opportunity 

As an HPC Architect, you will get the opportunity to architect high-performance computing solutions from scratch and design/optimize all aspects (Compute, Memory, Networking, Storage) for better cost of Ownership. 

Roles and Responsibility 

  • As an architect, you will be responsible for designing HPC infrastructure solutions, including compute, networking, storage, and workload management components. 

  • You will work closely with cross-functional teams, including Hardware, Software, product management, and business stakeholders, to understand compute workload and translate them into Platform architecture and designs that meet business needs. 

  • You will create and maintain detailed system architecture diagrams and specifications.  

  • You will evaluate and select appropriate hardware and software components for HPC environments 

  • You will Install, configure, and maintain HPC systems, including hardware, software, and networking components 

  • You will develop and implement automation scripts for system management and deployment.  

  • You will be a subject Matter expert to unblock dependent teams in the HPC domain. 

  • You will be expected to develop system benchmarks, profile systems to understand bottlenecks, optimize workflows and processes to improve cost of ownership. 

  • Identify and mitigate technical risks and issues throughout the HPC development life cycle. 

  • Ensure that Compute Cluster is resilient, reliable, and maintainable. 

  • You will be expected to stay abreast of the latest HPC technologies, including Hardware, Software and Networking Solutions 

  • Your primary focus will be to understand the compute workload and design HPC cluster with right combination of Nodes, CPU/GPU, Memory, Interconnects and storage to have optimum performance at minimum cost of Ownership. 

 

 

Our Ideal Candidate 

Someone who has the drive and passion to learn quickly, has the ability to multi-task and switch contexts based on business needs. 

Qualifications 

  • In-depth experience with Linux System administration and Hardware/Software Configuration. 

  • Strong knowledge of HPC technologies including cluster computing, high speed interconnects (InfiniBand, RoCE), parallel filesystems (Lustre, GPFS, BeeGFS etc) 

  • Experience in creating, maintaining Operating System images with different installation and boot schemes 

  • Extremely good with automation tools like Ansible, Chef, Salt-Stack and Scripting languages (Python and Bash) 

  • Experience in Creating, maintaining Storage Solutions with different RAID configuration. 

  • Ability to design storage solution for different IOPS, Access patterns (Random vs Sequential RW) and tune storage and filesystems for better performance. 

  • Good of knowledge Networking concepts including IP addressing, routing, protocols and Switch configuration for RDMA, VLAN configuration, network bonding etc. 

  • Good Knowledge Virtualization, Hardware and Software Hypervisors 

  • Good knowledge of containerization technologies like docker, singularity. 

  • Experience in Software Defined Networking and Storage. 

  • Experience in setting-up remote management protocols like IPMI, Redfish etc. 

  • Experience in setting-up and using monitoring systems like Prometheus, Grafana. 

  • Experience System profiling and custom tuning for target workload for higher performance and low cost of ownership 

  • Very good written and verbal communication skills. 

  • Very good in Technical documentation meant to serve as manuals for non-experts in the field. 

 

Additional Qualifications: 

 

  • Experience in HPC Cluster management and Work-load orchestration software (e.g. SLURM, Torque, LSF) 

  • Experience in Setting-up Deep-learning training/inference solutions. 

  • Experience in Private cloud infrastructure like Kubernetes, OpenStack, CloudStack etc. 

  • Experience in Distributed High Performance Computing and Parallel programming frameworks  

  • Good knowledge of Low-latency and high-throughput data transfer technologies (RDMA on RoCE, InfiniBand) 

 

Education: 

Bachelor's Degree or higher in Computer science or related Disciplines. 

Applied Materials is committed to diversity in its workforce including Equal Employment Opportunity for Minorities, Females, Protected Veterans and Individuals with Disabilities. 

Additional Information

Time Type:

Full time

Employee Type:

Assignee / Regular

Travel:

Relocation Eligible:

No

Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

Top Skills

Ansible
Bash
Beegfs
Chef
Cloudstack
Gpfs
Grafana
Infiniband
Ipmi
Kubernetes
Linux
Lsf
Lustre
Openstack
Prometheus
Python
Redfish
Roce
Salt-Stack
Slurm
Torque

Applied Materials Chennai, Tamil Nadu, IND Office

X6PW+5GQ, Tharamani, Chennai, Tamil Nadu, India, 600113

Similar Jobs

Yesterday
Chennai, Tamil Nadu, IND
Senior level
Senior level
Cloud • Fintech • Food • Information Technology • Software • Hospitality
As a Staff Engineer, you will develop backend systems for Accounts Payable, enhance user interfaces, automate processes, and collaborate with cross-functional teams to deliver high-quality solutions.
Top Skills: BigQueryC#DynamoDBGraphQLJavaKotlinNoSQLPostgresRedshiftRestful ApisSnowflakeSQLSQL Server
Yesterday
Hybrid
Chennai, Tamil Nadu, IND
Mid level
Mid level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Quality Assurance Sr Analyst is responsible for testing complex enterprise applications, conducting test design and execution, and collaborating with cross-functional teams.
Top Skills: Ab InitioAgileAWSAzureCi/CdETLGherkin LanguageJenkinsRelational DatabaseSQLUnix
Yesterday
Hybrid
2 Locations
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The role involves leading the development of data engineering processes, ensuring data quality, guiding junior developers, and collaborating with teams to support analytics and AI solutions.
Top Skills: AirflowAWSAzureC++DataikuDockerGCPGitHadoopJavaKafkaKubernetesPythonRedshiftSnowflakeSparkSQL

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account