Hewlett Packard Enterprise Logo

Hewlett Packard Enterprise

Principal Software Engineer – Scale-Up Networking (GPU-Centric)

Reposted 10 Days Ago
Be an Early Applicant
In-Office
3 Locations
Expert/Leader
In-Office
3 Locations
Expert/Leader
The Principal Software Engineer will architect and develop high-performance GPU-aware networking solutions, optimize data movement, enhance communication stacks, and lead technical contributions in HPC environments.
The summary above was generated by AI
Principal Software Engineer – Scale-Up Networking (GPU-Centric)

  

This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:

   

High Performance Computing, AI and Labs is a critical element of HPE. We are focused on delivering innovative solutions that accelerate our customers’ digital transformation, enabling them to tackle their complex, and data-intensive workloads. Combining deep expertise and the development of the world’s most cutting-edge, high-performance supercomputers, is defining the next era of computing delivering valuable insight & innovation. Join us and redefine what’s next for you.

What you'll do:

Key ResponsibilitiesArchitect & Deliver Scale-Up Networking
  • Design and implement GPU-aware networking paths for high-bandwidth, low-latency intra-node communication.

  • Develop and optimize GPU → NIC → GPU data movement, shared memory models, and DMA pathways.

GPU Ecosystem Integration
  • Work with NVIDIA CUDA, NVLink, NCCL, and AMD ROCm, InfinityFabric, RCCL teams to integrate and optimize scale-up communication semantics.

  • Drive improvements to DMA engines, BAR mappings, ATS/IOMMU, and GPU memory registration workflows.

Runtime & Communication Stack Development
  • Enhance and extend Libfabric, UCX, CXI, SHMEMX, OpenMPI for GPU-accelerated scale-up workflows.

  • Optimize communication collectives, transport layers, and GPU-direct capabilities.

Multi-NIC / NUMA Performance Optimization
  • Characterize and tune multi-NIC per socket, NUMA-zone mapping, GPU locality, CQ/queue design, and CPU/GPU topology optimization.

Upstreaming & Architecture Influence
  • Lead upstream contributions to open-source projects (OFI, UCX, OpenMPI, RCCL/NCCL enablement).

  • Partner with HPC/AI ecosystem teams to shape future architectures.

Debugging, Performance, and Quality
  • Own complex debugging across driver, runtime, GPU, kernel, and user-space boundaries.

  • Develop profiling workflows using Nsight, ROCm tools, eBPF, perf, etc.

What you need to bring:Required Skills & Experience
  • 10–15+ years building high-performance networking, GPU, or kernel-level software.

  • Deep expertise in C/C++, Linux internals, memory management, RDMA, PCIe, IOMMU, ATS, DMA engines.

  • Strong understanding of CUDA, ROCm, GPU memory models, P2P, GDS (GPUDirect Storage), GDR (GPUDirect RDMA).

  • Hands-on experience with MPI, SHMEM, Libfabric, UCX, or similar communication stacks.

  • Proven experience driving architecture, cross-org technical decisions, and upstream contributions.

  • Ability to mentor senior engineers, influence multi-team designs, and own end-to-end delivery.

Preferred Qualifications
  • Experience with NIC architecture (CXI, RoCE, Infiniband, Slingshot, NVLink Switch).

  • Experience optimizing collectives (AllReduce/AllGather) on GPUs.

  • Background contributing to open-source HPC/AI libraries.

  • Familiarity with HPC system architecture, NUMA tuning, and multi-accelerator systems.

Additional Skills:

Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX)

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Unconditional Inclusion

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected:

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

#india#highperformancecompute

Job:

Engineering

Job Level:

TCP_05

    

    

HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.

Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities.

   

HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.

   

No Fees Notice & Recruitment Fraud Disclaimer

 

It has come to HPE’s attention that there has been an increase in recruitment fraud whereby scammer impersonate HPE or HPE-authorized recruiting agencies and offer fake employment opportunities to candidates.  These scammers often seek to obtain personal information or money from candidates.

 

Please note that Hewlett Packard Enterprise (HPE), its direct and indirect subsidiaries and affiliated companies, and its authorized recruitment agencies/vendors will never charge any candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process.  The credentials of any hiring agency that claims to be working with HPE for recruitment of talent should be verified by candidates and candidates shall be solely responsible to conduct such verification. Any candidate/individual who relies on the erroneous representations made by fraudulent employment agencies does so at their own risk, and HPE disclaims liability for any damages or claims that may result from any such communication.

Top Skills

Amd Rocm
Ats
C/C++
Cxi
Dma
Infinityfabric
Iommu
Libfabric
Linux
Mpi
Nccl
Nvidia Cuda
Nvlink
Pcie
Rccl
Rdma
Shmem
Ucx

Similar Jobs

4 Hours Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
Lead orchestration, automation, and service-assurance initiatives across SES IT; manage test environments, vendor deliverables, Level-3 support, deployments, demos, training, and cross-team coordination to ensure highly available production systems and improved operational efficiency.
Top Skills: Amdocs AnnAmdocs ArmAmdocs OdoAmdocs OndAmdocs Oni-SAmdocs OsoAWSAzureBssCRMDataminerKafkaKubernetesMicrosoft AccessExcelMicrosoft OutlookMicrosoft PowerpointMicrosoft VisioMicrosoft WordNetcoolOssPostmanRest ApiTeoco HelixTmf
4 Hours Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
Develop, test, document and maintain backend operational software for satellite fleet and ground station orchestration. Ensure code quality, automated BDD testing, CI/CD pipelines, and support product transition into operations.
Top Skills: Python,Numpy,Scipy,Pandas,Jupyter,Pytest,Matplotlib,Go,Java,Rust,Git,Linux,Postgresql,Mysql,Mongodb,Nosql,Kafka,Rest,Grpc,Zmq,Tcp/Ip,Apache,Nginx,Virtualization,Containerization,Kubernetes,Aws,Azure,Google Cloud,Ci/Cd,Bdd
4 Hours Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
The Senior Software Engineer will design, test, and automate products while ensuring test coverage, document progress, and participate in Agile methodologies.
Top Skills: CkaCkadCloud Native ArchitecturesDockerGerritGitGoHelmJavaJenkinsKubernetesLinuxMavenPython

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account