NVIDIA Aerial CUDA Accelerated RAN (ACAR) is framework for building high-performance, software-defined, cloud-native Radio Access Network functions over NVIDIA CPU/GPU/DPU based systems. We are seeking a self-motivated senior performance engineer to drive performance and scalability of our platform. This position offers the opportunity to work on cutting-edge technology for 5G and 6G networks, using NVIDIA's world-class compute platforms to advance the field of software-defined digital signal processing stack!
What you'll be doing:
As a member of Aerial RAN team working for 5G and 6G networks, you will be responsible for:
Developing new Features for Aerial L1 and L2 Software
Optimizing CPU, GPU and NIC sub-systems for predictable low-latency and maximum efficiency
Crafting and implementing performance verification tools, frameworks and dashboards
Monitoring and prioritizing performance regressions reported by CI/CD
Collaborating with multi-functional teams to solve performance bottlenecks in CPU, GPU and NIC sub-systems
Benchmarking performance use-cases on different platforms
What we need to see:
BS/MS (or equivalent experience) in a relevant field and 12+ years’ experience or PhD with 7+ years’ experience or equivalent.
Strong software design, development, debugging and testing skills.
Hands-on experience with performance analysis, characterization and optimization.
Experience with programming latency sensitive, real-time, multi-threaded applications on CPUs and one or more of GPUs or DSPs or Vector processors.
Deep knowledge of CPU, DSP or GPU architecture, as well as memory, I/O and networking interfaces.
Familiarity with data science and using visualization tools to summarize large quantities of data.
Experience in one or more programming / scripting languages: C/C++, Python, shell scripting.
Ways to stand out from the crowd
Experience in designing and managing firmware timelines for wireless SoCs used in cellular wireless networks and/or terminals!
Experience in massive MIMO algorithm implementation and optimization for OTA performance
Track record in E2E design/testing of signal processing algorithms at the PHY layer or resource allocation optimization at MAC level.
Experience in field support/Network Optimization in real world deployments
Appetite to learn the details of how next generations of GPU will operate and build an outstanding Software-Radio 5G/6G stack that can fully demonstrate their power.


