AI Inference Performance Engineer - New College Grad 2026
USonsitejunior
Posted today · via Workday
About this role
We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s performance standards across language models, video generation, and speech workloads. We work directly within TensorRT-LLM, SGLang, and vLLM, building the tools that evaluate serving performance at scale. This team sits at the intersection of GPU performance engineering and public accountability. What You Will Be Doing: Drive industry benchmark results: own the end-to-end optimization pipeline, implement and integrate optimizations in quantization, scheduling, memory management, and distributed inference across TensorRT-LLM, SGLang, and vLLM.…
What we'd score you on
reqspace match rubricFive dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.
1
Skills match
For this role: python, c++, k8s, pytorch, openai…
2
Level fit
This role is junior-level. We check your trajectory against it.
3
Domain experience
Your work in the role's domain matters more than your years total. We weight recent and direct experience.
4
Recency
A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.
5
Location fit
This role is based in US. We weight your proximity and willingness to relocate.
Score yourself on this role.
Free · no card · written explanation included
Skills in this role
Pulled from the job description. These are the keywords we'll weight when scoring your fit.
pythonc++k8spytorchopenaiteams
More at Nvidia
- View →Senior Software Engineer, Subnet ManagementUS, CA, Santa Clara
- View →Senior Silicon Validation and Methodology EngineerChina, Shanghai
- View →Senior Applied Machine Learning Engineer - VLSI DesignUS, CA, Santa Clara
- View →Senior Firmware EngineerUS, CA, Santa Clara
- View →Solutions Architect - Top AI LabsChina, Beijing
- View →Software Engineer, DGX Cloud AI InfrastructureUS, CA, Santa Clara
