ML Runtime Optimization Engineer
mid$159K – $199K
via Ashby
About this role
ABOUT THE ROLE
We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, Triton).
AT APPLIED INTUITION, YOU WILL:
- Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms
- Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers
- Work on model pruning and quantization, and support deployment on memory constrained platforms…
What we'd score you on
reqspace match rubricFive dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.
1
Skills match
For this role: pytorch, jax
2
Level fit
This role is mid-level. We check your trajectory against it.
3
Domain experience
Your work in the role's domain matters more than your years total. We weight recent and direct experience.
4
Recency
A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.
5
Location fit
This role is based in a specific location. We weight your proximity and willingness to relocate.
Score yourself on this role.
Free · no card · written explanation included
Skills in this role
Pulled from the job description. These are the keywords we'll weight when scoring your fit.
pytorchjax
