ML Runtime Optimization Engineer

mid$159K – $199K

via Ashby

See if I'm a fit →Tailor my resume for this role →Apply on Ashby ↗

About this role

ABOUT THE ROLE We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, Triton). AT APPLIED INTUITION, YOU WILL: - Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms - Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers - Work on model pruning and quantization, and support deployment on memory constrained platforms…

Read the full description on Applied's site →

What we'd score you on

reqspace match rubric

Five dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.

Skills match

For this role: pytorch, jax

Level fit

This role is mid-level. We check your trajectory against it.

Domain experience

Your work in the role's domain matters more than your years total. We weight recent and direct experience.

Recency

A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.

Location fit

This role is based in a specific location. We weight your proximity and willingness to relocate.

Score yourself on this role.

Free · no card · written explanation included

See if I'm a fit →

Skills in this role

Pulled from the job description. These are the keywords we'll weight when scoring your fit.

pytorchjax

More at Applied

See all open jobs at Applied →