We are now looking for a Senior Machine Learning Engineer for Quantized Training.
NVIDIA is seeking machine learning engineers to support next-generation recipes for mixed-precision training. In this role you will (1) distill LLM research literature into its core, (2) translate literature into experiments at scale, (3) create insights to support or refute the efficacy of a technique, and (4) generate reproducible training recipes.
What you'll be doing:
Review state-of-the-art literature in quantized training
Build robust, reproducible, and portable training recipes
Provide engineering support to customers using HW and SW approaches
Collaborate closely with hardware, software, and research teams to assess and adopt deep learning algorithmic advancements in quantization
Work with production SW teams to realize recipes in production workflows
What we need to see:
Experience with PyTorch or similar frameworks such as jax/xla/etc
Proficient in the math of machine learning
Familiarity with FP8 for training
Published research or significant contributions to the field of AI, particularly in algorithm development for hardware-software co-design
PhD, M.S. degree or equivalent experience in Computer Science or a related field
5+ YoE working in ML / AI
Strong written and oral communication skills
Strong programming skills and ability to debug ML systems
Ways to stand out from the crowd:
Experience in LLM training, fine-tuning and optimization (quantization, sparsity)
Familiarity with MX formats for training
Experience with Transformer Engine, Megatron-LM, or NeMo
GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. This opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. Do you love the challenge of influencing the long-term opportunities that expand NVIDIA’s impact on the datacenter and beyond? If so, we want to hear from you!
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Yearly based
US, WA, Seattle