We are now looking for a Senior Machine Learning Engineer for Quantized Training.

NVIDIA is seeking machine learning engineers to support next-generation recipes for mixed-precision training. In this role you will (1) distill LLM research literature into its core, (2) translate literature into experiments at scale, (3) create insights to support or refute the efficacy of a technique, and (4) generate reproducible training recipes.

What you'll be doing:

  • Review state-of-the-art literature in quantized training

  • Build robust, reproducible, and portable training recipes

  • Provide engineering support to customers using HW and SW approaches

  • Collaborate closely with hardware, software, and research teams to assess and adopt deep learning algorithmic advancements in quantization

  • Work with production SW teams to realize recipes in production workflows

What we need to see:

  • Experience with PyTorch or similar frameworks such as jax/xla/etc

  • Proficient in the math of machine learning

  • Familiarity with FP8 for training

  • Published research or significant contributions to the field of AI, particularly in algorithm development for hardware-software co-design

  • PhD, M.S. degree or equivalent experience in Computer Science or a related field

  • 5+ YoE working in ML / AI

  • Strong written and oral communication skills

  • Strong programming skills and ability to debug ML systems

Ways to stand out from the crowd:

  • Experience in LLM training, fine-tuning and optimization (quantization, sparsity)

  • Familiarity with MX formats for training

  • Experience with Transformer Engine, Megatron-LM, or NeMo

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. This opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. Do you love the challenge of influencing the long-term opportunities that expand NVIDIA’s impact on the datacenter and beyond? If so, we want to hear from you!

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Salary

$180,000 - $339,250

Yearly based

Location

US, WA, Seattle

Job Overview
Job Posted:
3 months ago
Job Expires:
Job Type
Full Time

Share This Job: