NVIDIA is searching for several outstanding large language model and foundation model research interns to join our Research team. We are passionate about research that pushes boundaries but also has impact in the real world. You will be part of an amazing collaborative research team that consistently publishes at the top venues in machine learning and systems. Our existing expertise includes large language model, generative models, and so forth. Your contributions have the chance to create real impact on our products.

What you'll be doing:

  • Research, design and implement novel large language model and foundation models

  • Perform model optimization, compression and acceleration

  • Publish original research

  • Collaborate with other team members and teams

  • Speak at conferences and events

  • Transfer technology to product groups

  • Collaborate with external researchers

What we need to see:

  • Currently pursuing a Ph.D. in Computer Science/Engineering, Electrical Engineering

  • Strong background of theory and practice of LLM and foundation model, as well as deep learning

  • Excellent knowledge of theory and practice of model compression and acceleration techniques

  • Excellent programming skills in some rapid prototyping environment such as Python; C++ and parallel programming (e.g., CUDA) is a plus

  • Knowledge of common machine learning frameworks, such as PyTorch

  • Outstanding research track record, very good publication record

  • Excellent communication skills

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and productive people in the world working for us. If you're creative and autonomous, we want to hear from you!

Location

China, Shenzhen

Job Overview
Job Posted:
3 weeks ago
Job Expires:
Job Type
Full Time Intern

Share This Job: