Responsibilities:

  • Deploy, optimize and accelerate machine learning models on various computing platforms from e.g. Nvidia, TI, Ambarella etc
  • Implement compiler support to maximize inference performance

Required Skills:

  • Proficient in C/C++
  • 3+ years of industry experience
  • Familiar with AI inference computing optimization, quantization, model pruning techniques
  • Familiar with one or more of GPU/TPU/CPU/ML accelerators
  • Familiar with one or more of Machine Learning Compiler e.g. Tensorrt, TVM, XLA, Glow, or MLIR

Preferred Skills:

  • BS/MS/PhD in Computer Science, Computer Engineering, Electrical Engineering or related fields
  • ML applications and ML optimization experience
  • Familiar with one or more of the machine learning frameworks e.g. Pytorch/Tensorflow/AITemplate/JAX
  • ML experience with large transformer models
  • High performance computing experience
  • Understand the principles of operations for sensors such as camera, radar and lidar

Salary Range:

  • $150,000 - $190,000 a year
Our compensations (cash and equity) are determined based on the position, your location, qualifications, and experience.

Salary

$150,000 - $190,000

Yearly based

Location

Santa Clara, CA

Job Overview
Job Posted:
1 week ago
Job Expires:
Job Type
Full Time

Share This Job: