Boson AI is a startup building large language tools for audio understanding, generation, interaction and entertainment. Our founders, Alex Smola, Mu Li, and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for LLM, large Audio models and beyond. We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on modeling and training LLMs, understanding and interpreting model behavior and aligning models to human values. The ideal candidate will possess a strong background in machine learning, and have motivations for developing state-of-the-art models towards AGI.
Responsibilities
Design and verify novel model architectures and training objectives.
Investigate novel model alignment algorithms.
Write efficient and clean code for ML training.
Conduct large-scale experiments to verify the modeling choices and identify improvement areas.
Experience
Summarize results and clearly communicate the motivations and observations in your work
Proficiency in at least one deep learning framework, such as PyTorch.
Participation in at least one research project related to LLM or multimodal models, e.g. experience in training or fine-tuning them.
Experience in alignment research
Experience in large-scale distributed model training
Experience in writing GPU kernels in CUDA
Qualifications
PhD or Master's degree with solid scientific contributions