Chai is one of the fastest-growing, generative AI startups in Silicon Valley. YouTube but for LLM's - we have over 1 million active users.
Who we are looking for:
We need a relentless engineer with 3+ years of experience overseeing and being responsible for optimizing our LLMs. Ensuring they are performant, scaleable, and cost-efficient. You will work alongside equally talented and driven teammates implementing cutting-edge AI inference engines. We need someone who is reliable and has high standards.
Here's why we might not be the right fit for you: • We work hard and have a high-velocity environment with lots of growth opportunities. • We value exceptional performance and continuous improvement. We believe that if you aren't constantly learning, you aren't growing. • You will be responsible and accountable for making high-impact decisions that determine Chai's future
Here are the top 2 reasons why you should join us: • Exponential growth. 1 Million MAU. Join the team that gets us to 100 million MAU • Craftsmanship. Build something beautiful
Requirements: • Familiar with vLLM, quantization, and current techniques of LLM optimization • 3+ years of experience in software engineering • Bachelor or Master degree from a leading academic institution
Here is our tech stack: • Front end: Python, Flutter, Dart • Back end: Python, GCP, Redis, Kubernetes
Process: Exceptionally fast, application to offer within 7 days 1. Apply here 2. First round video interview, system design interview, then onsite 3. Reference checks, negotiation, and offer
Pay range$250,000—$350,000 USD