Streamline the deployment of Machine Learning models

As a Senior Developer on the Machine Learning team, you’ll play a key role in supporting other ML teams with the deployment and integration of ML models, including Large Language Models (LLMs), into existing infrastructure. Our team has built a mission-critical platform that trains thousands of models and serves over 100M model queries daily. This is your chance to accelerate AI innovation at Coveo by enhancing our ML platform’s capabilities to safely deploy, serve, and test models at scale.

This is your chance to contribute to AI innovation at Coveo by expanding the capabilities of our platform and supporting the shift toward agentic AI.

Here’s what you’ll be responsible for:

  • Contribute to every stage of the development lifecycle, from design and coding to automated testing and deployment.
  • Design and implement scalable solutions to enhance operational efficiency and streamline automated deployments.
  • Investigate and improve the performance, scalability, and efficiency of our platform infrastructure.
  • Ensure high availability and reliability of services handling millions of requests per day.
  • Contribute to the architecture and evolution of our platform by bringing forward innovative ideas.
  • Collaborate with applied scientists, data engineers, and software developers to integrate models seamlessly into the existing infrastructure.
  • Support the shift toward Agentic AI by developing new capabilities for models to be utilized differently.

Here is what will qualify you for the role: 

  • 8+ years of experience in backend development in a cloud environment (Java/Spring preferred, AWS an asset).
  • Strong understanding of building scalable and resilient distributed systems, with experience producing reusable code within complex infrastructures for large-scale applications.
  • A problem-solving mindset, with the resourcefulness to analyze, optimize, and debug large-scale systems while continuously embracing a growth-oriented approach

Here is what would make you stand out:

  • Familiarity with Terraform & Kubernetes for infrastructure automation and container orchestration.
  • Experience with open-source ML serving frameworks.
  • Generative AI Search: Understanding of generative AI search technologies and their application in enhancing search capabilities and user experiences.

Do you think you can bring this role to life? 

Send us your application, we want to get to know you! Join the Coveolife! 

We encourage all qualified candidacies regardless of, for example, age, gender, disability, gaps in CV, national or ethnic background. We know that applying for a new role is a lot of work and we really appreciate your time.

#li-hybrid 

Location

Montreal (Province of Quebec, Canada)

Job Overview
Job Posted:
1 day ago
Job Expires:
Job Type
Full Time

Share This Job: