We are seeking a skilled and experienced Platform Engineer/Architect to lead the setup, advancement and maintenance of a robust on-premise environment for hosting open-source large language models. This role involves designing and implementing scalable, secure, and efficient infrastructure solutions that cater to the specific needs of large-scale AI models.
- Infrastructure Design and Development:
- Design and architect a scalable and secure on-premise hosting environment for large language models.
- Develop and implement infrastructure automation tools for efficient management and deployment.
- Ensure high availability and disaster recovery capabilities.
- Performance Optimization:
- Optimize the hosting environment for maximum performance and efficiency.
- Implement monitoring tools to track system performance and resource utilization.
- Regularly update the infrastructure to incorporate the latest technological advancements.
- Security and Compliance:
- Establish robust security protocols to protect sensitive data and model integrity.
- Ensure compliance with data protection regulations and industry standards.
- Conduct regular security audits and vulnerability assessments.
- Collaboration and Support:
- Work closely with AI/ML teams to understand their requirements and provide suitable infrastructure solutions.
- Provide technical guidance and support to internal teams and stakeholders.
- Stay abreast of emerging trends in AI infrastructure and large language model hosting.
- Resource Management:
- Manage physical and virtual resources to ensure optimal allocation and utilization.
- Forecast resource needs and plan for future expansion and upgrades
- Bachelor's or Master's degree in Computer Science, Information Technology, or a related field with 7-12 years of experience.
- Proven experience in infrastructure architecture, with exposure to AI/ML environments.
- Experience with inferencing frameworks like TGI, TEI, Lorax, S-Lora etc.
- Experience with training frameworks like PyTorch, TensorFlow etc.
- Proven experience with On-premises OSS models – Llama3, Mistral etc.
- Strong knowledge of networking, storage, and computing technologies.
- Experience of working with container orchestration tools (e.g., Kubernetes - Redhat OS).
- Proficient programming skills in Python
- Familiarity with open-source large language models and their hosting requirements.
- Excellent problem-solving and analytical skills.
- Strong communication and collaboration abilities.
Come create the technology that helps the world act together
Nokia is committed to innovation and technology leadership across mobile, fixed and cloud networks. Your career here will have a positive impact on people’s lives and will help us build the capabilities needed for a more productive, sustainable, and inclusive world.
We challenge ourselves to create an inclusive way of working where we are open to new ideas, empowered to take risks and fearless to bring our authentic selves to work
What we offer
Nokia offers continuous learning opportunities, well-being programs to support you mentally and physically, opportunities to join and get supported by employee resource groups, mentoring programs and highly diverse teams with an inclusive culture where people thrive and are empowered.
Nokia is committed to inclusion and is an equal opportunity employer
Nokia has received the following recognitions for its commitment to inclusion & equality:
- One of the World’s Most Ethical Companies by Ethisphere
- Gender-Equality Index by Bloomberg
- Workplace Pride Global Benchmark
At Nokia, we act inclusively and respect the uniqueness of people. Nokia’s employment decisions are made regardless of race, color, national or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disability, protected veteran status or other characteristics protected by law.
We are committed to a culture of inclusion built upon our core value of respect.
Join us and be part of a company where you will feel included and empowered to succeed.