Huawei’s TTE RAMS Lab is a corporate competence center responsible for researching high reliability and high safety architecture as well as technologies for complex intelligent system; Our goal is to provide Huawei products with cutting-edge researches and advanced technical solutions on intelligent reliability and safety for carrier grade ICT and safety critical systems such as autonomous driving so that our products provide our customers with best user experiences and performance.
We are seeking a highly motivated and talented student intern to join our cutting-edge research team focused on large-scale reliable AI infrastructures. This position will emphasize ensuring the robustness and reliability of training and inferencing for large language models (LLMs). The ideal candidate will engage in both theoretical and practical research aimed at overcoming challenges related to scaling AI systems while maintaining reliability, resilience, and efficiency across various AI workflows.
As part of the team, you will have the opportunity to work at the forefront of AI infrastructure, addressing critical issues like fault tolerance, data and model consistency, distributed AI system infrastructure, and the optimization of machine learning pipelines.
Huawei is a leading global information and communications technology (ICT) solutions provider. Driven by a commitment to operations, ongoing innovation, and open collaboration, we have established a competitive ICT portfolio of end-to-end solutions in Telecom and enterprise networks, Devices and Cloud technology and services. Our ICT solutions, products and services are used in more than 170 countries and regions, serving over one-third of the world's population. With 197,000 employees, Huawei is committed to develop the future information society and build a Better Connected World.
Please send your application and CV (incl. cover letter and reference letters) in English.