Do you want to innovate an industry leading developer cloud? The cloud development team within Software and Advanced Technology Group (SATG) is developing and shaping the way people think about computing by focusing on developers, ecosystem partners, academia etc. We are redefining the space with cutting-edge software technologies along with the industry leading CPU, GPU, AI accelerator and IPU hardware. Join us if you want to make history, provide innovative solutions to challenging engineering problems, and challenge the state-of-the-art in cloud technologies.
We are looking to hire a Cloud Solution Engineer with specialized knowledge in Intel technologies, GPU or Gaudi AI accelerators, and large language models. In this role, you will be responsible for designing and implementing cloud solutions that optimize the capabilities of Intel hardware for AI-driven applications, particularly those involving large language models (LLMs). This position requires a strategic thinker with a strong technical background who can deliver innovative and effective cloud solutions.
Design and implement cloud solutions that leverage Intel technologies and Gaudi AI accelerators or GPU, with a focus on LLM applications.
Work closely with AI researchers and developers to understand their cloud infrastructure needs for LLM development and deployment.
Keep up-to-date with advancements in LLMs, including training techniques, optimization, and deployment in cloud environments.
Integrate Intel's development tools and libraries to support the development and scaling of LLMs in the cloud.
Advocate for the Intel Developer Cloud services and the use of Gaudi AI accelerators through technical discussions and developer support.
Provide technical leadership and mentorship within project teams, promoting best practices in cloud architecture and AI.
Analyze emerging trends in cloud computing and AI, especially LLMs, to guide the development of Intel Developer Cloud services.
Provides a White Glove support service experience to customers.
Works effectively with Site Reliability Engineering to drive requirements for monitoring and alerting to meet service uptime requirements.
You must possess the below minimum qualifications to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.
Minimum qualifications:
The candidate must have a Bachelor's degree in Electrical/Computer Engineering or Computer Science and 4+ years of experience -OR- a Master's degree in Electrical/Computer Engineering or Computer Science and 3+ years of experience -OR- a PhD in Electrical/Computer Engineering or Computer Science and 1+ year of experience in:
4+ years building cloud or cluster computing infrastructure; hands-on debugging of live production systems
1+ years background in cloud architecture OR 1+ years in AI performance engineering
Preferred Qualifications:
1+ years on customer-facing solution engineering related functions
1+ years on systematically optimizing workloads in the large cluster infrastructure
Experience with Intel hardware and software, including Gaudi AI accelerators, and familiarity with LLMs.
A track record of designing and deploying cloud solutions for AI applications, emphasizing scalability, performance, and security.
Knowledge of current LLM frameworks and tools, and their implementation in cloud platforms.
Proficiency in cloud service models, virtualization, container orchestration, and cloud-native technologies.
Skilled in scripting, automation, and Infrastructure as Code (IaC) tools.
Excellent communication skills and the ability to work collaboratively within a team environment.
Relevant certifications in cloud computing, AI, or machine learning are desirable.
Benefits:
We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing Benefits here: https://www.intel.com/content/www/us/en/jobs/benefits.html
Annual Salary Range for jobs which could be performed in
US, California:$162,082.00-$243,222.00Salary range dependent on a number of factors including location and experience.
Work Model for this Role
This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. In certain circumstances the work model may change to accommodate business needs.