Job Details:
Job Description:
As part of Intel's DCAI Software AI Solutions Team, we focus on building an end-to-end AI ecosystem across Intel platforms to deliver cutting-edge AI solutions and software optimizations. In this role, you will lead full-stack RnD for AI model optimization, deployment, and acceleration, collaborating closely with architects, hardware teams, and global clients to enable advanced AI technologies for cloud service providers (CSPs), enterprise and government clients, OEM/ODM partners, and ISVs worldwide. This internship aims to cultivate future professionals who deeply understand internet AI workload characteristics while mastering Intel's AI acceleration technologies, ensuring seamless support for deploying Intel's new platforms across diverse customer environments.Core
ResponsibilitiesOptimize model training and inference performance on mainstream AI frameworks (TensorFlow/PyTorch) using Intel's heterogeneous hardware, including Xeon processors, GPUs, and NPUs.Conduct AI workload characterization and design targeted optimization strategies (e.g., quantization, pruning, dynamic resource scheduling) to enhance compute efficiency and energy
performance.Collaborate cross-functionally to validate the feasibility of emerging AI technologies (e.g., large language model distributed training, multimodal inference) on Intel platforms.
Qualifications:
Optimize model training and inference performance on mainstream AI frameworks (TensorFlow/PyTorch) using Intel's heterogeneous hardware, including Xeon processors, GPUs, and NPUs.
Conduct AI workload characterization and design targeted optimization strategies (e.g., quantization, pruning, dynamic resource scheduling) to enhance compute efficiency and energy performance.
Collaborate cross-functionally to validate the feasibility of emerging AI technologies (e.g., large language model distributed training, multimodal inference) on Intel platforms.
Technical Focus Areas
Model Optimization: Quantization (INT8/BF16/FP6), model distillation, sparsification
Hardware Acceleration: AVX-512, VNNI, AMX instruction sets; GPU parallel computing
Deployment Frameworks: TensorRT, ONNX Runtime, OpenVINO
Job Type:
Student / Intern
Shift:
Shift 1 (China)
Primary Location:
PRC, Beijing
Additional Locations:
Business group:
The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and
technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.
Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Position of Trust
N/A
Work Model for this Role
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.