Senior AI and Machine Learning EngineerThis role has been designated as ‘Remote/Teleworker’, which means you will primarily work from home.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know diverse backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:

Job Description: 

High Performance Computing, AI and Labs is a critical element of HPE. We are focused on delivering innovative solutions that accelerate our customers’ digital transformation, enabling them to tackle their complex, and data-intensive workloads. The next era of computing combines deep learning and machine learning expertise with the development of the world’s most cutting-edge, high-performance supercomputers. Industries are rapidly changing to deliver valuable insight & innovation using ML/DL. Join our team and redefine what’s next for you.

The HPC & AI Performance Engineering team at HPE is building the industry’s highest performing HPC & AI servers and clusters for our customers. We do this by designing, benchmarking, proving, and improving ML/DL application performance on the world’s fastest supercomputers and enabling customers to make quicker and better data-driven decisions.

What you’ll do:

Responsibilities:

  • Studies and improves performance of Large Language Models running on HPE GPU servers
  • Performs system level analysis of HPC & AI workloads on various HPE platforms
  • Runs ML/DL code on accelerated hardware like NVIDIA and AMD GPUs and high-speed networks like InfiniBand
  • Develops software and scripts to automate AI workloads and analyze performance data
  • Installs and configures complex IT infrastructure components (servers, storage, network)
  • Writes white papers and other guidance documents for AI workload and model selection
  • Captures and reviews system performance data, logs, traces to understand workload behavior
  • Communicates technical work well and presents work to non-technical colleagues
  • Works with software and hardware partners in optimizing systems and resolving performance issues
  • Documents and reports issues when testing and evaluating systems
  • Communicates project status and concerns to management in a timely manner
  • Mentors less-experienced staff members

What you need to bring:

Education and Experience Required:

  • Master's degree or PhD in Computer Science, Engineering, Information Technology or Systems, or relevant field.
  • Typically 3+ years of experience.

Knowledge and Skills:

  • 3+ years of experience in Machine Learning/Artificial Intelligence
  • Proficiency in one or more AI & Machine Learning frameworks or libraries (TensorFlow, PyTorch, ONNX, DeepSpeed, Horovod, TensorRT, NeMo)
  • Experience with containers and distributed deep learning and neural networks, including transformers used in generative AI projects
  • Experience with High Performance Computer Servers, High Performance Networking, and associated software
  • Experience with Weka I/O, NTFS and Lustre File Systems
  • Programming experience in Python or C/C++ is strongly desired
  • Strong analytical and critical thinking skills
  • Must be a self-starter, able to work with minimum supervision in a semi-remote setting

Additional Skills:

Artificial Intelligence Technologies and performance benchmarking, Cross Domain Knowledge, Data Engineering, Data Science, Design Thinking, Development Fundamentals, Full Stack Development, IT Performance, Machine Learning Operations, Scalability Testing, Security-First Mindset.

#unitedstates #AIML #frameworks #libraries #TensorFlow #PyTorch, #ONNX #DeepSpeed #Horovod, #TensorRT, #NeMo #hpc #filesystems #python #C #C++ #containers #generativeai

Additional Skills:

Artificial Intelligence Technologies, Cross Domain Knowledge, Data Engineering, Data Science, Design Thinking, Development Fundamentals, Full Stack Development, IT Performance, Machine Learning Operations, Scalability Testing, Security-First Mindset

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Diversity, Inclusion & Belonging

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know diverse backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected:

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

#unitedstates

#highperformancecompute

Job:

Engineering

Job Level:

TCP_04

    

States with Pay Range Requirement

The expected salary/wage range for a U.S.-based hire filling this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level. If this is a sales role, then the listed salary range reflects combined base salary and target-level sales compensation pay. If this is a non-sales role, then the listed salary range reflects base salary only. Variable incentives may also be offered. Information about employee benefits offered can be found at https://myhperewards.com/main/new-hire-enrollment.html.

USD Annual Salary: $128,000.00 - $295,000.00Estimated job application period closure is November 2024. While this is the expected application time frame, there are many factors which may result in a change. If this position is still open beyond the anticipated closure time frame, it is likely HPE is still actively recruiting for this role and all qualified and interested candidates are encouraged to apply.

HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT and Affirmative Action employer. We are committed to diversity and building a team that represents a variety of backgrounds, perspectives, and skills. We do not discriminate and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global diverse team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.

Hewlett Packard Enterprise is EEO F/M/Protected Veteran/ Individual with Disabilities.

HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories. .

Salary

$128,000 - $295,000

Yearly based

Location

All, Texas, United States of America

Job Overview
Job Posted:
2 months ago
Job Expires:
Job Type
Full Time

Share This Job: