Join a cutting-edge and well-funded hardware startup in Silicon Valley as an Deep Learning and Large Language Model Performance Architect. Our mission is to reimagine silicon and create Risc-V based Accelerated computing platforms that will transform the industry. You will have the opportunity to work with some of the most talented and passionate engineers in the world to create designs that push the envelope on performance, energy efficiency and scalability. We offer a fun, creative and flexible work environment, with a shared vision to build products to change the world. Job Responsibility* Workload Analysis - Analyzing the performance of important workloads, tuning our current software, and proposing improvements for future software.* Performance modeling and analysis - develop analytical model for target systems and analyze the performance bottleneck. make recommendations to the implementation teams. Working with cross-collaborative teams of deep learning software engineers and hardware architects to develop innovative solutions. Adapting to the constantly evolving AI industry by being agile and excited to contribute across the codebase.* Pre-silicon and post-silicon performance validationQualification* MS or PhD in relevant discipline (CS, EE, Math) or equivalent experience with 5+ years working experiences* In-depth knowledge of deep learning models or large language models* Strong background in computer architecture or AI software stack/compilers* Strong C/C++ programming and hardware modeling skills* Strong problem solving and analytical thinking skills* Performance modeling and analysis background a plus* GPU programming experience (CUDA) a plus* LLVM/MLIR development experience a plus* Good communication and organizational skills
Location
(US&Taiwan) Santa Clara CA , Austin TX, PORtland OR OR FORt Collins CO, HsinChu Taiwan