We are CARIAD, an automotive software development team with the Volkswagen Group. Our mission is to make the automotive experience safer, more sustainable, more comfortable, more digital, and more fun. To achieve that we are building the leading tech stack for the automotive industry and creating a unified software platform for over 10 million new vehicles per year. We’re looking for talented, digital minds like you to help us create code that moves the world. Together with you, we’ll build outstanding digital experiences and products for all Volkswagen Group brands that will transform mobility. Join us as we shape the future of the car and everyone around it.Role Summary:The Machine Learning Engineer II will design, implement, and test AI Application for CARIAD’s software development platform.This role will support the software and integration team by building AI-driven applications that automate test case generation and analyze code and testing logs for error prediction and self-diagnosis. Additionally, the role will focus on improving QA processes through task automation, identifying patterns in testing failures, and delivering proactive solutions to enhance the overall engineering workflow and software quality.Role Responsibilities:AI Application Development Research and develop AI application to improve engineering efficiency and productivityDesign and create RAG and RL application pipelinesDefine and create vector datastores for different application useDesign and develop application frontend and backend servicesPrompt EngineeringDevelop application specific prompt tuning pipelineCreate and design prompt engineering architecture for different applicationsDevelop scripts to process data and generate ground truth dataset for fine-tuningModel Fine-tuning Fine tune base models with different quantization techniques for application specific usageOptimize models for performance and sizeEvaluate and improve the efficacy of the fine-tuned model Define and create multi-model and multi-adapter inferencing workflowModel Evaluation & DeploymentDefine and validate model evaluation requirementDefine and execute stress testing for inferencing scalabilityGenerate and publish evaluation result Define and create model deployment pipelineCollaboration and Communication Work with infrastructure team for model and application deploymentCollaborate effectively with cross-functional teams including software, testing, and program management teams. Communicate project status, technical challenges, and solutions in a clear and concise manner. General Skills:Analytical and debugging skills. Effective written and oral communication skills. Experience in project and technical leadership. Ability to lead technical meetings and provide feedback. Collaborate and work with multiple teams across geographies and time zones. Required Specialized Skills:Experience with LLM models Llama 3, Gemini, ChatGPT-4o, Mistral, Claude 2, Grok-1 Experience fine-tuning LLM models with PEFT, Quantization, Tokenization for LoRA/QLoRA Strong understanding on embeddings, vectorization, custom tokensExperience generating model dataset using parquet, defining schema of the dataset Proficient in Python, AI-related training and inferencing tools such as PyTorch, vLLM, Ray, Gradio, FastAPI, Streamlit Experience on prompting techniques such as Iterative refinement, Feedback loops, Zero-shot, Few-shot, CoT Experience with different model evaluation tasks such as HellaSwag, TruthfulQA, MMLU Good understanding of natural language processing, machine learning, AI generated content development Experience using LLM tools Ollama, ComfyUIExperience developing RAG, RL applicationsExperience with developing ML pipelines and MLOps for deploymentDesired Skills:Experience with Vision AI models GAN, Stable DiffusionExperience with Image generation pipeline with segmentation, image detection, image masking and processingWorkplace Flexibility:Calls, (virtual) meetings & workshops (overlapping with German/US business hours as needed) to align with leadership, development teams and partners. Occasional international and domestic travel to provide on-site support and planning/integration workshops with our internal and external stakeholders. Years of Relevant Experience:2+ year’s experience with Machine Learning and Deep Learning Models2+ year’s experience in Python and PyTorchRequired Education:MS in Computer Science, Data Science, or Computer EngineeringDesired Education: PhD in Computer Science, Data Science, or Computer EngineeringCompensationSalary range is dependent on factors such as geographical differentials, credentials or certifications, industry-based experience, qualification and training. In the city of Mountain View, California, the salary range for this position is $101,800 - 147,700.CARIAD, Inc. provides performance based merits and annual bonus along with a competitive benefits package. Benefits include medical, dental, vision, 401k with employer match and defined contribution plan, short and long term disability, basic life and AD&D insurance, employee assistance program, tuition reimbursement and student loan repayment plans, maternity and non-primary caregiver leave, adoption assistance, employee referral program and vacation and paid holidays. We also offer a unique vehicle lease program that covers registration and insurance fees. CARIAD is an Equal Opportunity Employer. We welcome and encourage applicants from all backgrounds, and do not discriminate based on race, sex, age, disability, sexual orientation, national origin, religion, color, gender identity/expression, marital status, veteran status, or any other characteristics protected by applicable laws.