Lamini enables every enterprise to safely, quickly, and cost-effectively build their own Expert AI. Our customers own their own models, trained on their data. Lamini optimizes for Expert AI workloads with minimal hallucination, enterprise-grade security, and enterprise flexibility, running on any infrastructure. Our team is made up of highly committed engineers, researchers, and tech industry veterans excited by mission and technology. We’re backed by leading VCs as well as computing and technology companies. About the Role: We are seeking a highly skilled and motivated DevOps Engineer to join our team at a Senior or Staff level. The ideal candidate will be instrumental in managing cloud infrastructure, improving internal development workflows, and ensuring seamless release and delivery of the Lamini Platform to enterprise customers. This role involves collaborating with cross-functional teams and leveraging cutting-edge technologies to enhance our platform's reliability, scalability, and performance.
Key Responsibilities:
Software Deployment and Delivery on Kubernetes platform:
Design and implement robust software deployment processes for delivering high-quality platforms to enterprise customers.
Work with on-prem and managed Kubernetes environments on Cloud to drive product architecture design.
Internal Infrastructure Support:
Maintain and enhance internal ML infrastructure on GCP VertexAI, AWS Bedrock, and private data center GPU servers.
Support the engineering team by improving the development environment (GitHub, Cloud, local setups).
Customer Support and Troubleshooting:
Diagnose and resolve issues related to deploying Lamini Platform in customer on-prem environments.
Ensure the reliability and performance of the platform and contribute to its continuous improvement.
Data Center Server Management:
Collaborate with data center vendors to manage GPU servers.
Utilize Infrastructure as Code (IaC) principles to automate provisioning and configuration management.
Team Collaboration:
Partner with cross-functional teams to ensure reliability and scalability are embedded in the design of new features and services.
Document systems, processes, and findings to maintain transparency and knowledge sharing.
Desired Fit:
Continuous Improvement: Proactively identify and address issues in Lamini Platform, to ensure a delightful experience of deploying Lamini Platform for customers.
Principled Approach: Advocate and implement best practices like Infrastructure as Code (IaC) IaC to ensure system reliability and consistency.
Collaborative Mindset: Work seamlessly across teams, supporting colleagues and contributing to team success.
Ownership: Take initiative to own problems end-to-end, learning new skills as needed to deliver solutions.
Technical Savvy: Experience using AI-assisted programming tools such as Copilot and Cursor is a plus.
Qualifications:
Bachelor’s degree in Computer Science, or a related field (or equivalent work experience).
Proven expertise in DevOps tools and platforms, with hands-on experience building workflows and pipelines.
Deep knowledge of Docker, Kubernetes, Observability, CI/CD, cloud platforms (AWS/GCP), and related tools (docker, helm, prometheus, git, terraform etc).
Proficiency in programming languages such as Python, Go, and shell scripting (e.g., Bash, Awk).
Strong problem-solving skills with the ability to thrive in a fast-paced environment.
Excellent communication skills for engaging with stakeholders and documenting technical processes.
If you are passionate about enhancing infrastructure reliability, driving platform excellence, and collaborating with cutting-edge technologies, we invite you to apply for this exciting opportunity! At Lamini AI, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants without regard to race, color, religion, sex, pregnancy (including childbirth, lactation and related medical conditions), national origin, age, physical and mental disability, marital status, sexual orientation, gender identity, gender expression, genetic information (including characteristics and testing), military and veteran status, and any other characteristic protected by applicable law. Lamini AI believes that diversity and inclusion among our employees is critical to our success as a company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. Selection for employment is decided on the basis of qualifications, merit, and business need.