AI Engineer

at Weekday

Full Time Remote

Apply Now

This role is for one of the Weekday's clients

Salary range: Rs 2000000 - Rs 6000000 (ie INR 20-60 LPA)

Min Experience: 3 years

JobType: full-time

We are seeking a skilled and motivated AI Engineer with hands-on experience in Retrieval-Augmented Generation (RAG) to join our growing team. In this role, you will design, build, and optimize AI systems that leverage both large language models (LLMs) and structured/unstructured knowledge bases to deliver intelligent and contextually accurate responses. You will be at the forefront of applied AI, working on cutting-edge solutions that bridge the gap between traditional NLP pipelines and next-generation generative AI models.

Requirements

Key Responsibilities:

Design and develop RAG-based architectures integrating vector databases, document retrieval systems, and LLMs.
Implement end-to-end pipelines that perform document ingestion, chunking, embedding generation, indexing, and retrieval.
Fine-tune and optimize retrieval mechanisms using tools such as FAISS, Weaviate, Pinecone, or Elasticsearch.
Integrate APIs of foundation models like OpenAI, Cohere, or Hugging Face models with retrieval systems to produce context-rich outputs.
Work with cross-functional teams including ML engineers, data scientists, and product teams to deliver AI-powered features.
Evaluate model performance using both qualitative and quantitative metrics and iteratively refine system behavior.
Ensure the accuracy, relevance, and safety of model-generated responses by fine-tuning prompts and optimizing knowledge grounding.
Stay updated with the latest research in generative AI, RAG systems, and prompt engineering.

Required Skills and Qualifications:

3 to 9 years of experience in AI/ML/NLP with a strong focus on building and deploying generative AI models.
Deep understanding of Retrieval-Augmented Generation (RAG) concepts and architecture.
Proficiency with vector databases (e.g., FAISS, Pinecone, Weaviate, Vespa) and embedding generation (e.g., SentenceTransformers, OpenAI Embeddings).
Strong programming skills in Python and experience with libraries such as LangChain, Transformers, Hugging Face, PyTorch, or TensorFlow.
Familiarity with LLM APIs like OpenAI (GPT-3.5/4), Anthropic, Cohere, or similar.
Experience working with unstructured data sources (e.g., PDFs, HTML, text files) and knowledge of chunking and context window optimization.
Ability to design scalable pipelines for real-time or batch inference using cloud platforms (AWS, Azure, GCP).
Excellent problem-solving, debugging, and analytical skills.
Strong written and verbal communication skills with the ability to explain technical concepts to non-technical stakeholders.

Good to Have:

Experience with LangChain, LlamaIndex, or other retrieval frameworks.
Exposure to knowledge graphs, search ranking algorithms, and hybrid retrieval methods.
Familiarity with prompt engineering and few-shot learning strategies.
Contributions to open-source AI projects or published research in the field of generative AI.

Location

India - Remote

Remote Job

Engineer

Job Overview

Job Posted:

2 weeks ago

Job Expires:

Job Type

Full Time

Location

Remote Job

Share This Job:

AI Jobs

Companies

Support

Job Details

Location

Remote Job

Share This Job:

Related Jobs

AI Jobs

Companies

Support