LLM Engineer

We are seeking a skilled LLM Engineer proficient in Python programming and experienced in developing, deploying, and optimizing large language models (LLMs). The ideal candidate will have hands-on experience with FastAPI or Flask frameworks, Lang Chain implementation, and building Retrieval-Augmented Generation (RAG) pipelines. You will play a key role in integrating cutting-edge AI technologies to solve complex business problems, focusing on vector stores and retrievers while deploying scalable solutions on AWS

Department:

Machine Learning Engineering

Project Location(s):

India - Remote

Job Type:

Full time - Contract

Education:

Bachelor's

Key Responsibilities

Python & API Development:

Design, develop, and maintain scalable web services using FastAPI or Flask.

Write efficient, reusable, and modular Python code to support API-driven LLM applications.

LLM & LangChain Implementation:

Build and optimize LLM-based applications using LangChain and related frameworks.

Develop custom pipelines for document indexing, retrieval, and summarization.

Integrate RAG capabilities with vector stores and retrievers for real-time querying.

Retrieval-Augmented Generation (RAG) Pipelines:

Architect and deploy RAG systems for chatbots, knowledge systems, and generative AI applications.

Optimize RAG models for speed, accuracy, and scalability.

Vector Stores & Retrievers:

Work with vector databases (Pinecone, FAISS, Chroma, Milvus) to store and manage embeddings.

Implement retrievers and re-rankers to improve query efficiency and response relevance.

AWS Cloud Deployment & Optimization:

Deploy and manage LLM-based applications on AWS (Lambda, EC2, S3, EKS, RDS).

Optimize model inference performance using quantization, distillation, and fine-tuning techniques.

Monitoring & Experimentation:

Implement real-time monitoring dashboards using Grafana, Prometheus, or Datadog.

Research and integrate cutting-edge generative AI advancements into production environments.

Qualifications

Required Technical & Business Skills

Project Management: Agile, Scrum, Jira, Confluence, Trello.

AI/ML Understanding: Familiarity with NLP, LLMs, RAG pipelines, and data science workflows.

Cloud & Infrastructure: AWS, Azure, GCP, CI/CD pipelines.

Risk & Budget Management: Cost tracking, risk mitigation, stakeholder reporting.

Collaboration Tools: Slack, Zoom, Microsoft Teams, Notion.

Soft Skills

Leadership: Ability to drive AI projects and influence teams in a fast-paced environment.

Communication: Strong ability to convey technical concepts to business stakeholders.

Problem-Solving: Quick thinking to resolve roadblocks and optimize AI project execution.

Adaptability: Comfort with ambiguity and evolving AI industry trends.

Apply now