Generative AI/LLM Engineer

Company : Soothsayer Analytics

Working Hours : Full-Time

No. of Positions : 4

Locations : Hyderabad

About the Role:

Position Overview:

We are seeking a talented Generative AI/LLM Engineer with a strong background in building and deploying AI models, focusing on leveraging state-of-the-art technologies like Azure OpenAI GPT-4, GPT-4 Vision, or GPT-4 Omni. Experience with Retrieval-Augmented Generation (RAG) and working with Vector Databases is essential. While fine-tuning large language models (LLMs) is a plus, it is not mandatory. A general understanding of how deep learning models are trained or fine-tuned is required. The ideal candidate should be able to quickly learn and implement advanced techniques, even if they do not initially possess all the required experience.

Key Responsibilities:

Design, develop, and deploy generative AI models using GPT-4 variants, including GPT-4 Vision and GPT-4 Turbo, tailored to address specific business needs.
Implement and optimize Retrieval-Augmented Generation (RAG) techniques for enhanced data-driven solutions.
Build and manage AI services using Python frameworks such as LangChain or LlamaIndex, and develop APIs with FastAPI or Quart for efficient integration.
Focus on scalability, performance, and optimization of AI solutions across cloud environments, particularly with Azure and AWS.
Work with Vector Databases (mandatory) and optionally Graph Databases for enhanced data management.
Utilize Cosmos DB and SQL for robust data storage and management solutions.
Apply MLOps or LLMOps practices to automate and streamline the AI model lifecycle, including CI/CD pipelines, monitoring, and maintenance.
Implement and manage Azure Pipelines for continuous integration and deployment.
Continuously research and adopt the latest advancements in AI, with a focus on quick learning and implementation of emerging technologies.

Required Skills and Qualifications:

Bachelor’s or Master’s degree in Computer Science, AI, Data Science, or a related field.
A minimum of 1+ years of experience specifically in Generative AI/LLM technologies, and 5+ years of experience in related fields.
Proficiency in Python and experience with frameworks like LangChain, LlamaIndex, FastAPI, or Quart.
Expertise in Retrieval-Augmented Generation (RAG) and experience with Vector Databases (mandatory).
Experience with Cosmos DB and SQL.
Fine-tuning LLMs and experience with Graph Databases are good to have but not mandatory.
Proven experience in MLOps, LLMOps, or DevOps with a strong understanding of CI/CD processes, automation, and pipeline management.
Familiarity with containers, Docker, or Kubernetes is a plus.
Familiarity with cloud platforms, particularly Azure or AWS, and experience with cloud-native AI services.
Strong problem-solving abilities and a proactive approach to learning new AI trends and best practices quickly.

Apply for this job

Upload Document *

Upload Document

I have read and agree to the Privacy Policy and Terms & Conditions.

Apply to job

Generative AI/LLM Engineer

Apply for this job