Generative AI/LLM Engineer

Company : Soothsayer Analytics

Working Hours : Full-Time

No. of Positions : 4

Locations : Hyderabad

apply now

apply now
About the Role:

Position Overview:

We are seeking a talented Generative AI/LLM Engineer with a strong background in building and deploying AI models, focusing on leveraging state-of-the-art technologies like Azure OpenAI GPT-4, GPT-4 Vision, or GPT-4 Omni. Experience with Retrieval-Augmented Generation (RAG) and working with Vector Databases is essential. While fine-tuning large language models (LLMs) is a plus, it is not mandatory. A general understanding of how deep learning models are trained or fine-tuned is required. The ideal candidate should be able to quickly learn and implement advanced techniques, even if they do not initially possess all the required experience.

Key Responsibilities:

  • Design, develop, and deploy generative AI models using GPT-4 variants, including GPT-4 Vision and GPT-4 Turbo, tailored to address specific business needs.

  • Implement and optimize Retrieval-Augmented Generation (RAG) techniques for enhanced data-driven solutions.

  • Build and manage AI services using Python frameworks such as LangChain or LlamaIndex, and develop APIs with FastAPI or Quart for efficient integration.

  • Focus on scalability, performance, and optimization of AI solutions across cloud environments, particularly with Azure and AWS.

  • Work with Vector Databases (mandatory) and optionally Graph Databases for enhanced data management.

  • Utilize Cosmos DB and SQL for robust data storage and management solutions.

  • Apply MLOps or LLMOps practices to automate and streamline the AI model lifecycle, including CI/CD pipelines, monitoring, and maintenance.

  • Implement and manage Azure Pipelines for continuous integration and deployment.

  • Continuously research and adopt the latest advancements in AI, with a focus on quick learning and implementation of emerging technologies.

Required Skills and Qualifications:

  • Bachelor’s or Master’s degree in Computer Science, AI, Data Science, or a related field.

  • A minimum of 1+ years of experience specifically in Generative AI/LLM technologies, and 5+ years of experience in related fields.

  • Proficiency in Python and experience with frameworks like LangChain, LlamaIndex, FastAPI, or Quart.

  • Expertise in Retrieval-Augmented Generation (RAG) and experience with Vector Databases (mandatory).

  • Experience with Cosmos DB and SQL.

  • Fine-tuning LLMs and experience with Graph Databases are good to have but not mandatory.

  • Proven experience in MLOps, LLMOps, or DevOps with a strong understanding of CI/CD processes, automation, and pipeline management.

  • Familiarity with containers, Docker, or Kubernetes is a plus.

  • Familiarity with cloud platforms, particularly Azure or AWS, and experience with cloud-native AI services.

  • Strong problem-solving abilities and a proactive approach to learning new AI trends and best practices quickly.

 

Apply for this job