Position Overview
This role involves building and deploying advanced AI solutions, particularly focusing
on Generative AI and Large Language Models (LLMs). You will contribute to the research,
development, and implementation of innovative technologies for business applications.
Key Areas of Expertise
? Experience with both open-source and closed-source LLMs is essential.
? Skilled in orchestration tools like LangChain and LlamaIndex and have hands-on
experience with serving and inferencing frameworks such as Text Generation
Inference (TGI) and vLLM.
? Experience in developing, evaluating, deploying, and monitoring LLM
applications, including Retrieval-Augmented Generation (RAG), Agents, or LLM
Fine-tuning, is crucial. Familiarity with vector databases and semantic search
technique.
? A strong background in MLOps/LLMOps practices on major cloud platforms like
Azure with specific expertise in Azure OpenAI and Azure DevOps, is expected.
Key Responsibilities
? Conduct research in Generative AI, leveraging public and private datasets to
solve business problems. Act as a domain expert in large language models and
NLP.
? Collaborate with domain experts and AI modelers to develop models for
investment-related applications. Leverage prompt engineering, agents, and finetuning
techniques for LLM development.
? Validate models through benchmarking analysis and user engagement, and
document models for internal validation.
? Build and maintain MLOps/LLMOps pipelines for continuous monitoring and
optimization of LLM applications. Explore synthetic data generation techniques
and other advanced methodologies.
? Stay updated on the latest advancements in Generative AI and propose new
innovative solutions.
Generative AI Development Responsibilities
? Responsible to design and implement LLM models, tune hyperparameters, and
evaluate model performance using held-out datasets. You will deploy LLM
models to production, including packaging, creating REST APIs, and deploying
on cloud platforms.
? Responsible to create development and deployment pipelines to transform
Generative AI proof-of-concepts into production-grade systems.