We are seeking a Senior Generative AI Application Engineer who will be responsible for designing, implementing, and optimizing generative AI solutions for applications.
Responsibilities
- Design applications utilizing large language models (LLMs) and generative AI technologies
- Implement prompt engineering strategies and Retrieval-Augmented Generation (RAG) architectures
- Build frameworks to evaluate model performance, safety, and alignment
- Create interfaces between generative AI models and production systems
- Develop mechanisms for responsible AI and guardrails for deployment
- Stay updated on advancements in generative AI research and best practices
Requirements
- Background in developing applications with generative AI models
- Expertise in prompt engineering and RAG architecture
- Proficiency in Python, with added value for experience in asynchronous applications
- Knowledge of modern AI/ML frameworks (e.g., LangChain)
- Familiarity with vector databases and embedding models
- Understanding of responsible AI practices and mitigation strategies for hallucinations
- Competency in working with cloud-based AI services (e.g., OpenAI API, Azure AI)
Nice to have
- Background in developing solutions for specific industries such as life sciences, finance, or insurance
- Familiarity with LLM tracing, evaluation datasets, and experiment management
- Understanding of techniques for evaluating RAG systems or other LLM architectures
- Skills in automated prompt optimization tools (e.g., dspy)
- Expertise in creating rapid prototypes