We are seeking a Senior AI Engineer to spearhead the design and development of cutting-edge AI solutions, leveraging the latest advancements in generative AI and LLM technologies to deliver seamless and scalable applications for our clients.
Responsibilities
- Design, implement, and maintain end-to-end AI applications, including chatbots, Q&A platforms and agent workflows
- Collaborate directly with clients to understand their needs, identify opportunities and recommend LLM-driven solutions
- Develop and manage robust data pipelines, prompt strategies, and datasets to ensure effective and accurate AI models (Will be a plus)
- Evaluate and refine AI system performance, ensuring outputs are accurate, secure, scalable and compliant with industry regulations (Will be a plus)
- Conduct research and rapid prototyping to validate technical feasibility and demonstrate the business value of AI solutions
- Stay current with evolving LLM technologies, frameworks, and methodologies to continuously improve solutions and client outcomes
Requirements
- Strong proficiency in Python, experience with web frameworks like FastAPI or similar
- Understanding of the AI application development lifecycle
- Experience with rapid UI prototyping using Streamlit, Gradio or similar frameworks
- Familiarity with major LLM platforms and APIs (OpenAI, Anthropic, Amazon Bedrock, Gemini) and related frameworks (LangGraph, LlamaIndex, Strands Agents, etc.)
- Knowledge of advanced AI integration patterns (e.g., RAG, Agents)
- Experience deploying AI solutions at scale, with considerations for performance, cost-efficiency and maintainability
- Proven ability to evaluate generative AI quality using metrics such as retrieval and classification scores, as well as LLM-based evaluation methods
- Proven experience in AI engineering and delivering ML-based solutions
- Strong problem-solving skills and attention to detail
- Excellent communication, collaboration, and interpersonal skills
Nice to have
- Experience designing experiments, conducting A/B tests, and iterating on models based on user feedback
- Understanding of retrieval systems (keyword search, vector search, embeddings) and ranking algorithms
- Familiarity with emerging protocols such as MCP, A2A, ACP, etc.
- Experience deploying to cloud AI platforms (Azure OpenAI, Amazon Bedrock, GCP Vertex AI) or on-premise solutions (e.g., vLLM)
- Experience with enterprise AI platforms such as AWS AgentCore, Databricks AgentBricks or Google Agents Space, or Azure AI Foundry
- Experience with observability and monitoring tools and frameworks