Senior AI Engineer

EPAM·Kazakhstan, Uzbekistan, Armenia, Georgia·Удалённо, Офис·3д. назад

We are seeking a Senior AI Engineer to design, deploy and optimize cutting-edge AI infrastructure powering large-scale GenAI applications. In this role, you will work with vector databases, LLM frameworks and cloud-native technologies to build robust, production-grade systems that drive intelligent solutions across the organization.

Responsibilities

Deploy and manage Milvus vector databases including schema design and index tuning with HNSW and IVF-FLAT
Build embedding and LLM framework pipelines leveraging OpenAI API, Hugging Face or Cohere
Manage Kubernetes clusters, Helm charts and containerized microservices for scalable orchestration
Implement Docker containerization with multi-stage builds and registry management
Develop production-level applications in Python along with Go, Java or C++
Integrate object storage systems including AWS S3, MinIO or Google Cloud Storage
Support large-scale RAG applications and multi-agent platforms
Optimize compute and inference through GPU scheduling, resource optimization and inference acceleration
Drive search optimization with hybrid search, metadata filtering and index tuning
Collaborate effectively with the team to deliver high-quality solutions

Requirements

B.Tech/B.E in Engineering with 5+ years of relevant experience
Expertise in Milvus deployment, schema design and index tuning (HNSW, IVF-FLAT)
Familiarity with Qdrant, Pinecone, Weaviate, PGVector or Chroma
Proficiency in OpenAI API, Hugging Face or Cohere for embeddings and LLMs
Skills in Kubernetes cluster management, Helm charts and containerized microservices
Competency in Docker containerization, multi-stage builds and registry management
Production-level proficiency in Python along with Go, Java or C++
Knowledge of object storage integration including AWS S3, MinIO or Google Cloud Storage
Excellent verbal and written communication skills

Nice to have

Background in supporting large-scale RAG applications and multi-agent platforms
Familiarity with LangChain, LlamaIndex or custom LLM orchestration pipelines
Understanding of AI observability through LLM evaluation, governance, tracing and monitoring tools
Knowledge of CI/CD pipelines, Infrastructure-as-Code and cloud-native deployment practices
Prior work experience in the Oil and Gas industry along with Dataiku DSS and SRE practices