Skip To Main Content
backBack to Search

Senior AI Engineer

Hybrid in Armenia, & 4 others
Infrastructure Automation and Orchestration& 12 others
Looking for something else?

Find a vacancy that works for you. Send us your CV to receive a personalized offer.

Find me a job

We are seeking a Senior AI Engineer to design, deploy and optimize cutting-edge AI infrastructure powering large-scale GenAI applications. In this role, you will work with vector databases, LLM frameworks and cloud-native technologies to build robust, production-grade systems that drive intelligent solutions across the organization.

Responsibilities
  • Deploy and manage Milvus vector databases including schema design and index tuning with HNSW and IVF-FLAT
  • Build embedding and LLM framework pipelines leveraging OpenAI API, Hugging Face or Cohere
  • Manage Kubernetes clusters, Helm charts and containerized microservices for scalable orchestration
  • Implement Docker containerization with multi-stage builds and registry management
  • Develop production-level applications in Python along with Go, Java or C++
  • Integrate object storage systems including AWS S3, MinIO or Google Cloud Storage
  • Support large-scale RAG applications and multi-agent platforms
  • Optimize compute and inference through GPU scheduling, resource optimization and inference acceleration
  • Drive search optimization with hybrid search, metadata filtering and index tuning
  • Collaborate effectively with the team to deliver high-quality solutions
Requirements
  • B.Tech/B.E in Engineering with 5+ years of relevant experience
  • Expertise in Milvus deployment, schema design and index tuning (HNSW, IVF-FLAT)
  • Familiarity with Qdrant, Pinecone, Weaviate, PGVector or Chroma
  • Proficiency in OpenAI API, Hugging Face or Cohere for embeddings and LLMs
  • Skills in Kubernetes cluster management, Helm charts and containerized microservices
  • Competency in Docker containerization, multi-stage builds and registry management
  • Production-level proficiency in Python along with Go, Java or C++
  • Knowledge of object storage integration including AWS S3, MinIO or Google Cloud Storage
  • Excellent verbal and written communication skills
Nice to have
  • Background in supporting large-scale RAG applications and multi-agent platforms
  • Familiarity with LangChain, LlamaIndex or custom LLM orchestration pipelines
  • Understanding of AI observability through LLM evaluation, governance, tracing and monitoring tools
  • Knowledge of CI/CD pipelines, Infrastructure-as-Code and cloud-native deployment practices
  • Prior work experience in the Oil and Gas industry along with Dataiku DSS and SRE practices