Colombia
Join our team as a Senior DevOps Engineer, where you will manage and optimize cloud infrastructure for advanced Generative AI applications using GCP, GKE, and Python.
Your expertise will play a vital role in ensuring the performance and scalability of our AI platform. If you're passionate about AI and cloud technologies, we encourage you to apply.
Responsibilities
- Design, deploy, and manage scalable, secure cloud infrastructure on GCP
- Integrate and support Python-based AI tools and frameworks
- Build and maintain automated CI/CD pipelines for efficient operations
- Implement monitoring, logging, and alerting solutions for AI services
- Ensure adherence to security best practices and governance standards
Requirements
- 3+ years of experience in DevOps or cloud infrastructure
- Hands-on expertise with Google Kubernetes Engine (GKE) and VertexAI
- Proficiency in Python and experience with AI tools like LiteLLM and Dify.AI
- Working knowledge of AI governance and security best practices
- Strong knowledge of cloud platforms, particularly GCP
- Fluent English communication skills at a B2+ level
Nice to have
- Familiarity with containerization technologies such as Docker
- Experience with orchestration tools like Kubernetes
- Knowledge of monitoring systems like Prometheus or Grafana
- Understanding of BigQuery and other GCP services
- Experience with GenAI or Agentic AI frameworks
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn