Skip To Main Content
backBack to Search

Lead Cloud Platform Developer with DevOps experience

Remote in Chile, Colombia
Site Reliability Engineering
& 14 others
warning.png
Sorry, this position is no longer available

We are seeking a remote Lead Cloud Platform Developer with DevOps experience to ensure optimal performance, reliability, and scalability of our cloud systems.

This role involves working on a highly scalable, fault-tolerant, and distributed cloud platform, with a focus on Site Reliability Engineering for Microservices. As a Lead, you will take ownership of the platform's reliability, performance, and scalability, leading the DevOps team towards achieving the highest levels of operational excellence. You will work closely with the development, infrastructure, and operations teams to ensure the smooth functioning of the cloud platform.

Responsibilities
  • Take ownership of the cloud platform's reliability, performance, and scalability, ensuring the highest levels of operational excellence
  • Design, develop, and maintain the cloud platform's infrastructure, tools, and frameworks, ensuring the smooth functioning of the platform
  • Lead the DevOps team, mentoring them towards achieving the platform's goals
  • Collaborate closely with the development, infrastructure, and operations teams, ensuring the smooth functioning of the cloud platform
  • Design and implement best practices for instrumentation, monitoring, alerting, and incident management for the cloud platform
  • Identify areas for improvement in the cloud platform and implement solutions to enhance its performance and reliability
  • Implement automation and CI/CD pipelines as per industry best practices
Requirements
  • Proven experience in Site Reliability Engineering for Microservices with a minimum of 5 years of experience
  • At least 1 year of experience in leadership roles with a track record of being a role model for the team
  • Solid experience in Microsoft Azure, including its various cloud services
  • Proven experience in using DevOps practices and tools such as Azure DevOps and Jenkins
  • Strong experience in Instrumentation, Monitoring, and Alerting for highly scalable cloud platforms
  • Knowledge of Python
  • Proficiency with Terraform
  • Excellent communication and leadership skills, with the ability to lead and mentor cross-functional teams
  • Proficient in the English language, with an Upper-Intermediate or higher level of proficiency
Nice to have
  • Good experience in Datadog, Splunk, and other similar monitoring and logging tools
  • Experience in Incident Management in large-scale cloud environments
  • Strong experience in Ansible, with a focus on automation and configuration management
  • Experience in Apache Cassandra and other NoSQL Databases
  • Familiarity with GitHub and Jira
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

These jobs are for you