Skip To Main Content
backBack to Search

Senior Site Reliability Engineer

Remote in Mexico
Site Reliability Engineering
& 10 others

We are seeking a highly skilled Senior Site Reliability Engineer to join our remote team, working on a distributed system project requiring proficiency in a wide range of tools and competencies.

As a Senior Site Reliability Engineer, your role involves understanding how all components of the system integrate seamlessly, ensuring reliability and excellent performance. If you're passionate about site reliability engineering with a proven history of success, we welcome you to our team.

Responsibilities
  • Design infrastructure and services that support the distributed system
  • Monitor system performance to identify and address issues, ensuring high reliability and availability
  • Collaborate with cross-functional teams to devise solutions that meet business and user requirements
  • Automate infrastructure deployment and configuration to achieve efficient and streamlined processes
  • Contribute to code reviews and recommend best practices in site reliability engineering
  • Create and maintain documentation for infrastructure and services to ensure knowledge sharing and team alignment
  • Adapt to emerging technologies and trends in site reliability engineering, expanding skills and expertise
Requirements
  • A minimum of 3 years of experience in Site Reliability Engineering with a background in designing, building, and maintaining complex distributed systems
  • Proficiency in containerization technologies such as Docker and Kubernetes for scalable service deployment
  • Skills in using monitoring and logging tools such as Grafana effectively
  • Familiarity with cloud platforms like Microsoft Azure and Google Cloud Platform for implementing cloud infrastructure
  • Competency in scripting languages such as PowerShell, Python, and Terraform for automating workflows
  • Experience with web technologies such as PHP and Angular for developing robust web solutions
  • Excellent communication and collaboration abilities to foster teamwork in cross-functional environments
  • Autonomy in decision-making and leadership qualities to manage and advance projects independently
  • Upper-Intermediate or higher level fluency in spoken and written English for seamless communication
Nice to have
  • Understanding of JavaScript programming concepts
  • Expertise in working with the Go programming language
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn