Skip To Main Content
backBack to Search

Senior Site Reliability Engineer

Remote in Mexico
Site Reliability Engineering
& 10 others

We are seeking a highly skilled Senior Site Reliability Engineer to join our remote team, contributing to a distributed system project that demands broad expertise across various tools and skills.

As a Senior Site Reliability Engineer, you will analyze and understand how all elements of the system operate together to ensure its reliability and performance. If you are driven by a passion for site reliability engineering with a history of success, we encourage you to join us.

Responsibilities
  • Design and build infrastructure and services that support the distributed system
  • Oversee system performance and address issues to maintain reliability and availability
  • Partner with cross-functional teams to create and implement solutions aligned with user and business needs
  • Automate infrastructure deployment and configuration to enhance productivity and streamline operations
  • Conduct code reviews and uphold best practices for site reliability engineering
  • Create and update comprehensive documentation for infrastructure and services, ensuring clarity and team alignment
  • Explore emerging technologies and trends in site reliability engineering to continuously develop skills and knowledge
Requirements
  • At least 3 years of experience in Site Reliability Engineering, showcasing a background in designing, building, and maintaining large-scale distributed systems
  • Proficiency in containerization technologies such as Docker and Kubernetes to support scalable and reliable service deployment and management
  • Competency in monitoring and logging tools such as Grafana
  • Familiarity with cloud platforms such as Microsoft Azure and Google Cloud Platform to architect and deploy cloud infrastructure effectively
  • Skills in scripting languages like PowerShell, Python, and Terraform to automate deployment and configuration
  • Understanding of web technologies, including PHP and Angular, to develop and sustain web applications
  • Strong communication and collaboration abilities to work efficiently with cross-functional teams
  • Confidence in decision-making and self-reliance, enabling ownership and progress in projects
  • Fluent spoken and written English at an Upper-Intermediate level or higher for effective communication
Nice to have
  • Proficiency in JavaScript and Go language
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn