Senior Site Reliability Engineer
Remote in Mexico
Site Reliability Engineering
& 10 others
Mexico
We are seeking a highly skilled Senior Site Reliability Engineer to join our remote team, contributing to a distributed system project that demands broad expertise across various tools and skills.
As a Senior Site Reliability Engineer, you will analyze and understand how all elements of the system operate together to ensure its reliability and performance. If you are driven by a passion for site reliability engineering with a history of success, we encourage you to join us.
Responsibilities
- Design and build infrastructure and services that support the distributed system
- Oversee system performance and address issues to maintain reliability and availability
- Partner with cross-functional teams to create and implement solutions aligned with user and business needs
- Automate infrastructure deployment and configuration to enhance productivity and streamline operations
- Conduct code reviews and uphold best practices for site reliability engineering
- Create and update comprehensive documentation for infrastructure and services, ensuring clarity and team alignment
- Explore emerging technologies and trends in site reliability engineering to continuously develop skills and knowledge
Requirements
- At least 3 years of experience in Site Reliability Engineering, showcasing a background in designing, building, and maintaining large-scale distributed systems
- Proficiency in containerization technologies such as Docker and Kubernetes to support scalable and reliable service deployment and management
- Competency in monitoring and logging tools such as Grafana
- Familiarity with cloud platforms such as Microsoft Azure and Google Cloud Platform to architect and deploy cloud infrastructure effectively
- Skills in scripting languages like PowerShell, Python, and Terraform to automate deployment and configuration
- Understanding of web technologies, including PHP and Angular, to develop and sustain web applications
- Strong communication and collaboration abilities to work efficiently with cross-functional teams
- Confidence in decision-making and self-reliance, enabling ownership and progress in projects
- Fluent spoken and written English at an Upper-Intermediate level or higher for effective communication
Nice to have
- Proficiency in JavaScript and Go language
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn