Senior Site Reliability Engineer
Remote in Mexico
Site Reliability Engineering
& 10 others
Mexico
We are seeking a highly skilled Senior Site Reliability Engineer to join our remote team, working on a distributed system project requiring proficiency in a wide range of tools and competencies.
As a Senior Site Reliability Engineer, your role involves understanding how all components of the system integrate seamlessly, ensuring reliability and excellent performance. If you're passionate about site reliability engineering with a proven history of success, we welcome you to our team.
Responsibilities
- Design infrastructure and services that support the distributed system
- Monitor system performance to identify and address issues, ensuring high reliability and availability
- Collaborate with cross-functional teams to devise solutions that meet business and user requirements
- Automate infrastructure deployment and configuration to achieve efficient and streamlined processes
- Contribute to code reviews and recommend best practices in site reliability engineering
- Create and maintain documentation for infrastructure and services to ensure knowledge sharing and team alignment
- Adapt to emerging technologies and trends in site reliability engineering, expanding skills and expertise
Requirements
- A minimum of 3 years of experience in Site Reliability Engineering with a background in designing, building, and maintaining complex distributed systems
- Proficiency in containerization technologies such as Docker and Kubernetes for scalable service deployment
- Skills in using monitoring and logging tools such as Grafana effectively
- Familiarity with cloud platforms like Microsoft Azure and Google Cloud Platform for implementing cloud infrastructure
- Competency in scripting languages such as PowerShell, Python, and Terraform for automating workflows
- Experience with web technologies such as PHP and Angular for developing robust web solutions
- Excellent communication and collaboration abilities to foster teamwork in cross-functional environments
- Autonomy in decision-making and leadership qualities to manage and advance projects independently
- Upper-Intermediate or higher level fluency in spoken and written English for seamless communication
Nice to have
- Understanding of JavaScript programming concepts
- Expertise in working with the Go programming language
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn