Back to Search
Senior Site Reliability Engineer (Azure)
Microsoft Azure, Azure Application Insights, Azure Cosmos DB, Azure Monitor, Azure DevOps, Azure Kubernetes Service, Bash, PowerShell, Troubleshooting and tracing in distributed systems
Sorry, this position is no longer available
We are hiring a Senior Site Reliability Engineer (Azure) for a remote position.
In this role, you will be responsible for analyzing and discovering how all components of a distributed system work together using a broad range of skills and tools. You must be autonomous and capable of making decisions, while also collaborating with multiple teams. This position involves monitoring applications, configuring and deploying Azure cloud resources, and identifying and troubleshooting issues. You must have experience with DevOps best practices and be able to set up and maintain CI/CD implementation using Azure DevOps or equivalent.
Responsibilities
- Analyze and discover how all components of a distributed system work together using a broad range of skills and tools
- Monitor applications, gather telemetry, set up alerting, and define SLOs using tools such as Azure Monitor or equivalent
- Configure and deploy Azure cloud resources such as AKS, CosmosDB, Key Vault, Redis Cache, Storage, ServiceBus, App Gateway, etc
- Set up and maintain CI/CD implementation using Azure DevOps or equivalent to adhere to DevOps best practices
- Troubleshoot issues and perform log queries and aggregation to identify issues
- Collaborate with multiple teams to ensure seamless project completion
- Continuously enhance skills and knowledge through learning and development opportunities
Requirements
- At least 3 years of work experience as a Site Reliability Engineer or DevOps Engineer
- Expertise in Microsoft Azure, including Azure Application Insights, Azure Cosmos DB, and Azure Monitor
- Proficiency in Bash and PowerShell programming, as well as troubleshooting and tracing in distributed systems
- Experience configuring and deploying Azure cloud resources like AKS, Key Vault, Redis Cache, Storage, ServiceBus, App Gateway, etc.
- Familiarity with DevOps best practices and setting up and maintaining CI/CD implementation using Azure DevOps or equivalent
- Strong communication and collaboration skills to work with multiple teams
- Upper-Intermediate level of English
Nice to have
- Ability to work on-call during weekends
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn