We are seeking a highly skilled and experienced Senior DevOps Engineer to join our dynamic team.
The successful candidate will be responsible for managing and optimizing our cloud-based systems and platforms, ensuring seamless deployment, maintenance, and monitoring of our Azure and AWS infrastructure.
Responsibilities
- Collaborate with product teams to manage Azure system deployments, lifecycle maintenance, capacity planning, and advising stakeholders
- Triage and resolve incidents and requests in the service management system
- Monitor applications, perform data manipulation for widgets, generate reports, and manage problems
- Tune agents and collectors for desired system data manipulation
- Provide occasional consultation and customer support both internally and externally
- Oversee Azure and AWS infrastructure management
- Create and manage check-in policies, including installation, configuration, and troubleshooting
- Automate scripts for report generation using Terraform and Python
- Maintain applications within Amazon Elastic Kubernetes Service (EKS) and Azure Kubernetes Service (AKS)
- Manage network protocols and cloud network security
- Monitor cloud infrastructure components
- Manage virtual machines, virtual networks, autoscaling, and storage solutions across various cloud services
Requirements
- At least 4 years of IT industry experience with comprehensive knowledge of AWS and Azure including a bachelor's degree in computer science or information technology
- Minimum of 3 years of experience with cloud-based development platforms like AWS or Azure
- 3+ years of experience in automated build scripts for release management with Terraform, Ansible, and PowerShell
- Proficiency in Linux system administration, ideally with Red Hat Linux or CentOS
- At least 2 years of experience setting up Kubernetes clusters and implementing EKS and AKS with tools like Docker, Prometheus, and Nginx
- Adaptability to work in 24x7 operational support on a rotational shift basis
- Proficiency in research and translating collected data into actionable insights
Nice to have
- Experience with Java
- Certification as an Azure Certified Solutions Architect or SysOps Administrator
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn