Skip To Main Content
backBack to Search

Senior AWS DevOps Engineer

DevOps, Amazon Web Services, Grafana, Kubernetes, Prometheus, Python, Terraform, Apache Kafka, Cortex, Datadog, Elasticsearch, Fluentbit, Go Language, Grafana Loki, New Relic Analytics, OpenTelemetry
warning.png
Sorry, this position is no longer available

We are looking for a Senior DevOps Engineer to join our remote team.

You will manage Amazon Web Services infrastructure, including EKS version upgrades, scaling, and right-sizing. Additionally, you will be setting up, tuning, modernizing, migrating, and decommissioning various observability services that we provide, such as Cortex/Mimir, Loki, Tempo, OpenTelemetry, Grafana, and Alertmanager. You will also build Docker images for multiple architectures and troubleshoot issues involving microservices in Kubernetes, AWS connectivity, services performance, Lambda functions, and Kafka. You will participate in hypercare events and on-call shifts as needed.

Responsibilities
  • Manage AWS infrastructure using Terraform/CloudFormation, including EKS version upgrades, scaling, and right-sizing
  • Set up, tune, modernize, migrate, and decommission various observability services, such as Cortex/Mimir, Loki, Tempo, OpenTelemetry, Grafana, and Alertmanager
  • Programmatically automate operations using Python/Golang, Gitlab CI, or custom self-service solutions, based on the AWS Service Catalog
  • Build Docker images for multiple architectures (arm64, amd64)
  • Troubleshoot issues involving microservices in Kubernetes, AWS connectivity, services performance, Lambda functions, and Kafka
  • Participate in hypercare events and on-call shifts as needed
  • Collaborate with cross-functional teams in a fast-paced environment to achieve project goals
  • Enhance technical and soft skills through continuous learning and development with your mentor
Requirements
  • Minimum of 3 years of experience in DevOps, with a focus on AWS infrastructure management and observability services
  • Expertise in Infrastructure as Code (IaC) using Terraform or CloudFormation
  • Experience in scripting using Python and/or Bash
  • Hands-on experience with Kubernetes, Docker, Grafana, Prometheus, and Alertmanager
  • Familiarity with Helm and GitLab CI
  • Excellent analytical, troubleshooting, and problem-solving skills
  • Ability to work independently and collaboratively with cross-functional teams
  • Proven experience in managing services on AWS, including EKS, ECS on Fargate, Lambda, ECR, Load Balancing, VPC Endpoint, Route53, and CloudWatch
  • Fluent in English with an Upper-Intermediate level of proficiency
Nice to have
  • Experience with Cortex, Tempo, and Promtail/FluentBit
  • Familiarity with Monitoring Mixins, New Relic, DataDog, and Kafka
  • Experience in Elasticsearch
  • Knowledge of Golang
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

These jobs are for you