Skip To Main Content
backBack to Search

Senior DevOps Engineer

DevOps, Datadog, Kubernetes, Observability and troubleshooting in distributed systems, Reports and Dashboard developing, Go Language, Grafana, JavaScript, Prometheus, Python
warning.png
Sorry, this position is no longer available

We are looking for a Senior DevOps Engineer to join our remote team and work on a project that involves observability and troubleshooting in distributed systems.

In this role, you will have the opportunity to work with cutting-edge technologies such as Datadog, Kubernetes, Prometheus, Grafana, and more. Your primary responsibility will be to develop reports and dashboards to visualize service availability and work with multiple teams to help with intake requests. As a senior engineer, you will be expected to provide technical leadership and mentorship to junior members of the team, and work directly with the client to ensure we meet their expectations.

Responsibilities
  • Update code repositories to add metadata to our telemetry for help in tooling
  • Create dashboards to visualize service availability, and work with multiple teams to help with intake requests
  • Work directly with the client to ensure we meet their expectations
  • Provide technical leadership and mentorship to junior members of the team
  • Collaborate with other teams to ensure observability and troubleshooting best practices are followed
  • Continuously learn and stay up-to-date with new technologies and trends in the DevOps field
Requirements
  • At least 3 years of experience as a DevOps Engineer, with a focus on observability and troubleshooting in distributed systems
  • Familiarity with Datadog or any open-source observability tools
  • Strong understanding of Kubernetes
  • Experience in developing reports and dashboards to visualize service availability
  • Understanding of the value of telemetry for production monitoring, including logs, traces, and metrics
  • Ability to work as an individual contributor directly with the client
  • Upper-intermediate English communication skills with at least a B2 level
Nice to have
  • Proficiency in Prometheus and Grafana
  • Ability to understand code in Golang, Python, and JavaScript
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

These jobs are for you