Skip To Main Content
backBack to Search

Senior Observability Engineer

DevOps, Datadog, Kubernetes, Observability and troubleshooting in distributed systems, Reports and Dashboard developing, Terraform, Amazon Web Services, Go Language, Google Cloud Platform, Grafana, JavaScript, Prometheus, Python
warning.png
Sorry, this position is no longer available

We are seeking a highly skilled and experienced Senior Observability Engineer to join our team.

In this role, you will work directly with the client as an independent contributor, ensuring seamless integration of observability tools and practices into their systems without supervision.

Your expertise will drive the enhancement of monitoring and observability capabilities, providing valuable insights for system optimization and reliability.

Responsibilities
  • Update code repositories to include metadata for enhanced telemetry tooling
  • Design and create dashboards to visualize service availability and performance
  • Collaborate with multiple teams to facilitate intake requests and address observability needs
  • Work independently with the client to understand their specific requirements
  • Develop and implement monitoring solutions using tools like Prometheus, Grafana, or Datadog
  • Optimize existing observability frameworks to improve data collection and analysis
  • Proactively identify and troubleshoot issues in production environments
  • Enhance logging, tracing, and metric collection systems
  • Train and mentor team members on best practices in observability engineering
  • Continuously research and recommend improvements to observability infrastructure
Requirements
  • 3+ years of experience with Kubernetes and Terraform
  • Proficiency in coding with Golang, Python, and JavaScript
  • Familiarity with Datadog or similar open-source observability tools
  • Experience with Prometheus and/or Grafana
  • Prior experience in Observability engineering
  • Understanding of the value of telemetry for production monitoring (Logs, Traces, Metrics)
  • Proven ability to work independently directly with clients
  • Capable of managing work, resolving roadblocks, and meeting deadlines
  • Adaptability to work in dynamic environments
  • Excellent communication skills
  • Proficiency in English (B2 level or higher)
Nice to have
  • Experience with AWS (GCP acceptable)
  • Advanced data analytics skills
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

These jobs are for you