We are seeking a seasoned Senior Platform Engineer to join our Software, System, and Test Engineering group.
You will design, implement, and optimize complex distributed systems with a focus on message fabric technologies. Join us to lead transformational projects and enhance our cloud-based development infrastructure.
Responsibilities
- Design and optimize large-scale Apache Kafka or Pulsar clusters
- Deploy and configure message fabric technologies to ensure reliability and scalability
- Troubleshoot and resolve issues in distributed systems and message-driven applications
- Build and maintain Infrastructure as Code (IaC) using tools such as Terraform
- Develop and maintain CI/CD pipelines using GitHub Actions, ArgoCD, or similar
- Implement containerization and orchestration solutions using Kubernetes and Helm
- Monitor system performance and observability using Datadog or equivalent tools
- Coordinate migration of message-driven applications from RabbitMQ to Kafka or Pulsar
- Communicate complex technical concepts to both technical and non-technical stakeholders
- Mentor and evangelize best practices in platform engineering and message-driven architectures
- Lead transformational projects to enhance software systems at scale
- Collaborate with cross-functional teams to ensure platform reliability and efficiency
- Document platform architecture and operational procedures
- Evaluate and recommend tools and technologies to improve platform capabilities
Requirements
- Extensive experience (4+ years) in platform engineering and cloud infrastructure development
- Strong expertise in Apache Kafka or Apache Pulsar architecture and operations
- Proven experience with message fabric technologies, including RabbitMQ
- Demonstrated leadership in building and leading transformational engineering projects
- Solid understanding of Infrastructure as Code (IaC) principles and tools such as Terraform
- Proficiency in CI/CD pipeline development and automation
- Experience with containerization and orchestration using Kubernetes and Helm
- Skilled in observability and troubleshooting distributed systems with Datadog or similar
- Excellent communication skills for technical and non-technical audiences
- Ability to mentor and guide engineering teams in best practices
- Deep knowledge of distributed systems design and architectural tradeoffs
- Strong problem-solving and analytical skills
- Proficient in source code management systems
- Fluent English skills (B2 Upper-Intermediate)
Nice to have
- Experience with Apache Pulsar as an alternative to Kafka
- Knowledge of ArgoCD for continuous delivery
- Hands-on experience migrating applications from RabbitMQ to Kafka or Pulsar
- Familiarity with Helm charts development
- Certifications in cloud platforms or Kubernetes