Chief Data Software Engineer
Data Software Engineering, Amazon Web Services, CI/CD, Python, SQL, Amazon DynamoDB, Apache Hive, Apache Spark, Databricks
We are seeking an experienced and skilled Chief Data Software Engineer to lead the development of robust data solutions, drive stakeholder collaboration, and guide a high-performing team in creating scalable applications using cutting-edge technologies.
As a key player, you will pair technical expertise with leadership skills to deliver impactful solutions within a dynamic US timezone setting.
Responsibilities
- Design scalable data pipelines using Python and SQL
- Lead application development with microservices architecture, CI/CD, and unit testing frameworks
- Implement cloud-based solutions leveraging AWS components like EC2, S3, Lambda, and RDS
- Manage data processing workflows with tools such as Hive and Spark
- Ensure system design compliance with data structures, algorithms, and best practices
- Collaborate with stakeholders to define requirements and produce technical solutions
- Explore database solutions like DynamoDB and Elasticsearch to optimize performance
- Guide the team to establish test-driven development strategies ensuring code integrity
- Support data orchestration tasks using tools like Airflow and data warehousing techniques
- Create, automate, and troubleshoot CI/CD pipelines for seamless software delivery
- Mentor teams of 5-10 developers, promoting professional development and growth
- Enforce documentation standards and supervise organized technical writing across the team
Requirements
- 7+ years of working experience in software engineering or data engineering roles
- Proficiency in Python and SQL for data and application development
- Background in cloud technologies including AWS (EC2, S3, Lambda, DynamoDB, SQS)
- Expertise in creating unit testing cases using mock frameworks for robust CI/CD processes
- Knowledge of data processing frameworks such as Spark and Hive
- Competency in data structures, algorithms, and system design principles
- Familiarity with Flask, microservices, and pipeline automation workflows
- Understanding of data modeling concepts and orchestration tools like Airflow
- Flexibility to manage stakeholder communications effectively
- Qualifications in test-driven development methods and scalable pipeline construction
Nice to have
- Understanding of Elasticsearch for boosting data retrieval and management efficiency
- Background in DynamoDB database integration within application workflows
- Skills in advanced data modeling strategies and schema refinement techniques
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn