Back to Search
Lead Data Engineer PySpark and Python
Data Software Engineering, Python, Apache Spark, Python.Core, Agile, Databricks, PySpark, SQL, Cloud Pipeline, Gradle, DevOps
Sorry, this position is no longer available
We are seeking a remote Lead Data Engineer with expertise in PySpark and Python to join our team. The successful candidate will be working primarily with big data engineering. The main project focuses on pipeline design and development in the FS-Insurance sector.
Responsibilities
- Design and develop data pipelines
- Collaborate with cross-functional teams to develop and implement data solutions
- Ensure data quality and reliability
- Lead and mentor junior data engineers
- Utilize Agile/Scrum software development methodologies
Requirements
- At least 5 years of experience in Big Data engineering
- Expertise in DSE Python, AWS and Databricks
- Strong proficiency in PySpark, Spark and Python programming
- Good communication skills
- Proficiency in SQL (Spark SQL preferred)
- Experience with distributed computing on enterprise data platforms
- Upper-Intermediate English level (B2+)
Nice to have
- Knowledge of Java and Gradle
- Familiarity with JavaScript/HTML/CSS
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn