Lead Data Engineer PySpark and Python

Remote in India

Data Software Engineering

Sorry, this position is no longer available

Location-specific conditions & benefits*

India

We are seeking a remote Lead Data Engineer with expertise in PySpark and Python to join our team. The successful candidate will be working primarily with big data engineering. The main project focuses on pipeline design and development in the FS-Insurance sector.

Responsibilities

Design and develop data pipelines
Collaborate with cross-functional teams to develop and implement data solutions
Ensure data quality and reliability
Lead and mentor junior data engineers
Utilize Agile/Scrum software development methodologies

Requirements

At least 5 years of experience in Big Data engineering
Expertise in DSE Python, AWS and Databricks
Strong proficiency in PySpark, Spark and Python programming
Good communication skills
Proficiency in SQL (Spark SQL preferred)
Experience with distributed computing on enterprise data platforms
Upper-Intermediate English level (B2+)

Nice to have

Knowledge of Java and Gradle
Familiarity with JavaScript/HTML/CSS

Benefits

International projects with top brands
Work with global teams of highly skilled, diverse peers
Healthcare benefits
Employee financial programs
Paid time off and sick leave
Upskilling, reskilling and certification courses
Unlimited access to the LinkedIn Learning library and 22,000+ courses
Global career opportunities
Volunteer and community involvement opportunities
EPAM Employee Groups
Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

Lead Data Engineer PySpark and Python

These jobs are for you