Middle Big Data Software Engineer for a Biotechnology Company

Sorry, this position is no longer available
India
We are currently looking for a remote Middle Big Data Software Engineer with 2+ years of production experience with Spark (PySpark) to join our team.
The customer is a biotechnology company, which engages in the discovery, invention, development, manufacture, and commercialization of medicines.
The main goal is to work out a solution that consumes and stores data from multiple customer’s domains.
Responsibilities
- Implement pipeline processing application using PySpark and Airflow
- Integrate required database structure
- Apply data marts in Hive and PostgreSQL
- Create analytical SQL Scripts in PostgreSQL or any other DB
- Communicate with English speaking colleagues and customer representatives
Requirements
- 2+ years of production experience with Spark (PySpark)
- Strong skills in Apache Airflow
- Knowledge of Apache Hive
- English level B2+
Nice to have
- Working experience within AWS services: S3, Athena, EC2
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn