Skip To Main Content
backBack to Search

Middle Big Data Software Engineer for a Biotechnology Company

Remote in India
Data Software Engineering, Big Data
warning.png
Sorry, this position is no longer available

We are currently looking for a remote Middle Big Data Software Engineer with 2+ years of production experience with Spark (PySpark) to join our team.

The customer is a biotechnology company, which engages in the discovery, invention, development, manufacture, and commercialization of medicines.

The main goal is to work out a solution that consumes and stores data from multiple customer’s domains.

Responsibilities
  • Implement pipeline processing application using PySpark and Airflow
  • Integrate required database structure
  • Apply data marts in Hive and PostgreSQL
  • Create analytical SQL Scripts in PostgreSQL or any other DB
  • Communicate with English speaking colleagues and customer representatives
Requirements
  • 2+ years of production experience with Spark (PySpark)
  • Strong skills in Apache Airflow
  • Knowledge of Apache Hive
  • English level B2+
Nice to have
  • Working experience within AWS services: S3, Athena, EC2
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

These jobs are for you