Skip To Main Content
backBack to Search

Lead Data Software Engineer

Hybrid in Colombia
Data Software Engineering
& 6 others

We are seeking an experienced and hands-on Lead Data Software Engineer to guide a team of talented data engineers and help shape the foundation of our data ecosystem. You will collaborate closely with the Head of Data/AI to modernize legacy assets, build scalable new data solutions, and enable the organization to leverage its data infrastructure effectively for advanced analytics and AI initiatives.

Responsibilities
  • Lead and mentor a small team of data engineers, ensuring best practices and high-quality deliverables
  • Oversee the cleaning, transformation, and optimization of legacy data assets to ensure data accuracy and usability
  • Collaborate with stakeholders to understand their data needs and translate those requirements into scalable, efficient workflows
  • Design and architect data pipelines, ensuring seamless ETL/ELT processes that meet performance needs
  • Work extensively with cloud platforms, particularly Google Cloud Platform, leveraging tools like BigQuery, Vertex AI, Dataform, and Dataplex
  • Execute hands-on coding and data engineering work, contributing to pipeline development and tuning processes
  • Build and maintain large-scale datasets in a cloud environment, ensuring reliability and optimization
  • Develop advanced SQL queries for data manipulation, reporting, and insights
  • Incorporate robust programming practices using Python to automate processes and drive efficiencies
  • Coordinate with cross-functional teams, including data scientists and business leaders, to align on strategic data initiatives
  • Ensure documentation and thorough testing of all processes and deliverables to ensure scalability and maintainability
  • Investigate and troubleshoot data pipeline performance issues, implementing improvements as needed
Requirements
  • 5+ years of software development experience
  • 1+ years in a leadership or mentoring capacity
  • Proven expertise working with the Google Cloud Platform, including BigQuery, Vertex AI, Dataform, or comparable tools in AWS/Azure
  • Advanced proficiency in SQL and data transformation techniques
  • Strong programming skills in Python or other languages for data engineering tasks
  • Background in working with large-scale datasets in cloud environments, coupled with experience in building scalable and efficient ETL/ELT pipelines
  • Excellent written and verbal communication skills in English (B2+ level), enabling effective collaboration within and between teams in a distributed environment
Nice to have
  • Familiarity with distributed data systems and tools such as Dataplex or similar platforms across cloud providers
  • Showcase of previous successful projects in modernizing legacy data systems or building data foundations from scratch
  • Knowledge of machine learning tools and workflows, particularly in integrating data for AI applications
Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn