Lead Data Software Engineer
Hybrid in Colombia
Data Software Engineering
& 6 others
Colombia
We are seeking an experienced and hands-on Lead Data Software Engineer to guide a team of talented data engineers and help shape the foundation of our data ecosystem. You will collaborate closely with the Head of Data/AI to modernize legacy assets, build scalable new data solutions, and enable the organization to leverage its data infrastructure effectively for advanced analytics and AI initiatives.
Responsibilities
- Lead and mentor a small team of data engineers, ensuring best practices and high-quality deliverables
- Oversee the cleaning, transformation, and optimization of legacy data assets to ensure data accuracy and usability
- Collaborate with stakeholders to understand their data needs and translate those requirements into scalable, efficient workflows
- Design and architect data pipelines, ensuring seamless ETL/ELT processes that meet performance needs
- Work extensively with cloud platforms, particularly Google Cloud Platform, leveraging tools like BigQuery, Vertex AI, Dataform, and Dataplex
- Execute hands-on coding and data engineering work, contributing to pipeline development and tuning processes
- Build and maintain large-scale datasets in a cloud environment, ensuring reliability and optimization
- Develop advanced SQL queries for data manipulation, reporting, and insights
- Incorporate robust programming practices using Python to automate processes and drive efficiencies
- Coordinate with cross-functional teams, including data scientists and business leaders, to align on strategic data initiatives
- Ensure documentation and thorough testing of all processes and deliverables to ensure scalability and maintainability
- Investigate and troubleshoot data pipeline performance issues, implementing improvements as needed
Requirements
- 5+ years of software development experience
- 1+ years in a leadership or mentoring capacity
- Proven expertise working with the Google Cloud Platform, including BigQuery, Vertex AI, Dataform, or comparable tools in AWS/Azure
- Advanced proficiency in SQL and data transformation techniques
- Strong programming skills in Python or other languages for data engineering tasks
- Background in working with large-scale datasets in cloud environments, coupled with experience in building scalable and efficient ETL/ELT pipelines
- Excellent written and verbal communication skills in English (B2+ level), enabling effective collaboration within and between teams in a distributed environment
Nice to have
- Familiarity with distributed data systems and tools such as Dataplex or similar platforms across cloud providers
- Showcase of previous successful projects in modernizing legacy data systems or building data foundations from scratch
- Knowledge of machine learning tools and workflows, particularly in integrating data for AI applications
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn