Chief AI Platform Engineer
Hybrid in Mexico
Cloud Native Development
& 7 others
Mexico
We are looking for a talented and ambitious Chief AI Platform Engineer to join our forward-thinking team.
You will play a vital role in building and scaling AI/ML infrastructure to address groundbreaking challenges in drug discovery and healthcare. This role provides the chance to craft advanced platforms and systems for data scientists, enhancing global healthcare initiatives and positively influencing millions of lives.
Responsibilities
- Provide infrastructure and platform support for deploying and monitoring machine learning solutions in production
- Ensure high performance and scalability for large-scale systems by optimizing solutions
- Develop cutting-edge AI/ML environments and workflows in collaboration with data science teams using AWS
- Partner with R&D data scientists to transform machine learning pipelines, models, and algorithms into production-ready solutions
- Own all facets of software engineering, including design, implementation, testing, and maintenance
- Lead technology initiatives from initial development to successful project completion
- Upgrade the technological stack with the latest advancements in data processing and artificial intelligence
- Manage an enterprise-level platform, addressing customer requirements and feature requests effectively
- Introduce and implement DevOps best practices and modern toolchains to drive automation and efficiency
- Scale MLOps environments to meet production-grade requirements
- Align platform and processes with GxP standards when necessary
Requirements
- Extensive experience (7+ years) with AWS cloud environments, including knowledge of SageMaker, Athena, S3, EC2, RDS, Glue, Lambda, Step Functions, EKS, and ECS
- Proficiency in infrastructure-as-code tools such as Terraform, Ansible, or CloudFormation
- Strong programming skills, especially in Python, though expertise in other languages will also be considered
- Competency in working with containers and microservices architectures, including Kubernetes or Docker systems
- Advanced understanding of Continuous Integration and Continuous Delivery pipelines, such as CodePipeline, CodeBuild, or CodeDeploy
- Background handling large-scale enterprise platforms and end-user feature requests
- Practical knowledge of DevOps tools and practices, including Docker and Git
- Familiarity with GxP compliance standards
- Strong analytical, communication, and problem-solving capabilities
Nice to have
- Proficiency in building extensive data processing pipelines using Hadoop, Spark, or SQL
- Competency in data science modeling tools and environments such as R, Python, or Jupyter Notebooks
- Understanding of managing multi-cloud platforms, including AWS, Azure, or GCP
- Background mentoring and assisting less experienced colleagues or clients in a professional capacity
- Familiarity with applying SAFe Agile principles and methodologies
Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn