Lead AI Platform Engineer
Hybrid in Mexico
Cloud Native Development
& 7 others
Mexico
We are seeking a talented and motivated Lead AI Platform Engineer to join our team.
In this position, you will be responsible for designing and building large-scale AI/ML infrastructure to address complex challenges in healthcare and drug discovery. This role offers a unique opportunity to develop advanced platforms that empower data scientists to create impactful solutions, driving innovation in global healthcare.
Responsibilities
- Develop and manage infrastructure and platforms to support the deployment and monitoring of machine learning solutions in production settings
- Optimize system scalability and performance to handle the demands of large-scale operations
- Collaborate with data science teams to create and implement AI/ML workflows and environments on AWS
- Work closely with R&D data scientists to productionize machine learning models, algorithms, and pipelines
- Oversee the complete software engineering lifecycle, including design, development, testing, and maintenance
- Lead technology initiatives from initial concept through successful delivery and implementation
- Upgrade the existing technology stack by incorporating advancements in artificial intelligence and data processing
- Manage an enterprise platform and service, effectively addressing customer needs and feature requests
- Implement DevOps practices and utilize modern tools to improve efficiency and automation processes
- Scale MLOps environments to meet production-grade standards
- Ensure systems adhere to GxP compliance standards when required
Requirements
- At least 5 years of experience working in AWS cloud environments, with expertise in services such as SageMaker, Athena, S3, EC2, RDS, Glue, Lambda, Step Functions, EKS, and ECS
- Proficiency in infrastructure-as-code tools like Terraform, Ansible, or CloudFormation
- Strong programming skills in Python, with flexibility to work with other programming languages
- Experience with containerization and microservices architectures, using platforms like Kubernetes or Docker
- Extensive knowledge of Continuous Integration and Continuous Delivery pipelines, including tools such as CodePipeline, CodeBuild, or CodeDeploy
- Proven track record of managing large-scale enterprise platforms and addressing end-user feature needs
- Hands-on experience with DevOps tools and practices, including Docker and Git
- Awareness of GxP compliance standards
- Strong problem-solving, analytical, and communication skills
Nice to have
- Experience building large-scale data processing pipelines using technologies such as Hadoop, Spark, or SQL
- Proficiency with data science modeling tools and platforms, including R, Python, or Jupyter Notebooks
- Knowledge of multi-cloud environments, such as AWS, Azure, and GCP
- Experience mentoring and guiding team members or clients in a professional capacity
- Familiarity with SAFe Agile methodologies and processes
We offer/Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn