Systems Engineer Jobs
EPAM is looking for Systems Engineers.
DevOps
GitHub Actions, Grafana, Istio
40 hrs/week
12+ months
- Development of Terraform and Terragrunt configurations for infrastructure code
- Management and creation of GitHub Actions workflows for CI/CD pipelines
- Resolution of data access permission issues in AWS S3 and AWS IAM
- Troubleshooting of Kubeflow ML pipeline issues related to CPU, Memory, GPU, and Permissions
- Scripting using Python for platform automation tasks
- Team collaboration to boost the CVML platform's reliability and efficiency
- Involvement in architecture and design discussions for system enhancements
- Keeping abreast of the latest DevOps tools and practices for continuous improvement
- Practical DevOps roles experience of at least 5 years
- Leadership experience of 1 year or more
- Deep understanding of Kubernetes and its ecosystem, particularly AWS EKS and KubeSpray
- Proficiency in using Terraform and Terragrunt for infrastructure code
- Experience with Prometheus and Grafana for monitoring and observability
- Solid knowledge of Istio for service mesh and its basic components, such as sidecars, mTLS, and ingress gateway
- Proficiency in Python for scripting and automation tasks
- Hands-on experience with GitHub and GitHub Actions for CI/CD pipelines
- Sound understanding of AWS services including network, LoadBalancer, and IAM
- Excellent troubleshooting skills for data access permission issues in AWS S3 and AWS IAM
- Ability to develop and troubleshoot Kubeflow ML pipeline issues
- Familiarity with distributed tracing tools such as Zipkin and Istio
- Knowledge of Golang, Kubeflow, and Pulumi
DevOps
Amazon Web Services, Git, Kubernetes
40 hrs/week
12+ months
- Assist in engineering tasks for our Gaming Client's observability infrastructure used by online services
- Operate independently based on broad objectives
- Engage effectively in English with team members and stakeholders
- Sustain, support, and upgrade existing infrastructure
- Oversee the integration of external systems
- Engage in scheduled rotations to maintain continuous system operations
- Be available for on-call shifts as needed, with extra compensation provided
- Cooperate with product engineering teams to improve service observability
- Emphasize best practices for compiling logs and observability metrics
- Enhance the development of new products and the upgrading of current systems
- Minimum of 5 years in a Systems Engineer, DevOps Engineer, or Cloud Engineer role
- 1+ years of relevant leadership experience
- Extensive experience with cloud environments
- Skilled in scripting languages including Python and Bash
- Proficiency in programming languages like Golang
- Comprehensive knowledge of observability concepts such as counters, gauges, and alert design
- Understanding of PromQL and NRQL
- Familiar with distributed tracing
- Aware of basic networking principles
- Exceptional problem-solving and analytical abilities
- Capable of working both independently and collaboratively
- Strong teamwork skills
- Experienced in continuous integration and continuous deployment (CI/CD) practices
- Knowledgeable in version control systems like Git
- Experienced with containerization and orchestration technologies (e.g., Docker, Kubernetes)
- Acquainted with infrastructure as code tools (e.g., Terraform, Ansible)
- Bachelor’s degree in Computer Science, Engineering, or a related field
- Fluent English communication skills at a B2+ level
- Understanding of OTEL
- Background in the gaming industry
- Advanced cloud platform certifications (AWS, Azure, GCP)
DevOps
GitHub Actions, Grafana, Istio
40 hrs/week
12+ months
- Develop Terraform and Terragrunt configurations for infrastructure as code
- Create and manage GitHub Actions workflows for CI/CD pipelines
- Troubleshoot data access permission issues in AWS S3 and AWS IAM
- Troubleshoot Kubeflow ML pipeline issues related to CPU, Memory, GPU, and Permissions
- Develop scripts using Python for platform automation tasks
- Collaborate with the team to enhance the reliability and efficiency of the CVML platform
- Participate in architecture and design discussions for system improvements
- Stay updated with the latest DevOps tools and practices for continuous improvement
- Minimum of 3 years of practical experience in DevOps roles
- In-depth knowledge of Kubernetes and its ecosystem, particularly AWS EKS and KubeSpray
- Proficiency in Terraform and Terragrunt for infrastructure as code
- Experience in using Prometheus and Grafana for monitoring and observability
- Solid understanding of Istio for service mesh and its basic components, such as sidecars, mTLS, and ingress gateway
- Proficiency in Python for scripting and automation tasks
- Hands-on experience with GitHub and GitHub Actions for CI/CD pipelines
- Strong understanding of AWS services including network, LoadBalancer, and IAM
- Excellent troubleshooting skills for data access permission issues in AWS S3 and AWS IAM
- Ability to develop and troubleshoot Kubeflow ML pipeline issues
- Familiarity with distributed tracing tools such as Zipkin and Istio
- Knowledge of Golang, Kubeflow, and Pulumi
DevOps
Amazon Web Services, Git, Kubernetes
40 hrs/week
12+ months
- Provide engineering assistance for our Gaming Client's observability infrastructure used by online services
- Work independently from open-ended objectives
- Communicate effectively in English with team members and stakeholders
- Support, maintain, and improve existing infrastructure
- Manage the integration of third-party systems
- Participate in a rotation to ensure systems are continuously operational
- Respond to on-call shifts as required, with additional compensation
- Collaborate with product engineering teams to enhance service observability
- Focus on best practices for aggregating logs and observability metrics
- Contribute to the development of new products and the improvement of existing systems
- 3+ years of relevant experience in Systems Engineer, DevOps Engineer, or Cloud Engineer role
- Strong experience with cloud environments
- Proficient in scripting languages such as Python and Bash
- Experience with programming languages like Golang
- In-depth understanding of observability concepts including counters, gauges, and alert design
- Knowledge of PromQL and NRQL
- Experience with distributed tracing
- Familiarity with basic networking principles
- Excellent problem-solving and analytical skills
- Ability to work independently and in a team environment
- Strong communication and collaboration skills
- Experience with continuous integration and continuous deployment (CI/CD) practices
- Understanding of version control systems like Git
- Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes)
- Familiarity with infrastructure as code tools (e.g., Terraform, Ansible)
- Bachelor’s degree in Computer Science, Engineering, or a related field
- Knowledge of OTEL
- Previous experience in the gaming industry
- Advanced certifications in cloud platforms (AWS, Azure, GCP)
Business Analysis
BA Stakeholders management, Microsoft Azure, Requirements and Change management
40 hrs/week
12+ months
- Collaborate with both onshore and offshore teams
- Elicit requirements from stakeholders and analyze the current state of systems and business processes
- Design and analyze potential solutions, and help drive the client and team towards defining clear requirements and conceptual solutions
- Summarize and present analysis results to stakeholders
- Write user stories in Jira/Confluence and support development phases and UAT
- Coordinate between onshore and offshore teams
- Manage project scope and user requirements to ensure timely delivery of project
- 3+ years of experience in Business Analysis
- Strong skills in stakeholder management
- Experience in requirements and change management
- Proficient in Microsoft Azure
- Proficient in Azure Data Factory
- Excellent written and verbal communication skills
- Ability to work collaboratively in a distributed team environment
- Previous experience working directly with clients
- Strong analytical and problem-solving skills
- Ability to summarize and present information effectively
- C1 level proficiency in English
- Additional cloud technology experience
- Experience in Agile methodologies
- Advanced certification in business analysis
Business Analysis
BA Stakeholders management, Microsoft Azure, Requirements and Change management
40 hrs/week
12+ months
- Work in collaboration with onshore and offshore teams
- Derive requirements from stakeholders and evaluate current systems and business processes
- Develop and scrutinize possible solutions, facilitating clients and teams in establishing clear requirements and conceptual solutions
- Present summarized analysis outcomes to stakeholders
- Craft user stories in Jira/Confluence and facilitate development phases and UAT
- Liaise between onshore and offshore teams
- Oversee project scope and user requirements to secure project delivery within timelines
- Minimum of 5 years’ experience in Business Analysis
- Over 1 year of relevant leadership experience
- Proficiency in stakeholder management
- Expertise in requirements and change management
- Skilled in Microsoft Azure and Azure Data Factory
- Exceptional written and verbal communication abilities
- Capability to collaborate in a distributed team setting
- Direct client interaction experience
- Strong analytical and problem-solving capabilities
- Ability to effectively summarize and present information
- English proficiency at C1 level
- Experience with additional cloud technologies
- Familiarity with Agile methodologies
- Advanced business analysis certifications
Java
Amazon Web Services, Apache Lucene Core, Search Engines
40 hrs/week
12+ months
- Lead the design and implementation of scalable search solutions using Java and search engine technologies
- Optimize search functionalities to improve relevance, accuracy, and performance
- Utilize Generative AI to enhance search relevance and user experience
- Collaborate with cross-functional teams to integrate search solutions into existing systems
- Mentor and guide a team of search engineers in developing innovative search features
- Conduct performance analysis and optimization of search engine operations
- Research and implement advanced search techniques to improve user search experience
- Ensure the scalability and efficiency of search operations on Amazon Web Services
- Drive the adoption of best practices in search engine design and implementation
- Regularly communicate and collaborate with stakeholders to understand and address search requirements
- Minimum of 7 years of experience in Java development and search engine technologies
- 2+ years of relevant leadership experience
- Hands-on experience with Apache Lucene Core, Solr, or similar search engines
- Proven expertise in designing and implementing search solutions on Amazon Web Services (AWS)
- Strong understanding of search engine optimization and performance tuning
- Capability to conceptualize and implement advanced search functionalities
- In-depth knowledge of search indexing and retrieval processes
- Excellent verbal and written communication skills in English at a B2+ level
- Experience with Elasticsearch for advanced search functionalities
- Knowledge of Generative AI for natural language processing and search relevance enhancement
- Experience in OpenSearch for scalable and efficient search operations
Java
Amazon Web Services, Apache Lucene Core, Search Engines
40 hrs/week
12+ months
- Lead the design and implementation of scalable search solutions using Java and search engine technologies
- Optimize search functionalities to improve relevance, accuracy, and performance
- Mentor and guide a team of engineers in developing innovative search features
- Collaborate with cross-functional teams to integrate search solutions into existing systems
- Conduct performance analysis and optimization of search engine operations
- Research and implement advanced search techniques to improve user search experience
- Ensure the scalability and efficiency of search operations on Amazon Web Services
- Regularly communicate and collaborate with stakeholders to understand and address search requirements
- Utilize Generative AI to enhance search relevance and user experience
- 5+ years of experience in Java development and search engine technologies
- 1+ year of relevant leadership experience
- Experience with Apache Lucene Core, Solr, or similar search engines
- Proven expertise in designing and implementing search solutions on Amazon Web Services (AWS)
- Capability to conceptualize and implement advanced search functionalities
- Strong understanding of search engine optimization and performance tuning
- In-depth knowledge of search indexing and retrieval processes
- Upper-Intermediate English level proficiency
- Knowledge of Generative AI for natural language processing and search relevance enhancement
- Experience with Elasticsearch for advanced search functionalities
- Proficiency with OpenSearch for scalable and efficient search operations
Java
Amazon Web Services, Apache Lucene Core, Search Engines
40 hrs/week
12+ months
- Participate in the design and implementation of scalable search solutions using Java and search engine technologies
- Collaborate with cross-functional teams to integrate search solutions into existing systems
- Research and implement advanced search techniques to improve user search experience
- Conduct performance analysis and optimization of search engine operations
- Ensure the scalability and efficiency of search operations on Amazon Web Services
- Communicate and collaborate with stakeholders to understand and address search requirements
- Optimize search functionalities to improve relevance, accuracy, and performance
- At least 3 years of experience in Java development and search engine technologies
- Proficiency with Apache Lucene Core, Solr, or similar search engines
- Proven expertise in designing and implementing search solutions on Amazon Web Services
- Ability to conceptualize and implement advanced search functionalities
- Strong understanding of search engine optimization and performance tuning
- Deep knowledge of search indexing and retrieval processes
- B2+ English level proficiency
- Knowledge of Generative AI for natural language processing and search relevance enhancement
- Proficiency with OpenSearch for scalable and efficient search operations
- Experience with Elasticsearch for advanced search functionalities
Functional Testing
Testing Fundamentals, Financial Services
40 hrs/week
12+ months
- Overseeing the creation and maintenance of test plans, test cases, and test scripts
- Collaboration with interdisciplinary teams to gather requirements and ensure thorough test evaluations
- Leading functional testing efforts, analyzing test results, and pinpointing issues
- Providing mentorship to junior testing engineers and advising on best testing practices
- Management of test environments, test data, and testing tools essential for functional testing
- At least 2 years' experience in QA
- Deep knowledge of testing principles and functional testing methods
- Proficiency in test management tools and defect tracking systems
- Experienced with automated testing tools such as Selenium, TestComplete, or UFT
- Familiarity with software development methodologies including Agile and Waterfall
- Understanding of continuous integration and continuous deployment processes
- Ability to craft detailed test cases from system requirements
- Prior experience in software development, quality assurance, or related roles
- Background in the Financial Services sector